Cognee releases with highlights and links to the full release notes on GitHub.Documentation Index
Fetch the complete documentation index at: https://docs.cognee.ai/llms.txt
Use this file to discover all available pages before exploring further.
Working on
Changes queued for the next release. This section is updated as unreleased work is merged and is folded into a versioned release section when the release is published.Highlights
- Adds ChromaDB vector search support for
include_payload=False, so callers can omit metadata payloads from returnedScoredResultvalues when they only need ids and scores. - Adds ChromaDB
node_namefiltering forsearch()andbatch_search(), includingORandANDsemantics throughnode_name_filter_operator. - Prevents Entity and EntityType node id collisions by namespacing generated ids by node category.
- Improves ontology parsing for file-like inputs with filename/content-type detection, RDFLib fallback formats, and clearer initialization errors when parsing fails.
Notes
- These changes are unreleased and should not be added to
v1.1.2release notes. - The ontology parser update improves file-like parsing behavior; upload endpoint format restrictions should be documented separately if they change.
v1.1.2
View on GitHub Patch release with a refreshed public frontend, improved Cloud UI workflows, and a Postgres graph adapter compatibility fix for asyncpg/PostgreSQL 16.Highlights
- Syncs the public frontend with the SaaS application, bringing updated dashboard, search, dataset, connection, onboarding, knowledge graph, and graph model editor experiences.
- Adds conversation-based search history and refreshed multi-dataset search flows in the frontend.
- Improves connection and onboarding flows with a connection modal, step-by-step agent setup guidance, new quickstart assets, and updated loading visuals.
- Adds memory customization UI support for datasets, including graph models, custom prompts, and ontology-related configuration.
- Fixes Postgres graph neighborhood expansion under asyncpg/PostgreSQL 16 by casting recursive CTE seed parameters to
text[].
Notable Changes
- Bumps the package version from
1.1.1to1.1.2and refreshes lockfiles. - Aligns frontend API routes and local development behavior with the OSS backend.
- Updates API key, tenant, configuration, dataset, ingestion, ontology, search-history, session, analytics, and user frontend modules.
- Adds frontend assets for quickstarts, agent integrations, loading states, and graph previews.
- Adds regression coverage for Postgres graph neighborhood seed array typing and retries a flaky usage-logger e2e path in CI.
Fixes and Improvements
- Postgres neighborhood query parameter typing: The Postgres graph adapter’s
get_neighborhoodquery now casts the seed parameter totext[](unnest(CAST(:seeds AS text[]))) in its recursive CTE seed row. Deployments usingGRAPH_DATABASE_PROVIDER=postgreswith asyncpg/PostgreSQL 16 should no longer hit parameter type inference errors when expanding neighbors from seed node ids. - Cloud UI refresh: Dashboard, dataset, dataset detail, connections, search, onboarding, knowledge graph, and graph model editor screens were refreshed and aligned with current Cloud workflows.
- Search and dataset workflows: Search now supports conversation history and multi-dataset recall flows, while dataset pages add improved status polling, graph access, and memory customization entry points.
- Connect Agent flow: The frontend adds clearer connection setup prompts, modal-based setup guidance, and integration visual assets.
- Frontend resilience: Error handling, loading states, analytics logging, tenant context, user configuration, and local fetch behavior were updated across the public frontend.
v1.1.1
View on GitHub Patch release that promotes accumulateddev work after v1.1.1.dev0, with agent-management APIs, graph visualization updates, custom graph-model support in remember, and backend stability fixes.
Highlights
- Adds agent management and connection endpoints for listing, creating, inspecting, registering, unregistering, and deleting agents and their active connections.
- Reworks graph visualization with a pipeline-aware Story layout, Schema view, improved labels, legends, and modular visualization components.
- Adds
graph_modelsupport to therememberREST endpoint, letting API callers pass a JSON-serialized graph schema into ingestion. - Expands graph and retrieval behavior with local Neo4j dataset handling, global context graph bucketing, improved edge text, and
node_namefiltering for chunk retrieval. - Improves LLM, PGVector, remember/session, prune, forget, and graph-projection error handling.
Notable Changes
- Bumps the package version from
1.1.0to1.1.1and refreshes the release lockfiles. - Splits agent lifecycle and connection handling into dedicated modules and API routes, including persisted agent connection state and agent-session names.
- Adds SDK/API support for retrieving specific agent configuration and for inspecting current agent connections.
- Adds local Neo4j dataset database handling and updates graph database selection to recognize that handler.
- Reworks global context index internals with graph bucketing, scoring, build, update, load, summarize, and persistence flows.
- Improves edge indexing and rendering by preserving natural edge descriptions, generating fallback edge text from metadata, and rendering relationship labels inside edge markup.
- Updates CI and test coverage across database adapters, agents, visualization, global context indexing, retrieval filters, and LLM configuration.
Fixes and Improvements
- Remember custom graph models: The
rememberREST endpoint now accepts an optionalgraph_modelform field, parses the JSON schema into a graph model, and forwards it into the ingestion flow. - Agent lifecycle and connections: Agent endpoints now separate agent resources from agent connections, support agent-session names, persist connection metadata, mark unregistering agents inactive, and expose connection detail.
- Graph visualization: Story view spacing, column pinning, schema rendering, edge-label rendering, and fallback labeling were improved so generated graph views are easier to inspect.
- Graph ingestion and retrieval: Edges with unprojectable endpoints are skipped instead of failing graph projection,
KnowledgeGraphsubclasses follow the knowledge-graph integration path, chunk retrieval receivesnode_namefilters, andforgetcan handle dataset values that are string UUIDs. - PGVector metadata consistency:
create_collectionnow reflects SQLAlchemy metadata only after the table-creation transaction commits, avoiding stale metadata entries when table creation rolls back. - LLM adapters: Generic LLM API transcription and Ollama image transcription now raise clear
ValueErrormessages for empty responses, Mistral guards againstNonemessages before reading content, and OpenAI instructor mode is honored. - Session remember routing:
remember(session_id=...)now routes through the JSON/entryendpoint in API mode, and usingcustom_promptwithsession_idraises a clearValueError. - Operational stability: Prune errors and dataset lookup issues are handled more defensively, brittle batch-query test settings were adjusted, and optional LLM configuration can be passed through CI.
v1.1.0.dev1
View on GitHub Developer preview release on the way tov1.1.0dev1. This release includes API, retrieval, permissions, storage-runtime, and backend consistency changes.
Highlights
- Adds database subprocess workers for LanceDB and Kuzu so native database work can run outside the main Cognee process. The wheel now includes the
cognee_db_workerspackage. - Exposes more ingestion controls through the public API and remote client paths, including chunk sizing and background execution options for
remember()andcognify(). - Adds
dataset_idssupport torecall(), making shared-dataset retrieval more reliable when dataset names are not owned by the calling user. - Expands permission management with DELETE endpoints for dataset permissions, roles, and user-role membership.
- Improves session visibility so parent users can see sessions created by child-agent users where appropriate.
Notable Changes
- Adds
graph_database_subprocess_enabledandvector_db_subprocess_enabledconfiguration, plus Kuzu tuning variables for threads, buffer pool size, and max DB size. - Keeps
belongs_to_setmetadata consistent across dataset deletion and shared-node/vector upserts in LanceDB, PGVector, and Neo4j paths. - Adds
include_payloadbehavior to Neptune Analytics vector search. - Improves Postgres hybrid batching by respecting embedding-engine batch size.
- Improves infer-schema text sampling and prompting.
- Rewrites the examples README into a fuller index and adds performance-testing support with Locust.
- Deprecates
.env.exampleas the canonical template in favor of.env.template. - Bumps the package version from
1.0.9to1.1.0.dev1and refreshes lockfiles.
v1.0.3
View on GitHub Patch release with bug fixes and stability improvements on top of v1.0.2.Highlights
- Promotes accumulated
devwork tomainfor thev1.0.3release - Adds session lifecycle APIs, unified memory/session handling, and dashboard support
- Introduces dataset queueing for async context management and ingestion flows
- Ships new relational migrations, including session lifecycle tables and
parent_user_id - Expands recall/remember and cloud routing behavior, plus frontend onboarding and Connect Agent updates
Notable Changes
- Added session endpoints, metrics, and supporting persistence work
- Added dataset queue infrastructure and follow-up fixes for background processing
- Added database migrations for new tables and user/dataset ownership handling
- Updated recall, remember, improve, and search-related API behavior
- Added frontend work for Connect Agent, dashboard/activity views, API keys, and onboarding
- Included guide updates, workflow/tooling changes, and dependency updates such as
litellmandonnxruntime
Bug Fixes
- PostgreSQL null-byte compatibility: Embedded null bytes (
\x00) in node or edge string fields no longer cause errors when using PostgreSQL as the relational backend. Null bytes are now automatically stripped from all string values (including nested attributes) before writes to the relational store. This sanitization is transparent — affected strings are silently cleaned rather than rejected. - Fixed duplicate graph nodes caused by
DataPoint.idbeing regenerated during graph construction. The originalidis now preserved when converting DataPoint instances into graph nodes, ensuring node identity is stable across graph extraction passes.
v1.0.2
View on GitHub Patch release with bug fixes and stability improvements on top of v1.0.1.Bug Fixes
- LanceDB schema migration: “contained null values” errors (raised when old rows lack a field required by a newer DataPoint schema) are now treated as recoverable schema drift. The affected table is automatically rebuilt from the current schema instead of raising a hard failure.
- cognee-mcp Docker image build: Added missing
build-essentialandlibpq-devsystem packages to the builder stage so thatcognee[postgres]can compilepsycopg2from source on Linux.
Dependency Updates
- Bumped
llama-index-corerequirement from>=0.13.0,<0.14to>=0.14.20,<0.15for thellama-indexextra. - Pinned
nltk>=3.9.3,<4explicitly in thedocsextra to satisfyunstructured’s dependency untilunstructuredv0.21.0.
v1.0.1
View on GitHub Patch release with bug fixes on top of v1.0.0.v1.0.0
View on GitHubHighlights
- New high-level API:
remember,recall,improve, andforgetcover the full memory lifecycle in four operations - Session-aware memory via
session_id— short-term context that can be promoted into the permanent graph - Unified
recallreplaces the previoussearchcall with automatic retrieval strategy selection - Legacy operations (
add,cognify,search,memify) remain available as lower-level building blocks
New Features
cognee.remember(data, session_id=...)— ingest and graph in one call; supports permanent or session memorycognee.recall(query, session_id=...)— query across both the permanent graph and session cachecognee.improve(...)— enrich an existing graph with feedback-based weighting and session promotioncognee.forget(dataset=..., session_id=...)— delete data, datasets, or full session memory
v0.5.4.dev1
Released: March 5, 2026View on GitHub
Highlights
- Developer preview release focused on quality, performance, and developer ergonomics
- Faster ingestion and sync
- Improved search relevance and new filtering options
- Stability fixes for memory creation, deletion, and CLI workflows
- Internal refactoring and dependency upgrades
New Features
- Bulk import CLI for faster batched ingestion
- Search filters for tags and date ranges
- Optional per-collection ingestion throttling
Improvements
- Lower latency for ingestion and sync
- Better search ranking
- More robust deletion and duplicate handling
- Clearer CLI messages and debug logs
Bug Fixes
- Fixed duplicate memories under concurrent ingestion
- Fixed partial state after deletion
- Fixed CLI export formatting issues
- Fixed intermittent retrieval failures under load
v0.5.3
Released: February 27, 2026View on GitHub
Highlights
- New graph visualization improvements
- Expanded permissions and user management work
- SessionManager and cache/session persistence work
- Search and graph retrieval improvements
- Multiple stability and CI/CD fixes
Notable Changes
- Added role-based permission checks and permission endpoints
- Added graph visualization updates, including note set coloring
- Added return type hints to API functions
- Added chunk associations for the memify pipeline
- Added vector filtering based on node sets
- Fixed delete flow bugs, health check issues, MCP issues, and several config/integration issues
v0.5.3.dev1
Released: February 20, 2026View on GitHub
Highlights
- Added vector filtering based on node sets
- Added principal Cognee configuration
- Fixed health check issues
- Fixed FalkorDB adapter port bug
- Fixed Ollama image ingestion argument issue
Notes
- Includes a small set of targeted fixes and feature work on top of
v0.5.3.dev0 - Introduced one new contributor in this release