Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.cognee.ai/llms.txt

Use this file to discover all available pages before exploring further.

Cognee releases with highlights and links to the full release notes on GitHub.

Working on

Changes queued for the next release. This section is updated as unreleased work is merged and is folded into a versioned release section when the release is published.

Highlights

  • Adds ChromaDB vector search support for include_payload=False, so callers can omit metadata payloads from returned ScoredResult values when they only need ids and scores.
  • Adds ChromaDB node_name filtering for search() and batch_search(), including OR and AND semantics through node_name_filter_operator.
  • Prevents Entity and EntityType node id collisions by namespacing generated ids by node category.
  • Improves ontology parsing for file-like inputs with filename/content-type detection, RDFLib fallback formats, and clearer initialization errors when parsing fails.

Notes

  • These changes are unreleased and should not be added to v1.1.2 release notes.
  • The ontology parser update improves file-like parsing behavior; upload endpoint format restrictions should be documented separately if they change.

v1.1.2

View on GitHub Patch release with a refreshed public frontend, improved Cloud UI workflows, and a Postgres graph adapter compatibility fix for asyncpg/PostgreSQL 16.

Highlights

  • Syncs the public frontend with the SaaS application, bringing updated dashboard, search, dataset, connection, onboarding, knowledge graph, and graph model editor experiences.
  • Adds conversation-based search history and refreshed multi-dataset search flows in the frontend.
  • Improves connection and onboarding flows with a connection modal, step-by-step agent setup guidance, new quickstart assets, and updated loading visuals.
  • Adds memory customization UI support for datasets, including graph models, custom prompts, and ontology-related configuration.
  • Fixes Postgres graph neighborhood expansion under asyncpg/PostgreSQL 16 by casting recursive CTE seed parameters to text[].

Notable Changes

  • Bumps the package version from 1.1.1 to 1.1.2 and refreshes lockfiles.
  • Aligns frontend API routes and local development behavior with the OSS backend.
  • Updates API key, tenant, configuration, dataset, ingestion, ontology, search-history, session, analytics, and user frontend modules.
  • Adds frontend assets for quickstarts, agent integrations, loading states, and graph previews.
  • Adds regression coverage for Postgres graph neighborhood seed array typing and retries a flaky usage-logger e2e path in CI.

Fixes and Improvements

  • Postgres neighborhood query parameter typing: The Postgres graph adapter’s get_neighborhood query now casts the seed parameter to text[] (unnest(CAST(:seeds AS text[]))) in its recursive CTE seed row. Deployments using GRAPH_DATABASE_PROVIDER=postgres with asyncpg/PostgreSQL 16 should no longer hit parameter type inference errors when expanding neighbors from seed node ids.
  • Cloud UI refresh: Dashboard, dataset, dataset detail, connections, search, onboarding, knowledge graph, and graph model editor screens were refreshed and aligned with current Cloud workflows.
  • Search and dataset workflows: Search now supports conversation history and multi-dataset recall flows, while dataset pages add improved status polling, graph access, and memory customization entry points.
  • Connect Agent flow: The frontend adds clearer connection setup prompts, modal-based setup guidance, and integration visual assets.
  • Frontend resilience: Error handling, loading states, analytics logging, tenant context, user configuration, and local fetch behavior were updated across the public frontend.

v1.1.1

View on GitHub Patch release that promotes accumulated dev work after v1.1.1.dev0, with agent-management APIs, graph visualization updates, custom graph-model support in remember, and backend stability fixes.

Highlights

  • Adds agent management and connection endpoints for listing, creating, inspecting, registering, unregistering, and deleting agents and their active connections.
  • Reworks graph visualization with a pipeline-aware Story layout, Schema view, improved labels, legends, and modular visualization components.
  • Adds graph_model support to the remember REST endpoint, letting API callers pass a JSON-serialized graph schema into ingestion.
  • Expands graph and retrieval behavior with local Neo4j dataset handling, global context graph bucketing, improved edge text, and node_name filtering for chunk retrieval.
  • Improves LLM, PGVector, remember/session, prune, forget, and graph-projection error handling.

Notable Changes

  • Bumps the package version from 1.1.0 to 1.1.1 and refreshes the release lockfiles.
  • Splits agent lifecycle and connection handling into dedicated modules and API routes, including persisted agent connection state and agent-session names.
  • Adds SDK/API support for retrieving specific agent configuration and for inspecting current agent connections.
  • Adds local Neo4j dataset database handling and updates graph database selection to recognize that handler.
  • Reworks global context index internals with graph bucketing, scoring, build, update, load, summarize, and persistence flows.
  • Improves edge indexing and rendering by preserving natural edge descriptions, generating fallback edge text from metadata, and rendering relationship labels inside edge markup.
  • Updates CI and test coverage across database adapters, agents, visualization, global context indexing, retrieval filters, and LLM configuration.

Fixes and Improvements

  • Remember custom graph models: The remember REST endpoint now accepts an optional graph_model form field, parses the JSON schema into a graph model, and forwards it into the ingestion flow.
  • Agent lifecycle and connections: Agent endpoints now separate agent resources from agent connections, support agent-session names, persist connection metadata, mark unregistering agents inactive, and expose connection detail.
  • Graph visualization: Story view spacing, column pinning, schema rendering, edge-label rendering, and fallback labeling were improved so generated graph views are easier to inspect.
  • Graph ingestion and retrieval: Edges with unprojectable endpoints are skipped instead of failing graph projection, KnowledgeGraph subclasses follow the knowledge-graph integration path, chunk retrieval receives node_name filters, and forget can handle dataset values that are string UUIDs.
  • PGVector metadata consistency: create_collection now reflects SQLAlchemy metadata only after the table-creation transaction commits, avoiding stale metadata entries when table creation rolls back.
  • LLM adapters: Generic LLM API transcription and Ollama image transcription now raise clear ValueError messages for empty responses, Mistral guards against None messages before reading content, and OpenAI instructor mode is honored.
  • Session remember routing: remember(session_id=...) now routes through the JSON /entry endpoint in API mode, and using custom_prompt with session_id raises a clear ValueError.
  • Operational stability: Prune errors and dataset lookup issues are handled more defensively, brittle batch-query test settings were adjusted, and optional LLM configuration can be passed through CI.

v1.1.0.dev1

View on GitHub Developer preview release on the way to v1.1.0dev1. This release includes API, retrieval, permissions, storage-runtime, and backend consistency changes.

Highlights

  • Adds database subprocess workers for LanceDB and Kuzu so native database work can run outside the main Cognee process. The wheel now includes the cognee_db_workers package.
  • Exposes more ingestion controls through the public API and remote client paths, including chunk sizing and background execution options for remember() and cognify().
  • Adds dataset_ids support to recall(), making shared-dataset retrieval more reliable when dataset names are not owned by the calling user.
  • Expands permission management with DELETE endpoints for dataset permissions, roles, and user-role membership.
  • Improves session visibility so parent users can see sessions created by child-agent users where appropriate.

Notable Changes

  • Adds graph_database_subprocess_enabled and vector_db_subprocess_enabled configuration, plus Kuzu tuning variables for threads, buffer pool size, and max DB size.
  • Keeps belongs_to_set metadata consistent across dataset deletion and shared-node/vector upserts in LanceDB, PGVector, and Neo4j paths.
  • Adds include_payload behavior to Neptune Analytics vector search.
  • Improves Postgres hybrid batching by respecting embedding-engine batch size.
  • Improves infer-schema text sampling and prompting.
  • Rewrites the examples README into a fuller index and adds performance-testing support with Locust.
  • Deprecates .env.example as the canonical template in favor of .env.template.
  • Bumps the package version from 1.0.9 to 1.1.0.dev1 and refreshes lockfiles.

v1.0.3

View on GitHub Patch release with bug fixes and stability improvements on top of v1.0.2.

Highlights

  • Promotes accumulated dev work to main for the v1.0.3 release
  • Adds session lifecycle APIs, unified memory/session handling, and dashboard support
  • Introduces dataset queueing for async context management and ingestion flows
  • Ships new relational migrations, including session lifecycle tables and parent_user_id
  • Expands recall/remember and cloud routing behavior, plus frontend onboarding and Connect Agent updates

Notable Changes

  • Added session endpoints, metrics, and supporting persistence work
  • Added dataset queue infrastructure and follow-up fixes for background processing
  • Added database migrations for new tables and user/dataset ownership handling
  • Updated recall, remember, improve, and search-related API behavior
  • Added frontend work for Connect Agent, dashboard/activity views, API keys, and onboarding
  • Included guide updates, workflow/tooling changes, and dependency updates such as litellm and onnxruntime

Bug Fixes

  • PostgreSQL null-byte compatibility: Embedded null bytes (\x00) in node or edge string fields no longer cause errors when using PostgreSQL as the relational backend. Null bytes are now automatically stripped from all string values (including nested attributes) before writes to the relational store. This sanitization is transparent — affected strings are silently cleaned rather than rejected.
  • Fixed duplicate graph nodes caused by DataPoint.id being regenerated during graph construction. The original id is now preserved when converting DataPoint instances into graph nodes, ensuring node identity is stable across graph extraction passes.

v1.0.2

View on GitHub Patch release with bug fixes and stability improvements on top of v1.0.1.

Bug Fixes

  • LanceDB schema migration: “contained null values” errors (raised when old rows lack a field required by a newer DataPoint schema) are now treated as recoverable schema drift. The affected table is automatically rebuilt from the current schema instead of raising a hard failure.
  • cognee-mcp Docker image build: Added missing build-essential and libpq-dev system packages to the builder stage so that cognee[postgres] can compile psycopg2 from source on Linux.

Dependency Updates

  • Bumped llama-index-core requirement from >=0.13.0,<0.14 to >=0.14.20,<0.15 for the llama-index extra.
  • Pinned nltk>=3.9.3,<4 explicitly in the docs extra to satisfy unstructured’s dependency until unstructured v0.21.0.

v1.0.1

View on GitHub Patch release with bug fixes on top of v1.0.0.

v1.0.0

View on GitHub

Highlights

  • New high-level API: remember, recall, improve, and forget cover the full memory lifecycle in four operations
  • Session-aware memory via session_id — short-term context that can be promoted into the permanent graph
  • Unified recall replaces the previous search call with automatic retrieval strategy selection
  • Legacy operations (add, cognify, search, memify) remain available as lower-level building blocks

New Features

  • cognee.remember(data, session_id=...) — ingest and graph in one call; supports permanent or session memory
  • cognee.recall(query, session_id=...) — query across both the permanent graph and session cache
  • cognee.improve(...) — enrich an existing graph with feedback-based weighting and session promotion
  • cognee.forget(dataset=..., session_id=...) — delete data, datasets, or full session memory

v0.5.4.dev1

Released: March 5, 2026
View on GitHub

Highlights

  • Developer preview release focused on quality, performance, and developer ergonomics
  • Faster ingestion and sync
  • Improved search relevance and new filtering options
  • Stability fixes for memory creation, deletion, and CLI workflows
  • Internal refactoring and dependency upgrades

New Features

  • Bulk import CLI for faster batched ingestion
  • Search filters for tags and date ranges
  • Optional per-collection ingestion throttling

Improvements

  • Lower latency for ingestion and sync
  • Better search ranking
  • More robust deletion and duplicate handling
  • Clearer CLI messages and debug logs

Bug Fixes

  • Fixed duplicate memories under concurrent ingestion
  • Fixed partial state after deletion
  • Fixed CLI export formatting issues
  • Fixed intermittent retrieval failures under load

v0.5.3

Released: February 27, 2026
View on GitHub

Highlights

  • New graph visualization improvements
  • Expanded permissions and user management work
  • SessionManager and cache/session persistence work
  • Search and graph retrieval improvements
  • Multiple stability and CI/CD fixes

Notable Changes

  • Added role-based permission checks and permission endpoints
  • Added graph visualization updates, including note set coloring
  • Added return type hints to API functions
  • Added chunk associations for the memify pipeline
  • Added vector filtering based on node sets
  • Fixed delete flow bugs, health check issues, MCP issues, and several config/integration issues

v0.5.3.dev1

Released: February 20, 2026
View on GitHub

Highlights

  • Added vector filtering based on node sets
  • Added principal Cognee configuration
  • Fixed health check issues
  • Fixed FalkorDB adapter port bug
  • Fixed Ollama image ingestion argument issue

Notes

  • Includes a small set of targeted fixes and feature work on top of v0.5.3.dev0
  • Introduced one new contributor in this release