Skip to main content

Introduction

Cognee is an open source tool and platform that transforms your raw data into intelligent, searchable memory. It combines vector search with graph databases to make your data both searchable by meaning and connected by relationships.
Dual storage architecture gives you both semantic search and structural reasoning
Modular design composes Tasks, Pipelines, and DataPoints
Main operations handle the complete workflow from ingestion to search: add, cognify, memify, search.

Table of Contents

Cognee uses three complementary storage systems, each playing a different role:
  • Relational store — Tracks documents, chunks, and provenance (where data came from and how it’s linked)
  • Vector store — Holds embeddings for semantic similarity (numerical representations that find conceptually related content)
  • Graph store — Captures entities and relationships in a knowledge graph (nodes and edges that show connections between concepts)
This architecture makes your data both searchable (via vectors) and connected (via graphs). Cognee ships with lightweight defaults that run locally, and you can swap in production-ready backends when needed.For detailed information about the storage architecture, see Architecture.
Cognee’s processing system is built from three fundamental components:
  • DataPoints — Structured data units that become graph nodes, carrying both content and metadata for indexing
  • Tasks — Individual processing units that transform data, from text analysis to relationship extraction
  • Pipelines — Orchestration of Tasks into coordinated workflows, like assembly lines for data transformation
These building blocks work together to create a flexible system where you can:
  • Use built-in Tasks for common operations
  • Create custom Tasks for domain-specific logic by extending DataPoints
  • Compose Tasks into Pipelines that match your workflow
Cognee provides four main operations that users interact with:
  • Add — Ingest and prepare data for processing, handling various file formats and data sources
  • Cognify — Create knowledge graphs from processed data through cognitive processing and entity extraction
  • Memify — Optional semantic enrichment of the graph for enhanced understanding (coming soon)
  • Search — Query and retrieve information using semantic similarity, graph traversal, or hybrid approaches
Note: Search works great with just the basic Add → Cognify → Search workflow. Memify is an optional enhancement that will provide additional semantic enrichment when available.
Beyond the core workflow, Cognee offers advanced features for sophisticated knowledge management:
  • Node Sets — Tagging and organization system that helps categorize and filter your knowledge base content
  • Ontologies — External knowledge grounding through RDF/XML ontologies that connect your data to established knowledge structures
These concepts extend Cognee’s capabilities for:
  • Organization — Managing growing knowledge bases with systematic tagging
  • Knowledge grounding — Connecting your data to external, validated knowledge sources
  • Domain expertise — Leveraging existing ontologies for specialized fields like medicine, finance, or research

Next steps

A good way to learn Cognee is to start with its architecture, move on to building blocks, practice the main operations, and finally explore advanced features.
I