Skip to main content
Cognee is designed for flexible deployment across development and production environments, with configurable data storage backends that scale with your needs.

Data Storage Architecture

Cognee operates on a three-tier data storage model, each optimized for specific data types and query patterns:

Graph Database

Relationships & EntitiesStores knowledge graph structure, entity relationships, and semantic connections.

Vector Database

Embeddings & SearchHandles semantic embeddings for similarity search and content retrieval.

Relational Database

Metadata & StateManages datasets, user permissions, pipeline state, and operational data.
Each storage layer can be deployed as managed services, self-hosted servers, or file-based systems (like S3 buckets), giving you complete flexibility over your infrastructure.

Deployment Options

Choose the deployment strategy that matches your requirements:
  • Development
  • Production
  • Hybrid
Local & Testing
  • Docker: Containerized local deployment with embedded databases
  • MCP: Direct integration with code editors and IDEs
  • File-based: SQLite, local files, and embedded vector stores

Storage Configuration Examples

Local Development

Embedded & File-based
# All data stored locally
GRAPH_DATABASE=networkx
VECTOR_DATABASE=lancedb
RELATIONAL_DATABASE=sqlite://./cognee.db
Multi-Agent Limitation: Default Kuzu graph store uses file-based locking and is not suitable for concurrent access from multiple agents. Use Neo4j or FalkorDB for multi-agent deployments.
Managed Services
# Fully managed cloud services
GRAPH_DATABASE=neo4j://your-aura-instance
VECTOR_DATABASE=pinecone://your-index
RELATIONAL_DATABASE=postgresql://your-rds-instance
S3 + Managed Databases
# Vector data in S3, databases managed
VECTOR_DATABASE=s3://your-bucket/vectors/
GRAPH_DATABASE=neo4j://managed-instance
RELATIONAL_DATABASE=postgresql://rds-instance

Quick Start Guide

1

Choose Deployment

Select your deployment method based on scale and requirements
2

Configure Storage

Set up your preferred combination of graph, vector, and relational databases
3

Deploy & Test

Launch Cognee and verify connectivity to all storage backends
4

Scale

Adjust storage and compute resources based on usage patterns

Deployment Methods

Architecture Benefits

Flexible Data Tiers: Each storage layer can be independently scaled, managed, or migrated without affecting others.
Cost Optimization: Use file-based storage (S3) for archival data and managed services for active workloads.
Security: Ensure proper network security and access controls across all storage tiers in production deployments.

Need Help?

Join Our Community

Get deployment support, share configurations, and connect with other Cognee users.