Data Storage Architecture
Cognee operates on a three-tier data storage model, each optimized for specific data types and query patterns:Graph Database
Relationships & EntitiesStores knowledge graph structure, entity relationships, and semantic connections.
Vector Database
Embeddings & SearchHandles semantic embeddings for similarity search and content retrieval.
Relational Database
Metadata & StateManages datasets, user permissions, pipeline state, and operational data.
Each storage layer can be deployed as managed services, self-hosted servers, or file-based systems (like S3 buckets), giving you complete flexibility over your infrastructure.
Deployment Options
Choose the deployment strategy that matches your requirements:- Development
- Production
- Hybrid
Local & Testing
- Docker: Containerized local deployment with embedded databases
- MCP: Direct integration with code editors and IDEs
- File-based: SQLite, local files, and embedded vector stores
Storage Configuration Examples
Local Development
Local Development
Embedded & File-based
Multi-Agent Limitation: Default Kuzu graph store uses file-based locking and is not suitable for concurrent access from multiple agents. Use Neo4j or FalkorDB for multi-agent deployments.
Cloud Production
Cloud Production
Managed Services
Hybrid S3
Hybrid S3
S3 + Managed Databases
Quick Start Guide
1
Choose Deployment
Select your deployment method based on scale and requirements
2
Configure Storage
Set up your preferred combination of graph, vector, and relational databases
3
Deploy & Test
Launch Cognee and verify connectivity to all storage backends
4
Scale
Adjust storage and compute resources based on usage patterns
Deployment Methods
Modal Deployment
Serverless & Auto-scalingPerfect for variable workloads with automatic resource management.
Kubernetes (Helm)
Enterprise & ProductionContainer orchestration with full control and high availability.
EC2 Deployment
Traditional CloudStandard server deployment with custom configurations.
Architecture Benefits
Flexible Data Tiers: Each storage layer can be independently scaled, managed, or migrated without affecting others.
Cost Optimization: Use file-based storage (S3) for archival data and managed services for active workloads.
Security: Ensure proper network security and access controls across all storage tiers in production deployments.
Need Help?
Join Our Community
Get deployment support, share configurations, and connect with other Cognee users.