Skip to main content
Datasets are the organizational unit for all data in Cognee Cloud. Each dataset maintains its own knowledge graph and vector store. See Datasets for the underlying concept.

List datasets

GET /api/v1/datasets/ — List all datasets accessible to the authenticated user.
curl https://your-tenant.aws.cognee.ai/api/v1/datasets/ \
  -H "X-Api-Key: your-key"

Create a dataset

POST /api/v1/datasets/ — Create a new dataset or return the existing one if the name already exists.
curl -X POST https://your-tenant.aws.cognee.ai/api/v1/datasets/ \
  -H "X-Api-Key: your-key" \
  -H "Content-Type: application/json" \
  -d '{"name": "my_dataset"}'
Datasets are also created implicitly when you call add or remember with a dataset_name that does not yet exist.

Dataset status

GET /api/v1/datasets/status — Get the processing status of all datasets. Returns the pipeline state for each dataset: whether cognify is pending, running, or completed.

Dataset data

GET /api/v1/datasets/{dataset_id}/data — List all data items in a dataset. GET /api/v1/datasets/{dataset_id}/data/{data_id}/raw — Download the original file for a specific data item.

Delete

DELETE /api/v1/datasets/{dataset_id} — Delete a dataset and all its contents. DELETE /api/v1/datasets/{dataset_id}/data/{data_id} — Delete a specific data item from a dataset.
Deleting a dataset removes all associated documents, knowledge graph data, and embeddings. This cannot be undone.