> ## Documentation Index
> Fetch the complete documentation index at: https://docs.cognee.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Datasets

> Create, manage, and upload documents to datasets

The Datasets page lists all datasets in your workspace. Each dataset is a container for documents and all subsequent operations — see [Datasets](/core-concepts/further-concepts/datasets) for the underlying concept.

## Dataset list

Each row shows:

* **Name** — The dataset identifier.
* **Documents** — Number of files in the dataset.
* **Created** — Creation timestamp.
* **Status** — `Ready` (has documents) or `Empty`.

Use the search bar to filter datasets by name.

## Create a dataset

Click **New Dataset**, enter a name, and confirm. The dataset is created immediately and appears in the list.

## Upload documents

1. Select a dataset from the list (or create one).
2. On the dataset detail page, drag files into the upload area or click to browse.
3. Supported formats: PDF, CSV, TXT, Markdown, JSON, DOCX.

After upload, the [add](/core-concepts/main-operations/legacy-operations/add) and [cognify](/core-concepts/main-operations/legacy-operations/cognify) pipeline runs in the background. The header shows **Processing...** while the knowledge graph is being built. When processing finishes, the header updates to show a **Last synced** timestamp.

## Sync a dataset

For datasets connected to an external source, the dataset detail page includes a **Sync** button that re-runs the cognify pipeline against the current dataset contents. Use this to rebuild the knowledge graph after source data changes without uploading new files again.

While syncing, the button shows a spinning icon and the label **Syncing...**. The **Last synced** timestamp updates when the sync completes.

## Dataset breadcrumb

The breadcrumb at the top of the UI follows the structure **Workspace / Dataset / Page**. The dataset selector appears only on pages where filtering by dataset is relevant: the Dashboard, Search, Knowledge Graph, and dataset detail pages. On other pages the breadcrumb shows only the workspace and current page name.

Selecting a dataset in the breadcrumb is global — it filters content across all applicable pages:

* The **Search** page queries only that dataset.
* The **Knowledge Graph** page visualizes only that dataset's graph.
* The **Dashboard** scopes metrics and search to that dataset.

To search or visualize across all datasets, set the breadcrumb back to **All datasets**.

On a dataset detail page the breadcrumb shows **Workspace / \[Dataset] / Documents**.

## Delete a dataset

Open the context menu on a dataset row and select **Delete**. This removes the dataset, all its documents, and the associated knowledge graph. This action cannot be undone.

<Note>
  Dataset operations are also available through the [REST API](/cognee-cloud/functionality/dataset-management) and the [Cloud SDK](/cognee-cloud/connections/cloud-sdk).
</Note>
