datasets
Get Dataset Status
Get the processing status of datasets.
This endpoint retrieves the current processing status of one or more datasets,
indicating whether they are being processed, have completed processing, or
encountered errors during pipeline execution.
## Query Parameters
- **dataset** (List[UUID]): List of dataset UUIDs to check status for
- **pipeline** (List[str], optional): One or more pipeline names to check.
- If omitted, defaults to **cognify_pipeline** (backward-compatible behavior)
- If one pipeline is provided, response is a flat map
- If multiple pipelines are provided, response is nested per dataset and pipeline
- **Available options: add_pipeline, cognify_pipeline**
## Response
Returns status information in one of two shapes:
- Single pipeline (default): {dataset_id: status}
- Multiple pipelines: {dataset_id: {pipeline_name: status}}
Status values:
- **pending**: Dataset is queued for processing
- **running**: Dataset is currently being processed
- **completed**: Dataset processing completed successfully
- **failed**: Dataset processing encountered an error
## Error Codes
- **500 Internal Server Error**: Error retrieving status information
GET
Get Dataset Status
Documentation Index
Fetch the complete documentation index at: https://docs.cognee.ai/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Response
Successful Response
- Response Get Dataset Status Api V1 Datasets Status Get
- Response Get Dataset Status Api V1 Datasets Status Get
Available options:
DATASET_PROCESSING_INITIATED, DATASET_PROCESSING_STARTED, DATASET_PROCESSING_COMPLETED, DATASET_PROCESSING_ERRORED Previous
Get Raw DataDownload the raw data file for a specific data item.
This endpoint allows users to download the original, unprocessed data file
for a specific data item within a dataset. The file is returned as a direct
download with appropriate headers.
## Path Parameters
- **dataset_id** (UUID): The unique identifier of the dataset containing the data
- **data_id** (UUID): The unique identifier of the data item to download
## Response
Returns the raw data file as a downloadable response.
## Error Codes
- **404 Not Found**: Dataset or data item doesn't exist, or user doesn't have access
- **500 Internal Server Error**: Error accessing the raw data file
Next
Get Dataset Status