Datasets
Every transaction belongs to a dataset. Datasets control where the transaction is stored and how it appears in the UI. To target a specific dataset, set dataset_id when creating a transaction via the Multi-Step API.
| Dataset | dataset_id value | Description |
|---|---|---|
| Production | "production" | The default dataset. Transactions appear in the Work Queue for review and production use. |
| Playground | "playground" | For testing and experimentation. Transactions appear in the Playground tab. |
| Memory | "memory" | Used for few-shot learning examples that help improve AI accuracy. |
| Evaluation | "{dataset_uuid}" | User-created datasets for quality evaluation. To find the UUID, open the dataset selector dropdown on the Evaluation tab, choose Manage Evaluation Datasets, and use the Copy ID command from the dataset's context menu. |
When dataset_id is not specified, transactions are created in the Production dataset.
Production (Work Queue)
The default. Transactions land in the Work Queue for human review and production processing.
{
"dataset_id": "production"
}
Use this when:
- Processing real documents in a production pipeline
- Documents should appear in the Work Queue for review
- Building an automated document processing integration
Playground
For testing and experimentation without affecting production data.
{
"dataset_id": "playground"
}
Use this when:
- Testing new workflow settings or extraction fields
- Debugging document processing issues
- Running experiments before deploying to production
Memory (Few-Shot Learning)
For adding documents as training examples that improve future AI accuracy.
{
"dataset_id": "memory"
}
Use this when:
- Building a library of correctly-labeled examples
- Improving extraction accuracy through few-shot learning
- Curating training data for specific document types
Evaluation
User-created datasets for measuring and comparing processing quality. Each evaluation dataset has its own UUID.
{
"dataset_id": "your-evaluation-dataset-uuid"
}
Use this when:
- Measuring extraction, classification, or splitting accuracy against ground truth
- Comparing processing results across different project settings versions
- Running quality benchmarks before deploying settings changes
Open the Evaluation tab in the UI, click the dataset selector dropdown, and choose Manage Evaluation Datasets. Right-click a dataset (or open its context menu) and select Copy ID to copy the dataset UUID to your clipboard. Use this UUID as the dataset_id when submitting transactions via the API.