Datasets

Every transaction belongs to a dataset. Datasets control where the transaction is stored and how it appears in the UI. To target a specific dataset, set dataset_id when creating a transaction via the Multi-Step API.

Dataset	`dataset_id` value	Description
Production	`"production"`	The default dataset. Transactions appear in the Work Queue for review and production use.
Playground	`"playground"`	For testing and experimentation. Transactions appear in the Playground tab.
Memory	`"memory"`	Used for few-shot learning examples that help improve AI accuracy.
Evaluation	`"{dataset_uuid}"`	User-created datasets for quality evaluation. To find the UUID, open the dataset selector dropdown on the Evaluation tab, choose Manage Evaluation Datasets, and use the Copy ID command from the dataset's context menu.

Default Dataset

When dataset_id is not specified, transactions are created in the Production dataset.

Production (Work Queue)

The default. Transactions land in the Work Queue for human review and production processing.

{
  "dataset_id": "production"
}

Use this when:

Processing real documents in a production pipeline
Documents should appear in the Work Queue for review
Building an automated document processing integration

Playground

For testing and experimentation without affecting production data.

{
  "dataset_id": "playground"
}

Use this when:

Testing new workflow settings or extraction fields
Debugging document processing issues
Running experiments before deploying to production

Memory (Few-Shot Learning)

For adding documents as training examples that improve future AI accuracy.

{
  "dataset_id": "memory"
}

Use this when:

Building a library of correctly-labeled examples
Improving extraction accuracy through few-shot learning
Curating training data for specific document types

Evaluation

User-created datasets for measuring and comparing processing quality. Each evaluation dataset has its own UUID.

{
  "dataset_id": "your-evaluation-dataset-uuid"
}

Use this when:

Measuring extraction, classification, or splitting accuracy against ground truth
Comparing processing results across different project settings versions
Running quality benchmarks before deploying settings changes

Finding your Evaluation Dataset ID

Open the Evaluation tab in the UI, click the dataset selector dropdown, and choose Manage Evaluation Datasets. Right-click a dataset (or open its context menu) and select Copy ID to copy the dataset UUID to your clipboard. Use this UUID as the dataset_id when submitting transactions via the API.

Production (Work Queue)​

Playground​

Memory (Few-Shot Learning)​

Evaluation​

Production (Work Queue)

Playground

Memory (Few-Shot Learning)

Evaluation