Skip to main content

Documentation Index

Fetch the complete documentation index at: https://koreai.mintlify.app/llms.txt

Use this file to discover all available pages before exploring further.

Datasets are curated collections of agent interactions (sessions and traces) used for evaluation, regression testing, and quality assurance. They can also act as golden sets for validating agent behavior against known scenarios or edge cases.

Why Datasets Matter

Datasets help teams:
  • Organize training, testing, and production samples.
  • Ensure portability by exporting datasets to external systems for analysis or model training.
  • Enable focused evaluation within the dataset.

Core Capabilities

  • Centralized view for managing all datasets within a project.
  • Clear distinction between Static and Auto-Update datasets.
  • Export options for offline workflows and reporting.
  • Tools for analyzing session content and running evaluations directly on a dataset.

Key Use Cases

Use CaseDescription
Policy EvaluationTest policies against a controlled dataset before enabling them on live traffic.
Manual EvaluationRun ad-hoc evaluations, including LLM-as-a-Judge or numeric checks, on selected sessions.
Regression TestingCheck traces and telemetry from updated versions of the agentic system—such as prompt updates or model upgrades—before deploying to production.
BenchmarkingCompare multiple agent or model versions using comparable datasets designed to measure performance across scenarios.

Dataset Types

Agent Management Platform supports two dataset types.
Static DatasetAuto-Update Dataset
DescriptionManually curated, fixed collection of sessions.Dynamically populates sessions based on saved filter conditions. Refreshes as new matching sessions arrive.
Best ForRegression testing, hand-curated benchmark sets, comparing model or prompt versions, reproducible quality checks.Continuous monitoring, automatically collecting failures or anomalies, tracking emerging patterns such as negative sentiment or long latency traces.
Session ManagementManually add or remove sessions at any time.Cannot manually add sessions. Filter criteria can be edited at any time.
Auto-RefreshDoes not update automatically.Sessions refresh automatically as new telemetry arrives.
Example Auto-Update filter: Last 7 days of sessions with error rate > 5%.

Creating a Dataset

  1. Go to EvaluationsDatasets.
  2. Select Create Dataset.
  3. Choose a dataset type: Static or Auto-Update.
  4. Configure the dataset details:
    • Name and description
    • For Static datasets: Manually select sessions.
    • For Auto-Update datasets: Define filter criteria (date range, metrics, tags)
  5. Review the selected sessions in preview mode.
  6. Select Save.
After creation, the dataset appears in the Dataset Manager and is available for evaluations, policy testing, and export.

Working with Datasets

Selecting a dataset opens its detail view, where you can explore and work with all included sessions.

Dataset Detail View

AreaDescription
Sessions ListView all sessions in the dataset, including metadata such as timestamp, duration, and status.
Inspect SessionsOpen a session’s detailed view to review input, output, traces, metadata, and evaluation results.
Overview TabAnalyze aggregated metrics such as success rate, duration, and performance trends.
Filter CriteriaView the filter rules used to automatically populate sessions. Auto-Update datasets only.
Dataset Detail

Add Sessions (Static Datasets Only)

Use this option to expand your dataset with additional sessions.
  1. Select Add Sessions from the top bar.
  2. Browse and select sessions from the project.
  3. Confirm to include them in the dataset.

Run Policies on a Dataset

Evaluate dataset sessions against one or more policies.
  1. Select Run Policies on Dataset.
  2. Choose one or more policies.
  3. Run evaluations across all sessions.
Multiple policies can run in parallel, allowing you to test different rules or metrics at the same time.

Managing Datasets

Edit a Dataset

Select the dataset you want to modify. Available edit options depend on the dataset type:
  • Static — Add or remove sessions.
  • Auto-Update — Edit filter criteria.

Delete a Dataset

  • Deletion removes only the dataset reference. The underlying telemetry data is not affected.
  • Datasets linked to active policies or evaluations cannot be deleted.

Export a Dataset

You can export a dataset for offline workflows, model training, or external analysis.
  1. Select the more options menu (⋮) in the top-right corner of the dataset.
  2. Select an export format: JSON, CSV, or JSONL.