Introduction

OneRun is an open-source platform for evaluating AI agent performance through realistic conversation simulations. Deploy it yourself, customize it for your needs, and build better agents with complete control over your evaluation process.

What OneRun Does

Test your agents against diverse, AI-generated personas in realistic scenarios
Measure performance with custom objectives and scoring criteria
Track improvements over time as you iterate on your agent
Deploy anywhere - self-hosted, on-premise, or in your preferred cloud

How OneRun Works

Define Your Agent

Create an agent profile that represents your AI system and its intended purpose

Set Evaluation Objectives

Define what success looks like with custom scoring criteria for different aspects of performance

Create Test Scenarios

Design realistic simulation scenarios that reflect your agent’s real-world use cases

Generate Diverse Personas

Automatically create varied conversation participants with unique backgrounds and characteristics

Run Conversations

Execute realistic dialogues between your agent and generated personas through your custom worker

Analyze Results

Review detailed performance metrics, identify patterns, and discover improvement opportunities

Why Choose OneRun?

Open Source & Self-Hosted
Full control over your data, evaluations, and deployment. No vendor lock-in. Framework Agnostic
Works with any AI framework - LangChain, direct API calls, or custom solutions. Production Ready
Built for teams serious about AI quality with systematic evaluation processes.

Get Started Today

Quick Start Guide

Get your first simulation running in under 10 minutes

Core Concepts

Understand the key concepts that power OneRun evaluations

Use Cases & Examples

See how teams use OneRun across different industries

Worker Examples

See complete examples of agents integrated with OneRun

Ready to Build Better Agents?

OneRun transforms agent development from guesswork into a systematic, data-driven process. Since it’s open source, you have complete control over your evaluation process and data.

OneRun is designed for teams serious about AI agent quality. Deploy it anywhere and customize it for your specific needs.

Getting Started

How-to Guides

Concepts

Security

Development

What OneRun Does

How OneRun Works

Why Choose OneRun?

Get Started Today

Quick Start Guide

Core Concepts

Use Cases & Examples

Worker Examples

Ready to Build Better Agents?

Getting Started

How-to Guides

Concepts

Security

Development

​What OneRun Does

​How OneRun Works

​Why Choose OneRun?

​Get Started Today

Quick Start Guide

Core Concepts

Use Cases & Examples

Worker Examples

​Ready to Build Better Agents?

What OneRun Does

How OneRun Works

Why Choose OneRun?

Get Started Today

Ready to Build Better Agents?