Create a Simulation

This guide walks you through creating and running a complete simulation to evaluate your AI agent’s performance. You’ll learn how to set up projects, agents, objectives, and launch simulations with generated personas.

Prerequisites

OneRun platform running (see Quick Start for setup)
Access to OneRun web interface
Basic understanding of your agent’s capabilities

Step 1: Create a Project

Projects organize all your evaluation work and provide isolation between different agent testing initiatives.

Open OneRun in your browser
Sign in to your account
Click “New Project” on the dashboard
Enter project name: Choose a descriptive name (e.g., “Customer Support Production” or “Marketing Team Bots”)
Save the project - you’ll be redirected to the project dashboard

Use project names that identify the business function or environment rather than specific agent versions. This helps organize multiple agents and evaluations within the same business context.

Step 2: Define Your Agent

Agents represent the AI system you want to evaluate. The agent configuration helps OneRun understand what your system does and generates appropriate test scenarios.

Navigate to “Agents” in your project
Click “Create Agent”
Configure agent details:
- Name: Your agent’s name (e.g., “Support Bot v2.1”)
- Description: Detailed description of what your agent does, its capabilities, and role (e.g., “Friendly customer service agent that handles billing inquiries, processes returns, and provides product information”)
Save the agent - note the Agent ID from the details page (needed for your worker)

The agent description is crucial as OneRun uses it to generate realistic personas and conversation scenarios. Be specific about your agent’s capabilities and limitations.

Step 3: Set Evaluation Objectives

Objectives define what “success” looks like for your agent. They provide the scoring criteria that OneRun uses to evaluate conversation quality.

Go to “Objectives” in your project
Click “New Objective”
Define success criteria:
- Name: Clear objective name (e.g., “Customer Satisfaction”)
- Criteria: Detailed evaluation guidelines (e.g., “Evaluate how satisfied the customer feels with the interaction. Score 1.0 for highly satisfied customers who express gratitude. Score 0.5-0.8 for neutral interactions where basic needs are met. Score 0.0-0.4 for frustrated customers or unresolved issues.”)
Add multiple objectives for comprehensive evaluation:
- Response Accuracy: How factually correct are the agent’s responses?
- Customer Satisfaction: How satisfied is the customer with the interaction?
- Professional Communication: Does the agent maintain appropriate tone and language?

Create specific, measurable objectives. Vague objectives like “good response” make it difficult to evaluate and improve your agent consistently.

Step 4: Create a Simulation

Simulations bring everything together - they generate personas, orchestrate conversations, and evaluate results against your objectives.

Navigate to “Simulations”
Click “New Simulation”
Configure simulation parameters:
- Name: Descriptive name for this test run
- Agent: Select the agent you created
- Objectives: Choose which objectives to evaluate
- Scenario Description: Describe the situations you want to test (e.g., “Customers with billing issues, product returns, and general inquiries”)
- Number of Conversations: Start with 10-20 for initial testing
- Max Turns per Conversation: Set appropriate limits (3-5 for simple tasks, 10+ for complex scenarios)

Step 5: Launch the Simulation

With everything configured, you’re ready to run your first evaluation.

Review simulation settings - make sure everything looks correct
Click “Start Simulation” - OneRun will begin generating personas and scenarios
Monitor progress - you’ll see conversations being created and assigned
Ensure your worker is running - without a worker, conversations won’t proceed

Persona Generation

OneRun creates diverse personas based on your agent description and scenario

Conversation Assignment

Each persona gets assigned to a conversation with your agent

Worker Processing

Your worker polls for conversations and handles the agent logic

Evaluation

Completed conversations are automatically scored against your objectives

Next Steps

After creating your first simulation:

Scale Up: Run larger simulations with 50+ conversations for statistical significance
Compare Versions: Use simulations to A/B test agent improvements
Automate Evaluation: Integrate OneRun into your development pipeline
Share Results: Use reports to communicate agent performance to stakeholders

Effective agent evaluation is iterative. Use each simulation to learn something new about your agent’s capabilities and limitations.

Ready to implement your agent? Check out Connect an Agent for detailed worker implementation.

Getting Started

How-to Guides

Concepts

Security

Development

Create a Simulation

Prerequisites

Step 1: Create a Project

Step 2: Define Your Agent

Step 3: Set Evaluation Objectives

Step 4: Create a Simulation

Step 5: Launch the Simulation

Next Steps

Getting Started

How-to Guides

Concepts

Security

Development

​Prerequisites

​Step 1: Create a Project

​Step 2: Define Your Agent

​Step 3: Set Evaluation Objectives

​Step 4: Create a Simulation

​Step 5: Launch the Simulation

​Next Steps

Prerequisites

Step 1: Create a Project

Step 2: Define Your Agent

Step 3: Set Evaluation Objectives

Step 4: Create a Simulation

Step 5: Launch the Simulation

Next Steps