Skip to content

Quick Start

Follow these steps to create and run your first evaluation.

Terminal window
npm install -g agentv
Terminal window
agentv init

The init command creates a .env.example file in your project root.

  1. Copy .env.example to .env
  2. Fill in your API keys, endpoints, and other configuration values
  3. Update the environment variable names in .agentv/targets.yaml to match those defined in your .env file

Create ./evals/example.yaml:

description: Math problem solving evaluation
execution:
target: default
evalcases:
- id: addition
expected_outcome: Correctly calculates 15 + 27 = 42
input_messages:
- role: user
content: What is 15 + 27?
expected_messages:
- role: assistant
content: "42"
execution:
evaluators:
- name: math_check
type: code_judge
script: ./validators/check_math.py
Terminal window
agentv eval ./evals/example.yaml

Results appear in .agentv/results/eval_<timestamp>.jsonl with scores, reasoning, and execution traces.