Quick Start

Follow these steps to create and run your first evaluation.

1. Install AgentV

npm install -g agentv

2. Initialize your workspace

agentv init

3. Configure environment variables

The init command creates a .env.example file in your project root.

Copy .env.example to .env
Fill in your API keys, endpoints, and other configuration values
Update the environment variable names in .agentv/targets.yaml to match those defined in your .env file

4. Create an eval

Create ./evals/example.yaml:

description: Math problem solving evaluation
execution:
  target: default

evalcases:
  - id: addition
    expected_outcome: Correctly calculates 15 + 27 = 42

    input_messages:
      - role: user
        content: What is 15 + 27?

    expected_messages:
      - role: assistant
        content: "42"

    execution:
      evaluators:
        - name: math_check
          type: code_judge
          script: ./validators/check_math.py

5. Run the eval

agentv eval ./evals/example.yaml

Results appear in .agentv/results/eval_<timestamp>.jsonl with scores, reasoning, and execution traces.

Next Steps

Learn about eval file formats
Configure targets for different providers
Create custom evaluators