Generate
The generate command creates structured evaluation criteria from your eval cases.
Generate Rubrics
Section titled “Generate Rubrics”Auto-generate rubrics from expected outcomes:
agentv generate rubrics evals/my-eval.yamlThis analyzes each eval case’s expected_outcome field and creates structured rubric criteria with appropriate weights.
How It Works
Section titled “How It Works”- Reads each eval case’s
expected_outcome - Uses an LLM to decompose the expected outcome into individual checkable criteria
- Assigns weights based on importance
- Writes the rubrics back to the eval file
Example
Section titled “Example”Before:
evalcases: - id: quicksort expected_outcome: Explains quicksort with time complexity and examples input_messages: - role: user content: Explain quicksortAfter running agentv generate rubrics:
evalcases: - id: quicksort expected_outcome: Explains quicksort with time complexity and examples input_messages: - role: user content: Explain quicksort rubrics: - Explains divide-and-conquer approach - Describes partition step - States O(n log n) average time complexity - Provides a concrete example