Overview
Thegeval diff command compares eval results between two runs to identify changes and regressions.
Usage
Options
| Option | Description |
|---|---|
-p, --previous <path> | Path to previous eval results (required) |
-c, --current <path> | Path to current eval results (required) |
--json | Output results as JSON |
--no-color | Disable colored output |
Example
Output
The diff command shows:- Metrics that improved
- Metrics that regressed
- Metrics that remained unchanged
- Detailed comparison for each metric