What is Geval?
Geval turns evaluation results into automated pass/fail decisions in CI/CD pipelines. It’s a lightweight, framework-agnostic tool that enforces quality contracts on your AI applications. Geval is:- ✅ Format-agnostic - Works with CSV, JSON, JSONL from any eval tool
- ✅ Contract-based - Define quality requirements as code
- ✅ CI-native - Exit codes and JSON output for automation
- ✅ Framework-agnostic - Works with Promptfoo, LangSmith, OpenEvals, or custom tools
- ❌ An eval runner (it consumes eval outputs)
- ❌ A monitoring tool (it’s for release gates)
- ❌ A testing framework (it validates existing results)
Quick Start
Get up and running with Geval in minutes. Install, create your first contract, and run your first check.
Key Features
Multi-Format Support
Works with CSV, JSON, and JSONL files from any eval tool. No vendor lock-in.
CI/CD Integration
Exit codes and JSON output for seamless integration with any CI/CD system.
Baseline Comparisons
Compare against fixed thresholds, previous runs, or main branch baselines.
Contract-Based
Define quality requirements as code. Version it. Review it. Commit it.
How It Works
- Define - Write quality contracts in YAML
- Integrate - Add to your CI/CD pipeline
- Enforce - Block bad releases automatically
View Examples
See real-world examples for performance monitoring, safety checks, and more.