Geval Documentation

What is Geval?

Geval turns evaluation results into automated pass/fail decisions in CI/CD pipelines. It’s a lightweight, framework-agnostic tool that enforces quality contracts on your AI applications. Geval is:

✅ Format-agnostic - Works with CSV, JSON, JSONL from any eval tool
✅ Contract-based - Define quality requirements as code
✅ CI-native - Exit codes and JSON output for automation
✅ Framework-agnostic - Works with Promptfoo, LangSmith, OpenEvals, or custom tools

Geval is not:

❌ An eval runner (it consumes eval outputs)
❌ A monitoring tool (it’s for release gates)
❌ A testing framework (it validates existing results)

Quick Start

Get up and running with Geval in minutes. Install, create your first contract, and run your first check.

Key Features

Multi-Format Support

Works with CSV, JSON, and JSONL files from any eval tool. No vendor lock-in.

CI/CD Integration

Exit codes and JSON output for seamless integration with any CI/CD system.

Baseline Comparisons

Compare against fixed thresholds, previous runs, or main branch baselines.

Contract-Based

Define quality requirements as code. Version it. Review it. Commit it.

How It Works

Your Evals → Geval Contract → CI Pass/Block Decision

Define - Write quality contracts in YAML
Integrate - Add to your CI/CD pipeline
Enforce - Block bad releases automatically

View Examples

See real-world examples for performance monitoring, safety checks, and more.

Get Started

CLI Reference

Contracts

Integration

API Reference

Examples

What is Geval?

Quick Start

Key Features

Multi-Format Support

CI/CD Integration

Baseline Comparisons

Contract-Based

How It Works

View Examples

Get Started

CLI Reference

Contracts

Integration

API Reference

Examples

​What is Geval?

Quick Start

​Key Features

Multi-Format Support

CI/CD Integration

Baseline Comparisons

Contract-Based

​How It Works

View Examples

What is Geval?

Key Features

How It Works