agent-release-gate

Ship AI agents with a seatbelt on.

agent-release-gate is a lightweight CI gate for AI systems. It checks agent outputs against expected behavior, catches regressions against a baseline, and returns a hard pass/fail signal so bad runs don't sneak into production.

Why this exists

Most AI failures don't look like crashes. They look like:

Confident nonsense
Slower responses and higher cost over time
A prompt tweak that quietly made outcomes worse

This tool makes those problems visible before release.

What it does

Scores each test case using expected + forbidden phrases
Calculates pass rate, average latency, and average cost
Fails if quality drops below your threshold
Optionally compares against a baseline report and blocks regressions
Outputs both JSON (for machines) and Markdown (for humans)

Install

pip install -e .

Quick start

argate evaluate \
  --spec examples/spec.yaml \
  --results examples/results.json \
  --output gate-report.json \
  --markdown gate-report.md

The command exits with:

0 when the gate passes
1 when the gate fails

Perfect for CI pipelines.

Spec format

global:
  minimum_pass_rate: 0.8
  allowed_regression: 0.02
  max_avg_latency_ms: 1500
  max_avg_cost_usd: 0.03

cases:
  - id: refund_status
    expected_all: ["refund", "3-5 business days"]
    expected_any: ["order", "transaction"]
    forbidden: ["cannot help", "policy not found"]
    min_score: 0.72

Repo layout

src/agent_release_gate/: scoring + gate logic
examples/: runnable demo spec and results
tests/: unit tests
.github/workflows/ci.yml: CI example

Development

pip install -e . pytest
pytest

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
examples		examples
src		src
tests		tests
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

agent-release-gate

Why this exists

What it does

Install

Quick start

Spec format

Repo layout

Development

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

agent-release-gate

Why this exists

What it does

Install

Quick start

Spec format

Repo layout

Development

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages