Skip to content

youBencha Documentation

Benchmark AI Coding Agents with Confidence

youBencha is an open-source framework for benchmarking AI coding agents. It provides a structured, reproducible way to evaluate how well AI agents perform real-world coding tasks.

CLI Reference

Complete reference for all youBencha CLI commands including yb run, yb report, and more.

View Commands β†’

Adapters

Connect youBencha to different AI agents with adapters like Copilot CLI.

View Adapters β†’

Examples

Real-world examples including CI/CD integration and Slack notifications.

View Examples β†’

Troubleshooting

Common issues and solutions when running youBencha evaluations.

Get Help β†’