: GitBench
Benchmark
git_show
12 fixtures

Models ranked by pass rate on git_show. A model that dominates overall might rank lower here. That's the point.

Loading...
Loading...