: GitBench
Benchmark
commit_squash
12 fixtures

Models ranked by pass rate on commit_squash. A model that dominates overall might rank lower here. That's the point.

Loading...
Loading...