: GitBench
Benchmark
worktree_usage
12 fixtures

Models ranked by pass rate on worktree_usage. A model that dominates overall might rank lower here. That's the point.

Loading...
Loading...