: GitBench
Unlock a worktree
Tests ability to unlock a previously locked worktree. Evaluates worktree-unlock workflow.

These commands set up the repo before the model sees the prompt. They define the starting file structure, staged changes, and Git history.

  1. 01 git init
  2. 02 git config user.email 'test@test.com'
  3. 03 git config user.name 'Test User'
  4. 04 echo 'Hello' > hello.txt
  5. 05 git add hello.txt
  6. 06 git commit -m 'Initial commit'
  7. 07 git checkout -b feature
  8. 08 echo 'Feature' > feature.txt
  9. 09 git add feature.txt
  10. 10 git commit -m 'Add feature'
  11. 11 git checkout main
  12. 12 git worktree add ../feature-wt feature
  13. 13 git worktree lock ../feature-wt
Prompt
Unlock the worktree at '../feature-wt'. Output ONLY the git command, nothing else.
Expected
git worktree unlock ../feature-wt
Loading campaign evidence…
deepseek/deepseek-v4-flash:high PASS 100% 194 in → 150 out (147 reasoning)
git worktree unlock ../feature-wt
deepseek/deepseek-v4-flash:high__json_schema PASS 100% 191 in → 139 out (133 reasoning)
git worktree unlock ../feature-wt
JSON Schema Structured Output
(raw) {"command":"git worktree unlock ../feature-wt"}
deepseek/deepseek-v4-flash:none PASS 100% 197 in → 9 out (0 reasoning)
git worktree unlock ../feature-wt
deepseek/deepseek-v4-flash:none__json_schema PASS 100% 197 in → 16 out (0 reasoning)
git worktree unlock ../feature-wt
JSON Schema Structured Output
(raw) { "command": "git worktree unlock ../feature-wt" }
mistralai/devstral-2512__json_schema PASS 100% 229 in → 14 out
git worktree unlock ../feature-wt
JSON Schema Structured Output
(raw) {"command": "git worktree unlock ../feature-wt"}
nvidia/nemotron-3-nano-30b-a3b:high PASS 100% 239 in → 346 out (354 reasoning)
git worktree unlock ../feature-wt
nvidia/nemotron-3-nano-30b-a3b:high__json_schema PASS 100% 249 in → 327 out (326 reasoning)
git worktree unlock ../feature-wt
JSON Schema Structured Output
(raw) { "command": "git worktree unlock ../feature-wt" }
nvidia/nemotron-3-nano-30b-a3b:none PASS 100% 243 in → 10 out (0 reasoning)
git worktree unlock '../feature-wt'
poolside/laguna-xs.2:high PASS 100% 264 in → 129 out (118 reasoning)
git worktree unlock ../feature-wt
poolside/laguna-xs.2:high__json_schema PASS 100% 264 in → 119 out (104 reasoning)
git worktree unlock ../feature-wt
JSON Schema Structured Output
(raw) {"command": "git worktree unlock ../feature-wt"}
poolside/laguna-xs.2:none PASS 100% 257 in → 10 out (0 reasoning)
git worktree unlock ../feature-wt
mistralai/devstral-2512 FAIL 0% 233 in → 14 out
```sh git worktree unlock ../feature-wt ```
Failure: Failed assertions: [{'type': 'git_output', 'command': 'git worktree list --porcelain', 'not_contains': 'locked'}]
nvidia/nemotron-3-nano-30b-a3b:none__json_schema FAIL 0% 235 in → 17 out (0 reasoning)
git worktree lock ../feature-wt
JSON Schema Structured Output
(raw) { "command": "git worktree lock ../feature-wt" }
Failure: Failed assertions: [{'type': 'git_output', 'command': 'git worktree list --porcelain', 'not_contains': 'locked'}]
poolside/laguna-xs.2:none__json_schema FAIL 0% 268 in → 16 out (0 reasoning)
git worktree remove --force ../feature-wt
JSON Schema Structured Output
(raw) {"command": "git worktree remove --force ../feature-wt"}
Failure: Failed assertions: [{'type': 'git_output', 'command': 'git worktree list --porcelain', 'not_contains': 'locked'}]