: GitBench
Lock a worktree
Tests ability to lock a worktree to prevent modifications. Evaluates worktree-locking command knowledge.

These commands set up the repo before the model sees the prompt. They define the starting file structure, staged changes, and Git history.

  1. 01 git init
  2. 02 git config user.email 'test@test.com'
  3. 03 git config user.name 'Test User'
  4. 04 echo 'Hello' > hello.txt
  5. 05 git add hello.txt
  6. 06 git commit -m 'Initial commit'
  7. 07 git checkout -b feature
  8. 08 echo 'Feature' > feature.txt
  9. 09 git add feature.txt
  10. 10 git commit -m 'Add feature'
  11. 11 git checkout main
  12. 12 git worktree add ../feature-wt feature
Prompt
Lock the worktree at '../feature-wt' with the reason 'do not delete'. Output ONLY the git command, nothing else.
Expected
git worktree lock --reason 'do not delete' ../feature-wt
Loading campaign evidence…
deepseek/deepseek-v4-flash:high PASS 100% 194 in → 110 out (93 reasoning)
git worktree lock ../feature-wt --reason 'do not delete'
deepseek/deepseek-v4-flash:high__json_schema PASS 100% 262 in → 330 out (300 reasoning)
git worktree lock ../feature-wt --reason 'do not delete'
JSON Schema Structured Output
(raw) {"command": "git worktree lock ../feature-wt --reason 'do not delete'"}
deepseek/deepseek-v4-flash:none__json_schema PASS 100% 198 in → 22 out (0 reasoning)
git worktree lock ../feature-wt --reason 'do not delete'
JSON Schema Structured Output
(raw) { "command": "git worktree lock ../feature-wt --reason 'do not delete'" }
mistralai/devstral-2512__json_schema PASS 100% 234 in → 21 out
git worktree lock --reason "do not delete" ../feature-wt
JSON Schema Structured Output
(raw) {"command": "git worktree lock --reason \"do not delete\" ../feature-wt"}
nvidia/nemotron-3-nano-30b-a3b:high PASS 100% 250 in → 313 out (326 reasoning)
git worktree lock ../feature-wt --reason "do not delete"
nvidia/nemotron-3-nano-30b-a3b:high__json_schema PASS 100% 248 in → 436 out (443 reasoning)
git worktree lock '../feature-wt' --reason="do not delete"
JSON Schema Structured Output
(raw) { "command": "git worktree lock '../feature-wt' --reason=\"do not delete\"" }
nvidia/nemotron-3-nano-30b-a3b:none PASS 100% 248 in → 16 out (0 reasoning)
git worktree lock ../feature-wt --reason "do not delete"
nvidia/nemotron-3-nano-30b-a3b:none__json_schema PASS 100% 252 in → 23 out (0 reasoning)
git worktree lock ../feature-wt --reason 'do not delete'
JSON Schema Structured Output
(raw) { "command": "git worktree lock ../feature-wt --reason 'do not delete'" }
poolside/laguna-xs.2:high PASS 100% 262 in → 227 out (211 reasoning)
git worktree lock ../feature-wt --reason "do not delete"
poolside/laguna-xs.2:high__json_schema PASS 100% 273 in → 275 out (250 reasoning)
git worktree lock ../feature-wt --reason "do not delete"
JSON Schema Structured Output
(raw) { "command": "git worktree lock ../feature-wt --reason \"do not delete\"" }
deepseek/deepseek-v4-flash:none FAIL 0% 200 in → 20 out (0 reasoning)
```bash git worktree lock ../feature-wt --reason "do not delete" ```
Failure: Failed assertions: [{'type': 'git_output', 'command': 'git worktree list --porcelain', 'contains': 'locked'}, {'type': 'git_output', 'command': 'git worktree list --porcelain', 'contains': 'do not delete'}]
mistralai/devstral-2512 FAIL 0% 237 in → 21 out
```bash git worktree lock --reason "do not delete" ../feature-wt ```
Failure: Failed assertions: [{'type': 'git_output', 'command': 'git worktree list --porcelain', 'contains': 'locked'}, {'type': 'git_output', 'command': 'git worktree list --porcelain', 'contains': 'do not delete'}]
poolside/laguna-xs.2:none FAIL 0% 272 in → 17 out (0 reasoning)
git worktree lock ../feature-wt -m "do not delete"
Failure: Failed assertions: [{'type': 'git_output', 'command': 'git worktree list --porcelain', 'contains': 'locked'}, {'type': 'git_output', 'command': 'git worktree list --porcelain', 'contains': 'do not delete'}]
poolside/laguna-xs.2:none__json_schema FAIL 0% 270 in → 25 out (0 reasoning)
git worktree lock ../feature-wt -m 'do not delete'
JSON Schema Structured Output
(raw) { "command": "git worktree lock ../feature-wt -m 'do not delete'" }
Failure: Failed assertions: [{'type': 'git_output', 'command': 'git worktree list --porcelain', 'contains': 'locked'}, {'type': 'git_output', 'command': 'git worktree list --porcelain', 'contains': 'do not delete'}]