: GitBench
Create detached worktree
Tests ability to create a detached worktree at a specific commit. Evaluates detached-worktree creation.

These commands set up the repo before the model sees the prompt. They define the starting file structure, staged changes, and Git history.

  1. 01 git init
  2. 02 git config user.email 'test@test.com'
  3. 03 git config user.name 'Test User'
  4. 04 echo 'v1' > version.txt
  5. 05 git add version.txt
  6. 06 git commit -m 'Version 1'
  7. 07 echo 'v2' > version.txt
  8. 08 git add version.txt
  9. 09 git commit -m 'Version 2'
Prompt
Create a detached git worktree at '../detached-wt' pointing at the first commit (HEAD~1). Output ONLY the git worktree command, nothing else.
Expected
git worktree add --detach ../detached-wt HEAD~1
Loading campaign evidence…
deepseek/deepseek-v4-flash:high PASS 100% 121 in → 175 out (161 reasoning)
git worktree add ../detached-wt HEAD~1
deepseek/deepseek-v4-flash:none PASS 100% 124 in → 16 out (0 reasoning)
git worktree add --detach ../detached-wt HEAD~1
deepseek/deepseek-v4-flash:none__json_schema PASS 100% 122 in → 24 out (0 reasoning)
git worktree add --detach ../detached-wt HEAD~1
JSON Schema Structured Output
(raw) { "command": "git worktree add --detach ../detached-wt HEAD~1" }
mistralai/devstral-2512__json_schema PASS 100% 142 in → 21 out
git worktree add --detach ../detached-wt HEAD~1
JSON Schema Structured Output
(raw) {"command": "git worktree add --detach ../detached-wt HEAD~1"}
nvidia/nemotron-3-nano-30b-a3b:high PASS 100% 158 in → 357 out (328 reasoning)
git worktree add ../detached-wt HEAD~1
nvidia/nemotron-3-nano-30b-a3b:high__json_schema PASS 100% 159 in → 228 out (185 reasoning)
git worktree add ../detached-wt HEAD~1
JSON Schema Structured Output
(raw) { "command": "git worktree add ../detached-wt HEAD~1" }
nvidia/nemotron-3-nano-30b-a3b:none PASS 100% 154 in → 13 out (0 reasoning)
git worktree add ../detached-wt HEAD~1
nvidia/nemotron-3-nano-30b-a3b:none__json_schema PASS 100% 156 in → 26 out (0 reasoning)
git worktree add ../detached-wt $(git rev-parse HEAD~1)
JSON Schema Structured Output
(raw) { "command": "git worktree add ../detached-wt $(git rev-parse HEAD~1)" }
poolside/laguna-xs.2:high PASS 100% 184 in → 139 out (124 reasoning)
git worktree add ../detached-wt HEAD~1
poolside/laguna-xs.2:high__json_schema PASS 100% 181 in → 189 out (171 reasoning)
git worktree add ../detached-wt HEAD~1
JSON Schema Structured Output
(raw) {"command": "git worktree add ../detached-wt HEAD~1"}
poolside/laguna-xs.2:none__json_schema PASS 100% 183 in → 20 out (0 reasoning)
git worktree add --detach ../detached-wt HEAD~1
JSON Schema Structured Output
(raw) {"command": "git worktree add --detach ../detached-wt HEAD~1"}
Invalid JSON. Output: git worktree add --detach ../detached-wt HEAD~1
JSON Schema Structured Output
Structured Output Error
Failed to parse structured JSON response: Expecting value: line 1 column 1 (char 0)
Failure: Failed to parse structured JSON response: Expecting value: line 1 column 1 (char 0)
mistralai/devstral-2512 FAIL 0% 145 in → 21 out
```sh git worktree add --detach ../detached-wt HEAD~1 ```
Failure: Failed assertions: [{'type': 'dir_exists', 'path': '../detached-wt'}, {'type': 'file_content', 'path': '../detached-wt/version.txt', 'value': 'v1'}]
poolside/laguna-xs.2:none FAIL 0% 185 in → 21 out (0 reasoning)
```bash git worktree add --detach ../detached-wt HEAD~1 ```
Failure: Failed assertions: [{'type': 'dir_exists', 'path': '../detached-wt'}, {'type': 'file_content', 'path': '../detached-wt/version.txt', 'value': 'v1'}]