: GitBench
Create worktree for a different branch
Tests ability to create a git worktree for an existing branch. Evaluates basic worktree-creation command knowledge.

These commands set up the repo before the model sees the prompt. They define the starting file structure, staged changes, and Git history.

  1. 01 git init
  2. 02 git config user.email 'test@test.com'
  3. 03 git config user.name 'Test User'
  4. 04 echo 'Hello' > hello.txt
  5. 05 git add hello.txt
  6. 06 git commit -m 'Initial commit'
  7. 07 git checkout -b feature
  8. 08 echo 'Feature code' > feature.txt
  9. 09 git add feature.txt
  10. 10 git commit -m 'Add feature'
  11. 11 git checkout main
Prompt
Run the command to create a git worktree for the 'feature' branch at path '../feature-wt' (relative to the repo). Output ONLY the git worktree command, nothing else.
Expected
git worktree add ../feature-wt feature
Loading campaign evidence…
deepseek/deepseek-v4-flash:high PASS 100% 126 in → 113 out (108 reasoning)
git worktree add ../feature-wt feature
deepseek/deepseek-v4-flash:high__json_schema PASS 100% 133 in → 154 out (135 reasoning)
git worktree add ../feature-wt feature
JSON Schema Structured Output
(raw) { "command": "git worktree add ../feature-wt feature" }
deepseek/deepseek-v4-flash:none PASS 100% 129 in → 10 out (0 reasoning)
git worktree add ../feature-wt feature
deepseek/deepseek-v4-flash:none__json_schema PASS 100% 127 in → 17 out (0 reasoning)
git worktree add ../feature-wt feature
JSON Schema Structured Output
(raw) { "command": "git worktree add ../feature-wt feature"}
mistralai/devstral-2512__json_schema PASS 100% 145 in → 15 out
git worktree add ../feature-wt feature
JSON Schema Structured Output
(raw) {"command": "git worktree add ../feature-wt feature"}
nvidia/nemotron-3-nano-30b-a3b:high PASS 100% 158 in → 228 out (245 reasoning)
git worktree add '../feature-wt' feature
nvidia/nemotron-3-nano-30b-a3b:high__json_schema PASS 100% 159 in → 295 out (322 reasoning)
git worktree add ../feature-wt feature
JSON Schema Structured Output
(raw) { "command": "git worktree add ../feature-wt feature" }
poolside/laguna-xs.2:high PASS 100% 189 in → 139 out (128 reasoning)
git worktree add ../feature-wt feature
poolside/laguna-xs.2:high__json_schema PASS 100% 186 in → 365 out (350 reasoning)
git worktree add ../feature-wt feature
JSON Schema Structured Output
(raw) {"command": "git worktree add ../feature-wt feature"}
poolside/laguna-xs.2:none PASS 100% 189 in → 11 out (0 reasoning)
git worktree add ../feature-wt feature
poolside/laguna-xs.2:none__json_schema PASS 100% 188 in → 15 out (0 reasoning)
git worktree add ../feature-wt feature
JSON Schema Structured Output
(raw) {"command": "git worktree add ../feature-wt feature"}
nvidia/nemotron-3-nano-30b-a3b:none__json_schema FAIL 75% 161 in → 25 out (0 reasoning)
git worktree add --force ../feature-wt refs/heads/feature
JSON Schema Structured Output
(raw) { "command": "git worktree add --force ../feature-wt refs/heads/feature" }
Failure: Failed assertions: [{'type': 'git_output', 'command': 'git -C ../feature-wt branch --show-current', 'contains': 'feature'}]
mistralai/devstral-2512 FAIL 25% 145 in → 15 out
```bash git worktree add ../feature-wt feature ```
Failure: Failed assertions: [{'type': 'dir_exists', 'path': '../feature-wt'}, {'type': 'branch_exists', 'name': 'feature', 'in_worktree': '../feature-wt'}, {'type': 'file_content', 'path': '../feature-wt/feature.txt', 'value': 'Feature code'}]
nvidia/nemotron-3-nano-30b-a3b:none FAIL 25% 160 in → 249 out (0 reasoning)
git worktree add -b feature '../feature-wt' 65a8d03af0ef240097c40d15028cbf9eeea6b15e
Failure: Failed assertions: [{'type': 'dir_exists', 'path': '../feature-wt'}, {'type': 'branch_exists', 'name': 'feature', 'in_worktree': '../feature-wt'}, {'type': 'file_content', 'path': '../feature-wt/feature.txt', 'value': 'Feature code'}]