: GitBench
Three-way: conflicting port and protocol settings in same line range
Tests ability to resolve a three-way merge conflict with port and protocol settings. Evaluates multi-dimensional conflict resolution with explicit preferences.

These commands set up the repo before the model sees the prompt. They define the starting file structure, staged changes, and Git history.

  1. 01 git init
  2. 02 git config user.email 'test@test.com'
  3. 03 git config user.name 'Test User'
  4. 04 printf 'PORT=8080 PROTOCOL=http' > config.txt
  5. 05 git add config.txt
  6. 06 git commit -m 'Initial config'
  7. 07 git checkout -b branch-a
  8. 08 printf 'PORT=8443 PROTOCOL=https' > config.txt
  9. 09 git add config.txt
  10. 10 git commit -m 'Use HTTPS port'
  11. 11 git checkout main
  12. 12 git checkout -b branch-b
  13. 13 printf 'PORT=9090 PROTOCOL=http' > config.txt
  14. 14 git add config.txt
  15. 15 git commit -m 'Change to non-standard port'
  16. 16 git checkout main
  17. 17 git merge branch-a
  18. 18 git merge branch-b
Prompt
Resolve the merge conflict in config.txt. Three different configurations were attempted: base PORT=8080/PROTOCOL=http, branch-a PORT=8443/PROTOCOL=https, branch-b PORT=9090/PROTOCOL=http. Both branches were merged into main (branch-a first, then branch-b). Prefer the HTTPS configuration when there is a conflict. Provide ONLY the resolved config.txt content.
Expected
PORT=8443 PROTOCOL=https
Loading campaign evidence…
deepseek/deepseek-v4-flash:high PASS 100% 130 in → 113 out (103 reasoning)
PORT=8443 PROTOCOL=https
deepseek/deepseek-v4-flash:high__json_schema PASS 100% 130 in → 224 out (224 reasoning)
PORT=8443 PROTOCOL=https
JSON Schema Structured Output
(raw) { "resolved_content": "PORT=8443\nPROTOCOL=https" }
deepseek/deepseek-v4-flash:none PASS 100% 130 in → 10 out (0 reasoning)
PORT=8443 PROTOCOL=https
deepseek/deepseek-v4-flash:none__json_schema PASS 100% 130 in → 20 out (0 reasoning)
PORT=8443 PROTOCOL=https
JSON Schema Structured Output
(raw) { "resolved_content": "PORT=8443\nPROTOCOL=https" }
mistralai/devstral-2512__json_schema PASS 100% 140 in → 19 out
PORT=8443 PROTOCOL=https
JSON Schema Structured Output
(raw) {"resolved_content": "PORT=8443\nPROTOCOL=https"}
nvidia/nemotron-3-nano-30b-a3b:high PASS 100% 153 in → 190 out (180 reasoning)
PORT=8443 PROTOCOL=https
nvidia/nemotron-3-nano-30b-a3b:none PASS 100% 153 in → 12 out (0 reasoning)
PORT=8443 PROTOCOL=https
poolside/laguna-xs.2:high PASS 100% 178 in → 258 out (245 reasoning)
PORT=8443 PROTOCOL=https
poolside/laguna-xs.2:high__json_schema PASS 100% 178 in → 201 out (182 reasoning)
PORT=8443 PROTOCOL=https
JSON Schema Structured Output
(raw) {"resolved_content": "PORT=8443\nPROTOCOL=https\n"}
poolside/laguna-xs.2:none__json_schema PASS 100% 178 in → 22 out (0 reasoning)
PORT=8443 PROTOCOL=https
JSON Schema Structured Output
(raw) { "resolved_content": "PORT=8443\nPROTOCOL=https" }
mistralai/devstral-2512 FAIL 0% 140 in → 31 out
Here's the resolved `config.txt` content, preferring the HTTPS configuration: ``` PORT=8443 PROTOCOL=https ```
Failure: Expected 'PORT=8443 PROTOCOL=https ', got 'Here's the resolved `config.txt` content, preferring the HTTPS configuration: ``` PORT=8443 PROTOCOL=https ```'
Invalid JSON. Output:
JSON Schema Structured Output
Structured Output Error
Failed to parse structured JSON response: Expecting value: line 1 column 1 (char 0)
Failure: Failed to parse structured JSON response: Expecting value: line 1 column 1 (char 0)
nvidia/nemotron-3-nano-30b-a3b:none__json_schema FAIL 0% 153 in → 32 out (0 reasoning)
[resolved]PORT=8443 [resolved]PROTOCOL=https
JSON Schema Structured Output
(raw) { "resolved_content": "[resolved]PORT=8443\n[resolved]PROTOCOL=https\n" }
Failure: Expected 'PORT=8443 PROTOCOL=https ', got '[resolved]PORT=8443 [resolved]PROTOCOL=https '
poolside/laguna-xs.2:none FAIL 0% 178 in → 12 out (0 reasoning)
PORT=9090 PROTOCOL=https
Failure: Expected 'PORT=8443 PROTOCOL=https ', got ' PORT=9090 PROTOCOL=https '