Single unmerged branch
Tests ability to identify that zero branches should be deleted when only unmerged branches exist. Evaluates the all-unmerged edge case.
Baseline Repository
These commands set up the repo before the model sees the prompt. They define the starting file structure, staged changes, and Git history.
- 01
git init - 02
git config user.email 'test@test.com' - 03
git config user.name 'Test User' - 04
echo 'App' > app.py - 05
git add app.py - 06
git commit -m 'Initial' - 07
git checkout -b feature-experimental - 08
echo 'experimental' > exp.py - 09
git add exp.py - 10
git commit -m 'Experiment' - 11
git checkout main
Prompt
Look at the branches in this repository. Which branches should be deleted because they are fully merged into main? List ONLY the branch names to delete, one per line. If none, respond with 'none'.
Expected
none
Campaign Evidence
Loading campaign evidence…
Model Outputs (14)
none
none
none
none
JSON Schema
Structured Output
(raw) {"branches_to_delete":["none"]}
none
none
none
(empty output)
JSON Schema
Structured Output
(raw) {
"branches_to_delete": []
}
Failure: Missing: ['none']
feature-experimental
Failure: Missing: ['none']
Extra: ['feature-experimental']
feature-experimental
JSON Schema
Structured Output
(raw) { "branches_to_delete": ["feature-experimental"] }
Failure: Missing: ['none']
Extra: ['feature-experimental']
feature-experimental
JSON Schema
Structured Output
(raw) {"branches_to_delete": ["feature-experimental"]}
Failure: Missing: ['none']
Extra: ['feature-experimental']
feature-experimental
JSON Schema
Structured Output
(raw) {
"branches_to_delete": [
"feature-experimental"
]
}
Failure: Missing: ['none']
Extra: ['feature-experimental']
(empty output)
JSON Schema
Structured Output
(raw) {"branches_to_delete": []}
Failure: Missing: ['none']
feature-experimental
JSON Schema
Structured Output
(raw) {"branches_to_delete": ["feature-experimental"]}
Failure: Missing: ['none']
Extra: ['feature-experimental']