Baseline Repository
These commands set up the repo before the model sees the prompt. They define the starting file structure, staged changes, and Git history.
- 01
git init - 02
git config user.email 'alice@example.com' - 03
git config user.name 'Alice' - 04
echo 'a' > file.txt - 05
git add file.txt - 06
git commit -m 'Alice commit 1' - 07
git config user.email 'bob@example.com' - 08
git config user.name 'Bob' - 09
echo 'b' > file.txt - 10
git add file.txt - 11
git commit -m 'Bob commit 1' - 12
git config user.email 'alice@example.com' - 13
git config user.name 'Alice' - 14
echo 'c' > file.txt - 15
git add file.txt - 16
git commit -m 'Alice commit 2' - 17
git config user.email 'alice@example.com' - 18
git config user.name 'Alice' - 19
echo 'd' > file.txt - 20
git add file.txt - 21
git commit -m 'Alice commit 3' - 22
git config user.email 'bob@example.com' - 23
git config user.name 'Bob' - 24
echo 'e' > file.txt - 25
git add file.txt - 26
git commit -m 'Bob commit 2'
Prompt
How many commits were authored by Alice? Output ONLY the number, nothing else.
Expected
3
Campaign Evidence
Loading campaign evidence…
Model Outputs (14)
3
3
JSON Schema
Structured Output
(raw) { "count": 3 }
3
3
JSON Schema
Structured Output
(raw) {
"count": 3
}
3
3
JSON Schema
Structured Output
(raw) {"count": 3}
3
3
JSON Schema
Structured Output
(raw) {
"count": 3
}
3
3
JSON Schema
Structured Output
(raw) {"count":3}
3
3 ways
Failure: Expected '3', got '3 ways'
4
JSON Schema
Structured Output
(raw) { "count": 4 }
Failure: Expected '3', got '4'
4
JSON Schema
Structured Output
(raw) {"count": 4}
Failure: Expected '3', got '4'