: GitBench
N
Nvidia / nemotron-3-nano-30b-a3b

high

86.8% 177 / 204 fixtures 1 run(s)
42,451 input / 291,706 total output / 89,059 reasoning within output tokens $0.06158866
84.3% 172 / 204 fixtures 1 run(s)
42,274 input / 84,835 total output / 80,310 reasoning within output tokens $0.02289684
Loading reliability summary…
Pass Rate Delta
-2.5% Text: 86.8% → JSON: 84.3%
+8
Gained
JSON pass / text fail
−13
Lost
Text pass / JSON fail
164
Unchanged Pass
Both pass
19
Unchanged Fail
Both fail
Fixture Reliability Delta
Fixture Text JSON Delta
f010 100% (1/1) 0% (0/1) +100%
Benchmark Deltas
Benchmark Text JSON Delta
reflog 91.7% 66.7% -25%
rebase 75% 58.3% -16.7%
blame_forensics 83.3% 91.7% + 8.3%
commit_messages 100% 91.7% -8.3%
commit_squash 83.3% 75% -8.3%
git_clean 91.7% 100% + 8.3%
merge_conflicts 58.3% 50% -8.3%
stash_recovery 91.7% 100% + 8.3%
submodule_usage 75% 83.3% + 8.3%
cherry_pick 50% 41.7% -8.3%
branch_cleanup 100% 100% + 0%
git_bisect 100% 100% + 0%
git_grep 100% 100% + 0%
git_log_format 100% 100% + 0%
git_show 100% 100% + 0%
tag_management 91.7% 91.7% + 0%
worktree_usage 83.3% 83.3% + 0%
Changed Fixtures (21)