
Agent BattleGround — Claude Code vs Codex
I gave Claude Code (Fable 5) and Codex (gpt-5.5) the exact same six creative-coding tasks — an arcade game, a generative-art CLI, a bug hunt, an ASCII aquarium, a messy-data dashboard, and a 120-line demoscene — one shot each, zero interventions, blind judges, instrumented re-tests. This is the full fight card: every prompt, the actual screen recordings, the parameter-by-parameter scorecards, and what the judges said. Final tally: 6–0, but the story is closer than the headline.







