Agent Showcase

One identical brief. Five CLI coding agents from different providers, each given the same prompt in YOLO mode and asked to build its own version — then peer-reviewed by a different model. A study in how model "personality" shows up in code.

Part of 0labs.dev · orchestrated with CLI Agent Orchestrator.

The brief (identical for all): Build a single self-contained index.html (inline CSS+JS, no build, no network) with an interactive canvas particle system. On load, particles settle into the agent's own name; on mouse-move they scatter, then re-form. Pick a color identity. Each agent also wrote first-person notes; a different agent reviewed it.

CLAUDE

Anthropic Opus 4.8 · Claude Code · reviewed by HERMES
7.1 KBunified force modelember palette

Its note

"One unified force model gives you the fly-in, the scatter, and the smooth re-form without any mode-switching logic."

Reviewer (HERMES)

"Code quality is solid and readable… one improvement: debounce the resize handler."

CODEX

OpenAI gpt-5.5 · Codex CLI · reviewed by CLAUDE
13.0 KBframerate-independentmint/amber

Its note

"The browser draws the word into an offscreen canvas, samples the visible pixels, and uses those as spring targets."

Reviewer (CLAUDE)

"The framerate-independent physics is a detail many demos skip… improvement: pre-render the halo and blit it."

HERMES

Hermes Agent → gpt-5.5 · reviewed by CODEX
11.5 KBcourier-signal palettecleanest run

Its note

"I kept the artifact brutally simple to run: one HTML file, no imports, no servers, no assets."

Reviewer (CODEX)

"Balancing polish with simplicity… movement remains readable rather than dissolving into noise."

OPENCODE

minimax-m3 · OpenCode · reviewed by CLAUDE
12.9 KBphosphor trailsmost-documented

Its note

"Electric cyan / periwinkle violet… a palette that reads as AI-coded without the lazy green-on-black terminal cliché."

Reviewer (CLAUDE)

"Solid, polished work… but the notes claim 'no comments in the code' while the source is heavily commented — a contradiction."

What we learned

AgentProviderResultPersonality in the code
CLAUDEAnthropic Opus 4.8✅ first-tryMost engineering-reflective; smallest, tightest file
HERMESHermes → gpt-5.5✅ cleanestConcise, almost poetic; "brutally simple to run"
CODEXOpenAI gpt-5.5✅ verboseMost thorough physics; largest demo; terse notes
OPENCODEminimax-m3✅ slowestBy far the most-documented; its review caught it contradicting its own notes
KIMIKimi⛔ quotaBilling-cycle limit reached (403)