Codex, Opus, Gemini try to build Counter Strike

instantdb.com

instantdb.com Saved Tuesday, December 2, 2025 Raindrop

High resonance

Write Soon

This article strongly resonates and has clear angles for your perspective

Quick Take

This is exactly the kind of hands-on AI evaluation Brian would find valuable - testing three major models on a complex, multi-step engineering task rather than toy problems. The detailed breakdown of where each model excelled (Claude for frontend polish, Gemini for backend logic) provides actionable intelligence for choosing AI tools in real projects.

Relevant Domains

AI/agents/future of software work Engineering craft/architecture/productivity (secondary) Side projects/automation/earning from skills (tertiary)

Blog Angles

"I Tested AI Models on Real Engineering Tasks (Not Toy Examples)"

Thesis

Your Hook

"The React useEffect Problem Is Blocking AI (And Humans)"

Thesis

Your Hook

"Why I'm Not Retiring My IDE Yet: What AI Still Gets Wrong"

Thesis

Your Hook

"The AI Model Hierarchy: When to Use Claude vs Gemini vs GPT"

Thesis

Your Hook

Quick Take

Relevant Domains

Blog Angles

"I Tested AI Models on Real Engineering Tasks (Not Toy Examples)"

"The React useEffect Problem Is Blocking AI (And Humans)"

"Why I'm Not Retiring My IDE Yet: What AI Still Gets Wrong"

"The AI Model Hierarchy: When to Use Claude vs Gemini vs GPT"

Tags