Share

AI News & Strategy Daily with Nate B. Jones
Claude Code vs Codex: The Decision That Compounds Every Week You Delay
What's really happening inside AI coding tools that nobody's comparing? The common story is that Claude vs. ChatGPT is a model competition. But the model is the least important part.
In this video, I share the inside scoop on why the AI harness matters more than the model:
- Why the same Claude model scored 78% vs. 42% on identical benchmarks
- How Claude Code and Codex embody opposite philosophies of AI - collaboration
- What harness lock-in actually costs teams who switch tools later
- Where non-technical leaders are making the wrong procurement decisions
The teams getting this right are choosing the architecture that matches how they work, and that decision compounds every quarter.
Chapters
00:00 The harness vs. the model — what everyone gets wrong
01:45 Why nobody compares AI harnesses
03:20 Same model, double the performance: the benchmark that proves it
04:50 How Anthropic built Claude Code's harness
07:10 How OpenAI built Codex's harness
09:30 Five ways the harnesses are diverging
13:45 State and memory: where institutional knowledge lives
16:20 Context management and tool integration
19:00 Multi-agent coordination: collaboration vs. isolation
21:30 Harness lock-in: the cost nobody is pricing in
24:00 What this means for engineers and engineering leaders
26:30 Why non-technical leaders need to understand this now
Subscribe for daily AI strategy and news.
Full Story w/ Prompts: https://natesnewsletter.substack.com/p/same-model-78-vs-42-the-harness-made
For deeper playbooks and analysis: https://natesnewsletter.substack.com/
My site: https://natebjones.com
___________________
More episodes
View all episodes

Wall Street Just Bet $285 Billion on AI Agents. The Best One Barely Works.
22:28|What's really happening with AI agents that claim to do the work for you?The common story is that outcome-focused AI agents have finally arrived. The reality is that most of them still can't answer three basic questions.In this video, I share the inside scoop on which AI agents actually deliver outcomes and which are still living on demo energy: • Why verifiability is the hidden foundation of every real agent • How three questions separate genuine agents from expensive hype • What Lindy, Google Opal, Sauna, and Obvious actually get right • Where the three-layer architecture points for builders who want controlOperators and builders who apply these three questions before committing will avoid the hype cycle and invest in tools that compound value over time.Subscribe for daily AI strategy and news.For playbooks and analysis: https://natesnewsletter.substack.com/p/every-ai-agent-you-use-has-the-same?r=1z4sm5&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true
I Broke Down Anthropic's $2.5 Billion Leak. Your Agent Is Missing 12 Critical Pieces.
26:52|What's really happening inside the $2.5 billion run rate product when Anthropic accidentally leaks the entire Claude Code architecture?The common story is that the leak reveals upcoming features. But the reality is that the secret sauce is 12 boring primitives that make agents actually work at scale, and most teams skip half of them.In this video, I share the inside scoop on what Claude Code teaches us about building production agents: • Why tool registries with metadata-first design are day one non-negotiables • How an 18-module security architecture protects a single bash tool • What session persistence and workflow state actually need to capture • Where most agentic projects die from premature complexityBuilders who keep chasing the glamorous AI parts will keep shipping demos that crash. The leak proves that successful agents are 80% plumbing and 20% model.Subscribe for daily AI strategy and news.For deeper playbooks and analysis: https://natesnewsletter.substack.com/p/your-agent-has-12-blind-spots-you?r=1z4sm5&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true
Your Claude Limit Burns In 90 Minutes Because Of One ChatGPT Habit.
26:34|What's really happening inside your AI costs when Jensen Huang says engineers will spend $250,000 a year on tokens?The common story is that frontier models are expensive. But the reality is that your habits cost more than the models ever will, and most users burn 8-10x what they need to.In this video, I share the inside scoop on token efficiency before Mythos pricing hits: • Why raw PDFs can turn 4,500 words into 100,000 tokens • How conversation sprawl compounds waste with every turn • What plugin overhead costs you before you type a word • Where model mixing drops a $10 session to $1Builders who keep burning tokens as a badge of honor will face a reckoning when cutting-edge models cost 10x what Opus costs today. The habits you build now determine whether you scale or stall.Subscribe for daily AI strategy and news.For playbooks and analysis: https://natesnewsletter.substack.com/p/your-claude-sessions-cost-10x-what
Claude Mythos Changes Everything. Your AI Stack Isn't Ready.
31:19|What's really happening inside Anthropic when Claude Mythos leaks and security researchers say it found zero-day vulnerabilities in a 50,000-star GitHub repo within minutes?The common story is that bigger models just mean better benchmarks. But the reality is that Mythos is a step change that will force you to simplify everything you've built around weaker models.In this video, I share the inside scoop on how to prepare before Mythos drops: • Why your 3,000-token system prompts are about to become liabilities • How retrieval architecture shifts when the model fills its own context • What hard-coded domain knowledge you can finally delete • Where verification gates need to move in your pipelineBuilders who keep compensating for model limitations instead of simplifying toward outcomes will be left behind. The bitter lesson is that smarter models reward letting go.Subscribe for daily AI strategy and news.For playbooks and analysis:https://natesnewsletter.substack.com/p/anthropic-just-built-a-model-that?r=1z4sm5&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true
Your iPhone Is About to Control Every AI App You Use. Here's What This Means For You.
22:11|What's really happening inside Apple's AI strategy heading into WWDC? The common story is that Apple lost the AI race. The reality is more complicated.In this video, I share the inside scoop on Apple's agentic play and what WWDC will actually signal: • Why Siri is becoming Apple's default AI agent • How app intents will open agentic development to the ecosystem • What MCP integration means for builders on mobile • Where Google, Samsung, and OpenAI fit into Apple's long gameApple has for free what OpenAI is spending billions to build. But execution at WWDC will determine whether that advantage actually lands.Subscribe for daily AI strategy and news.For playbooks and analysis: https://natesnewsletter.substack.com/p/the-company-everyone-says-lost-the?r=1z4sm5&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true
Anthropic, OpenAI, and Microsoft Just Agreed on One File Format. It Changes Everything.
26:19|What's really happening inside the skills ecosystem when agents now call skills more often than humans do?The common story is that skills are just personal configuration files from October. But the reality is that skills have become organizational infrastructure, and most teams haven't updated their approach to match.In this video, I share the inside scoop on how to build agent-readable skills that actually compound: • Why the description field is where most skills go to die • How agent-first design changes handoffs and contracts • What three-tier skill architecture looks like for teams • Where community repositories fill the domain-specific gapBuilders who keep treating skills as glorified prompts will miss the compounding advantage; the practitioners who version, test, and share skills are pulling ahead every week.Subscribe for daily AI strategy and news.For playbooks and analysis: https://natesnewsletter.substack.com/p/your-ai-skills-fail-10-of-the-time?r=1z4sm5&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true
48 Days. That's How Long Before the Helium Runs Out for AI Chips.
22:20|What's really happening with the physical infrastructure behind AI? The common story is that AI spending is unstoppable — but the reality is more complicated.In this video, I share the inside scoop on how a missile strike at a Qatari refinery is threatening the entire AI chip supply chain: • Why helium is irreplaceable inside advanced semiconductor fabrication • How the Ras Laffan shutdown flows directly into HBM and AI accelerator supply • What LNG disruptions mean for energy costs at East Asian chip fabs • Where China's geopolitical advantage in helium and energy is quietly compoundingThe operators, planners, and builders betting on AI infrastructure need to understand this isn't a short-term blip — it's a structural cost and supply shock that will reprice everything from laptops to hyperscaler inference.Subscribe for daily AI strategy and news.For deeper playbooks and analysis: https://natesnewsletter.substack.com/
Anthropic Just Gave You 3 Tools That Work While You're Gone.
29:08|What's really happening inside Anthropic's response to OpenClaw when they ship Dispatch and Computer Use in the same week?The common story is that these are just mobile chat features, but the reality is a complete orchestration layer that lets you spawn parallel agent sessions from your phone while your desktop executes work without you.In this video, I share the inside scoop on the three primitives that finally make always-on agents real:• Why scheduled tasks run on Anthropic's cloud without your laptop• How Dispatch turns your phone into a command surface for parallel agents• What Computer Use unlocks for apps that will never have MCP servers• Where the management mindset separates real work from demo theaterBuilders who keep expecting agents to create more work for them will miss the entire point: the only metric that matters is whether tasks get off your desk, not onto it.Subscribe for daily AI strategy and news.For playbooks and analysis: https://natesnewsletter.substack.com/p/90-of-what-you-build-on-your-ai-agent?
A Markdown File Just Replaced Your Most Expensive Design Meeting. (Google Stitch)
29:34|What's really happening inside the creative tools space when design, video, and 3D all move to the command line in the same month?The common story is that AI is replacing designers. But the reality is that three releases in the last few weeks collapsed the cost of creative exploration while raising the value of taste and judgment.In this video, I share the inside scoop on how design is following development to the terminal: • Why Google Stitch tanked Figma stock with free vibe design • How Remotion turns video production into React components • What Blender MCP does with 1,500 operators and natural language • Where scheduled creative pipelines become the real unlockBuilders who combine these primitives with scheduling and workflows will produce at scales that were impossible six months ago. The floor dropped, but the ceiling for excellence didn't move.Subscribe for daily AI strategy and news.For playbooks and analysis: https://natesnewsletter.substack.com/p/a-0-design-sprint-used-to-be-impossible?r=1z4sm5&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true