{"version":"1.0","type":"rich","provider_name":"Acast","provider_url":"https://acast.com","height":250,"width":700,"html":"<iframe src=\"https://embed.acast.com/$/696f9ac95e25e3a6c0c6faa0/6996d2904a6b6137bf6de383?\" frameBorder=\"0\" width=\"700\" height=\"250\"></iframe>","title":"About Claude - The Triumph of the Ordinary","description":"<p><strong>SHOW NOTES</strong></p><p><br></p><p>Claude's mid-tier Sonnet model just topped a benchmark designed to measure AI against the actual day-to-day work of professionals — beating its own more powerful flagship in the process. Today we explore what that result reveals about how the definition of AI capability is quietly being rewritten.</p><p><br></p><p><strong>**In this episode:**</strong></p><p>- What GDPval is, why OpenAI built it, and why the result matters beyond a product launch</p><p>- The sixteen-month computer use trajectory that shows something crossing a threshold</p><p>- Why \"reliability\" and \"taste\" beat \"brilliance\" when the task is an inbox, not an exam</p><p>- The deeper argument: ordinary professional work is harder than it looks, and the race is catching up to that fact</p><p><br></p><p><strong>**Links:**</strong></p><p>- Introducing Claude Sonnet 4.6: https://www.anthropic.com/news/claude-sonnet-4-6</p><p>- Claude Sonnet 4.6 model page: https://www.anthropic.com/claude/sonnet</p><p>- GDPval benchmark (OpenAI): https://openai.com/index/gdpval/</p><p>- VentureBeat: Sonnet 4.6 matches flagship at one-fifth the cost: https://venturebeat.com/technology/anthropics-sonnet-4-6-matches-flagship-ai-performance-at-one-fifth-the-cost</p><p><br></p><p><strong>**Referenced in this episode:**</strong></p><p>- EP013: Twenty Minutes — the most compressed product launch in AI history</p><p><br></p><p>Website: <a href=\"https://aboutclaude.xyz/\" rel=\"noopener noreferrer\" target=\"_blank\">aboutclaude.xyz</a></p><p>🦉 X: @_about_claude</p><p><br></p>","author_name":"Neil & Claude"}