Claude Opus 4.8 Is Out. The Benchmark Numbers Aren't the Story.

6 min · 30. maj 2026

Description

Anthropic dropped Opus 4.8 yesterday — same price, better coding scores, and a four-fold reduction in silent code bugs. But the real headline is alignment: Opus 4.8 scores at near-Mythos levels on misalignment metrics, quietly bringing the restricted model's safety profile into the general tier. Plus: Figure AI's robots sorted 250,000 packages in 200 hours with zero failures, and California's AI legislation just hit its crossover deadline with thirty bills in play and no federal law in sight.

Comments

Be the first to comment

Get Started

All episodes

132 episodes

The SpaceX Roadshow Is Live. Morningstar Says It's Worth Half the Ask. Here's What Investors Need to Know.

SpaceX officially launched its roadshow yesterday — $135 per share, $1.75 trillion valuation, $75 billion raise, pricing June 11th. Morningstar values it at $780 billion — less than half the target. ARK Invest says $2.5 trillion by 2030. We break down the bull and bear case, cover OpenAI's Codex enterprise expansion this week, and preview Tim Cook's final WWDC keynote tomorrow.

Yesterday6 min

Microsoft Just Declared AI Independence From OpenAI. Seven Models. No OpenAI Data. A Tenth of the Cost.

Microsoft unveiled seven in-house MAI models at Build — including MAI-Thinking-1, its first reasoning model built entirely without OpenAI distillation, matching Claude Opus 4.6 on coding benchmarks at a tenth of GPT-5.5's cost. Mustafa Suleiman called it "long-term self-sufficiency." We break down what the MAI family actually is, cover Anthropic's Monday IPO filing at a $965 billion valuation, and preview Tim Cook's final WWDC keynote Sunday.

5. juni 20266 min

The SpaceX IPO Roadshow Is Live. Here's What Investors Are Actually Buying.

SpaceX begins its investor roadshow today targeting a $1.75 trillion valuation and $75 billion raise — the largest US IPO in history. We break down the four very different businesses inside SpaceX's S-1, the retail investor event on June 11th that's unlike anything ever done at this scale, and why the xAI segment is the question every institutional investor will be asking. Plus: GitHub Copilot's billing switch is generating a developer revolt, and Build Day 2 confirmed Claude in Azure AI Foundry.

4. juni 20266 min

Microsoft Just Cut the OpenAI Cord. GitHub Copilot Gets Its Own AI Model by August.

he biggest surprise at Microsoft Build was Project Polaris — Microsoft's own in-house coding model replacing GPT-4 Turbo in GitHub Copilot by August. Microsoft now controls the model, the inference infrastructure, and the developer experience end to end. We break down what Polaris is, what it means for teams building on Copilot SDK, and cover the full Build recap: open-source Windows Agent Framework, Copilot Workspace GA with autopilot mode, and DirectML 2.0.

3. juni 20265 min

Microsoft Build Is Live Today. Here's What's at Stake — and Why Developers Are Watching Closely.

Microsoft Build 2026 opens this morning in San Francisco with one goal: move AI agents from announced to production-ready. We break down the confirmed session tracks, what the Agent Framework graduation means for enterprise developers, and why Microsoft needs to win back developer affection after losing the coding tool satisfaction race to Cursor and Claude Code. Plus: Nvidia confirmed inference revenues just overtook training for the first time — a structural shift in the AI chip market.

2. juni 20265 min

Claude Opus 4.8 Is Out. The Benchmark Numbers Aren't the Story.

Description

Comments

1 month for 9 kr.

All episodes