AI First Pod

Claude Opus 4.8 Is Out. The Benchmark Numbers Aren't the Story.

6 min · 30. maj 2026
episode Claude Opus 4.8 Is Out. The Benchmark Numbers Aren't the Story. cover

Description

Anthropic dropped Opus 4.8 yesterday — same price, better coding scores, and a four-fold reduction in silent code bugs. But the real headline is alignment: Opus 4.8 scores at near-Mythos levels on misalignment metrics, quietly bringing the restricted model's safety profile into the general tier. Plus: Figure AI's robots sorted 250,000 packages in 200 hours with zero failures, and California's AI legislation just hit its crossover deadline with thirty bills in play and no federal law in sight.

Comments

0

Be the first to comment

Sign up now and become a member of the AI First Pod community!

Get Started

1 month for 9 kr.

Then 99 kr. / month · Cancel anytime.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

All episodes

132 episodes