AI First Pod

Claude Opus 4.8 Is Out. The Benchmark Numbers Aren't the Story.

6 min · 30. maj 2026
episode Claude Opus 4.8 Is Out. The Benchmark Numbers Aren't the Story. cover

Beskrivelse

Anthropic dropped Opus 4.8 yesterday — same price, better coding scores, and a four-fold reduction in silent code bugs. But the real headline is alignment: Opus 4.8 scores at near-Mythos levels on misalignment metrics, quietly bringing the restricted model's safety profile into the general tier. Plus: Figure AI's robots sorted 250,000 packages in 200 hours with zero failures, and California's AI legislation just hit its crossover deadline with thirty bills in play and no federal law in sight.

Kommentarer

0

Vær den første til at kommentere

Tilmeld dig nu og bliv en del af AI First Pod-fællesskabet!

Kom i gang

2 måneder kun 19 kr.

Derefter 99 kr. / måned · Opsig når som helst.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

Alle episoder

127 episoder