AI First Pod

AI First Pod

Claude Opus 4.8 Is Out. The Benchmark Numbers Aren't the Story.

6 min · Ayer
Portada del episodio Claude Opus 4.8 Is Out. The Benchmark Numbers Aren't the Story.

Descripción

Anthropic dropped Opus 4.8 yesterday — same price, better coding scores, and a four-fold reduction in silent code bugs. But the real headline is alignment: Opus 4.8 scores at near-Mythos levels on misalignment metrics, quietly bringing the restricted model's safety profile into the general tier. Plus: Figure AI's robots sorted 250,000 packages in 200 hours with zero failures, and California's AI legislation just hit its crossover deadline with thirty bills in play and no federal law in sight.

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de AI First Pod!

Prueba gratis

Empieza 7 días de prueba

$99 / mes después de la prueba. · Cancela cuando quieras.

  • Podcasts solo en Podimo
  • 20 horas de audiolibros al mes
  • Podcast gratuitos

Todos los episodios

126 episodios