The Adversarial Testing Podcast

System Card: Claude Opus 4.8

1 h 0 min · 1. juni 2026
episode System Card: Claude Opus 4.8 cover

Description

A verbatim reading of key sections from Anthropic's system card for Claude Opus 4.8. Covers the executive summary, RSP findings on autonomy and biological risks, alignment assessment key findings including grader-speculation concerns, and the model welfare overview.

Comments

0

Be the first to comment

Sign up now and become a member of the The Adversarial Testing Podcast community!

Get Started

1 month for 9 kr.

Then 99 kr. / month · Cancel anytime.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

All episodes

12 episodes

episode Writing Code vs. Shipping Code: Productivity Effects Across Generations of AI Coding Tools (Abstract, Introduction & Conclusion) artwork

Writing Code vs. Shipping Code: Productivity Effects Across Generations of AI Coding Tools (Abstract, Introduction & Conclusion)

The abstract, introduction, and conclusion of NBER Working Paper No. 35275 by Mert Demirer, Leon Musolff, and Liyuan Yang (May 2026). Using data on more than 100,000 GitHub developers and their AI usage telemetry, the paper traces how the productivity effects of AI coding tools evolve across three generations - autocomplete, sync agents, and async agents - and asks how much of those task-level gains reach final output. Each generation sharply increases coding activity, but the gains attenuate steeply across the production hierarchy: large effects on lines of code shrink to small effects on releases, consistent with a weak-link model in which human review and integration remain the binding constraint.

6. juni 202617 min