The Adversarial Testing Podcast
A verbatim reading of key sections from Anthropic's system card for Claude Opus 4.8. Covers the executive summary, RSP findings on autonomy and biological risks, alignment assessment key findings including grader-speculation concerns, and the model welfare overview.
9 episoder
Kommentarer
0Vær den første til at kommentere
Tilmeld dig nu og bliv en del af The Adversarial Testing Podcast-fællesskabet!