The Adversarial Testing Podcast
A verbatim reading of key sections from Anthropic's system card for Claude Opus 4.8. Covers the executive summary, RSP findings on autonomy and biological risks, alignment assessment key findings including grader-speculation concerns, and the model welfare overview.
10 Episoder
Kommentarer
0Vær den første til å kommentere
Registrer deg nå og bli medlem av The Adversarial Testing Podcast sitt community!