The Adversarial Testing Podcast
A verbatim reading of key sections from Anthropic's system card for Claude Opus 4.8. Covers the executive summary, RSP findings on autonomy and biological risks, alignment assessment key findings including grader-speculation concerns, and the model welfare overview.
9 Folgen
Kommentare
0Sei die erste Person, die kommentiert
Melde dich jetzt an und werde Teil der The Adversarial Testing Podcast-Community!