The Adversarial Testing Podcast
From data curation to production monitoring — how frontier labs evaluate, red-team, and decide when to ship their most powerful models.
Vær den første til at kommentere
Tilmeld dig nu og bliv en del af The Adversarial Testing Podcast-fællesskabet!
Derefter 99 kr. / måned · Opsig når som helst.
8 episoder
System Card: Claude Opus 4.8
A verbatim reading of key sections from Anthropic's system card for Claude Opus 4.8. Covers the executive summary, RSP findings on autonomy and biological risks, alignment assessment key findings including grader-speculation concerns, and the model welfare overview.
Net Zero Realism
A verbatim reading of Dieter Helm's essay on why the costs of the UK's net zero transition have been systematically understated. Covers the true economics of renewables, intermittency, EVs and heat pumps, the global climate context, and what a realistic UK climate strategy should prioritise.
The Seventh Carbon Budget: Costs and Households
A verbatim reading of the CCC's Seventh Carbon Budget report, focused on how Net Zero is funded and what it costs the public. Covers Chapter 4 on costs and investment, and Chapter 8.3 on distributional impacts across household archetypes.
Electoral Hallucinations: Safeguarding UK Elections in the World of LLMs and AI Chatbots (Executive Summary)
The executive summary of Electoral Hallucinations by Jamie Hancock and Azzurra Moores, published by Demos in May 2026. The report presents new evidence from testing five AI services during the 2026 Scottish Parliament elections, finding that 34.1% of responses contained factual errors — including hallucinated candidates, incorrect voting procedures, and fabricated political scandals. It identifies a regulatory gap where AI meets elections and sets out four recommendations for the UK government ahead of 2029.
The Labour Party Is Playing With Fire Over Its Future and the Future of the Country
Tony Blair argues that Labour risks electoral irrelevance by governing from a traditional soft-left comfort zone while the world undergoes two epochal shifts. He makes the case for a Radical Centre strategy built around technological transformation, economic competitiveness, and a renegotiated relationship with Europe.
Kommentarer
0Vær den første til at kommentere
Tilmeld dig nu og bliv en del af The Adversarial Testing Podcast-fællesskabet!