The Adversarial Testing Podcast
From data curation to production monitoring — how frontier labs evaluate, red-team, and decide when to ship their most powerful models.
Wees de eerste die een reactie plaatst
Meld je nu aan en word lid van de The Adversarial Testing Podcast community!
€ 9,99 / maand na proefperiode. · Elk moment opzegbaar.
9 afleveringen
Hackers Used Meta's AI Support Bot to Seize Instagram Accounts
A verbatim reading of Brian Krebs's report on an Instagram account-takeover exploit involving Meta's AI support assistant, including the alleged attack flow, Meta's response, and why multi-factor authentication appears to have blocked the exploit.
System Card: Claude Opus 4.8
A verbatim reading of key sections from Anthropic's system card for Claude Opus 4.8. Covers the executive summary, RSP findings on autonomy and biological risks, alignment assessment key findings including grader-speculation concerns, and the model welfare overview.
Net Zero Realism
A verbatim reading of Dieter Helm's essay on why the costs of the UK's net zero transition have been systematically understated. Covers the true economics of renewables, intermittency, EVs and heat pumps, the global climate context, and what a realistic UK climate strategy should prioritise.
The Seventh Carbon Budget: Costs and Households
A verbatim reading of the CCC's Seventh Carbon Budget report, focused on how Net Zero is funded and what it costs the public. Covers Chapter 4 on costs and investment, and Chapter 8.3 on distributional impacts across household archetypes.
Electoral Hallucinations: Safeguarding UK Elections in the World of LLMs and AI Chatbots (Executive Summary)
The executive summary of Electoral Hallucinations by Jamie Hancock and Azzurra Moores, published by Demos in May 2026. The report presents new evidence from testing five AI services during the 2026 Scottish Parliament elections, finding that 34.1% of responses contained factual errors — including hallucinated candidates, incorrect voting procedures, and fabricated political scandals. It identifies a regulatory gap where AI meets elections and sets out four recommendations for the UK government ahead of 2029.
Reacties
0Wees de eerste die een reactie plaatst
Meld je nu aan en word lid van de The Adversarial Testing Podcast community!