The Adversarial Testing Podcast
From data curation to production monitoring — how frontier labs evaluate, red-team, and decide when to ship their most powerful models.
Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de The Adversarial Testing Podcast!
$99 / mes después de la prueba. · Cancela cuando quieras.
3 episodios
Evaluating and Testing Frontier LLMs — The Full Lifecycle
How to Train a Frontier LLM — The Full Pipeline
A technical walk-through of the entire training pipeline for a modern frontier large language model, from raw data curation through pre-training, mid-training, GRPO reasoning RL, safety alignment, and deployment monitoring.
The AI Economy Debate: What the Evidence Actually Shows
Same technology, same evidence, twentyfold gap in macro forecasts. We walk through the empirical record on AI's economic impact — adoption, worker-level RCTs, the Danish null, Acemoglu's macro arithmetic, the Anthropic Economic Index, and where the data converges.
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de The Adversarial Testing Podcast!