The Adversarial Testing Podcast

How to Train a Frontier LLM — The Full Pipeline

24 min · 15 de may de 2026
Portada del episodio How to Train a Frontier LLM — The Full Pipeline

Descripción

A technical walk-through of the entire training pipeline for a modern frontier large language model, from raw data curation through pre-training, mid-training, GRPO reasoning RL, safety alignment, and deployment monitoring.

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de The Adversarial Testing Podcast!

Empezar

2 meses por 1 €

Después 4,99 € / mes · Cancela cuando quieras.

  • Podcasts exclusivos
  • 20 horas de audiolibros / mes
  • Podcast gratuitos

Todos los episodios

12 episodios

Portada del episodio Writing Code vs. Shipping Code: Productivity Effects Across Generations of AI Coding Tools (Abstract, Introduction & Conclusion)

Writing Code vs. Shipping Code: Productivity Effects Across Generations of AI Coding Tools (Abstract, Introduction & Conclusion)

The abstract, introduction, and conclusion of NBER Working Paper No. 35275 by Mert Demirer, Leon Musolff, and Liyuan Yang (May 2026). Using data on more than 100,000 GitHub developers and their AI usage telemetry, the paper traces how the productivity effects of AI coding tools evolve across three generations - autocomplete, sync agents, and async agents - and asks how much of those task-level gains reach final output. Each generation sharply increases coding activity, but the gains attenuate steeply across the production hierarchy: large effects on lines of code shrink to small effects on releases, consistent with a weak-link model in which human review and integration remain the binding constraint.

6 de jun de 202617 min