Drinking with Einstein

From Visualizations to Circuits: The Origins of the Glass Box (The Glass Box Pt. 2)

23 min · 9 de feb de 2026
Portada del episodio From Visualizations to Circuits: The Origins of the Glass Box (The Glass Box Pt. 2)

Descripción

Is AI just a "giant math soup" that happens to work, or is it a machine with parts we can understand? In Part 2 of The Glass Box series, we dig into the origin story of Mechanistic Interpretability. We trace the 7-year arc from the first "camera" that let us see inside a neural network (Zeiler & Fergus, 2013) to the "microscope" of Feature Visualization (Olah et al., 2017) and the eventual "blueprint" of Circuits (2020). We break down the four foundational papers that gave us the instruction manual for auditing AI and the plot twist (Superposition) that reminded us that the brain of these neural networks is a compression machine. Papers covered: * Visualizing and Understanding Convolutional Networks (2013) * Feature Visualization (2017) * The Building Blocks of Interpretability (2018) * Zoom In: An Introduction to Circuits (2020) Grab a drink. It’s time to see how the tools were built.

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de Drinking with Einstein!

Empezar

2 meses por 1 €

Después 4,99 € / mes · Cancela cuando quieras.

  • Podcasts solo en Podimo
  • 20 horas de audiolibros / mes
  • Podcast gratuitos

Todos los episodios

2 episodios

Portada del episodio From Visualizations to Circuits: The Origins of the Glass Box (The Glass Box Pt. 2)

From Visualizations to Circuits: The Origins of the Glass Box (The Glass Box Pt. 2)

Is AI just a "giant math soup" that happens to work, or is it a machine with parts we can understand? In Part 2 of The Glass Box series, we dig into the origin story of Mechanistic Interpretability. We trace the 7-year arc from the first "camera" that let us see inside a neural network (Zeiler & Fergus, 2013) to the "microscope" of Feature Visualization (Olah et al., 2017) and the eventual "blueprint" of Circuits (2020). We break down the four foundational papers that gave us the instruction manual for auditing AI and the plot twist (Superposition) that reminded us that the brain of these neural networks is a compression machine. Papers covered: * Visualizing and Understanding Convolutional Networks (2013) * Feature Visualization (2017) * The Building Blocks of Interpretability (2018) * Zoom In: An Introduction to Circuits (2020) Grab a drink. It’s time to see how the tools were built.

9 de feb de 202623 min