The Information Bottleneck
In this episode, we host Jürgen Schmidhuber - the man, the legend, one of the godfathers of modern AI. His lab worked out many ideas behind today’s systems (LSTM, world models, artificial curiosity, Transformer variants, and even GAN-style setups) decades before they became fashionable, and he’s just as well known for making sure people remember who did what first. This is the first of two conversations with him. We go back to his lab in the early 90s and ask how one small group came up with so many of the ideas that are now being scaled to a thousand billion dollars, back when compute was ten million times more expensive. A lot of the episode comes down to one distinction he keeps making: prediction vs. decision-making. His take is that LLMs are very good prediction machines that imitate the web, but that’s only half the problem. To actually act in the world, you need a controller that uses a world model to plan. He talks about his 1990 work on world models and artificial curiosity, where the controller gets rewarded for running experiments that improve its own model (an adversarial setup years before GANs), why planning millisecond by millisecond doesn’t scale, and why you need sub-goals instead. We also talk about compression as the core of understanding, from falling apples to Kepler to Einstein, and why we still don’t have a robot that can do what a plumber does, even though the AI behind the screen keeps getting better. Then the conversation moves to credit assignment: how “to Schmidhuber” became a verb, what he thinks is broken about the award system, and a long exchange on PMAX vs. JEPA. He ends on the real origins of deep learning and a prediction about self-replicating machines in space. ---------------------------------------- Timeline 00:00 Intro 00:55 1991 in Munich, and why that lab mattered 02:38 "I'm not very smart" and why compute getting 10× cheaper every 5 years changed everything 04:25 Chess as an AI proxy 08:27 Artificial curiosity in the 90s vs. today's RL exploration 09:10 Why RL is harder than supervised learning 20:48 Coding agents vs. robots, and how a baby learns its own hands 26:20 Compression as understanding 33:40 What's actually missing on the road to AGI 37:30 Why millisecond-by-millisecond planning is stupid 47:44 Convergence to LLMs, GPUs, and how far we still are from the Bremermann limit 51:49 Unsupervised learning, factorial codes, and predictability minimization 58:12 Credit assignment: the fights with LeCun and the Nobel critique 1:02:13 On his last name becoming a verb 1:05:17 The award system's missing peer review 1:07:03 Closed labs and the decline of open research 1:13:23 Audience questions 1:34:02 Closing: who really invented deep learning? ---------------------------------------- Music: * "Kid Kodi" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0. * "Palms Down" - Blue Dot Sessions - via Free Music Archive - CC BY-NC 4.0. * Changes: trimmed ---------------------------------------- About: The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.
43 afleveringen
Reacties
0Wees de eerste die een reactie plaatst
Meld je nu aan en word lid van de The Information Bottleneck community!