Delta Podcast
Kevin Wang is the first author of the NeurIPS 2025 Best Paper, titled "1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities". He's currently a researcher at OpenAI, where he works on RL/reasoning. Before coming to OpenAI, Kevin studied CS at Princeton.Delta Institute (deltainstitutes.org) supports exceptional researchers and engineers, from academia to industry and beyond. They host technical events to bring great people together, a podcast that gives industry/academic leaders a platform to share their experiences, a small fellows program that builds a tight-knit community of exceptional people, and a grant program that provides compute/mentorship for research projects.Timestamps:00:00 Introduction00:26 Overview of the 1000 Layer Networks Paper00:42 Motivation and Background of the Research01:37 Self-Supervised Reinforcement Learning Paradigm04:16 Challenges and Innovations in Data Scaling06:23 Hindsight Experience Replay and Its Impact08:56 Classification vs Regression in Reinforcement Learning12:25 Training Stability and Architectural Components14:23 Key Results and Performance Gains17:23 Qualitative Behaviors and Representation Learning19:44 Scaling Depth and Batch Size23:06 Limits of Scaling in Reinforcement Learning23:55 Exploring Actor Loss and Layer Depth in Training24:51 Scaling Layers for Complex Tasks28:28 Challenges and Innovations in Deep Network Training30:36 Future Directions in Reinforcement Learning37:32 Personal Journey and Career Path
60 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de Delta Podcast!