TalkRL: The Reinforcement Learning Podcast
Danijar Hafner was a Research Scientist at Google DeepMind until recently. Featured References Training Agents Inside of Scalable World Models [https://arxiv.org/abs/2509.24527] [ blog [https://danijar.com/project/dreamer4/] ] Danijar Hafner, Wilson Yan, Timothy Lillicrap One Step Diffusion via Shortcut Models [https://arxiv.org/abs/2410.12557] Kevin Frans, Danijar Hafner, Sergey Levine, Pieter Abbeel Action and Perception as Divergence Minimization [https://arxiv.org/abs/2009.01791] [ blog [https://danijar.com/project/apd/] ] Danijar Hafner, Pedro A. Ortega, Jimmy Ba, Thomas Parr, Karl Friston, Nicolas Heess Additional References * Mastering Diverse Domains through World Models [https://arxiv.org/abs/2301.04104v1] [ blog [https://danijar.com/project/dreamerv3/] ] DreaverV3l Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap * Mastering Atari with Discrete World Models [https://arxiv.org/abs/2010.02193] [ blog [https://danijar.com/project/dreamerv2/] ] DreaverV2 ; Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, Jimmy Ba * Dream to Control: Learning Behaviors by Latent Imagination [https://arxiv.org/abs/1912.01603] [ blog [https://danijar.com/project/dreamer/] ] Dreamer ; Danijar Hafner, Timothy Lillicrap, Jimmy Ba, Mohammad Norouzi * Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos [https://arxiv.org/abs/2206.11795] [ Blog Post [https://openai.com/research/vpt] ], Baker et al
74 Episoder
Kommentarer
0Vær den første til å kommentere
Registrer deg nå og bli medlem av TalkRL: The Reinforcement Learning Podcast sitt community!