Jake Beck, Alex Goldie, & Cornelius Braun on Sutton's OaK, Metalearning, LLMs, Squirrels @ RLC 2025

David Abel on the Science of Agency @ RLDM 2025

David Abel is a Senior Research Scientist at DeepMind on the Agency team, and an Honorary Fellow at the University of Edinburgh. His research blends computer science and philosophy, exploring foundational questions about reinforcement learning, definitions, and the nature of agency. Featured References Plasticity as the Mirror of Empowerment [https://arxiv.org/pdf/2505.10361] David Abel, Michael Bowling, André Barreto, Will Dabney, Shi Dong, Steven Hansen, Anna Harutyunyan, Khimya Khetarpal, Clare Lyle, Razvan Pascanu, Georgios Piliouras, Doina Precup, Jonathan Richens, Mark Rowland, Tom Schaul, Satinder Singh A Definition of Continual RL [https://arxiv.org/pdf/2307.11046] David Abel, André Barreto, Benjamin Van Roy, Doina Precup, Hado van Hasselt, Satinder Singh Agency is Frame-Dependent [https://arxiv.org/pdf/2502.04403] David Abel, André Barreto, Michael Bowling, Will Dabney, Shi Dong, Steven Hansen, Anna Harutyunyan, Khimya Khetarpal, Clare Lyle, Razvan Pascanu, Georgios Piliouras, Doina Precup, Jonathan Richens, Mark Rowland, Tom Schaul, Satinder Singh On the Expressivity of Markov Reward [https://arxiv.org/abs/2111.00876] David Abel, Will Dabney, Anna Harutyunyan, Mark Ho, Michael Littman, Doina Precup, Satinder Singh — Outstanding Paper Award, NeurIPS 2021 Additional References * Bidirectional Communication Theory [https://ieeexplore.ieee.org/abstract/document/1091610/similar#similar] — Marko 1973 * Causality, Feedback and Directed Information [https://www.isiweb.ee.ethz.ch/archive/massey_pub/pdf/BI532.pdf] — Massey 1990 * The Big World Hypothesis [https://openreview.net/forum?id=Sv7DazuCn8] — Javed et al. 2024 * Loss of plasticity in deep continual learning [https://www.nature.com/articles/s41586-024-07711-7] — Dohare et al. 2024 * Three Dogmas of Reinforcement Learning [https://david-abel.github.io/tdorl.pdf] — Abel 2024 * Explaining dopamine through prediction errors and beyond [https://pubmed.ncbi.nlm.nih.gov/39054370/] — Gershman et al. 2024 * David Abel Google Scholar [https://scholar.google.com/citations?user=lvBJlmwAAAAJ&hl=en] * David Abel personal website [https://david-abel.github.io/]

8 de sep de 202559 min

Jake Beck, Alex Goldie, & Cornelius Braun on Sutton's OaK, Metalearning, LLMs, Squirrels @ RLC 2025

Descripción

Comentarios

2 meses por 1 €

Todos los episodios