The Turing Podcast
In this conversation, Jonathan Siddharth explores the evolution and future of reinforcement learning (RL) in AI. The discussion also covers the significance of human interaction in AI training, the role of human feedback, and the construction of RL gym environments for training agents. The Era of Experience [ https://storage.googleapis.com/deepmind-media/Era-of-Experience%20/The%20Era%20of%20Experience%20Paper.pdf]
5 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y forma parte de la comunidad de The Turing Podcast!