The Turing Podcast
In this conversation, Jonathan Siddharth explores the evolution and future of reinforcement learning (RL) in AI. The discussion also covers the significance of human interaction in AI training, the role of human feedback, and the construction of RL gym environments for training agents. The Era of Experience [ https://storage.googleapis.com/deepmind-media/Era-of-Experience%20/The%20Era%20of%20Experience%20Paper.pdf]
5 episoder
Kommentarer
0Vær den første til at kommentere
Tilmeld dig nu og bliv en del af The Turing Podcast-fællesskabet!