The Turing Podcast
In this conversation, Jonathan Siddharth explores the evolution and future of reinforcement learning (RL) in AI. The discussion also covers the significance of human interaction in AI training, the role of human feedback, and the construction of RL gym environments for training agents. The Era of Experience [ https://storage.googleapis.com/deepmind-media/Era-of-Experience%20/The%20Era%20of%20Experience%20Paper.pdf]
5 Episoder
Kommentarer
0Vær den første til å kommentere
Registrer deg nå og bli medlem av The Turing Podcast sitt community!