Tic-Tac-Toe the Hard Way

Tic-Tac-Toe the Hard Way

Podcast door People + AI Research

Tijdelijke aanbieding

2 maanden voor € 1

Daarna € 9,99 / maandElk moment opzegbaar.

Phone screen with podimo app open surrounded by emojis

Meer dan 1 miljoen luisteraars

Je zult van Podimo houden en je bent niet de enige

4.7 sterren in de App Store

Over Tic-Tac-Toe the Hard Way

A writer and a software engineer from Google's People + AI Research team explore the human choices that shape machine learning systems by building competing tic-tac-toe agents.

Alle afleveringen

10 afleveringen
episode Lessons learned artwork
Lessons learned

What have we learned about machine learning and the human decisions that shape it? And is machine learning perhaps changing our minds about how the world outside of machine learning — also known as the world — works? For more information about the show, check out pair.withgoogle.com/thehardway/ [https://pair.withgoogle.com/thehardway/]. You can reach out to the hosts on Twitter: @dweinberger [https://twitter.com/dweinberger] and @tafsiri [https://twitter.com/tafsiri].

22 jul 2020 - 33 min
episode Head to Head: The Even Bigger ML Smackdown! artwork
Head to Head: The Even Bigger ML Smackdown!

Yannick and David’s systems play against each other in 500 games. Who’s going to win? And what can we learn about how the ML may be working by thinking about the results? See the agents play each other in Tic-Tac-Two [https://pair.withgoogle.com/thehardway/tic-tac-two/viewer/]! For more information about the show, check out pair.withgoogle.com/thehardway/ [https://pair.withgoogle.com/thehardway/]. You can reach out to the hosts on Twitter: @dweinberger [https://twitter.com/dweinberger] and @tafsiri [https://twitter.com/tafsiri].

22 jul 2020 - 24 min
episode Enter tic-tac-two artwork
Enter tic-tac-two

David’s variant of tic-tac-toe that we’re calling tic-tac-two is only slightly different but turns out to be far more complex. This requires rethinking what the ML system will need in order to learn how to play, and  how to represent that data. For more information about the show, check out pair.withgoogle.com/thehardway/ [https://pair.withgoogle.com/thehardway/]. You can reach out to the hosts on Twitter: @dweinberger [https://twitter.com/dweinberger] and @tafsiri [https://twitter.com/tafsiri].

22 jul 2020 - 21 min
episode Head to Head: the Big ML Smackdown! artwork
Head to Head: the Big ML Smackdown!

David and Yannick’s tic-tac-toe ML agents face-off against each other in tic-tac-toe! See the agents play each other [https://pair.withgoogle.com/thehardway/tic-tac-toe/viewer/]! For more information about the show, check out pair.withgoogle.com/thehardway/ [https://pair.withgoogle.com/thehardway/]. You can reach out to the hosts on Twitter: @dweinberger [https://twitter.com/dweinberger] and @tafsiri [https://twitter.com/tafsiri].

22 jul 2020 - 25 min
episode Give that model a treat! : Reinforcement learning explained artwork
Give that model a treat! : Reinforcement learning explained

Switching gears, we focus on how Yannick’s been training his model using reinforcement learning.  He explains the differences from David’s supervised learning approach. We find out how his system performs against a player that makes random tic-tac-toe moves. Resources: Deep Learning for JavaScript book [https://www.manning.com/books/deep-learning-with-javascript] Playing Atari with Deep Reinforcement Learning [https://arxiv.org/abs/1312.5602] Two Minute Papers episode on Atari DQN [https://www.youtube.com/watch?v=V1eYniJ0Rnk&vl=en] For more information about the show, check out pair.withgoogle.com/thehardway/ [https://pair.withgoogle.com/thehardway/]. You can reach out to the hosts on Twitter: @dweinberger [https://twitter.com/dweinberger] and @tafsiri [https://twitter.com/tafsiri].

22 jul 2020 - 26 min
Super app. Onthoud waar je bent gebleven en wat je interesses zijn. Heel veel keuze!
Super app. Onthoud waar je bent gebleven en wat je interesses zijn. Heel veel keuze!
Makkelijk in gebruik!
App ziet er mooi uit, navigatie is even wennen maar overzichtelijk.
Phone screen with podimo app open surrounded by emojis

4.7 sterren in de App Store

Tijdelijke aanbieding

2 maanden voor € 1

Daarna € 9,99 / maandElk moment opzegbaar.

Exclusieve podcasts

Advertentievrij

Gratis podcasts

Luisterboeken

20 uur / maand

Begin hier

Alleen bij Podimo

Populaire luisterboeken