2021 MIRI Conversations

Shah and Yudkowsky on alignment failures

2 h 45 min · 10. sept. 2025
episode Shah and Yudkowsky on alignment failures cover

Description

This is the final discussion log in the Late 2021 MIRI Conversations [https://www.lesswrong.com/s/n945eovrA3oDueqtq] sequence, featuring Rohin Shah and Eliezer Yudkowsky, with additional comments from Rob Bensinger, Nate Soares, Richard Ngo, and Jaan Tallinn. The discussion begins with summaries and comments on Richard and Eliezer's debate. Rohin's summary has since been revised and published in the Alignment Newsletter [https://www.lesswrong.com/posts/3vFmQhHBosnjZXuAJ/an-171-disagreements-between-alignment-optimists-and]. This was originally posted on 28th Feb 2022. https://www.lesswrong.com/s/n945eovrA3oDueqtq/p/tcCxPLBrEXdxN5HCQ [https://www.lesswrong.com/s/n945eovrA3oDueqtq/p/tcCxPLBrEXdxN5HCQ]

Comments

0

Be the first to comment

Sign up now and become a member of the 2021 MIRI Conversations community!

Get Started

1 month for 9 kr.

Then 99 kr. / month · Cancel anytime.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

All episodes

13 episodes

episode Conversation on technology forecasting and gradualism artwork

Conversation on technology forecasting and gradualism

This post is a transcript of a multi-day discussion between Paul Christiano, Richard Ngo, Eliezer Yudkowsky, Rob Bensinger, Holden Karnofsky, Rohin Shah, Carl Shulman, Nate Soares, and Jaan Tallinn, following up on the Yudkowsky/Christiano debate in 1 [https://www.lesswrong.com/posts/vwLxd6hhFvPbvKmBH/yudkowsky-and-christiano-discuss-takeoff-speeds], 2 [https://www.lesswrong.com/posts/7MCqRnZzvszsxgtJi/christiano-cotra-shulman-and-yudkowsky-on-ai-progress], 3 [https://www.lesswrong.com/posts/sCCdCLPN9E3YvdZhj/shulman-and-yudkowsky-on-ai-progress], and 4 [https://www.lesswrong.com/posts/fS7Zdj2e2xMqE6qja/more-christiano-cotra-and-yudkowsky-on-ai-progress]. This was originally posted on 9th Dec 2021. https://www.lesswrong.com/s/n945eovrA3oDueqtq/p/nPauymrHwpoNr6ipx [https://www.lesswrong.com/s/n945eovrA3oDueqtq/p/nPauymrHwpoNr6ipx]

10. sept. 20251 h 0 min