2021 MIRI Conversations

Shah and Yudkowsky on alignment failures

2 h 45 min · 10. sept. 2025
episode Shah and Yudkowsky on alignment failures cover

Beskrivelse

This is the final discussion log in the Late 2021 MIRI Conversations [https://www.lesswrong.com/s/n945eovrA3oDueqtq] sequence, featuring Rohin Shah and Eliezer Yudkowsky, with additional comments from Rob Bensinger, Nate Soares, Richard Ngo, and Jaan Tallinn. The discussion begins with summaries and comments on Richard and Eliezer's debate. Rohin's summary has since been revised and published in the Alignment Newsletter [https://www.lesswrong.com/posts/3vFmQhHBosnjZXuAJ/an-171-disagreements-between-alignment-optimists-and]. This was originally posted on 28th Feb 2022. https://www.lesswrong.com/s/n945eovrA3oDueqtq/p/tcCxPLBrEXdxN5HCQ [https://www.lesswrong.com/s/n945eovrA3oDueqtq/p/tcCxPLBrEXdxN5HCQ]

Kommentarer

0

Vær den første til at kommentere

Tilmeld dig nu og bliv en del af 2021 MIRI Conversations-fællesskabet!

Kom i gang

1 måned kun 9 kr.

Derefter 99 kr. / måned · Opsig når som helst.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

Alle episoder

13 episoder

episode Conversation on technology forecasting and gradualism cover

Conversation on technology forecasting and gradualism

This post is a transcript of a multi-day discussion between Paul Christiano, Richard Ngo, Eliezer Yudkowsky, Rob Bensinger, Holden Karnofsky, Rohin Shah, Carl Shulman, Nate Soares, and Jaan Tallinn, following up on the Yudkowsky/Christiano debate in 1 [https://www.lesswrong.com/posts/vwLxd6hhFvPbvKmBH/yudkowsky-and-christiano-discuss-takeoff-speeds], 2 [https://www.lesswrong.com/posts/7MCqRnZzvszsxgtJi/christiano-cotra-shulman-and-yudkowsky-on-ai-progress], 3 [https://www.lesswrong.com/posts/sCCdCLPN9E3YvdZhj/shulman-and-yudkowsky-on-ai-progress], and 4 [https://www.lesswrong.com/posts/fS7Zdj2e2xMqE6qja/more-christiano-cotra-and-yudkowsky-on-ai-progress]. This was originally posted on 9th Dec 2021. https://www.lesswrong.com/s/n945eovrA3oDueqtq/p/nPauymrHwpoNr6ipx [https://www.lesswrong.com/s/n945eovrA3oDueqtq/p/nPauymrHwpoNr6ipx]

10. sept. 20251 h 0 min