2021 MIRI Conversations

Shah and Yudkowsky on alignment failures

2 h 45 min · 10. syys 2025
jakson Shah and Yudkowsky on alignment failures kansikuva

Kuvaus

This is the final discussion log in the Late 2021 MIRI Conversations [https://www.lesswrong.com/s/n945eovrA3oDueqtq] sequence, featuring Rohin Shah and Eliezer Yudkowsky, with additional comments from Rob Bensinger, Nate Soares, Richard Ngo, and Jaan Tallinn. The discussion begins with summaries and comments on Richard and Eliezer's debate. Rohin's summary has since been revised and published in the Alignment Newsletter [https://www.lesswrong.com/posts/3vFmQhHBosnjZXuAJ/an-171-disagreements-between-alignment-optimists-and]. This was originally posted on 28th Feb 2022. https://www.lesswrong.com/s/n945eovrA3oDueqtq/p/tcCxPLBrEXdxN5HCQ [https://www.lesswrong.com/s/n945eovrA3oDueqtq/p/tcCxPLBrEXdxN5HCQ]

Kommentit

0

Ole ensimmäinen kommentoija

Rekisteröidy nyt ja liity 2021 MIRI Conversations-yhteisöön!

Aloita maksutta

14 vrk ilmainen kokeilu

Kokeilun jälkeen 7,99 € / kuukausi. · Peru milloin tahansa.

  • Podimon podcastit
  • 20 kuunteluaikaa / kuukausi
  • Lataa offline-käyttöön

Kaikki jaksot

13 jaksot

jakson Conversation on technology forecasting and gradualism kansikuva

Conversation on technology forecasting and gradualism

This post is a transcript of a multi-day discussion between Paul Christiano, Richard Ngo, Eliezer Yudkowsky, Rob Bensinger, Holden Karnofsky, Rohin Shah, Carl Shulman, Nate Soares, and Jaan Tallinn, following up on the Yudkowsky/Christiano debate in 1 [https://www.lesswrong.com/posts/vwLxd6hhFvPbvKmBH/yudkowsky-and-christiano-discuss-takeoff-speeds], 2 [https://www.lesswrong.com/posts/7MCqRnZzvszsxgtJi/christiano-cotra-shulman-and-yudkowsky-on-ai-progress], 3 [https://www.lesswrong.com/posts/sCCdCLPN9E3YvdZhj/shulman-and-yudkowsky-on-ai-progress], and 4 [https://www.lesswrong.com/posts/fS7Zdj2e2xMqE6qja/more-christiano-cotra-and-yudkowsky-on-ai-progress]. This was originally posted on 9th Dec 2021. https://www.lesswrong.com/s/n945eovrA3oDueqtq/p/nPauymrHwpoNr6ipx [https://www.lesswrong.com/s/n945eovrA3oDueqtq/p/nPauymrHwpoNr6ipx]

10. syys 20251 h 0 min