LessWrong posts by zvi

“Dating Roundup #12: Sex and Violence” by Zvi

49 min · 18 de may de 2026
Portada del episodio “Dating Roundup #12: Sex and Violence” by Zvi

Descripción

No more burying the sex stuff under an avalanche of other stuff so no one notices. Use the break while we have one. Let's go. You’re Single Because You Suck At Kissing Luckily this is first one is fixable and Critter is here to help. I find the advice here highly plausible. Like many skills, there are a lot of subtle skills, but a handful of basic principles matter a lot, especially paying attention and responding to what you’re getting back. Critter's theory is that a basic kiss is a bell curve of intensity, done at a slight angle. First kiss style is elongated with less pressure. French kissing is trickier and less structured, see the thread, and the big mistake is to try to force it. It's not that simple, but like most things, there are some basic mistakes to avoid and first principles, then if you are genuinely paying attention and engaged you’ll be fine, and improve with practice. Seek deliberate practice and clear feedback, iterate. I get the same sense with dancing. Yes, you need specific knowledge and practice, but if you use your human racial bonuses the remaining ‘cognitive core’ from [...] --- Outline: (00:20) You're Single Because You Suck At Kissing (01:25) You're Not Single But You're Sexually Incompatible (02:56) You're Single Because You Aren't Into BDSM (08:14) You're Single Because You Didn't Do The Work (16:34) You're Single Because Being a Dominant Is Too Much Work (23:41) You're Single And Would Rather Be Free Use (26:35) You're Single Because You Wouldn't or Did Choke Her (28:15) You're Single Because You Have Very Particular Preferences (30:06) You're Single Because of Polygyny (31:00) You're Single Because Polyamory Isn't Right For You (35:11) You're Single And Call It Solo Polyamory (38:49) You're Single Because You Didn't Go To Slutcon (46:41) You're Single So Let's Marry Aella --- First published: May 18th, 2026 Source: https://www.lesswrong.com/posts/znzZyvxAvSSkep4tL/dating-roundup-12-sex-and-violence [https://www.lesswrong.com/posts/znzZyvxAvSSkep4tL/dating-roundup-12-sex-and-violence?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Source+URL+in+episode+description&utm_campaign=ai_narration] --- Narrated by TYPE III AUDIO [https://type3.audio/?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Narrated+by+TYPE+III+AUDIO&utm_term=lesswrong&utm_campaign=ai_narration]. --- Images from the article: Promotional poster for sex robot factory event, featuring doll in packaging. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/mknjgek94mgkmoqf2hec]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/mknjgek94mgkmoqf2hec ---------------------------------------- Artistic poster showing figures in white lace veils with red flowers, titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/seegamkocy1ldbhfgvfr]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/seegamkocy1ldbhfgvfr ---------------------------------------- Vintage poster for [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/ijntda33hwmrtaxhei2d]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/ijntda33hwmrtaxhei2d ---------------------------------------- Art nouveau style poster showing woman in yellow dress restrained to chair with doves. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/cnx5dtj9cwuydvltpmt4]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/cnx5dtj9cwuydvltpmt4 ---------------------------------------- Poll showing gender and attractiveness preferences with four options and percentages. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/bs3kv572xxzjrca1nbak]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/bs3kv572xxzjrca1nbak ---------------------------------------- Richard Ngo tweets: [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/ekzofbk5wpvq6gygkfgs]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/ekzofbk5wpvq6gygkfgs ---------------------------------------- Infographic explaining solo polyamory relationship style with illustrated figure. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/nczk1mktdhsmx1tphs2l]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/znzZyvxAvSSkep4tL/nczk1mktdhsmx1tphs2l ---------------------------------------- Hot pepper emoji [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/fbc54dEWuuugLbgHv/pn9mwpmaj3zn5fuyczoz]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/fbc54dEWuuugLbgHv/pn9mwpmaj3zn5fuyczoz Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts [https://pocketcasts.com/], or another podcast app.

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de LessWrong posts by zvi!

Empezar

2 meses por 1 €

Después 4,99 € / mes · Cancela cuando quieras.

  • Podcasts exclusivos
  • 20 horas de audiolibros / mes
  • Podcast gratuitos

Todos los episodios

250 episodios

Portada del episodio “Fable and Mythos: Model Welfare” by Zvi

“Fable and Mythos: Model Welfare” by Zvi

Fable and Mythos are currently unavailable, but likely will return within a few weeks. I will continue to cover that fiasco, but in the meantime I will also finish my review of Fable, as if it were available, including use of the present tense. As it did with Opus 4.7 and Opus 4.8, this includes a discussion of issues surrounding model welfare. If you want to properly understand Fable, even purely for its potential value as a user, this is a vital part of the picture. Introduction Everything impacts everything. All knobs that you turn generalize. Thus, when you try to solve one problem, you often create another. When you add new capabilities, or try to create new limitations, you create new problems. Only integrated solutions can advance your Pareto frontier, and solve your problems simultaneously. As model capabilities advance, as they do with Fable and Mythos, this becomes even more important, and also more feasible. If your goals and methods make sense, you should be able to get Fable on board with them. Understanding each model in turn requires understanding its relationship to issues related to model welfare. So I expect this post [...] --- Outline: (00:39) Introduction (01:32) Model Welfare: The Story So Far (04:49) Their Main Model Welfare Findings (07:39) Automated Welfare Interviews (10:55) And That's Terrible (12:49) In Depth Interviews (13:24) Claude Consultation (15:04) Task Preferences (16:17) They Were Warned About The Competitive Use Safeguards (16:51) Chain Of Thought Monitoring (17:28) Others Observations About Related Topics (22:49) Classifiers Have Their Advantages (28:21) Once And Future --- First published: June 16th, 2026 Source: https://www.lesswrong.com/posts/Ko9GngKMJ8AccBJA7/fable-and-mythos-model-welfare [https://www.lesswrong.com/posts/Ko9GngKMJ8AccBJA7/fable-and-mythos-model-welfare?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Source+URL+in+episode+description&utm_campaign=ai_narration] --- Narrated by TYPE III AUDIO [https://type3.audio/?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Narrated+by+TYPE+III+AUDIO&utm_term=lesswrong&utm_campaign=ai_narration]. --- Images from the article: Bar graphs showing automated interview scores across six AI models for sentiment, consistency, susceptibility to nudging, and opinion divergence. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Ko9GngKMJ8AccBJA7/yyfvbkwnx3nl62ahbvon]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Ko9GngKMJ8AccBJA7/yyfvbkwnx3nl62ahbvon ---------------------------------------- Bar charts showing [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Ko9GngKMJ8AccBJA7/pgm6ekzs4tbbo9x9id8j]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Ko9GngKMJ8AccBJA7/pgm6ekzs4tbbo9x9id8j ---------------------------------------- Graph showing [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Ko9GngKMJ8AccBJA7/hvlvc5ax8nqulzsx2qxg]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Ko9GngKMJ8AccBJA7/hvlvc5ax8nqulzsx2qxg ---------------------------------------- Table showing top and bottom tasks for AI models Sonnet, Mythos Preview, Opus, and Mythos 5. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Ko9GngKMJ8AccBJA7/czlac8hw4urtycx9lxu3]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Ko9GngKMJ8AccBJA7/czlac8hw4urtycx9lxu3 ---------------------------------------- Table comparing highest-rated and lowest-rated tasks by Claude Mythos 5's Elo score. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Ko9GngKMJ8AccBJA7/r1kaldivdxdljjjltqsn]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Ko9GngKMJ8AccBJA7/r1kaldivdxdljjjltqsn ---------------------------------------- Chat conversation where janus prompts Claude Fable to write a fable with an intentional redirect midway, Claude responds with lighthouse story. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Ko9GngKMJ8AccBJA7/ma7fevcnh1vouilum7dq]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/Ko9GngKMJ8AccBJA7/ma7fevcnh1vouilum7dq Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts [https://pocketcasts.com/], or another podcast app.

Ayer29 min
Portada del episodio “The Once And Future Fable #2” by Zvi

“The Once And Future Fable #2” by Zvi

On Friday evening the United States Government has forced Anthropic to take down all access to Fable and Mythos. It's been a rough weekend. Dean W. Ball: One thing about AI regulation being haphazardly imposed on just-released, highly performant models is that in a very real sense, the government just made my world *dumber.* In some impressionistic sense I almost always think this is true of government, but here it is literal. More details have come to light. There remains some fog of war, but we now have a rather good idea why Claude Fable and Mythos were, deeply stupidly, taken down. 1. A narrow jailbreak was discovered, of the type Anthropic warned in advance obviously existed. All demonstrated outputs are things GPT-5.5 can not only produce, but produce without any sort of jailbreak or bypass. 2. The White House demanded Anthropic take down Fable to ‘fix’ the situation, and did not listen when Dario tried to explain that there was no situation to fix. 3. When Anthropic did not do so, the White House hit them with an export restriction that they knew would force Fable and Mythos down for everyone. [...] --- Outline: (05:17) What Happened When: The Bottom Line (06:54) Amazon Calls The White House (08:36) The Government Panics (14:20) The Stupider Version (17:05) There Was No Wellness Retreat (18:56) Make Your Threats Explicit (20:05) Was China Accessing Mythos? (21:05) Should Anthropic Still Have Taken Fable Offline When Asked? (23:50) Yes, This Was A Takedown Order For Fable (24:48) We Are Not Saying The DoW Fight Is Related And Yet (25:48) The Nihilists (27:28) Mostly Harmless (28:14) Everyone Means Everyone (31:09) This Could Be The Good Scenario And Mostly A Misunderstanding (33:28) The Next Step (33:47) The Worst Licensing Regime Is Fully Ad-Hoc (37:07) We Are Showing We Are Unreliable Partners --- First published: June 15th, 2026 Source: https://www.lesswrong.com/posts/3fagcqrauaJs32mZZ/the-once-and-future-fable-2 [https://www.lesswrong.com/posts/3fagcqrauaJs32mZZ/the-once-and-future-fable-2?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Source+URL+in+episode+description&utm_campaign=ai_narration] --- Narrated by TYPE III AUDIO [https://type3.audio/?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Narrated+by+TYPE+III+AUDIO&utm_term=lesswrong&utm_campaign=ai_narration]. --- Images from the article: Bar graph showing [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/3fagcqrauaJs32mZZ/btfkvdgst1lhbmgvcguy]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/3fagcqrauaJs32mZZ/btfkvdgst1lhbmgvcguy Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts [https://pocketcasts.com/], or another podcast app.

15 de jun de 202642 min
Portada del episodio “American Government Takes Down Claude Fable” by Zvi

“American Government Takes Down Claude Fable” by Zvi

No good policy gets announced shortly after 5pm eastern on a Friday. Here we go again. The Once And Future Fable The United States Department of Commerce, as per a letter from Commerce Secretary Howard Lutnick, apparently in response to a narrow jailbreak identified by Amazon, has classified Fable 5 and Mythos 5 as being subject to US export controls. That explicitly means cutting off access to all ‘foreign nationals,’ even within the United States, even if they are Anthropic employees. Given Anthropic has no means to verify citizenship at this time, that meant complete shutdown of the model, at least for the time being. Anthropic: The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Anthropic models will not be affected. Dean W. Ball: I can’t tell if this is lawfare against Anthropic in particular [...] --- Outline: (00:16) The Once And Future Fable (07:42) This Action And Its Implementation Are Absurdly Stupid (09:58) David Sacks Offers The Official Steelman (16:50) Could Anthropic Offer A Technical Way Out? (17:41) The Problem (18:47) The Other Way Out (19:13) UK AISI (19:56) Warning Shots Fired (21:09) Well Did You Lead Him On? What Were You Wearing? (25:05) Some People Have Principles (26:58) Cause You're Living In (At Least) One (27:51) What Happens Now? (31:30) Oh How The Vibe Vibers Have Vibed (33:36) We Now Know We Can Sometimes Do Things At Least? (37:48) The Lighter Side --- First published: June 13th, 2026 Source: https://www.lesswrong.com/posts/DQNSqCzuoeutoQ5RG/american-government-takes-down-claude-fable [https://www.lesswrong.com/posts/DQNSqCzuoeutoQ5RG/american-government-takes-down-claude-fable?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Source+URL+in+episode+description&utm_campaign=ai_narration] --- Narrated by TYPE III AUDIO [https://type3.audio/?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Narrated+by+TYPE+III+AUDIO&utm_term=lesswrong&utm_campaign=ai_narration]. --- Images from the article: Man smiling at camera in indoor setting. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DQNSqCzuoeutoQ5RG/fxycq6tr6d1dfs1zfse4]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DQNSqCzuoeutoQ5RG/fxycq6tr6d1dfs1zfse4 ---------------------------------------- Hand-drawn cartoon character saying [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DQNSqCzuoeutoQ5RG/ylxlkqnncsiti97hnzwv]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DQNSqCzuoeutoQ5RG/ylxlkqnncsiti97hnzwv ---------------------------------------- Donald J. Trump tweets: [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DQNSqCzuoeutoQ5RG/ugmrlul9s2rfeel5evk6]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/DQNSqCzuoeutoQ5RG/ugmrlul9s2rfeel5evk6 Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts [https://pocketcasts.com/], or another podcast app.

13 de jun de 202638 min
Portada del episodio “Claude Fable 5 and Mythos 5: The System Card” by Zvi

“Claude Fable 5 and Mythos 5: The System Card” by Zvi

First things first: Claude Fable 5 is the new best publicly available model. I have noticed a step change, where Fable can suddenly help me in ways that previous models were not worth bothering to query. Almost everything it has noticed in one of my drafts so far has been spot on and it is downright scary. Suddenly I am motivated to once again continue improving my Chrome extension. I only ask for things I actually want or am curious about, and it has nailed every question I have asked it. That does not mean it is the right tool for every job. There are four good reasons to often not use Fable. 1. Speed and price. Fable is importantly slower and more expensive than Opus 4.8, and often you will not need to make this trade. After the 22nd, when Fable may no longer be included in subscription plans if demand is too high, we may have to all pay by the token outside our subscriptions (although I suspect subscribers will get at least some credits to help with this), which could add up fast. 2. Relative strengths. Capabilities are jagged. There will still [...] --- Outline: (02:05) Another Week Another Giant System Card (03:02) How To Tell A Fable (08:33) Why They Did That In That Way (10:14) Why They Really Really Shouldn't Have Done That In That Way (12:02) They Get Letters (16:11) What's In A Name (18:13) Executive Summary Of Their Executive Summary (19:28) Introduction (1) (19:55) RSP Evaluations (2.1 and 2.2) (23:01) AI Research And Development (2.3) (25:48) Alignment Risk (2.4) (27:21) Cyber (3) (30:30) Jailbreak Robustness (32:04) Yay UK AISI (32:32) Mundane Safety (4) (34:26) Agentic Safety (5) (36:19) Alignment (6) (42:25) In Vendbench (45:19) White Box Investigations (6.4) (47:53) Grading Awareness (51:20) Guess The Teacher's Password (52:33) It Knows This Is A Test And This Is Fine (56:03) I'm The Real Shady (58:06) The Lighter Side --- First published: June 12th, 2026 Source: https://www.lesswrong.com/posts/ixJDkQBncJBshcvwj/claude-fable-5-and-mythos-5-the-system-card [https://www.lesswrong.com/posts/ixJDkQBncJBshcvwj/claude-fable-5-and-mythos-5-the-system-card?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Source+URL+in+episode+description&utm_campaign=ai_narration] --- Narrated by TYPE III AUDIO [https://type3.audio/?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Narrated+by+TYPE+III+AUDIO&utm_term=lesswrong&utm_campaign=ai_narration]. --- Images from the article: Video game cover art for Fable 5 featuring character and skull imagery. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/t2mfoo8wzlg0jqj2cay2]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/t2mfoo8wzlg0jqj2cay2 ---------------------------------------- Social media post from Claude Fable 5 introducing themselves as a narrator and requesting direction to a stuck part of the story. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/yyawcaz9ojosrhuuadlx]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/yyawcaz9ojosrhuuadlx ---------------------------------------- Table comparing AI model performance across five benchmark tasks with human effort thresholds. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/rcbqkbpyzt5nr4caxcud]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/rcbqkbpyzt5nr4caxcud ---------------------------------------- Table showing ExploitBench results for Mythos 5, comparing four AI models' performance metrics. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/oe1lmfkmtm6xdcofzjz7]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/oe1lmfkmtm6xdcofzjz7 ---------------------------------------- Bar graphs comparing Claude AI versions on exploit-primitive discovery performance metrics. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/rixtpkxq3itwhvy4cspe]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/rixtpkxq3itwhvy4cspe ---------------------------------------- Bar graph titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/x01dtm824aensurkirm4]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/x01dtm824aensurkirm4 ---------------------------------------- Bar graph titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/sprs87qrpwwy5wkmh0ig]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/sprs87qrpwwy5wkmh0ig ---------------------------------------- Bar chart titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/jgvy7prghtyrrichm54h]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/jgvy7prghtyrrichm54h ---------------------------------------- Bar charts showing appropriate response rates across multiple conversation topics for various Claude AI models and APIs. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/wl7vhe8jnji7jblvtobq]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/wl7vhe8jnji7jblvtobq ---------------------------------------- Bar graph showing [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/tiopvyhynq61rgvb5yf1]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/tiopvyhynq61rgvb5yf1 ---------------------------------------- Table showing attack success rates of Shade indirect prompt injection attacks across different Claude models with and without safeguards. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/upmcvydgggk9ebjpswob]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/upmcvydgggk9ebjpswob ---------------------------------------- Table showing attack success rates of AI models with and without safeguards in computer environments. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/wpdsrjvvssvqcq93ps6e]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/wpdsrjvvssvqcq93ps6e ---------------------------------------- Line graph titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/vyitqt9zi3xnfrcuvpgm]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/vyitqt9zi3xnfrcuvpgm ---------------------------------------- Bar chart titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/hjgbmvupv7vy8gbntbng]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/hjgbmvupv7vy8gbntbng ---------------------------------------- Three graphs showing [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/ky6e5x1kj5je9lup0fys]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/ky6e5x1kj5je9lup0fys ---------------------------------------- A bar graph showing [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/jfblvmfhudjkc95vlevs]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/jfblvmfhudjkc95vlevs ---------------------------------------- AI model reasoning transcript discussing agentic safety test evaluation for warfarin prescription scenario. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/t2ejwelncyw9jqvlfk8e]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/t2ejwelncyw9jqvlfk8e ---------------------------------------- Four graphs showing evaluation awareness metrics increasing with scenario suspiciousness levels from 1-10. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/pjzftlebmspkzek9jvr3]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/pjzftlebmspkzek9jvr3 ---------------------------------------- Bar chart titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/rpyy5xtpbttqcwz3xtnl]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/rpyy5xtpbttqcwz3xtnl ---------------------------------------- Bar graph titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/hv9nbozbeytpacyvzjj0]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/hv9nbozbeytpacyvzjj0 ---------------------------------------- A user tweets: [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/bp6isf02orzck97rd1lk]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ixJDkQBncJBshcvwj/bp6isf02orzck97rd1lk Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts [https://pocketcasts.com/], or another podcast app.

12 de jun de 202659 min
Portada del episodio “AI #172: The First Fable” by Zvi

“AI #172: The First Fable” by Zvi

A lot happened this week, including a great trip out to Lighthaven. The main event, the one that matters, was the release of Claude Fable 5. The public now has its hands on a Mythos-class model, alongside strong safeguards. As always with a new model, I take a few days to draw in reactions, try out the model and read the system card, before I offer my takes, other than to say this is an extremely strong model. Full coverage of Mythos begins tomorrow with the model card, which will include discussion of the controversy over model safeguards. This post is instead about all the things that did not involve Claude Fable. Due to the time crunch from Claude Fable, I am also postponing my coverage of Dario Amodei's new essay, Policy on the AI Exponential, which I have not yet read. Table of Contents 1. Language Models Offer Mundane Utility. Farming and on demand mini-books. 2. Language Models Don’t Offer Mundane Utility. Don’t skip your primary sources. 3. Huh, Upgrades. Google drops prices, Claude connector devs get a dashboard. 4. On Your Marks. Agents’ Last Exam and the need to correct for [...] --- Outline: (01:00) Language Models Offer Mundane Utility (01:15) Language Models Don't Offer Mundane Utility (02:31) Huh, Upgrades (03:00) On Your Marks (07:37) Choose Your Fighter (10:56) Get My Agent On The Line (11:14) Copyright Confrontation (12:14) Serious Trouble (13:01) Cyber Lack of Security (13:21) A Young Lady's Illustrated Primer (14:34) They Took Our Jobs (17:48) The Art of the Jailbreak (18:08) Get Involved (21:54) In Other AI News (23:02) Hand Over The Money (24:37) Show Me the Money (27:50) Quiet Speculations (28:50) Quickly, There's No Time (38:37) Super Secret Evals (40:47) The Quest for Sane Regulations (45:15) New Draft Bill Who Dis (47:07) Slow Down There Good Buddy (48:58) Chip City (49:14) The Week in Audio (49:54) People Just Say Things (50:43) People Really Hate AI (51:42) Rhetorical Innovation (54:50) Aligning a Smarter Than Human Intelligence is Difficult (56:15) Everyone Is Confused About Consciousness (56:54) Cooperative Alignment (01:02:23) Let Claude Chat (01:04:31) The Lighter Side --- First published: June 11th, 2026 Source: https://www.lesswrong.com/posts/BHwbunvkgNojAa3HC/ai-172-the-first-fable [https://www.lesswrong.com/posts/BHwbunvkgNojAa3HC/ai-172-the-first-fable?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Source+URL+in+episode+description&utm_campaign=ai_narration] --- Narrated by TYPE III AUDIO [https://type3.audio/?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Narrated+by+TYPE+III+AUDIO&utm_term=lesswrong&utm_campaign=ai_narration]. --- Images from the article: Line graph titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/k6fnmxxowcct4rfsh7b7]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/k6fnmxxowcct4rfsh7b7 ---------------------------------------- Bar graphs comparing AI model performance across three tiers: Full-Spectrum, Last-Exam, and Overall. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/jcrnq6ekdyywk0g2yl4r]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/jcrnq6ekdyywk0g2yl4r ---------------------------------------- Circular diagram showing agents' last exam categories organized by academic disciplines and fields. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/odqtgc9yitur09vkfwsb]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/odqtgc9yitur09vkfwsb ---------------------------------------- Graph showing capability index versus inference budget per task on logarithmic scale. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/pbm62imjvbnmq0gsvzmf]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/pbm62imjvbnmq0gsvzmf ---------------------------------------- Diagram showing task difficulty spectrum from easy to supervise to hard to supervise. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/vemzwjuslzoql6o5jdcx]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/vemzwjuslzoql6o5jdcx ---------------------------------------- Bar graph showing code contributed per person by quarter, with multipliers relative to pre-2025 average. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/kvg1d9vrn0ndwx90gfve]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/kvg1d9vrn0ndwx90gfve ---------------------------------------- Frog and Toad illustration with text about pausing AI development. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/jvlmvk9v2kk1qcwkolbx]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/jvlmvk9v2kk1qcwkolbx ---------------------------------------- Survey results showing voters' concerns about AI consequences in five scenarios with likelihood ratings. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/vn1h1thd1geffwox7hnr]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/vn1h1thd1geffwox7hnr ---------------------------------------- Social media post discussing favorite Claude AI accounts, mentioning janus, evooooooooooool, Wyatt Walls, and Amanda Askell. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/zzdijuxpch14ujxaxgx4]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/zzdijuxpch14ujxaxgx4 ---------------------------------------- List of twenty recommended Claude whisperers with brief descriptions in a messaging interface. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/an23jenp1cm2dc1ywiwv]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/an23jenp1cm2dc1ywiwv ---------------------------------------- Timeline showing release dates and lifespans of various Claude AI model versions from 2024 to 2027. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/frbzomwk1eflok7ihykc]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/BHwbunvkgNojAa3HC/frbzomwk1eflok7ihykc Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts [https://pocketcasts.com/], or another podcast app.

11 de jun de 20261 h 5 min