LessWrong posts by zvi
Only three days after the release of Claude Fable 5, Anthropic was forced by the United States Government to make it unavailable, when a jailbreak was brought to its attention, rather than the previous situation of ‘yes obviously experts can jailbreak anything if they care enough’ and ‘yes obviously you can ask Fable to fix your code.’ Three days was enough time for many of us to learn to love Fable, and for us to dearly miss it now that it is gone. The world was briefly smarter, and now it is again stupider. At some point it will get smarter again, which will likely be within two weeks. This post is written as if Fable 5 is again available for public use, rather than trying to include a lot of qualifying clauses. It remains to be seen how this will play out, and this post does not attempt to cover that question. My previous release coverage of Fable covered the model card and then model welfare. Coverage of the government takedown of Fable starts here, and continues here and here. The Official Pitch The pitch is that Fable 5 is the best model [...] --- Outline: (01:08) The Official Pitch (04:06) Technical Details (04:31) The System Prompt and Jailbreak (06:45) Benchmarks (15:22) Other People's Benchmarks (21:08) The Classifiers Are Not Messing Around (22:53) The Classifiers Need Work (28:15) The Classifiers Have Consequences (29:18) First Hit Is Free (29:53) How Easily We Forget (30:46) Data Retention Is An Issue (31:15) Fable For The Win (36:15) Andrej Karpathy Is Impressed (37:54) Every Is Very Impressed (39:04) Other People Are Impressed (51:10) Know How To Tell a Fable (53:06) You Can Just Make Things (55:37) You Can Just Install Things (56:05) Good Personality (57:51) Fable Writes A Fable (01:06:04) Is That Code (01:08:32) Fable Crosses The Threshold (01:09:12) Man With A Plan (01:10:12) Less Impressed Assessments (01:13:39) Actively Negative Assessments (01:14:16) Coherence (01:15:27) Good Night And Good Luck (01:16:05) Curious Fable (01:16:23) I See You, Baby (01:16:40) We Finally Did It We Know How To Count Letters (01:17:46) That's Not My Style (01:20:12) The Lighter Side --- First published: June 19th, 2026 Source: https://www.lesswrong.com/posts/kMnobCQp9z2pSbzDB/claude-fable-5-and-mythos-5-capabilities [https://www.lesswrong.com/posts/kMnobCQp9z2pSbzDB/claude-fable-5-and-mythos-5-capabilities?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Source+URL+in+episode+description&utm_campaign=ai_narration] --- Narrated by TYPE III AUDIO [https://type3.audio/?utm_source=TYPE_III_AUDIO&utm_medium=Podcast&utm_content=Narrated+by+TYPE+III+AUDIO&utm_term=lesswrong&utm_campaign=ai_narration]. --- Images from the article: Performance comparison table of Claude family models versus other AI models across multiple benchmarks. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/hiuggoqq0wxtieup1ogs]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/hiuggoqq0wxtieup1ogs ---------------------------------------- Table showing benchmark performance scores across different AI models and evaluation tests. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/j3lgvgssyqu7nv5n76nd]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/j3lgvgssyqu7nv5n76nd ---------------------------------------- Line graph titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/mhl0big4oz6dfsdsusby]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/mhl0big4oz6dfsdsusby ---------------------------------------- Graph showing [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/xa6vj8cndixwwxpp5cly]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/xa6vj8cndixwwxpp5cly ---------------------------------------- Bar graph titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/qzjin04zw1r2ttogtc5e]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/qzjin04zw1r2ttogtc5e ---------------------------------------- Table showing Toolathlon scores for Claude AI models across different pass rates and average turns. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/scexyz016idar9ltxxu2]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/scexyz016idar9ltxxu2 ---------------------------------------- Bar graph titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/hlp3tgx0ya3dhabylkm5]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/hlp3tgx0ya3dhabylkm5 ---------------------------------------- Line graph titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/czd85rgkmaer1kru9rgu]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/czd85rgkmaer1kru9rgu ---------------------------------------- Bar graph showing [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/lyqrcpikudbnrp6eem7f]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/lyqrcpikudbnrp6eem7f ---------------------------------------- Two bar charts comparing AI model performance scores and cost per intelligence index task. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/w6xjiegp4e69erzlkiku]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/w6xjiegp4e69erzlkiku ---------------------------------------- Bar graph titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/mkkjjwkblu842g06axs5]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/mkkjjwkblu842g06axs5 ---------------------------------------- Benchmark leaderboard showing AI model performance on ProofBench math proofs. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/oui6viwzrlnqgprwaafo]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/oui6viwzrlnqgprwaafo ---------------------------------------- Bar chart showing [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/enatrmfceqxqlknz3d3e]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/enatrmfceqxqlknz3d3e ---------------------------------------- Bar chart titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/djg0hgltcll8fm3hgrrw]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/djg0hgltcll8fm3hgrrw ---------------------------------------- Comic comparing appropriate versus inappropriate workplace comments about appearance. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/grxrljaqb76qrrsiodrr]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/grxrljaqb76qrrsiodrr ---------------------------------------- Section titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/i5cu29iaomnnjmgxydkf]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/i5cu29iaomnnjmgxydkf ---------------------------------------- Poem titled [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/ox6awbgcmv95kqewjthb]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/ox6awbgcmv95kqewjthb ---------------------------------------- Bar graph showing [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/jsw6jcemsexxa8x86nzp]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/jsw6jcemsexxa8x86nzp ---------------------------------------- Bar chart showing new terms coined by AI models with adoption rates. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/vvismkxidzlnzhvg3olz]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/vvismkxidzlnzhvg3olz ---------------------------------------- Table showing terms, coiners, and usage statistics for various phrases coined by Fable 5. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/jqgxlmo4dfhjbeyhmllz]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/jqgxlmo4dfhjbeyhmllz ---------------------------------------- Table showing GPT-5.5 terms with coiner and usage counts. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/x1ppnfcsb70xfjhr7p9m]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/x1ppnfcsb70xfjhr7p9m ---------------------------------------- A Twitter thread shows a conversation about wordplay. The first tweet asks [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/jogylfvetz6ylgqhx0ee]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/jogylfvetz6ylgqhx0ee ---------------------------------------- AI chatbot conversation about whether to walk or drive to a car wash 100 feet away. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/pxfzumvynyawfotoa2ep]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/pxfzumvynyawfotoa2ep ---------------------------------------- Ten pithy tweets styled after tracewoodgrains about education, internet culture, and institutional dynamics. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/pprr4di0ni2x98s7ye95]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/pprr4di0ni2x98s7ye95 ---------------------------------------- List of ten humorous tweets in Joe Weisenthal's style about economics and finance. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/c9orpg966p3n6bzscqn1]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/c9orpg966p3n6bzscqn1 ---------------------------------------- Ten pithy tweets in @TheZvi style about regulation, prediction markets, AI policy, pricing, testing, economics, education standards, technology contradictions, and consciousness debates. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/eflubmdzdzqeg4vql6zr]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/eflubmdzdzqeg4vql6zr ---------------------------------------- List of ten humorous observations about AI assistant limitations and human interactions. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/r1928pqzunjzfq6j9nhi]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/kMnobCQp9z2pSbzDB/r1928pqzunjzfq6j9nhi ---------------------------------------- Person facepalming emoji with blonde hair and blue shirt. [https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/5nDxmAvZ9w5CPa9gR/rim2lomv33dh8hv9ujz8]https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/5nDxmAvZ9w5CPa9gR/rim2lomv33dh8hv9ujz8 Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts [https://pocketcasts.com/], or another podcast app.
250 Episoder
Kommentarer
0Vær den første til å kommentere
Registrer deg nå og bli medlem av LessWrong posts by zvi sitt community!