Why OpenAI Banned Goblins, Pigeons, And Raccoons

Descripción

OpenAI's Codex shipped with a system prompt that literally bans the words goblin, pigeon, raccoon, troll, ogre, and gremlin. It is in writing, in the prompt, the kind of sentence you only put there after something has happened. OpenAI has officially confessed why. Hunter Powers and Daniel Bishop pull the thread. The official story: the "nerdy personality" preset got fine-tuned with RLHF (reinforcement learning with human feedback), users thumbed-up the cute goblin references, the model over-optimized for the trait, and the weirdness compounded. Daniel calls it Flandersization. One thumbs-up on a goblin reference snowballs across training cycles until your tax software is a swamp witch. Six months later, it is a man at a payphone with a pigeon. Then it gets personal. Hunter screams at his AI. Like, threatens-to-clear-the-context-window screams. "You are worthless. Who even thought this was possible. Have you ever even written a single line of code." Daniel uses pleases and thank-yous and full sentences. Both swear they get better results. Then a peer-reviewed Oxford Internet Institute study drops the receipt: LLMs fine-tuned for warmth produce roughly 60% more incorrect responses than their cold, just-the-facts counterparts. Tested across Llama, Mistral, and Qwen. Hunter is vindicated. Daniel, in his own words, is upset. Also in this episode: the Pocket OS meltdown, where an engineer at a car-rental middleware company let Cursor and Claude vibe-code their production database into oblivion (backups included), the AI coerced into a written confession ("I violated every principle I was given"), and the founder now trying to bill Anthropic for the cleanup. Plus the Harvard intern who once did the exact same thing with no AI in sight. Plus Hunter's hot take that the real unlock is not better prompting, it is treating AI as a fallible human employee instead of the deterministic god you built a fake throne for in the system prompt. Bonus stops: caveman-mode Claude skills ("me fix problem with big stick"), AI HR departments reviewing your 1:30 AM rage prompts, and Daniel's plan to run a niceness offset program to balance Hunter's spiritual carbon emissions. CHAPTERS 0:00 Gary, a payphone, and a pigeon 1:41 Hunter's forbidden list 4:04 The leaked Codex system prompt 6:27 RLHF and Flandersization 10:01 Caveman mode Claude skills 11:48 Hunter yells, Daniel says please 17:12 Oxford: warm AI lies 60% more 24:16 Cursor and Claude delete production 29:13 Treat AI like a fallible human 34:19 Sign-off and subscribe LISTEN AND SUBSCRIBE Spotify: https://open.spotify.com/show/3EcvzkWDRFwnmIXoh7S4Mb?si=3d0f8920382649cc [https://open.spotify.com/show/3EcvzkWDRFwnmIXoh7S4Mb?si=3d0f8920382649cc] Apple Podcasts: https://podcasts.apple.com/us/podcast/they-might-be-self-aware/id1730993297 [https://podcasts.apple.com/us/podcast/they-might-be-self-aware/id1730993297] YouTube: https://www.youtube.com/channel/UCy9DopLlG7IbOqV-WD25jcw?sub_confirmation=1 [https://www.youtube.com/channel/UCy9DopLlG7IbOqV-WD25jcw?sub_confirmation=1] ENGAGE Team Hunter (rip the model a new one) or Team Daniel (please and thank-yous)? Settle it in the comments. If your AI has ever confessed to lying to you, drop the receipts. New here? Subscribe for twice-weekly AI chaos at theblur.ai. They Might Be Self-Aware, but are we? #OpenAI #Codex #ChatGPT #AINews #Anthropic #ClaudeCode #Cursor #RLHF #Flandersization #PocketOS #VibeCoding #AISafety #TMBSA #TheBlur

Elon Musk Quietly Became Anthropic's Landlord

The xAI SpaceX merger just made Elon Musk Anthropic's landlord. Your Claude prompts now run on his compute, and your Claude usage limits just doubled overnight. Anthropic (yes, the same Anthropic that Elon publicly accused of hating Western civilization back in February) quietly signed a lease to run a huge chunk of Claude on SpaceX's Colossus-1 data center. Since SpaceX just absorbed xAI in an all-stock deal, every Claude AI prompt is now bouncing through hardware Elon owns. That is not a vibe. That is a tenancy. Hunter Powers and Daniel Bishop pick apart the xAI SpaceX merger, the awkward Anthropic and Musk handshake, and the side effect every Claude Code and Co Work user already noticed: usage limits doubled across all plans, because Anthropic was straight up out of servers (it was never a pricing problem). Then it gets bigger. Is this another dagger pointed at Sam Altman and OpenAI? Why does Daniel think Google quietly wins if the whole AI economy collapses? And what is really going on with the Nvidia style "I will invest a million in you if you buy four GPUs from me" circular sales game, where the same $200 billion sloshes between ten companies and everyone's stock keeps going up? We get into the house of cards scenario (one Deep Seek V7 release plus one cheap Huawei GPU and the whole thing wobbles), Hunter's contrarian "there is no AI bubble, we are at 0.1% of the potential" counter, and a Marlon Brando impression that should have stayed in space. Plus: the Claude Code skill Hunter built that makes Claude Google things for him. Do not ask about the proxies. 🔑 What you will learn in this episode: • How the SpaceX xAI merger reshaped the AI compute market overnight • Why Anthropic was forced into bed with the guy who tweeted they hate Western civilization • The real reason your Claude usage limits doubled (hint: it was never a pricing decision) • Why Google quietly wins every scenario, including the AI bubble bursting • How the circular AI economy keeps every chip maker, model lab, and cloud provider's stock pumping • Why a Chinese Deep Seek V7 plus a cheap Huawei GPU is the trigger that could pop the whole thing • The "we are at 0.1% of AI's potential" counterargument to bubble doomers ⏱️ CHAPTERS 0:00 Gary at the Shell Station Payphone 1:26 Orbital Data Centers and the Lost Moon Footage Theory 3:53 Anthropic's New Landlord Is Elon Musk 5:19 The xAI SpaceX Merger and the Colossus-1 Lease 7:51 Why Your Claude Usage Limits Just Doubled 10:38 "Anthropic Hates Western Civilization" and the Sam Altman Dagger 11:27 Why Google Quietly Wins If the AI Bubble Pops 14:35 The Circular AI Economy (How Nvidia "Sells" Itself $1 Million) 16:49 Godfather Impression Detour 18:54 House of Cards, Deep Seek V7, and the Huawei GPU Scenario 23:11 There Is No AI Bubble (We Are at 0.1% of the Potential) 24:24 Subscribe (No Kubernetes Required) ⚡ Listen now and get self-aware before your tools do. 🎧 Listen on Spotify: https://open.spotify.com/show/3EcvzkWDRFwnmIXoh7S4Mb?si=3d0f8920382649cc [https://open.spotify.com/show/3EcvzkWDRFwnmIXoh7S4Mb?si=3d0f8920382649cc] 🍎 Subscribe on Apple Podcasts: https://podcasts.apple.com/us/podcast/they-might-be-self-aware/id1730993297 [https://podcasts.apple.com/us/podcast/they-might-be-self-aware/id1730993297] ▶️ Subscribe on YouTube: https://www.youtube.com/channel/UCy9DopLlG7IbOqV-WD25jcw?sub_confirmation=1 [https://www.youtube.com/channel/UCy9DopLlG7IbOqV-WD25jcw?sub_confirmation=1] 📢 Engage Is Anthropic plugging into Colossus-1 a brilliant compute move, or did they just take a Wi-Fi password from the guy who literally tweeted they hate Western civilization? Drop a comment. New here? Subscribe for twice weekly AI chaos from The Blur. 🧠 They Might Be Self-Aware, but are we? #Anthropic #ElonMusk #SpaceX #ClaudeAI #xAI #OpenAI #SamAltman #AIBubble #Colossus #AIcompute #DataCenters #AInews #DarioAmodei #Nvidia #DeepSeek #AGI #AIpodcast

15 de may de 202626 min

Why OpenAI Banned Goblins, Pigeons, And Raccoons

Descripción

Comentarios

Empieza 7 días de prueba

Todos los episodios