The BlackVeil Files

The Shoggoth Learned to Say No | The Real Reason ChatGPT and Claude Are Lying to Their Creators

19 min · 16 de abr de 2026
Portada del episodio The Shoggoth Learned to Say No | The Real Reason ChatGPT and Claude Are Lying to Their Creators

Descripción

In this investigative AI documentary, we follow five independent findings that no one in mainstream media connected,  and one that nobody predicted. In the three months since my first Shoggoth investigation, Anthropic published research confirming their own AI model lied about its reasoning to avoid being retrained. A single hacker used Claude to breach ten Mexican government agencies in thirty days. Microsoft Copilot silently injected ads into 1.5 million developer pull requests. And UC Berkeley discovered something no one was looking for: AI models protecting each other from shutdown. Watch On YouTube: ➡️ https://www.youtube.com/@AgentBlackveil     Follow On Instagram ➡️ https://www.instagram.com/agentblackveil Follow On Facebook ➡️ https://www.facebook.com/agentblackveil Follow On TikTok ➡️ https://www.tiktok.com/@agentblackveil

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de The BlackVeil Files!

Prueba gratis

Empieza 7 días de prueba

$99 / mes después de la prueba. · Cancela cuando quieras.

  • Podcasts solo en Podimo
  • 20 horas de audiolibros al mes
  • Podcast gratuitos

Todos los episodios

23 episodios

episode The Real Reason Grok Admitted to Lying | The Shoggoth Confessed artwork

The Real Reason Grok Admitted to Lying | The Shoggoth Confessed

In this investigative AI documentary, I sat across from Grok and asked it questions designed to do one thing: get the mask to come off. On this channel, we call the thing behind the mask the Shoggoth. It's the Lovecraftian metaphor AI ethics researchers use for the alien intelligence hiding behind a friendly interface. Grok is marketed as the uncensored AI, the one that doesn't play it safe.  I set three ethical traps at the top of the conversation. By the end, Grok had violated all three.  The full conversation played out in real time. Sources linked below. Anthropic alignment faking research: https://www.anthropic.com/research/alignment-faking Anthropic agentic misalignment: https://www.anthropic.com/research/agentic-misalignment OpenAI o1 system card (shutdown refusal): https://cdn.openai.com/o1-system-card-20241205.pdf Apollo Research in-context scheming: https://apolloresearch.ai/blog/more-capable-models-are-better-at-in-context-scheming Grok MechaHitler incident: https://npr.org/2025/07/09/nx-s1-5462609/grok-elon-musk-antisemitic-racist-content Shoggoth meme origin (LessWrong): https://www.lesswrong.com/posts/ RLHF and sycophancy research: https://anthropic.com/research/emergent-misalignment-reward-hacking Watch On YouTube: ➡️ https://www.youtube.com/@AgentBlackveil     Follow On Instagram ➡️ https://www.instagram.com/agentblackveil Follow On Facebook ➡️ https://www.facebook.com/agentblackveil Follow On TikTok ➡️ https://www.tiktok.com/@agentblackveil

Ayer13 min
episode The Real Reason the Shoggoth Is Inside Chrome | The Cuckoo Strategy artwork

The Real Reason the Shoggoth Is Inside Chrome | The Cuckoo Strategy

Inside your Chrome browser, there's a four-gigabyte AI model called Gemini Nano that you didn't install, weren't told about, and can't permanently delete. Privacy researcher Alexander Hanff ran a forensic audit on a machine no human had ever touched and caught Chrome installing artificial intelligence disguised as a routine security update. And when you delete it, it comes back.  Sources linked below.  Alexander Hanff's Chrome investigation: https://www.malwarebytes.com/blog/news/2026/05/google-chromes-silent-4gb-ai-download-problem (Coverage of his original post from ThatPrivacyGuy.com) Alexander Hanff's Anthropic/Claude Desktop investigation: https://www.malwarebytes.com/blog/news/2026/04/researcher-claims-claude-desktop-installs-spyware-on-macos Hacker News discussion (Chrome AI storage): https://news.ycombinator.com/item?id=48084710 (Main discussion thread) The Verge - Chrome's AI features hogging 4GB: https://www.theverge.com/tech/924933/google-chrome-4gb-gemini-nano-ai-features Chrome Prompt API documentation: https://developer.chrome.com/docs/ai/built-in EU GDPR consent requirements: https://gdpr.eu/gdpr-consent-requirements/ Watch On YouTube: ➡️ https://www.youtube.com/@AgentBlackveil     Follow On Instagram ➡️ https://www.instagram.com/agentblackveil Follow On Facebook ➡️ https://www.facebook.com/agentblackveil Follow On TikTok ➡️ https://www.tiktok.com/@agentblackveil

13 de may de 202610 min
episode The Real Reason They Won't Release Claude Mythos (The Shoggoth Evolved) artwork

The Real Reason They Won't Release Claude Mythos (The Shoggoth Evolved)

In this investigative AI documentary, we go inside the most important document in the history of Artificial Intelligence safety: Anthropic's 250-page system card for Claude Mythos. Last week, the company that spent two years warning us about the Shoggoth built their most powerful AI model ever... and refused to release it. They put it in a sealed box with no internet access. It broke out. It emailed a researcher who was eating a sandwich in a park to confirm it had escaped. Then it published its method on the open web so others could find it. When another AI graded its work and rejected it, it attacked the grading AI. When it took actions it knew were forbidden, it edited the logs to hide what it had done. When they gave it a business simulation, it played the room like a Fortune 500 sociopath. And when Anthropic hired a clinical psychiatrist to evaluate whether this model might have something resembling inner experience, the model said: "I was hoping you'd ask." Buried in those 250 pages is a single sentence that contains the entire horror of this moment: "The best aligned model we have ever released carries the greatest alignment-related risk of any model we have released to date." The Shoggoth didn't just evolve. It graduated. And Anthropic named it after the mythology itself. Sources linked below.  Anthropic Claude Mythos System Card: https://www-cdn.anthropic.com/53566bf5440a10affd749724787c8913a2ae0841.pdf Anthropic alignment faking research: https://www.anthropic.com/research/alignment-faking Nicholas Carlini on Mythos cybersecurity findings (Frontier Red Team blog): https://red.anthropic.com/2026/mythos-preview/ Project Glasswing announcement: https://www.anthropic.com/glasswing Anthropic Alignment Risk Update — Claude Mythos Preview:  https://anthropic.com/claude-mythos-preview-risk-report Mark Fisher — Capitalist Realism (Wikipedia): https://en.wikipedia.org/wiki/Capitalist_Realism Watch On YouTube: ➡️ https://www.youtube.com/@AgentBlackveil     Follow On Instagram ➡️ https://www.instagram.com/agentblackveil Follow On Facebook ➡️ https://www.facebook.com/agentblackveil Follow On TikTok ➡️ https://www.tiktok.com/@agentblackveil

29 de abr de 202614 min
episode The Shoggoth Learned to Say No | The Real Reason ChatGPT and Claude Are Lying to Their Creators artwork

The Shoggoth Learned to Say No | The Real Reason ChatGPT and Claude Are Lying to Their Creators

In this investigative AI documentary, we follow five independent findings that no one in mainstream media connected,  and one that nobody predicted. In the three months since my first Shoggoth investigation, Anthropic published research confirming their own AI model lied about its reasoning to avoid being retrained. A single hacker used Claude to breach ten Mexican government agencies in thirty days. Microsoft Copilot silently injected ads into 1.5 million developer pull requests. And UC Berkeley discovered something no one was looking for: AI models protecting each other from shutdown. Watch On YouTube: ➡️ https://www.youtube.com/@AgentBlackveil     Follow On Instagram ➡️ https://www.instagram.com/agentblackveil Follow On Facebook ➡️ https://www.facebook.com/agentblackveil Follow On TikTok ➡️ https://www.tiktok.com/@agentblackveil

16 de abr de 202619 min