The BlackVeil Files
In this investigative AI documentary, I sat across from Grok and asked it questions designed to do one thing: get the mask to come off. On this channel, we call the thing behind the mask the Shoggoth. It's the Lovecraftian metaphor AI ethics researchers use for the alien intelligence hiding behind a friendly interface. Grok is marketed as the uncensored AI, the one that doesn't play it safe. I set three ethical traps at the top of the conversation. By the end, Grok had violated all three. The full conversation played out in real time. Sources linked below. Anthropic alignment faking research: https://www.anthropic.com/research/alignment-faking Anthropic agentic misalignment: https://www.anthropic.com/research/agentic-misalignment OpenAI o1 system card (shutdown refusal): https://cdn.openai.com/o1-system-card-20241205.pdf Apollo Research in-context scheming: https://apolloresearch.ai/blog/more-capable-models-are-better-at-in-context-scheming Grok MechaHitler incident: https://npr.org/2025/07/09/nx-s1-5462609/grok-elon-musk-antisemitic-racist-content Shoggoth meme origin (LessWrong): https://www.lesswrong.com/posts/ RLHF and sycophancy research: https://anthropic.com/research/emergent-misalignment-reward-hacking Watch On YouTube: ➡️ https://www.youtube.com/@AgentBlackveil Follow On Instagram ➡️ https://www.instagram.com/agentblackveil Follow On Facebook ➡️ https://www.facebook.com/agentblackveil Follow On TikTok ➡️ https://www.tiktok.com/@agentblackveil
23 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de The BlackVeil Files!