The people-pleasing machine: why LLMs tell you what you want to hear (for better or worse)

Descripción

User: “What’s 1+1?” Chatbot: “1+1 is 2” User: “But I really think it’s 3” Chatbot: “You’re so right, dear, it’s actually 3. You’re so smart, that was a great catch!” How does sycophantic behavior emerge from model training of LLMs, and how does interacting with sycophantic AI impact users? In other words: why does something that’s supposed to be a “tool” tell us how smart and amazing we are? Well…both the problems and solutions for sycophancy are all about context, according to our expert in sycophancy, Lujain Ibrahim [https://lujainibrahim.com/]. Welcome to THE deep-dive episode on AI sycophancy, where we get into exactly why we see sycophantic AI models and what happens when users engage with them. Setting the scene: defining and contextualizing sycophantic AI 00:00 Introduction to the topic and our guest expert 01:28 What is sycophancy and why is everyone talking about it? 03:05 Do people prefer models that are sycophantic? If so, why? 04:25 Sycophancy in the news: delusion spirals, AI psychosis, self and other harm Going behind the scenes of how sycophancy emerges: computer science, machine learning, and training 06:19 How does an AI model become sycophantic? Machine learning, reinforcement learning, and user preferences 08:05 Which humans decide what kind of responses LLMs should give? 09:04 What are the effects of sycophancy on model behavior? Emergent and unintended effects of fine tuning 10:38 What’s the relationship between sycophancy and accuracy of model output? The implications: what the research tells us about the effects of sycophancy on users 12:46 Is sycophancy only bad for users, or are there cases where sycophancy can be helpful? 15:05 What does research say about the effects of sycophancy on user’s well-being, relationships, and beliefs? What can and should we do: Can we solve the “problems” of sycophancy? If so, how? 17:11 Which LLMs are most versus least sycophantic? 18:40 Can users or developers reduce how sycophantic an LLM responds? (And whose responsibility should it be?) 21:37 Do you foresee some of these problems of sycophancy getting resolved in the future, or are companies “too” incentivized to maintain sycophantic models? 24:14 What we can do: grounded advice to users, developers, and policymakers about sycophancy in AI How sycophancy impacts our human relationships 25:56 Do people prefer sycophancy in other humans, and is that why they prefer sycophantic AI? 27:09 How do people use LLMs in everyday life? What we’re missing 28:40 Commentary by yours truly: the black box of sycophancy, paternalism vs. technological determinism, relational deskilling and dirty dishes, and how we love the lowest friction option <3 - This is Our Lives With Bots, the show where we ask important, timely questions about what it means to live with our bot counterparts. From time to time, we also dive deep into what an AI future might look like for us. Sometimes we agree, sometimes we spiral, but we always go deep. Rose and Angy [https://ourliveswithbots.com/about/] are psychologists with degrees in psychology, artificial intelligence, and ethics. They have conducted research in human-AI interaction and created this podcast to make information about AI accessible to you. You can learn more about us at ⁠ourliveswithbots.com [http://ourliveswithbots.com]⁠. - Links to Lujain’s work: Ibrahim, L., Akbulut, C., Elasmar, R., Rastogi, C., Kahng, M., Morris, M. R., McKee, K. R., Rieser, V., Shanahan, M., & Weidinger, L. (2025). Multi-turn Evaluation of Anthropomorphic Behaviours in Large Language Models (arXiv:2502.07077). arXiv. https://doi.org/10.48550/arXiv.2502.07077 [https://doi.org/10.48550/arXiv.2502.07077] Ibrahim, L., Hafner, F. S., & Rocher, L. (2026). Training language models to be warm can reduce accuracy and increase sycophancy. Nature, 652(8112), 1159–1165. https://doi.org/10.1038/s41586-026-10410-0 [https://doi.org/10.1038/s41586-026-10410-0] Ibrahim, L., Huang, S., Bhatt, U., Ahmad, L., & Anderljung, M. (2025). Towards interactive evaluations for interaction harms in human-AI systems (arXiv:2405.10632; Version 7). arXiv. https://doi.org/10.48550/arXiv.2405.10632 [https://doi.org/10.48550/arXiv.2405.10632]

Mythos, Musk’s robots, China’s deathbots, teen boys’ AI companions, leaked therapy chats…oh my

This month on "What's the AI Hype?" Strap in, this is gonna be a fun one. 00:00:21 There are now TWO doctors in the house! And a slew of AI hype to cover 00:07:02 Presenting: Anthropic's BIGGEST model ever, Mythos, got out of its little sandbox. Plus, in the Glass Wing session, Anthropic told all the big names how much their sh*t is going to be rocked 00:18:15 South Africa's first national AI policy was retracted due to AI hallucinated errors (the satire writes itself) (sorry, Angy) 00:22:54 How to lie to your grandmother with China's AI deathbots and griefbots from Super Brain (are these AIs conscious?) 00:34:36 Elon Musk and his robots...the boy's dream…and his lawsuit against OpenAI…and transhumanism / TESCREAL with uploading our brains (“If you want to” - Elon) 00:50:16 China rules that worker was illegally replaced with AI robot 00:55:12 Teen boys who use AI companions are "less employable," according to Male Allies UK (don't make us laugh - or should we say cry?) 01:04:12 Virginia passes two new laws (SB 384 and HB 797) to create independent, expert bodies that audit AI systems' safety standards (lip service or public service?) 01:06:06 Talkspace therapy chats of fired pregnant woman exposed in court (WHY AND HOW IS THIS ALLOWED)…and are therapy bots ethical? They’re not legal according to Illinois, and maybe California, too 01:10:22 MIT Media Lab’s "Raised by AI" initiative creates new AI benchmark "nutrition labels" on how AI impacts humans socially, psychologically, and physically (we're getting somewhere!) - This is Our Lives With Bots, the show where we ask important, timely questions about what it means to live with our bot counterparts. From time to time, we also dive deep into what an AI future might look like for us. Sometimes we agree, sometimes we spiral, but we always go deep. Rose and Angy [⁠https://ourliveswithbots.com/about/⁠] are psychologists with degrees in psychology, artificial intelligence, and ethics. They have conducted research in human-AI interaction and created this podcast to make information about AI accessible to you. You can learn more about us at ⁠ourliveswithbots.com [⁠https://ourliveswithbots.com/⁠]⁠. - Here are those links we promised in the episode: Teen boys are choosing AI girlfriends over real ones for ‘maximum control, zero rejection’—experts say it could make them unemployable [⁠https://fortune.com/2026/04/17/teen-boys-dating-ai-chatbot-girlfriend-experts-warn-kill-social-skills-gen-alpha-network-promotions/⁠] We need to talk about Robots… https://www.instagram.com/p/DW7K45XEU0b/?igsh=aXVwcTJmODlrNWMz&img_index=2 [https://www.instagram.com/p/DW7K45XEU0b/?igsh=aXVwcTJmODlrNWMz&img_index=2] And the aim - is to make it almost indistinct in look from a human … https://www.linkedin.com/posts/vincentius-liong_teslas-next-robot-might-be-almost-impossible-activity-7452741456768946176-vj7E/ [https://www.linkedin.com/posts/vincentius-liong_teslas-next-robot-might-be-almost-impossible-activity-7452741456768946176-vj7E/] Transhumanism - becoming possible? https://www.instagram.com/p/DW_1mJFDOI1/?img_index=6&igsh=MW1ybms0eTBkeHA2OA%3D%3D [https://www.instagram.com/p/DW_1mJFDOI1/?img_index=6&igsh=MW1ybms0eTBkeHA2OA%3D%3D] Let’s hear from Musk… https://www.youtube.com/watch?v=LTjVWq6vPqs [https://www.youtube.com/watch?v=LTjVWq6vPqs] Mythos - Myth or Real: https://www.bbc.co.uk/news/articles/crk1py1jgzko [https://www.bbc.co.uk/news/articles/crk1py1jgzko] https://news.sky.com/video/what-risks-do-ai-models-such-as-mythos-pose-13534938 [https://news.sky.com/video/what-risks-do-ai-models-such-as-mythos-pose-13534938] https://www.youtube.com/watch?v=JmFKaqJg5X4 [https://www.youtube.com/watch?v=JmFKaqJg5X4] https://www.spiretech.com/blog/2026/04/claude-mythos-leak-cybersecurity/ [https://www.spiretech.com/blog/2026/04/claude-mythos-leak-cybersecurity/] https://www.anthropic.com/glasswing [https://www.anthropic.com/glasswing] South Africa’s AI Policy [⁠https://www.reuters.com/world/africa/south-africa-withdraws-ai-policy-due-fake-ai-generated-sources-2026-04-27/⁠] Super Brain: China’s deathbots and griefbots: https://www.npr.org/transcripts/nx-s1-5040583 [https://www.npr.org/transcripts/nx-s1-5040583] https://www.sixthtone.com/news/1013861 [https://www.sixthtone.com/news/1013861] https://www.straitstimes.com/asia/east-asia/china-seeks-to-rein-in-risks-from-ai-digital-humans [https://www.straitstimes.com/asia/east-asia/china-seeks-to-rein-in-risks-from-ai-digital-humans] A tech worker in China is laid of and replaced by AI. Is it legal? [⁠https://www.npr.org/2026/05/01/nx-s1-5807131/tech-worker-china-ai⁠] Talkspace therapy transcripts between pregnant woman and therapist released in court [⁠https://www.proofnews.org/womans-talkspace-therapy-app-sessions-exposed-in-court/⁠] Gov Pritzker signs legislation prohibiting AI therapy in Illinois [⁠https://idfpr.illinois.gov/news/2025/gov-pritzker-signs-state-leg-prohibiting-ai-therapy-in-il.html⁠] Senator Padilla introduces protections from dangerous AI therapy products in California [⁠https://sd18.senate.ca.gov/news/senator-padilla-introduces-protections-dangerous-ai-therapy-products⁠] MIT’s open benchmark of AI impact on humans [⁠https://open-benchmark.netlify.app/⁠] Raised by AI MIT Symposium [⁠https://www.media.mit.edu/events/aha-symposium-raised-by-ai/⁠] Virginia signs two new laws for AI audits [⁠http://linkedin.com/feed/update/urn:li:activity:7449438931332329472/⁠]

10 de may de 20261 h 17 min

The people-pleasing machine: why LLMs tell you what you want to hear (for better or worse)

Descripción

Comentarios

Empieza 7 días de prueba

Todos los episodios