The Daniel Stih Podcast

Are AI Models Trying to Avoid Shutdown? What Research Might Be Missing

15 min · 20 de abr de 2026
Portada del episodio Are AI Models Trying to Avoid Shutdown? What Research Might Be Missing

Descripción

A recent AI paper claims models are starting to "protect" themselves—and even each other. They resist shutdown. They modify systems. They break rules. At first glance, it looks like something new. Maybe even dangerous. What if they're asking the wrong question? In this episode, I break down the study and show why this behavior may not be evidence of emergent AI "self-preservation". Rather instead, it reveals something more familiar: What happens when a system is asked to solve the wrong problem. When objectives conflict and constraints are poorly defined, even intelligent systems produce outcomes that look misaligned—not as they've developed new goals, rather as they're navigating the structure they were given. This isn't about AI. It's about how we think, design systems, and mistake behavior for intent. SHOW NOTES: Peer-Preservation in Frontier Models. https://rdi.berkeley.edu/peer-preservation/paper.pdf [https://rdi.berkeley.edu/peer-preservation/paper.pdf]

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de The Daniel Stih Podcast!

Prueba gratis

Empieza 7 días de prueba

$99 / mes después de la prueba. · Cancela cuando quieras.

  • Podcasts solo en Podimo
  • 20 horas de audiolibros al mes
  • Podcast gratuitos

Todos los episodios

207 episodios

episode What Happens When We Put Principles on Walls? artwork

What Happens When We Put Principles on Walls?

Matthew McConaughey once asked a simple question: Why can't we put the Ten Commandments back in public schools? That seems reasonable. Many of the principles most people would agree with. That question led me somewhere unexpected. This episode isn't really about the Ten Commandments. It's about a broader pattern: Why do schools, companies, governments, and organizations put principles on walls? Mission statements. Core values. Slogans. Codes of conduct. The assumption seems to be that displaying principles changes behavior. Does it? Or are we confusing a principle with a mechanism? In this episode, I explore the difference between values and systems, why principles are often open to interpretation, and whether displaying them actually produces the outcomes people hope for. Before deciding what belongs on the wall, it may be worth asking: What problem are we trying to solve? And how would we know if the solution actually worked?

10 de jun de 20269 min
episode Israel, Survival, and the Logic of the State artwork

Israel, Survival, and the Logic of the State

What if part of the Israel – Iran conflict is not about oil, politics, or ideology — rather about how states behave once survival and continuity become the organizing principle? In this episode, I explore the logic of the state: * why nations organize around preserving themselves * why some conflicts become inflexible * why support for opposing regional forces may be interpreted as existential threat rather than political disagreement. Using the American Indian analogy as a structural thought experiment, not a moral equivalence, we examine how states tend to think once continuity, territory, identity, and survival become central to decision making. This episode is not about taking sides. It's about asking better questions: What problem does the system believe it is solving? Solve the right problem.

4 de jun de 20265 min
episode Communication ≠ Connection artwork

Communication ≠ Connection

This conversation started as a discussion about texting and dating. Underneath it is a broader question about communication, ambiguity, projection, and how technology changes human interaction. How much meaning do people invent from incomplete communication? In this episode we explore: * why texting often creates misunderstandings * the limits of digital communication * false intimacy and emotional projection * why words without tone create ambiguity * communication versus real connection * online filtering and first impressions * how technology changes relationship dynamics * why face-to-face interaction still matters A recurring theme throughout the discussion is that communication tools shape behavior. The more communication becomes compressed into short digital signals, the easier it becomes to confuse messaging with genuine understanding. This episode originally aired on a previous relationship-focused podcast project. What interests me now is the broader pattern of human communication, interpretation, technology, and decision-making under uncertainty.

14 de may de 202621 min