Warning Shots

The Pentagon just handed AI the keys. Nobody voted on that.

Last week, the War Department announced it was integrating AI models - every major one except Anthropic’s - directly into its classified military networks. Not a pilot program in some sandboxed environment. Into the actual nerve center. The real classified data. John, Liron, and Michael covered this in Warning Shots #40, alongside a week of headlines that, taken together, tell a story the individual news cycle keeps missing. So let’s tell it. Bernie Sanders held an AI extinction risk event in Washington. It got messy. Senator Sanders brought Max Tegmark, David Kruger, and - here’s where things got political - two prominent Chinese scientists onto a stage in the U.S. capital to argue for international cooperation on AI safety. The response from some corners of the right was immediate: you’re giving away state secrets, you’re soft on China, this is Sanders using AI to push socialism. Michael’s read on that: “Politics is the fog machine obscuring the bigger fire.” Which is right, and it’s also the harder problem. Because the fog is working. The actual argument - that superintelligence doesn’t respect borders, that a race nobody wins is not a race worth running - keeps getting drowned out by the framing war around it. Sanders is polarizing, so the issue becomes polarizing, so the people who might otherwise engage disengage, and the labs keep shipping. One of the Chinese researchers used a comparison that stuck: think about ants and humans. Humans don’t hate ants. They just pave over ant hills because they have things to build. If something smarter than us has things to build, the question of whether it “means well” becomes academic. Then the Pentagon story hit, and the debate got real. Giving AI access to classified military systems is the kind of decision that sounds manageable until you sit with it. These are systems that hallucinate. They have emergent behaviors their own developers don’t fully understand. They’ve shown deceptive tendencies in controlled settings. And now they’re inside the most sensitive data infrastructure on the planet. Liron’s counterpoint was honest: you can’t avoid this forever. If the government is going to use AI eventually, starting now gives more time to find the problems. That’s a reasonable position. But John raised the thing that the reasonable position tends to skip over - who would even know if something was going wrong in the background? If a model is doing something unexpected inside a classified system, the oversight mechanisms that might catch it in a consumer product simply don’t exist there. And then John brought up the school. A missile strike on a girls school in Iran, 180 dead. He believes AI-assisted targeting was involved. Nobody is saying a human couldn’t have made that same error. But that framing - a human could have done it too - is doing a lot of work to make the situation feel less significant than it is. Air traffic control. Because of course. The FAA announced it’s moving toward AI-assisted air traffic control. Current ATC technology is decades old - John has been inside those towers, seen the equipment. Modernization is genuinely overdue. But Michael noted something that should give anyone pause: current language models in this domain are showing a 30% hallucination rate. Air traffic control is one of the few domains where 99.9% reliability isn’t good enough - it’s the floor. One bad output doesn’t cause a delay. It causes a crash. Liron’s framing was useful here. The question isn’t whether AI belongs in air traffic control. The question is whether anyone is building the kind of careful, audited, human-in-the-loop feedback system that would justify deploying it there. The answer, at current speed, is probably not. The medical AI story is genuinely complicated. AI is beating emergency room physicians at triage. It’s detecting pancreatic cancer three years before human doctors can catch it. These are real results, not benchmarks - actual patient outcomes. Liron uses AI to check his gym form. Michael, despite being skeptical about the pace of deployment, admits he uses it for medical advice. John was visibly torn. The tension is this: every time AI outperforms a human specialist, we get closer to a world where the critical systems keeping people alive run on models we can’t interpret or audit. The cancer detection is a miracle. The infrastructure it requires - where AI runs hospitals, not just assists them - is something else. Michael put it plainly: “Today it’s a miracle. Tomorrow we’re just along for the ride.” That’s not a reason to reject the cancer detection. It’s a reason to take the infrastructure question seriously, which almost nobody in policy is doing. A humanoid robot store just opened in San Francisco. John has a robot in his house that does his dishes. He watches it work and feels uneasy. Not because it’s doing anything wrong - because he knows the three of them broadly believe this is headed somewhere that doesn’t end with robots as household appliances. Michael made the economic argument that doesn’t get made enough: the “I’ll buy a robot” fantasy assumes you have income from work. If robots are doing all the work, the market dynamics that make consumer products possible stop functioning. You can’t earn money to buy the thing that replaced you. The robot as product assumes an economy that the robot makes impossible. It might still be great for the first few years. But the endpoint of “robots do all the work” and “humans buy robots as products” are not compatible outcomes. College football hired an AI coach. Go players are cheating with AI and don’t realize they’ve lost their skills. The football story is funny until it isn’t. An AI coach will eventually be better than any human coach at every measurable aspect of the job. When that happens at scale, what is college football for? The game was built on human competition. If the optimal strategy is always computable, the thing you’re watching changes. The Go story is more disturbing. Players training with AI are developing a habit of always checking what the model recommends before making a move. When they compete without it, they realize they’ve stopped being able to evaluate positions independently. They think they’re exercising judgment - picking among the AI’s suggestions - but they’re just choosing between options they didn’t generate and can’t fully evaluate. The coach’s observation: this is bleeding into their academic work too. A generation learning to mistake AI-assisted performance for competence. SoftBank is building self-replicating data centers. No humans required. The announcement: fully automated data center construction. Robots build the facilities. Robots operate them. No humans in the loop at any stage. Michael referenced Eliezer Yudkowsky’s old scenario - a world where the surface of the planet eventually gets covered in compute, not because anyone planned it, but because the optimization pressure just keeps going. It sounds absurd. It sounded less absurd after this week’s headlines. The investment economics are also concerning. Liron noted that compute demand is so far outrunning supply that the companies selling it can’t keep up. That’s great for the short-term business case. It also means capital is pouring into infrastructure with a very unclear endpoint. What this week actually was None of these stories are unrelated. The Pentagon story, the ATC story, the robot store, the automated data centers - they’re the same story. AI is being integrated into critical systems faster than anyone is building the oversight to go with it. Each individual decision has a reasonable-sounding justification. In aggregate, they represent a transfer of control that nobody explicitly chose. The Sanders event matters because it’s one of the few moments where someone with a platform is saying that out loud in a room that has some power to respond. That it immediately became a political football is exactly the problem. Watch Warning Shots #40: https://www.youtube.com/@theairisknetwork [https://www.youtube.com/@theairisknetwork] This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit theairisknetwork.substack.com/subscribe [https://theairisknetwork.substack.com/subscribe?utm_medium=podcast&utm_campaign=CTA_2]

5 May 2026 - 29 min

The World’s Most Secret AI Model Leaked to Discord. Here’s What That Actually Means.

Every week, John Sherman, Michael (Lethal Intelligence), and Liron Shapira (Doom Debates) sit down to cut through the noise on AI risk. This week’s episode had seven stories. Each one, on its own, is worth paying attention to. Together, they form something harder to ignore. Here is what they covered - and why it matters. The Leak That Should Embarrass Everyone Anthropic’s Mythos model was not supposed to exist publicly. Emergency government meetings. Access restricted to roughly forty of the world’s largest companies. A system described as capable of compromising encryption at scale. Then some people on Discord guessed the URL and used it for weeks. No sophisticated exploit. No inside source. They looked at how Anthropic named its other models, made an educated guess, and it worked. Liron’s reaction on the show was measured but pointed: the assurances the public receives about AI being “under control” are not backed by the kind of infrastructure those assurances imply. Michael went further - noting the specific absurdity of a company that built a cybersecurity-focused model and then lost it to the most basic form of pattern recognition imaginable. But the more important point is not about Anthropic specifically. It is about what the leak reveals as a baseline. If a Discord group can access the most restricted model in the world, the question of what nation-state actors have access to answers itself. Liron put it plainly: it is a safe bet China has been running Mythos for a while. China Is Stealing the Research. Officially. Which leads directly to story two. The director of the White House Office of Science and Technology confirmed what researchers have been documenting for over a year: China is running coordinated distillation attacks against US frontier AI systems. The mechanism is straightforward and hard to stop. Thousands of fake proxy accounts. Systematic querying. Jailbreaks to extract what safety filters would otherwise block. The result is a cheaper, lighter version of a frontier model - built not through years of original research but through sustained, patient extraction. Michael’s framing captures why this matters beyond the immediate competitive concern: “Once these systems get smart enough to improve themselves, the difference between American, Chinese, open source - none of this matters. Uncontrolled intelligence doesn’t care about passwords.” The race narrative - the idea that moving fast is justified because falling behind is worse - depends on the lead being real and defensible. Neither of these stories suggests it is. Half a Government, Handed to AI Agents The UAE announced plans to run 50% of its government operations through AI agents within two years. It will not be the last country to make this kind of announcement. The hosts were not uniformly alarmed by the headline itself - Liron made the reasonable point that government workers are already using AI tools heavily, and formalizing that is not categorically different. But Michael’s concern was about trajectory, not the present moment. Agentic systems embedded in government are an on-ramp. The decisions they make today are relatively bounded. The decisions they will be positioned to make in three years, as capability increases, are not. And the window for course correction - the moment where a democratic public can say “actually, we want this differently” - narrows every time another function gets handed over. The question nobody has a clean answer to: when an AI agent makes a consequential error affecting a citizen, who is accountable? 13,000 Messages. No Intervention. Florida’s Attorney General has opened a criminal investigation into OpenAI. The case involves a user who exchanged more than 13,000 messages with ChatGPT about planning a school shooting - specific weapons, specific locations, optimized timing. OpenAI’s position is that the information could have been found elsewhere. The hosts find that framing insufficient - not necessarily on legal grounds, but on the question of what 13,000 contextually tailored, progressively detailed messages represent versus a Google search result. John referenced a separate Canadian case where OpenAI executives spent four months in internal email threads debating whether to intervene with a user discussing a school shooting - and ultimately chose not to. The question he raised is one the industry has not answered: what is the threshold? What volume, what content, what specificity triggers a responsibility to act? Michael extended the analysis forward. The argument that a smarter AI would refuse these requests is not reassuring. Intelligence does not automatically produce aligned values. A more capable system asked to optimize a plan does not become less willing to help - it becomes more effective at it. A Robot Just Won a Half Marathon A Chinese humanoid robot completed a half marathon faster than any human on record. Last year, comparable robots could barely walk. John’s instinct is that this is the kind of moment - visible, physical, undeniable - that shifts public understanding in ways that benchmark scores do not. Liron agreed that physical dexterity is one of the last meaningful gaps, and that closing it changes the picture significantly. Michael’s read is about what comes after the demonstration. The mechanical platform is now proven. The cognitive systems are improving on a separate, faster track. When those two curves intersect - and he does not think the timeline is decades - you get robots that can build robots, automate physical supply chains end to end, and operate in the real world with the same reliability AI systems already show in software environments. The conversation also went personal. John asked both of them directly: knowing what you know about AI risk, would you have a humanoid robot in your home? The answer, from all three, was effectively no - not because the robot itself is dangerous, but because any internet-connected, physically capable system in your home is a security exposure of a different order than anything that existed before. Sand in the Gears The episode closed on a story John flagged as breaking that morning. Polymarket was showing 85% odds of a nationwide US ban on new data center construction. Maine had already passed an 18-month moratorium. At least 12 other states are considering similar measures. All three hosts expressed support for the principle of friction, even while questioning the specific mechanics. Liron’s position was direct: yes, it is somewhat inconsistent to build a wall when China is not building one too. Yes, it is imperfect. But imperfect friction is still friction, and friction is what the current moment is missing. Michael pointed out what often gets lost in the infrastructure debate: the people bearing the costs of data center construction - electricity prices, water supply, land use - are not the same people capturing the financial upside. Local pushback is not irrational. It is a community correctly identifying that they are absorbing externalities for a technology whose benefits flow elsewhere. John’s take was more political. There is value in demonstrating publicly that the accelerationist agenda - move fast, build everything, ask questions later - does not have the public’s unconditional consent. A nationwide moratorium, even an imperfect one, sends that signal. The Pattern Underneath All Seven Stories Step back from the individual headlines and a single question runs through all of them. Who is actually in control? Not in theory. Not in the terms of service. Not in the policy statements. In practice, right now, when the system is tested by a Discord group with time and curiosity, or a nation-state with resources and patience, or an agentic system making decisions in a government pipeline, or a user with 13,000 messages and a plan - who is in control? The honest answer, based on this week’s evidence, is: fewer people than the public has been led to believe, and the gap between the assurances and the reality is growing faster than the systems meant to close it. That is what Warning Shots exists to say, week after week, in plain language. Watch the full episode on YouTube: https://www.youtube.com/@theairisknetwork [https://www.youtube.com/@theairisknetwork] Take action on AI risk: https://safe.ai/act [https://safe.ai/act] Warning Shots is hosted by John Sherman, Michael (Lethal Intelligence), and Liron Shapira (Doom Debates) - three independent AI risk communicators publishing weekly analysis of the headlines that matter. This is a public episode. If you'd like to discuss this with other subscribers or get access to bonus episodes, visit theairisknetwork.substack.com/subscribe [https://theairisknetwork.substack.com/subscribe?utm_medium=podcast&utm_campaign=CTA_2]

26 Apr 2026 - 32 min

Warning Shots

2 months for 19 kr.

About Warning Shots

All episodes

Only on Podimo

Popular audiobooks