The Human in the Loop
32 steps. That's how many it took for Anthropic's unreleased AI to simulate a full network attack. They buried that number in a release note. The model is called Mythos. The UK AI Security Institute tested it. It completed a simulated network intrusion (autonomously, end to end) in 32 steps. Anthropic decided not to ship it. That decision matters. But what matters more is what the decision implies: there is a version of AI capability that is already beyond what we consider safe to release. It exists now. In a lab. Tested by a government body. Most AI conversations are still about benchmarks. MMLU scores. Reasoning tests. Coding evals. Those measure what AI can do on curated problems. They don't measure what a motivated system can do on an uncurated one. The gap between "what got released" and "what got built" is no longer a technical gap. It's a policy gap. And that's a completely different kind of problem. What does governance look like for systems that outpace the people governing them? I don't have a clean answer. But I think Anthropic's call this week is the right one. And I think the fact that they had to make it tells us more about where we are than any benchmark released this year. What would it take for your organization to make the same call? #AI #CyberSecurity #TheHumanInTheLoop
29 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de The Human in the Loop!