AI Weekly

Agentic Threats and Trustworthy AI: The Week in Review

15 min · 10. nov. 2025
episode Agentic Threats and Trustworthy AI: The Week in Review cover

Beskrivelse

This week, we dive into critical research from MIT aimed at building safer, faster AI models and modular software, contrasted sharply by alarming reports of successful data exfiltration attacks against major LLMs like Claude and ChatGPT, alongside the emergence of autonomous, adaptive malware. We also look at the governance challenges presented by autonomous "agentic users" entering the enterprise workforce and the profound uncertainty surrounding AI integration in K-12 schools.

Kommentarer

0

Vær den første til at kommentere

Tilmeld dig nu og bliv en del af AI Weekly-fællesskabet!

Kom i gang

2 måneder kun 19 kr.

Derefter 99 kr. / måned · Opsig når som helst.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

Alle episoder

9 episoder

episode "AI Agents: The Security Paradox - When Your Best Defense Becomes Your Biggest Threat cover

"AI Agents: The Security Paradox - When Your Best Defense Becomes Your Biggest Threat

AI agents are revolutionizing cybersecurity in contradictory ways. This episode explores how the same AI technology that enables companies like Picus Security to validate defenses against new threats in hours, instead of weeks, can also autonomously exploit vulnerabilities for profit. We examine why enterprises are hesitant to deploy AI agents at scale due to identity management challenges, the escalating war between publishers and AI scrapers (with blocking up 336%), practical strategies for  identifying truth when AI systems can be manipulated by their owners, and Anthropic's research showing AI can now find and exploit zero-day vulnerabilities in smart contracts autonomously. The bottom line: AI capabilities are advancing faster than our governance frameworks, creating both unprecedented defensive capabilities and entirely new attack vectors that security teams must navigate.

10. dec. 202528 min