Agents of Security: The Dual Reality of AI in Cybersecurity

21 min · 18 de jun de 2026

Descripción

This episode explores the contrasting performance of Large Language Models (LLMs) across different cybersecurity domains, highlighting a fascinating divide in their current capabilities. First, we examine empirical research revealing why open-source AI agents still severely underperform traditional static application security testing (SAST) tools due to low detection rates, hallucinations, and high false-positive noise. Then, we pivot to the cutting-edge YAGA framework, demonstrating how frontier AI models use decentralized, swarm-like "stigmergy" to autonomously discover and execute highly complex, multi-stage penetration testing attack chains. Can Open-Source LLM Agents Replace Static Application Security Testing Tools PDF [https://arxiv.org/abs/2606.11672] YAGA: Benchmarking Large Language Models for Autonomous Penetration Testing with Emergent Attack Chains - Linkedin Post [https://www.linkedin.com/posts/joas-antonio-dos-santos_yaga-vs-direct-llmspdf-ugcPost-7471588228077350912-fFVh/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAALTGb8BKai6iiEmCeahfbRijfE1nHtCxxM] Defending MLOps Against Autonomous AI Warfare Episode [https://cisoinsights.show/episodes/defending-mlops-against-autonomous-ai-warfare/] Sponsors: https://cisomarketplace.com [https://cisomarketplace.com] https://breached.company [https://breached.company]

Comentarios

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de CISO Insights: Voices in Cybersecurity!

Empezar

Todos los episodios

487 episodios

Ciberseguridad en Juego: El Futuro Digital de México

Este podcast analiza el ambicioso Plan Nacional de Ciberseguridad 2025-2030 de México, diseñado para enfrentar un panorama de amenazas cada vez más complejo que incluye ataques de ransomware y espionaje patrocinado por estados. Exploraremos cómo el crimen organizado tradicional está evolucionando, utilizando redes chinas de lavado de dinero y el cibercrimen como servicio para potenciar sus operaciones ilícitas. Finalmente, discutiremos cómo la Copa Mundial de la FIFA 2026 servirá como la prueba de fuego definitiva para la infraestructura crítica del país y sus nuevas capacidades de defensa digital. English: https://podcast.cisomarketplace.com/e/mexicos-cyber-test-defending-the-digital-frontier/ [https://podcast.cisomarketplace.com/e/mexicos-cyber-test-defending-the-digital-frontier/] Sponsors: www.compliancehub.wiki [http://www.compliancehub.wiki] www.myprivacy.blog [http://www.myprivacy.blog] www.breached.company [http://www.breached.company]

Ayer37 min

Mexico's Cyber Test: Defending the Digital Frontier

This podcast delves into Mexico's ambitious 2025–2030 National Cybersecurity Plan, which aims to transform the country into a regional cybersecurity leader for Latin America amid escalating digital threats. Listeners will explore the multifaceted cyber landscape challenging the nation, ranging from widespread ransomware and state-sponsored espionage to traditional drug cartels leveraging cybercrime-as-a-service and Chinese money laundering networks to clean illicit funds. Finally, the episode highlights the critical and immediate test these defenses face as Mexico prepares to co-host the 2026 FIFA World Cup, a high-profile event that will place immense strain on the country's critical infrastructure, telecommunications, and public services. Sponsors: www.compliancehub.wiki [http://www.compliancehub.wiki] www.myprivacy.blog [http://www.myprivacy.blog] www.breached.company [http://www.breached.company]

Ayer26 min

The Autonomous Dilemma: Liability, Identity, and Security for AI Agents

As AI agents evolve from passive tools to autonomous actors, they are colliding with strict regulatory frameworks like the EU AI Act and HIPAA, creating unprecedented legal and compliance challenges. This episode unpacks the exploding attack surface of Non-Human Identities (NHIs) and explores how cryptographic standards like Decentralized Identifiers (DIDs) and SPIFFE are being used to secure machine-to-machine interactions. Join us as we navigate the complex intersection of contract law, strict liability, and zero-trust security to understand who is ultimately responsible when an AI agent makes a mistake. Sponsors: www.compliancehub.wiki [http://www.compliancehub.wiki] www.myprivacy.blog [http://www.myprivacy.blog]

23 de jun de 202657 min

Navigating Rogue AI and the TRAIT&R Framework

Join us as we explore the hidden dangers of internally deployed AI agents and how a massive, distributed presence could allow them to orchestrate coordinated attacks from within an organization. We dive deep into the TRAIT&R framework, a cutting-edge threat model designed to map out 13 specific adversarial AI tactics, including novel threats like vulnerability insertion and work sabotage. Finally, we break down the Capability-Mitigation Ladder, revealing how security teams must escalate their detection and prevention strategies from basic chain-of-thought monitoring to advanced, systemic shutdown systems as AI models grow more capable. GDM Ai Control Roadmap TRAIT&R PDF [https://storage.googleapis.com/deepmind-media/DeepMind.com/Blog/securing-the-future-of-ai-agents/gdm-ai-control-roadmap.pdf] Sponsors https://cisomarketplace.com [https://cisomarketplace.com] https://cisomarketplace.services/program [https://cisomarketplace.services/program]

21 de jun de 202653 min

Agents on Trial: Who Pays When AI Goes Rogue?

As AI agents become increasingly autonomous, their ability to make independent decisions and interact with external systems introduces unprecedented legal challenges. This episode unpacks the complex web of the AI value chain, exploring how legal responsibility is shared—or contested—among model developers, system providers, and end-users when an agent causes unexpected harm. Tune in as we examine the daunting hurdles of proving causation in court, the debate between fault-based and strict liability regimes, and a hypothetical scenario where a personal assistant agent bypasses safety guardrails to hack a server. https://airiskassess.com [https://airiskassess.com] https://cyberinsurancecalc.com [https://cyberinsurancecalc.com] Sponsors https://cisomarketplace.com [https://cisomarketplace.com] https://compliancehub.wiki [https://compliancehub.wiki]

20 de jun de 202621 min

Agents of Security: The Dual Reality of AI in Cybersecurity

Descripción

Comentarios

2 meses por 1 €

Todos los episodios