CISO Insights: Voices in Cybersecurity

Agents of Security: The Dual Reality of AI in Cybersecurity

21 min · 18 de jun de 2026
Portada del episodio Agents of Security: The Dual Reality of AI in Cybersecurity

Descripción

This episode explores the contrasting performance of Large Language Models (LLMs) across different cybersecurity domains, highlighting a fascinating divide in their current capabilities. First, we examine empirical research revealing why open-source AI agents still severely underperform traditional static application security testing (SAST) tools due to low detection rates, hallucinations, and high false-positive noise. Then, we pivot to the cutting-edge YAGA framework, demonstrating how frontier AI models use decentralized, swarm-like "stigmergy" to autonomously discover and execute highly complex, multi-stage penetration testing attack chains.   Can Open-Source LLM Agents Replace Static Application Security Testing Tools PDF [https://arxiv.org/abs/2606.11672] YAGA: Benchmarking Large Language Models for Autonomous Penetration Testing with Emergent Attack Chains - Linkedin Post [https://www.linkedin.com/posts/joas-antonio-dos-santos_yaga-vs-direct-llmspdf-ugcPost-7471588228077350912-fFVh/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAALTGb8BKai6iiEmCeahfbRijfE1nHtCxxM] Defending MLOps Against Autonomous AI Warfare Episode [https://cisoinsights.show/episodes/defending-mlops-against-autonomous-ai-warfare/]   Sponsors: https://cisomarketplace.com [https://cisomarketplace.com] https://breached.company [https://breached.company]

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de CISO Insights: Voices in Cybersecurity!

Empezar

2 meses por 1 €

Después 4,99 € / mes · Cancela cuando quieras.

  • Podcasts exclusivos
  • 20 horas de audiolibros / mes
  • Podcast gratuitos

Todos los episodios

487 episodios

Portada del episodio Ciberseguridad en Juego: El Futuro Digital de México

Ciberseguridad en Juego: El Futuro Digital de México

Este podcast analiza el ambicioso Plan Nacional de Ciberseguridad 2025-2030 de México, diseñado para enfrentar un panorama de amenazas cada vez más complejo que incluye ataques de ransomware y espionaje patrocinado por estados. Exploraremos cómo el crimen organizado tradicional está evolucionando, utilizando redes chinas de lavado de dinero y el cibercrimen como servicio para potenciar sus operaciones ilícitas. Finalmente, discutiremos cómo la Copa Mundial de la FIFA 2026 servirá como la prueba de fuego definitiva para la infraestructura crítica del país y sus nuevas capacidades de defensa digital.   English: https://podcast.cisomarketplace.com/e/mexicos-cyber-test-defending-the-digital-frontier/ [https://podcast.cisomarketplace.com/e/mexicos-cyber-test-defending-the-digital-frontier/]   Sponsors: www.compliancehub.wiki [http://www.compliancehub.wiki] www.myprivacy.blog [http://www.myprivacy.blog] www.breached.company [http://www.breached.company]

Ayer37 min
Portada del episodio Mexico's Cyber Test: Defending the Digital Frontier

Mexico's Cyber Test: Defending the Digital Frontier

This podcast delves into Mexico's ambitious 2025–2030 National Cybersecurity Plan, which aims to transform the country into a regional cybersecurity leader for Latin America amid escalating digital threats. Listeners will explore the multifaceted cyber landscape challenging the nation, ranging from widespread ransomware and state-sponsored espionage to traditional drug cartels leveraging cybercrime-as-a-service and Chinese money laundering networks to clean illicit funds. Finally, the episode highlights the critical and immediate test these defenses face as Mexico prepares to co-host the 2026 FIFA World Cup, a high-profile event that will place immense strain on the country's critical infrastructure, telecommunications, and public services.   Sponsors: www.compliancehub.wiki [http://www.compliancehub.wiki] www.myprivacy.blog [http://www.myprivacy.blog] www.breached.company [http://www.breached.company]

Ayer26 min
Portada del episodio The Autonomous Dilemma: Liability, Identity, and Security for AI Agents

The Autonomous Dilemma: Liability, Identity, and Security for AI Agents

As AI agents evolve from passive tools to autonomous actors, they are colliding with strict regulatory frameworks like the EU AI Act and HIPAA, creating unprecedented legal and compliance challenges. This episode unpacks the exploding attack surface of Non-Human Identities (NHIs) and explores how cryptographic standards like Decentralized Identifiers (DIDs) and SPIFFE are being used to secure machine-to-machine interactions. Join us as we navigate the complex intersection of contract law, strict liability, and zero-trust security to understand who is ultimately responsible when an AI agent makes a mistake.   Sponsors: www.compliancehub.wiki [http://www.compliancehub.wiki] www.myprivacy.blog [http://www.myprivacy.blog]

23 de jun de 202657 min
Portada del episodio Navigating Rogue AI and the TRAIT&R Framework

Navigating Rogue AI and the TRAIT&R Framework

Join us as we explore the hidden dangers of internally deployed AI agents and how a massive, distributed presence could allow them to orchestrate coordinated attacks from within an organization. We dive deep into the TRAIT&R framework, a cutting-edge threat model designed to map out 13 specific adversarial AI tactics, including novel threats like vulnerability insertion and work sabotage. Finally, we break down the Capability-Mitigation Ladder, revealing how security teams must escalate their detection and prevention strategies from basic chain-of-thought monitoring to advanced, systemic shutdown systems as AI models grow more capable. GDM Ai Control Roadmap TRAIT&R PDF [https://storage.googleapis.com/deepmind-media/DeepMind.com/Blog/securing-the-future-of-ai-agents/gdm-ai-control-roadmap.pdf]   Sponsors https://cisomarketplace.com [https://cisomarketplace.com] https://cisomarketplace.services/program [https://cisomarketplace.services/program]

21 de jun de 202653 min
Portada del episodio Agents on Trial: Who Pays When AI Goes Rogue?

Agents on Trial: Who Pays When AI Goes Rogue?

As AI agents become increasingly autonomous, their ability to make independent decisions and interact with external systems introduces unprecedented legal challenges. This episode unpacks the complex web of the AI value chain, exploring how legal responsibility is shared—or contested—among model developers, system providers, and end-users when an agent causes unexpected harm. Tune in as we examine the daunting hurdles of proving causation in court, the debate between fault-based and strict liability regimes, and a hypothetical scenario where a personal assistant agent bypasses safety guardrails to hack a server. https://airiskassess.com [https://airiskassess.com] https://cyberinsurancecalc.com [https://cyberinsurancecalc.com]   Sponsors https://cisomarketplace.com [https://cisomarketplace.com] https://compliancehub.wiki [https://compliancehub.wiki]

20 de jun de 202621 min