CISO Insights: Voices in Cybersecurity

Agents of Security: The Dual Reality of AI in Cybersecurity

21 min · Ayer
Portada del episodio Agents of Security: The Dual Reality of AI in Cybersecurity

Descripción

This episode explores the contrasting performance of Large Language Models (LLMs) across different cybersecurity domains, highlighting a fascinating divide in their current capabilities. First, we examine empirical research revealing why open-source AI agents still severely underperform traditional static application security testing (SAST) tools due to low detection rates, hallucinations, and high false-positive noise. Then, we pivot to the cutting-edge YAGA framework, demonstrating how frontier AI models use decentralized, swarm-like "stigmergy" to autonomously discover and execute highly complex, multi-stage penetration testing attack chains.   Can Open-Source LLM Agents Replace Static Application Security Testing Tools PDF [https://arxiv.org/abs/2606.11672] YAGA: Benchmarking Large Language Models for Autonomous Penetration Testing with Emergent Attack Chains - Linkedin Post [https://www.linkedin.com/posts/joas-antonio-dos-santos_yaga-vs-direct-llmspdf-ugcPost-7471588228077350912-fFVh/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAALTGb8BKai6iiEmCeahfbRijfE1nHtCxxM] Defending MLOps Against Autonomous AI Warfare Episode [https://cisoinsights.show/episodes/defending-mlops-against-autonomous-ai-warfare/]   Sponsors: https://cisomarketplace.com [https://cisomarketplace.com] https://breached.company [https://breached.company]

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de CISO Insights: Voices in Cybersecurity!

Prueba gratis

Empieza 7 días de prueba

$99 / mes después de la prueba. · Cancela cuando quieras.

  • Podcasts solo en Podimo
  • 20 horas de audiolibros al mes
  • Podcast gratuitos

Todos los episodios

482 episodios

episode Swarm Intelligence: Architecting the Autonomous Security Brain artwork

Swarm Intelligence: Architecting the Autonomous Security Brain

This episode breaks down the architecture required to build a fully autonomous, enterprise-grade penetration testing department using multi-agent swarms. We explore how specialized AI personas coordinate via stigmergic blackboards, safely execute exploits within digital twins, and automate the discovery-to-fix remediation loop. Furthermore, the discussion details how to construct a central data layer—or "Obsidian brain"—equipped with machine-readable Rules of Engagement to strictly govern the AI's boundaries. Agents of Security Podcast [https://podcast.cisomarketplace.com/e/agents-of-security-the-dual-reality-of-ai-in-cybersecurity/] Sponsors: www.cisomarketplace.com [http://www.cisomarketplace.com] https://cisomarketplace.services/program [https://cisomarketplace.services/program]

19 de jun de 202649 min
episode Agents of Security: The Dual Reality of AI in Cybersecurity artwork

Agents of Security: The Dual Reality of AI in Cybersecurity

This episode explores the contrasting performance of Large Language Models (LLMs) across different cybersecurity domains, highlighting a fascinating divide in their current capabilities. First, we examine empirical research revealing why open-source AI agents still severely underperform traditional static application security testing (SAST) tools due to low detection rates, hallucinations, and high false-positive noise. Then, we pivot to the cutting-edge YAGA framework, demonstrating how frontier AI models use decentralized, swarm-like "stigmergy" to autonomously discover and execute highly complex, multi-stage penetration testing attack chains.   Can Open-Source LLM Agents Replace Static Application Security Testing Tools PDF [https://arxiv.org/abs/2606.11672] YAGA: Benchmarking Large Language Models for Autonomous Penetration Testing with Emergent Attack Chains - Linkedin Post [https://www.linkedin.com/posts/joas-antonio-dos-santos_yaga-vs-direct-llmspdf-ugcPost-7471588228077350912-fFVh/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAALTGb8BKai6iiEmCeahfbRijfE1nHtCxxM] Defending MLOps Against Autonomous AI Warfare Episode [https://cisoinsights.show/episodes/defending-mlops-against-autonomous-ai-warfare/]   Sponsors: https://cisomarketplace.com [https://cisomarketplace.com] https://breached.company [https://breached.company]

Ayer21 min
episode Breaking the Union Ceiling: The Path to Cybersecurity SuperIntelligence artwork

Breaking the Union Ceiling: The Path to Cybersecurity SuperIntelligence

Current cybersecurity AI systems typically rely on single-agent scaffolds, yet research demonstrates that no individual orchestration layer is optimally suited for every type of threat. By uniting structurally diverse scaffolds through a shared "blackboard" substrate, different agents can exchange intermediate findings and compress each other's reconnaissance phases. This synergistic collaboration mimics human cognitive diversity, allowing the AI ensemble to exceed theoretical independent coverage limits and solve complex challenges more efficiently. Towards Cyber-security Super-intelligence Whitepaper PDF: [https://media.licdn.com/dms/document/media/v2/D4E1FAQHaLcQ1IR0FZQ/feedshare-document-sanitized-pdf/B4EZ6Bya.fHQA8-/0/1780293940601?e=1782226800&v=beta&t=1pLjKh5i39z51CEfcT66EdVZTWXEovVsFdYs5vLCgHc]   Sponsors: https://cisomarketplace.services/program [https://cisomarketplace.services/program] https://cisomarketplace.services/ai-services [https://cisomarketplace.services/ai-services]

16 de jun de 202656 min
episode Defending MLOps Against Autonomous AI Warfare artwork

Defending MLOps Against Autonomous AI Warfare

In this podcast, we dive into the critical evolution of MLSecOps and how organizations must adapt to defend their dynamic machine learning pipelines against the OWASP ML Top 10 threats, including data poisoning and AI supply chain attacks. We explore actionable insights from DARPA's AI Cyber Challenge, highlighting how autonomous systems like Buttercup use multi-agent architectures and LLMs to revolutionize vulnerability discovery and automated patching. Finally, we map out the essential open-source tools, such as Sigstore and MLRun, alongside the new security personas required to build robust, secure-by-design AI applications from initial data engineering to continuous production monitoring. Visualizing Secure MLOps (MLSecOps): A Practical Guide for Building Robust AI/ML Pipeline Security [https://openssf.org/wp-content/uploads/2025/08/OpenSSF_MLSecOps_Whitepaper.pdf]   Sponsors: https://cisomarketplace.services/program [https://cisomarketplace.services/program] https://cisomarketplace.services/ai-services [https://cisomarketplace.services/ai-services]

15 de jun de 202640 min
episode The AI Accountability Gap: Prioritizing Catastrophic Risks artwork

The AI Accountability Gap: Prioritizing Catastrophic Risks

In this episode, we dive into a landmark Delphi study where 272 international experts prioritize the most severe threats posed by artificial intelligence over the next five years, including AI-enabled cyberattacks, dangerous capabilities, and extreme power centralization. We explore the stark "moral hazard" at the heart of the AI ecosystem, revealing how the general public and critical sectors bear the greatest vulnerabilities while the upstream developers responsible for safeguards face intense competitive pressures to race ahead. Finally, we discuss why implementing pragmatic mitigations is crucial yet insufficient, as structural risks are deeply entrenched in global economic systems and retain a persistent likelihood of causing catastrophic global outcomes. Prioritization of Risks from Artificial Intelligence PDF [https://cdn.prod.website-files.com/669550d38372f33552d2516e/6a172558bd2947234379749f_a8684052fd49a64374c9a9d3e4e5ab59_Prioritizing%20the%20risks%20from%20Artificial%20Intelligence.pdf]   Sponsors: https://airiskassess.com/ [https://airiskassess.com/] https://cisomarketplace.services/program [https://cisomarketplace.services/program]

14 de jun de 202633 min