LLM Primer

2-7-4. Prompt Injection and Jailbreaks: Defending the Interpreter

37 min · 18. Feb. 2026
Episode 2-7-4. Prompt Injection and Jailbreaks: Defending the Interpreter Cover

Beschreibung

This episode explores Chapter 4, detailing how attackers manipulate model behavior through crafted inputs like instruction overrides. We discuss why prompt injection is an inherent property of instruction-following systems rather than a standard bug. The episode covers jailbreaking techniques like role-playing and obfuscation, and why defense requires architectural layers rather than just better prompts. Amazon.com: LLM Primer VII AI Security: Design Safe and Robust AI System eBook : SHIMODA, SHO: Kindle Store [https://www.amazon.com/dp/B0GP5T98GJ]

Kommentare

0

Sei die erste Person, die kommentiert

Melde dich jetzt an und werde Teil der LLM Primer-Community!

Loslegen

2 Monate für 1 €

Dann 4,99 € / Monat · Jederzeit kündbar.

  • Podcasts nur bei Podimo
  • 20 Stunden Hörbücher / Monat
  • Alle kostenlosen Podcasts

Alle Folgen

19 Folgen