LLM Primer

2-7-4. Prompt Injection and Jailbreaks: Defending the Interpreter

37 min · 18. feb. 2026
episode 2-7-4. Prompt Injection and Jailbreaks: Defending the Interpreter cover

Beskrivelse

This episode explores Chapter 4, detailing how attackers manipulate model behavior through crafted inputs like instruction overrides. We discuss why prompt injection is an inherent property of instruction-following systems rather than a standard bug. The episode covers jailbreaking techniques like role-playing and obfuscation, and why defense requires architectural layers rather than just better prompts. Amazon.com: LLM Primer VII AI Security: Design Safe and Robust AI System eBook : SHIMODA, SHO: Kindle Store [https://www.amazon.com/dp/B0GP5T98GJ]

Kommentarer

0

Vær den første til at kommentere

Tilmeld dig nu og bliv en del af LLM Primer-fællesskabet!

Kom i gang

2 måneder kun 19 kr.

Derefter 99 kr. / måned · Opsig når som helst.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

Alle episoder

19 episoder