Stop Building AI Agents: Build Harnesses Instead

Beskrivelse

Everyone is building AI agents. OpenAI SDKs, Claude Code, Deep Agent systems, custom workflows, and orchestration frameworks all promise more autonomous AI. But as these systems become more capable, they start running into familiar engineering problems: * retries * state management * orchestration * context control * durable execution This week we're joined by Hamza Tahir, CTO and co-founder of ZenML and creator of Kitaru, to discuss what happens when agents stop being simple chat interfaces and start behaving like long-running distributed systems. We explore: * what an agent harness actually is * durable execution and why it matters * orchestration vs business logic * state management for long-running agents * retries, checkpoints, and human-in-the-loop workflows * context management and token costs * open vs closed agent frameworks * why everyone seems to be rebuilding the same layer of infrastructure One of the biggest questions we kept coming back to: What is a meta harness? If you have an answer, let us know in the comments. Kitaru https://github.com/zenml-io/kitaru [https://github.com/zenml-io/kitaru] ZenML https://www.zenml.io [https://www.zenml.io] Hamza Tahir https://www.linkedin.com/in/hamzatahir/ [https://www.linkedin.com/in/hamzatahir/] Pedro Agentware https://github.com/Soypete/pedro-agentware [https://github.com/Soypete/pedro-agentware] OpenAI Agents SDK https://platform.openai.com/docs/guides/agents [https://platform.openai.com/docs/guides/agents] Temporal https://temporal.io [https://temporal.io] DBOS https://www.dbos.dev [https://www.dbos.dev] Apache Airflow https://airflow.apache.org [https://airflow.apache.org] Prefect https://www.prefect.io [https://www.prefect.io] Domesticating AI is a bi-weekly podcast about practical AI for developers. We help you brace the feral open-source AI landscape — so you can tame it instead of getting dragged by it. Subscribe on YouTube, follow on Spotify or Apple Podcasts, and support the show on Patreon. Keep your AI on a leash. Links

Hacking AI: Why Most AI Systems Are Insecure by Default

Hosts: Miriah Peterson, Matt Sharp, Chris Brousseau Recorded: April 2026 Status: Released Most AI systems today are designed to be helpful — not secure. In this episode, we break down how AI systems actually get exploited in production: * a real supply chain attack on a widely used AI dependency * prompt injection and why it still works * image-based (multimodal) exploits * tool and agent abuse If you’re building AI — especially at a startup — you are the security team. A widely used AI dependency was compromised via a malicious .pth file: * executes automatically when Python starts * no import required * targets credentials, SSH keys, and environment variables 👉 Just installing the package was enough. This highlights a critical reality: Your AI system is only as secure as your dependencies. * Models cannot distinguish between instructions and data * External content can override system behavior * Still one of the most common AI vulnerabilities 🔗 https://learnprompting.org/docs/prompt_hacking/injection [https://learnprompting.org/docs/prompt_hacking/injection] * Hidden instructions embedded in images * AI interprets images differently than humans * Expands the attack surface significantly 🔗 https://arxiv.org/abs/2306.11698 [https://arxiv.org/abs/2306.11698] * AI systems can take real-world actions via tools * Prompt injection → API calls, data leaks, unintended execution * Agents amplify risk through autonomy and retries If you’re building AI systems today: * separate instructions from data * limit tool permissions * treat outputs as untrusted * validate everything before execution * AI systems have an internet-sized attack surface * Supply chain attacks bypass all AI safeguards * Prompt injection is a fundamental problem * AI doesn’t fail safely — it fails wherever your system is weakest * LiteLLM incident: https://github.com/BerriAI/litellm/issues/24512 [https://github.com/BerriAI/litellm/issues/24512] * Attack breakdown: https://futuresearch.ai/blog/litellm-pypi-supply-chain-attack/ [https://futuresearch.ai/blog/litellm-pypi-supply-chain-attack/] * LLM attack techniques: https://llm-attacks.org/ [https://llm-attacks.org/] * OWASP LLM Top 10: https://owasp.org/www-project-top-10-for-large-language-model-applications/ [https://owasp.org/www-project-top-10-for-large-language-model-applications/] * Gandalf challenge: https://gandalf.lakera.ai/ [https://gandalf.lakera.ai/] We’ve launched a Patreon for Domesticating AI 🎉 Get: * early access to episodes * behind-the-scenes content * bloopers and uncut moments 👉 https://patreon.com/DomesticatingAIPodcast [https://patreon.com/DomesticatingAIPodcast] * 🎥 YouTube: https://youtu.be/HTTxE7Y1sko [https://youtu.be/HTTxE7Y1sko] What’s the weirdest way an AI system has broken for you? Keep your AI on a leash.

24. apr. 202643 min

Stop Building AI Agents: Build Harnesses Instead | Hamza Tahir (ZenML / Kitaru)

Beskrivelse

Kommentarer

1 måned kun 9 kr.

Alle episoder