Rubber Duck Radio

Why 95% of AI Pilots Fail (And How to Fix It)

1 h 0 min · 5. juni 2026
episode Why 95% of AI Pilots Fail (And How to Fix It) cover

Description

Tim kicks things off with an AWS agent nightmare that couldn't tell dev from prod, sparking a deep dive into where deterministic pipelines end and true LLM reasoning begins. Using a clever flight-tracking case study, the hosts map out when to use frontier models, local open-weight models, or no AI at all—then connect it all to an MIT study showing 95% of generative AI pilots fail to deliver profit, often because companies treat the API bill itself as a success metric. If you're wrestling with agentic vs. scripted workflows, bloated AI spend, or just an editor that can't keep up, this conversation offers a clearer lens for building with intention.

Comments

0

Be the first to comment

Sign up now and become a member of the Rubber Duck Radio community!

Get Started

1 month for 9 kr.

Then 99 kr. / month · Cancel anytime.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

All episodes

18 episodes

episode Fable 5 Banned: The Multi-Model Escape Plan artwork

Fable 5 Banned: The Multi-Model Escape Plan

Anthropic launched Claude Fable 5 with huge expectations, only to see the US government order it pulled globally three days later. Tim and Paul dig into the swirling conspiracy theories: was it retaliation for refusing to arm the Pentagon? Did a competitor exploit a jailbreak report to kneecap a rival? And did Anthropic’s own transparency accidentally hand over the rope? Then the conversation pivots to token anxiety, ballooning API costs, and the open-source models like GLM 5.2 and DeepSeek V4 Pro that now rival proprietary giants at a fraction of the price. The episode’s core insight: a three-stage workflow—planning with a flagship model, implementing with a cheap or local one, and reviewing with a third—lets developers escape single-point-of-failure risks and spiraling bills, and it's already taking shape across the coding community.

19. juni 20261 h 0 min