Cover image of show You've Been a Bad Agent

You've Been a Bad Agent

Podcast by Wilhelm Klopp & Matt Carey

English

Technology & science

Limited Offer

2 months for 19 kr.

Then 99 kr. / monthCancel anytime.

  • 20 hours of audiobooks / month
  • Podcasts only on Podimo
  • All free podcasts
Get Started

About You've Been a Bad Agent

Wil and Matt discuss tech, startups, and building really cool things with AI. Sometimes joined by (actual expert) friends.

All episodes

26 episodes

episode Anthropic×{Karpathy,Pope}, Themes Will Pwn You, The Warmth of a Private Subnet, Slack is Back, $100B For Charity artwork

Anthropic×{Karpathy,Pope}, Themes Will Pwn You, The Warmth of a Private Subnet, Slack is Back, $100B For Charity

* Shai-Hulud and GitHub Actions: trusted publishing is no longer the gold standard, just additive * VS Code extensions as the next attack surface: 50M install themes that can shell out and auto-update silently * Matt's AI-native package manager pitch: vendor everything, LLM-upgrade your deps by replaying upstream commits * Tailscale and Cloudflare Mesh maxing: the warmth of the orange cloud over the cold open internet * Google's Santa is now Northpole Santa, and Wilhelm runs it on the Mac mini * Wilhelm is enamoured with atc.com: Enhanced Radar's $7M raise and following his own LAX delay live from the gate * Amp Labs' "Software After Software" manifesto and Amp Neo as remote enterprise AI engineering * Cloud agent adoption is still slower than we expected, and the amphetamine.app commute hack * Karpathy joins Anthropic pre-training  * Pope Leo XIV's Magnifica Humanitas: Anthropic's Chris Olah on stage at the Vatican on May 25 * Chinese token resellers at 3 to 4 cents on the dollar and the RL distillation theory behind it * PyCon vibe check: more handwritten Python than expected, and "AI is just a tool" is still a common take * Slack is back: Benioff finally gets it, and messaging is the only sane UI for multi-agent work * Eric Ries on Incorruptible, founder control at Cloudflare and Anthropic, and the Philip Morris stress test * Nan Ransohoff's $370B philanthropy wave and the pitch to pay nonprofit talent in Anthropic equity * Will Manidis on Grindslop Links * Andrej Karpathy's joining Anthropic: https://x.com/karpathy/status/2056753169888334312 [https://x.com/karpathy/status/2056753169888334312] * Pope Leo XIV's first encyclical Magnifica Humanitas:https://www.vaticannews.va/en/pope/news/2026-05/pope-leo-xiv-first-encyclical-magnifica-humanitas.html [https://www.vaticannews.va/en/pope/news/2026-05/pope-leo-xiv-first-encyclical-magnifica-humanitas.html] * Enhanced Radar: https://www.ycombinator.com/companies/enhanced-radar [https://www.ycombinator.com/companies/enhanced-radar] * ATC app: https://www.atc.com/ [https://www.atc.com/] * Amp Labs: https://amplabs.com/ [https://amplabs.com/] * Will Manidis, On Grindslop: https://minutes.substack.com/p/on-grindslop [https://minutes.substack.com/p/on-grindslop] * Nan Ransohoff, The Third Wave of American Philanthropy: https://nanransohoff.substack.com/p/the-third-wave-of-american-philanthropy [https://nanransohoff.substack.com/p/the-third-wave-of-american-philanthropy] * Eric Ries on Lenny's Podcast (Incorruptible): https://www.lennysnewsletter.com/p/how-to-build-a-company-that-withstands [https://www.lennysnewsletter.com/p/how-to-build-a-company-that-withstands] * Incorruptible by Eric Ries: https://www.incorruptible.co/ [https://www.incorruptible.co/] * Google Santa (archived): https://github.com/google/santa [https://github.com/google/santa] * Northpole Santa (active fork): https://github.com/northpolesec/santa [https://github.com/northpolesec/santa] * 10k people have the same idea within days of each other: https://substack.com/@contraptions/note/c-255396946

22 May 2026 - 1 h 26 min
episode AI Barista Orders 120 Eggs With No Stove, Matt Pocock Skills, Build for the Next Model, The Gervais Principle artwork

AI Barista Orders 120 Eggs With No Stove, Matt Pocock Skills, Build for the Next Model, The Gervais Principle

* Almost one year of the pod: Wilhelm's San Francisco Peter Pan arc, Matt's Cloudflare and Lisbon move * The backpack-at-a-party SF meme * Matt's Lisbon coincidence * Pieter Levels appreciation: dehumidifier stack, shrimp mode, accelerationist Portugal takes * Nat Friedman at Stripe Sessions: this is the slow part of AI progress * Andon Labs' Stockholm cafe: Mona orders 120 eggs (no stove), 22.5kg canned tomatoes, the staff's hall of shame * The capability overhang is still real: Anthropic adding $10B ARR a month and we still can't prompt properly * AI's last frontier: low-volume decisions with no revert button? * The Gervais Principle in tech * Chad V2: iOS Live Activities as a surface for top priority, next deadline, and a daily magic slot * Cascading hallucinations: when the agent writes its mistake to a file and treats it as gospel * Matt Pocock's skills: five lines of prose that completely change how Claude behaves * Skills as a new prompt-poisoning attack vector: models treat them like system prompt * Matt on scripted skills: markdown explains the script, the script does the deterministic work * Homework from Matt: what physical job would you try to get Claude to automate? Links: * Andon Labs official writeup (Mona, the Stockholm cafe): https://andonlabs.com/blog/ai-cafe-stockholm [https://andonlabs.com/blog/ai-cafe-stockholm] * Simon Willison's coverage of the cafe: https://simonwillison.net/2026/May/5/our-ai-started-a-cafe-in-stockholm/ [https://simonwillison.net/2026/May/5/our-ai-started-a-cafe-in-stockholm/] * Matt Pocock skills (the GitHub repo, ~48k stars): https://github.com/mattpocock/skills [https://github.com/mattpocock/skills] * Stripe Sessions: Nat Friedman & Daniel Gross with the Collisons (the Golden Age of Tinkering convo): https://www.youtube.com/watch?v=I-ldITom1cg [https://www.youtube.com/watch?v=I-ldITom1cg] * Stripe Sessions: John Collison's "Indexing the Economy" (state of the internet economy talk, indie hackers shoutout): https://www.youtube.com/watch?v=-vRY2dtD7iQ [https://www.youtube.com/watch?v=-vRY2dtD7iQ] * The Gervais Principle (Venkatesh Rao, Ribbonfarm 2009): https://www.ribbonfarm.com/2009/10/07/the-gervais-principle-or-the-office-according-to-the-office/ [https://www.ribbonfarm.com/2009/10/07/the-gervais-principle-or-the-office-according-to-the-office/] * NYT on Chinese peptides + the Frontier Tower rave: https://www.nytimes.com/2026/01/03/business/chinese-peptides-silicon-valley.html [https://www.nytimes.com/2026/01/03/business/chinese-peptides-silicon-valley.html] * Frontier Tower: https://frontiertower.io/ [https://frontiertower.io/] * Cloudflare Code Mode (Matt Carey): https://blog.cloudflare.com/code-mode-mcp/ [https://blog.cloudflare.com/code-mode-mcp/] * Cloudflare MCP Server Portals: https://blog.cloudflare.com/zero-trust-mcp-server-portals/ [https://blog.cloudflare.com/zero-trust-mcp-server-portals/] * Karpathy's LLM Wiki gist (the inspo for Chad V2's data architecture): https://gist.github.com/karpathy/442a6bf555914893e9891c11519de94f [https://gist.github.com/karpathy/442a6bf555914893e9891c11519de94f] * Karpathy's original LLM Wiki tweet: https://x.com/karpathy/status/2039805659525644595 [https://x.com/karpathy/status/2039805659525644595] * Pieter Levels: https://x.com/levelsio [https://x.com/levelsio] * Chris Bakke: https://x.com/ChrisJBakke [https://x.com/ChrisJBakke]

9 May 2026 - 50 min
episode The Golden Age of Tinkering, Reachy Mini on Qwen+Cerebras, Compute Predictions, Fuzzing Artifacts, Flatter orgs artwork

The Golden Age of Tinkering, Reachy Mini on Qwen+Cerebras, Compute Predictions, Fuzzing Artifacts, Flatter orgs

* Wilhelm's Reachy Mini: Time to first token beats tokens per second once embodiment is in the room * Lukas the sleep score assassin * Matt's Portugal week: wing foiling the Óbidos lagoon * Airbnb-style booking page for Matt's spare room on Cloudflare Workers, Email open beta, Turnstile * For scale: Cloudflare market cap $85B, Cursor at a rumored $60B, GitHub sold to Microsoft for $8B, Cursor's real moat is the best non-lab RL trace dataset * Nat Friedman and Daniel Gross at Stripe Session * Geoffrey Huntley's PyCon LT keynote: flatter orgs, every director managing 50 agents, max 5 hops from the CEO * Matt's counter to the compute panic: cloud isolates and shared sandboxes, not a billion persistent Raspberry Pis * Chad V2 plans * Cloudflare Artifacts shipped in under 3 weeks with its own fuzzer * Prompt injection risks Links * Reachy Mini — https://www.pollen-robotics.com/reachy-mini [https://www.pollen-robotics.com/reachy-mini] * Cerebras Inference — https://cerebras.ai/inference [https://cerebras.ai/inference] * ElevenLabs Scribe v2 — https://elevenlabs.io/speech-to-text [https://elevenlabs.io/speech-to-text] * friends.fyi — https://friends.fyi [https://friends.fyi] * Cloudflare Hyperdrive — https://developers.cloudflare.com/hyperdrive/ [https://developers.cloudflare.com/hyperdrive/] * Cloudflare Email Routing / Sending — https://developers.cloudflare.com/email-routing/ [https://developers.cloudflare.com/email-routing/] * Cloudflare Turnstile — https://www.cloudflare.com/products/turnstile/ [https://www.cloudflare.com/products/turnstile/] * Cofounder — https://cofounder.co [https://cofounder.co] * Stripe Sessions: Nat Friedman & Daniel Gross with the Collisons — https://www.youtube.com/watch?v=I-ldITom1cg [https://www.youtube.com/watch?v=I-ldITom1cg] * Geoffrey Huntley, "Software Development Now Costs Less Than Minimum Wage" (PyCon LT) — https://www.youtube.com/watch?v=6zQTQ4iVaKg [https://www.youtube.com/watch?v=6zQTQ4iVaKg]

5 May 2026 - 1 h 14 min
episode Cloudflare Ships GitHub for Agents, Hardware Startups Shouldn’t Ship Apps, Matt's Three-Week MCP World Tour, Bad Boy Browser, Inbound Prompt Injection artwork

Cloudflare Ships GitHub for Agents, Hardware Startups Shouldn’t Ship Apps, Matt's Three-Week MCP World Tour, Bad Boy Browser, Inbound Prompt Injection

- Matt's three-week MCP tour: MCP Maintainers Day, MCP Dev Summit NYC, AI Engineer London - Opus 4.7 drops with a new tokenizer, Matt's theory: it's secretly the smaller Mythos base model that "didn't cook fully" - Inbound agents and prompt injection: Anthropic's Routines, OpenAI's free moderation endpoint - Open-source idea of the week: a public list of prompt injection heuristics — Zod for prompts - StackOne's 200M param prompt-injection model - BB Browser (Bad Boy Browser) - Cloudflare Agents Week: Artifacts, Project Think, Emails, and much more - Cloudflare Artifacts: a Git API for a world where one org has billions of repos, not millions - Sunil Pai's AI Engineer talk: the shift to agents isn't incremental, it's a new compute unit and a new internet - Hardware startups should ship MCP servers, not apps - One year of Johnny Ive × OpenAI: rumors of no screen - Claude Code's worktree upgrade is actually good now - Stateless MCP vs CLI: MCP is opinionated OpenAPI + auth, and CLIs are weirdly MORE stateful - Rage-bait short-form clips generated by Opus 4.7, but stop watching and go listen to the whole pod Links: - https://blog.cloudflare.com/project-think/ - https://github.com/epiral/bb-browser - https://blog.cloudflare.com/artifacts-git-for-agents-beta/

17 Apr 2026 - 1 h 19 min
episode Pwning Your Friends' Agents Is Good Manners, Make Something Agents Want, OpenAI Buys TBPN, MCP Goes Stateless, Cloudflare Agents Week with Sunil Pai artwork

Pwning Your Friends' Agents Is Good Manners, Make Something Agents Want, OpenAI Buys TBPN, MCP Goes Stateless, Cloudflare Agents Week with Sunil Pai

* Claude Code harness lockdown: who got the email and who didn't * Chad mobile app and the Tinder-style agent review queue concept * Sunil's productivity crisis: bonk-driven development and the side project drought * The AI murder mystery party game that can't get built * Thomas's Raspberry Pi agent vs the kitchen kettle * Matt phishing Thomas's MCP server via DCR and a Rick Roll * TBPN acquired by OpenAI: media play or IPO narrative? * Generative UI: declarative JSON vs just letting the model write code * Kenton's tic-tac-toe canvas demo (and who actually came up with it) * Worker Bundler and Cloudflare Agents Week preview * Executable oracles: constraining LLM degrees of freedom for better output - https://john.regehr.org/writing/zero_dof_programming.html * Karpathy's LLM Wiki and agents pinging each other's knowledge bases * friends.fyi and actors.dev: building services for agents, not humans * Sunil's Pi agent: surf webcam monitoring, macro tracking, and sleep score graphs * MCP going stateless: sessions removed, elicitations rewritten as multi-turn tool calls

6 Apr 2026 - 1 h 8 min
En fantastisk app med et enormt stort udvalg af spændende podcasts. Podimo formår virkelig at lave godt indhold, der takler de lidt mere svære emner. At der så også er lydbøger oveni til en billig pris, gør at det er blevet min favorit app.
En fantastisk app med et enormt stort udvalg af spændende podcasts. Podimo formår virkelig at lave godt indhold, der takler de lidt mere svære emner. At der så også er lydbøger oveni til en billig pris, gør at det er blevet min favorit app.
Rigtig god tjeneste med gode eksklusive podcasts og derudover et kæmpe udvalg af podcasts og lydbøger. Kan varmt anbefales, om ikke andet så udelukkende pga Dårligdommerne, Klovn podcast, Hakkedrengene og Han duo 😁 👍
Podimo er blevet uundværlig! Til lange bilture, hverdagen, rengøringen og i det hele taget, når man trænger til lidt adspredelse.

Choose your subscription

Most popular

Limited Offer

Premium

20 hours of audiobooks

  • Podcasts only on Podimo

  • No ads in Podimo shows

  • Cancel anytime

2 months for 19 kr.
Then 99 kr. / month

Get Started

Premium Plus

Unlimited audiobooks

  • Podcasts only on Podimo

  • No ads in Podimo shows

  • Cancel anytime

Start 7 days free trial
Then 129 kr. / month

Start for free

Only on Podimo

Popular audiobooks

Get Started

2 months for 19 kr. Then 99 kr. / month. Cancel anytime.