Nerd Snipe with Theo and Ben

We've been testing GPT-5.5 for a few weeks now...

1 h 44 min · 23. apr. 2026
episode We've been testing GPT-5.5 for a few weeks now... cover

Description

Anthropic month is finally over. We've got a ton to talk about: GPT-5.5 pre-release impressions, the Vercel hack, Cursor + xAI, Qwen models, Kimi k2.6, and so much more...Thank you to today's Sponsors! Depot, truly modern CI: nerdsnipe.link/depot [https://nerdsnipe.link/depot] Coderabbit, the ultimate AI code reviewer: nerdsnipe.link/coderabbit [https://nerdsnipe.link/coderabbit] Clerk, the auth platform with the best DX: nerdsnipe.link/clerk [https://nerdsnipe.link/clerk] TIMESTAMPS00:00:00 - Intro, vacation chaos, and episode setup 00:01:34 - Vercel hack explained and security fallout 00:05:38 - Kimi K2.6 and the rise of strong open-weight models 00:14:29 - Cursor + xAI/SpaceX partnership and acquisition option 00:37:24 - GPT Image 2 impressions, strengths, and flaws 00:43:53 - Anthropic Cloud Code Pro pricing controversy 00:49:30 - GPT-5.5 first impressions: split opinions 01:02:25 - Critiques of GPT-5.5 for coding, context, and prompting 01:22:02 - GPT-5.5 Pro

Comments

0

Be the first to comment

Sign up now and become a member of the Nerd Snipe with Theo and Ben community!

Get Started

1 month for 9 kr.

Then 99 kr. / month · Cancel anytime.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

All episodes

12 episodes

episode Our impressions of Claude Fable/Mythos (we filmed this before the ban) artwork

Our impressions of Claude Fable/Mythos (we filmed this before the ban)

RIP Fable 5. We recorded this before it got taken offline, but it's still worth talking about. The model is incredible. We really miss it. Thank you, Firecrawl, Depot, and Clerk for sponsoring! * Firecrawl, the best api for searching and crawling the web: nerdsnipe.link/firecrawl [https://nerdsnipe.link/firecrawl] * Depot, better CI in every way: nerdsnipe.link/depot [https://nerdsnipe.link/Depot] * Clerk, the best dx in auth: nerdsnipe.link/clerk [https://nerdsnipe.link/Clerk] Sources * https://x.com/thsottiaux/status/2043177597434306699 [https://x.com/thsottiaux/status/2043177597434306699] * https://cognition.ai/blog/frontier-code [https://cognition.ai/blog/frontier-code] * https://x.com/paradite_/status/2064585901351792887 [https://x.com/paradite_/status/2064585901351792887] * https://www-cdn.anthropic.com/d00db56fa754a1b115b6dd7cb2e3c342ee809620.pdf ( [⁠https://www-cdn.anthropic.com/d00db56fa754a1b115b6dd7cb2e3c342ee809620.pdf⁠ (]page 13 has the “prompt modification” quote) * https://www.anthropic.com/institute/recursive-self-improvement [https://www.anthropic.com/institute/recursive-self-improvement] Timestamps * 0:00 Intro * 3:29 Fable First Impressions * 10:24 Benchmark Drama * 26:42 Claude Code & Workflows * 36:07 Pricing & June 22 Cutoff * 39:54 Data Retention * 45:58 Hidden Safeguards * 1:07:01 The Claude Constitution

15. juni 20261 h 17 min
episode Now even Google's buying GPUs from SpaceX? artwork

Now even Google's buying GPUs from SpaceX?

Cloudflare buys Void0, Google buying up compute from xAI, and Claude seems to be getting more anxious so we're here to break down everything this week on another episode of Nerd Snipe! Thank you to Composio for sponsoring today's episode! * Composio, connect your agents to everything: https://nerdsnipe.link/composio [https://nerdsnipe.link/composio] Sources: * https://x.com/vite_js/status/2062525206158078047 [https://x.com/vite_js/status/2062525206158078047] * https://x.com/EdLudlow/status/2062970770612199542 [https://x.com/EdLudlow/status/2062970770612199542] * https://x.com/nrehiew_/status/2063099050719846719 [https://x.com/nrehiew_/status/2063099050719846719] * https://www.reddit.com/r/EconomyCharts/comments/1lp34n4/china_vs_us_energy/ [https://www.reddit.com/r/EconomyCharts/comments/1lp34n4/china_vs_us_energy/] * https://x.com/elonmusk/status/1963443919150330139 [https://x.com/elonmusk/status/1963443919150330139] * https://x.com/AnthropicAI/status/2062568862479208923 [https://x.com/AnthropicAI/status/2062568862479208923] * https://x.com/HSVSphere/status/2060396271756595666 [https://x.com/HSVSphere/status/2060396271756595666] * https://x.com/theo/status/2061018426152530232 [https://x.com/theo/status/2061018426152530232] * https://x.com/Teknium/status/2062522290504613944 [https://x.com/Teknium/status/2062522290504613944] 01:58 Cloudflare vs Vercel 13:03 Convex angle 19:59 SpaceX compute 35:49 AI self-improvement 44:45 Article reactions 50:28 Claude anxiety 57:42 Guardrails 01:08:21 Cursed image gen 01:19:18 Hermes agents

10. juni 20261 h 35 min
episode We (mostly) like Claude Opus 4.8 artwork

We (mostly) like Claude Opus 4.8

Opus 4.8 + a ton of new stuff in Claude Code dropped this week, and we actually kinda like it. There's also a new benchmark that's actually good, and we have a lot of thoughts about the future of these AI labs... Thank you PostHog and Clerk for sponsoring! * PostHog, the all in one suite of product tools: nerdsnipe.link/posthog [https://nerdsnipe.link/posthog] * Clerk, the best dx in auth: nerdsnipe.link/clerk [https://nerdsnipe.link/clerk] SOURCES * https://www.anthropic.com/news/claude-opus-4-8 [https://www.anthropic.com/news/claude-opus-4-8] * https://x.com/theo/status/2060120708815139241 [https://x.com/theo/status/2060120708815139241] * https://x.com/Baconbrix/status/2060065875911422343 [https://x.com/Baconbrix/status/2060065875911422343] * ⁠https://x.com/zoink/status/2060769829133721974 [https://x.com/Baconbrix/status/2060065875911422343] * https://x.com/maria_rcks/status/2060937270824153251 [https://x.com/Baconbrix/status/2060065875911422343] * https://x.com/_catwu/status/2060054180379689074 [https://x.com/_catwu/status/2060054180379689074] * https://x.com/datacurve/status/2060834005998793199 [https://x.com/datacurve/status/2060834005998793199] * https://deepswe.datacurve.ai/ [https://deepswe.datacurve.ai/] * https://deepswe.datacurve.ai/blog#results [https://deepswe.datacurve.ai/blog#results] * https://github.com/scaleapi/SWE-agent/blob/402a7b8fdac8193f3f255bb53859ba274234f596/config/benchmarks/anthropic_filemap_multilingual.yaml [https://github.com/scaleapi/SWE-agent/blob/402a7b8fdac8193f3f255bb53859ba274234f596/config/benchmarks/anthropic_filemap_multilingual.yaml] * https://deepswe.datacurve.ai/ [https://deepswe.datacurve.ai/] * https://x.com/AnthropicAI/status/2060061347522433422 [https://x.com/AnthropicAI/status/2060061347522433422] * https://x.com/theo/status/2060199299632472494 [https://x.com/theo/status/2060199299632472494] * https://x.com/theo/status/2060901326058561795 [https://x.com/theo/status/2060901326058561795] * https://x.com/38twelveDaily/status/2060408696945975631 [https://x.com/38twelveDaily/status/2060408696945975631] 00:00 - Shower Thoughts 02:44 - Deep SWE Benchmark 10:45 - Opus vs GPT-5.5 19:57 - Anthropic’s Huge Raise 25:39 - Token Maxing 40:02 - AI Slot Machine 43:49 - Claude Code Friction 50:01 - Opus, Mythos, and Safety

3. juni 20261 h 15 min
episode Google is Not a Serious Company artwork

Google is Not a Serious Company

Not only did Google accidentally ban Railway's account, but their new flagship model Gemini 3.5 Flash is absurdly bad. Oh and apparently Theo's building his own cloud... Thank you Macroscope and GT for sponsoring today's episode! * Macroscope: ⁠nerdsnipe.link/macroscope [https://nerdsnipe.link/macroscope] * General Translation: nerdsnipe.link/gt [https://nerdsnipe.link/gt] SOURCES ⁠https://x.com/theo/status/2057359424378097823⁠ [https://x.com/theo/status/2057359424378097823] ⁠https://x.com/JustJake/status/2056881510939283776⁠ [https://x.com/JustJake/status/2056881510939283776] ⁠https://x.com/KorduGG/status/2059141337895604626⁠ [https://x.com/KorduGG/status/2059141337895604626] ⁠https://x.com/unboringtech/status/2059144145273491610⁠ [https://x.com/unboringtech/status/2059144145273491610] TIMESTAMPS 00:00 - Gemini fallout 05:21 - Video gen 10:10 - Google Cloud 14:04 - Windsurf/Cursor 25:10 - Manus/Meta 30:00 - China lock-in 35:00 - Cursor/xAI 45:00 - Cloud workflows 50:32 - Lakebed 65:22 - Hermes/security

28. maj 20261 h 22 min