Ep. 010 - How Much Do GPUs Really Cost, and Where Does the Value Go? (AI Cloud TCO) | Jordan Nanos, Dan Nishball, Kang Wen Cheang, Zane Fong

Descripción

This episode features Jordan Nanos (@JordanNanos) and Daniel Nishball (@dnishball) breaking down the economics of GPU clusters through real-world data and experience. Joined with Kang Wen Cheang and Zane Fong, the team discussed moving beyond theoretical TCO models as they examine how reliability differences between top-tier and lower-tier providers create significant cost disparities that aren't captured in simple per-GPU pricing. The discussion introduces practical frameworks for measuring goodput and understanding how system failures cascade through entire training jobs.Nanos walks through the mechanics of fault-tolerant frameworks including AWS's Checkpointless Training and explains why a single GPU failure can halt progress across hundreds of nodes. The conversation reveals how hyperscalers and NeoClouds price their services and why paying premium rates for reliable infrastructure often delivers better value than chasing the lowest per-hour costs. Subscribe to SemiAnalysis for in-depth analysis of AI hardware economics and infrastructure trends that impact the entire semiconductor ecosystem.

Ep. 011 - GPT 5.5 vs Claude 4.7: OpenAI's Comeback From the Brink (Tokenomics) | Jordan Nanos, Dylan Patel, Doug O'Laughlin, Max Kan

OpenAI was in serious trouble at the beginning of this year. Anthropic's Claude Opus 4.5 release had triggered a wave of developers to start using Claude Code, pushing Anthropic's revenue past OpenAI's on a like-for-like basis by April. OpenAI's GPT 5.4 response was such an embarrassment they didn't even compare it to Claude in their model release card. Then came GPT 5.5 - finally back on the frontier, but is it enough to reclaim the crown? Jordan Nanos (@JordanNanos), Dylan Patel (@Dylan522p), Doug O'Laughlin (@FabricatedKnowledge), and Max Kan (@maxkan_) break down the latest AI model wars, from Claude 4.7's coding dominance to DeepSeek's long-delayed v4 release and what it reveals about China's AI capabilities. They analyze token efficiency, benchmark gaming, and why fast mode might be fake news. Subscribe for weekly deep dives into the semiconductor and AI infrastructure powering the future. The Coding Assistant Breakdown [https://newsletter.semianalysis.com/p/the-coding-assistant-breakdown-more?_gl=1*1kdxhrs*_ga*MTY1NDExMjk2Ny4xNzc2MTIzOTQ1*_ga_FKWNM9FBZ3*czE3NzgwMjUzODEkbzI2JGcwJHQxNzc4MDI1MzgxJGo2MCRsMCRoMjEyNzMwMzMyNw..] AI Value Capture [https://newsletter.semianalysis.com/p/ai-value-capture-the-shift-to-model?_gl=1*1x68y7d*_ga*MTY1NDExMjk2Ny4xNzc2MTIzOTQ1*_ga_FKWNM9FBZ3*czE3NzgwMjUzODEkbzI2JGcwJHQxNzc4MDI1MzgxJGo2MCRsMCRoMjEyNzMwMzMyNw..] Timestamps: 00:00 OpenAI's Comeback and the Latest AI Model Wars 04:05 The High Cost of AI Models and Fast Mode Effectiveness 08:16 When AI Tokens Become Too Expensive for Tasks 13:11 Why AI Model Quality Degrades and Benchmarks Fail 18:42 Deep Dive into Claude 4.7 Features and Tokenizer Changes 25:29 DeepSeek's Release and China's AI Compute Constraints 28:20 The Future of Context Windows and Agent Orchestration 30:47 The Great Debate: CLI vs. App for AI Interaction 36:33 Debunking AI Fake News and Context Window Limitations 40:51 The AI Race: China, Meta, and the Neo Cloud Vision 43:46 Final Thoughts and Listener Feedback Request

6 de may de 202645 min

Ep. 010 - How Much Do GPUs Really Cost, and Where Does the Value Go? (AI Cloud TCO) | Jordan Nanos, Dan Nishball, Kang Wen Cheang, Zane Fong

Descripción

Comentarios

Empieza 7 días de prueba

Todos los episodios