OpenAI’s Chip Rebellion: Why It’s Breaking Free From Nvidia’s Grip

Descripción

Here’s the thing about OpenAI’s new deal with Broadcom that actually matters: they’re not just buying different chips (boring), they’re fundamentally reshaping how AI companies think about hardware control. And honestly? This feels like the moment the industry collectively realized it was too dependent on one company’s supply chain. Let me back up. OpenAI just struck a deal to develop custom AI chips optimized specifically for inference—that’s the moment when ChatGPT actually runs and talks to you, as opposed to the brutal training phase where models learn from mountains of data. This isn’t exactly shocking news (companies diversify suppliers, tale as old as time), but the timing and the target reveal something genuinely strategic happening. Here’s the framework for understanding why this matters: Think of AI infrastructure in two phases. Training is the heavyweight championship—it demands absolute raw computational power, which is why Nvidia’s H100 GPUs have basically owned this space. Nvidia crushed it here because of CUDA, their software ecosystem, and years of engineering prowess. OpenAI isn’t ditching Nvidia for training (that would be insane). But inference Broadcom gets this. They’ve built custom silicon for hyperscalers before (Google’s TPUs ring a bell?), and they know how to engineer chips that do one job really, really well. The application-specific integrated circuits (ASICs) they’re building for OpenAI will be optimized to the point that they’re probably more efficient than general-purpose GPUs for inference workloads. Translation: same performance, lower power consumption, massive cost savings when you’re running ChatGPT for millions of users simultaneously. What’s wild is that Sam Altman has been publicly signaling this move for months. He’s talked openly about the need for “more hardware options” and “multiple chip architectures.” This wasn’t a secret—it was a warning to Nvidia that the moat was eroding. And look, I’m not here to bury Nvidia (their training dominance is still absurd), but the inference market is massive, and spreading that load across custom silicon? That’s rational infrastructure thinking. The broader context here: We’re watching the AI industry mature past its “just buy whatever Nvidia has” phase. Supply constraints have been real (remember when everyone was fighting over H100s?), costs are astronomical, and companies with billions at stake need redundancy. OpenAI, Google, Meta—they’re all building custom silicon because relying on one vendor during an arms race is actually reckless. No timeline or financials have been disclosed (because of course not), but this move signals something important: the race isn’t just about who builds the best models anymore. It’s about who controls the entire stack—models, software, and now hardware. That’s where the real competitive moat gets built. Watch this space. When custom inference chips start delivering real cost advantages, we’ll see more of this. And suddenly Nvidia’s dominance looks a little less inevitable. Source: The Wall Street Journal Want more than just the daily AI chaos roundup? I write deeper dives and hot takes on my Substack (because apparently I have Thoughts about where this is all heading): limitededitionjonathan on Substack

Gemini Robotics 1.5: Google DeepMind Just Cracked the Code on Agentic Robots

Look, I know another AI model announcement sounds boring (trust me, I’ve written about 47 of them this month), but Google DeepMind just dropped something that actually made me sit up and pay attention. Their new Gemini Robotics 1.5 isn’t just another incremental upgrade—it’s a completely different approach to making robots that can think, plan, and adapt like actual agents in the real world. Here’s what’s wild: instead of trying to cram everything into one massive model (which, let’s be honest, has been the industry’s default approach), DeepMind split embodied intelligence into two specialized models. The ERVLA stack pairs Gemini Robotics-ER 1.5 for high-level reasoning with Gemini Robotics 1.5 for low-level motor control. Think of it like giving a robot both a strategic brain and muscle memory that can actually talk to each other. The “embodied reasoning” model (ER) handles the big picture stuff—spatial understanding, planning multiple steps ahead, figuring out if a task is actually working or failing, and even tool use. Meanwhile, the visuomotor learning agent (VLA) manages the precise hand-eye coordination needed to actually manipulate objects. The genius part? They can transfer skills between completely different robot platforms without starting from scratch. What does this look like in practice? These robots can now receive a high-level instruction like “prepare this workspace for the next task” and break it down into concrete steps: assess what’s currently there, determine what needs to move where, grab the right tools, and execute the plan while monitoring progress. If something goes wrong (like a tool slips or an object isn’t where expected), the reasoning model can replan on the fly. The technical breakthrough here is in the bidirectional communication between the two models. Previous approaches either had rigid, pre-programmed behaviors or tried to learn everything end-to-end (which works great in simulation but falls apart when you meet real-world complexity). This stack lets robots maintain both flexible high-level reasoning and precise low-level control. Here’s the framework for understanding why this matters: we’re moving from “task-specific robots” to “contextually intelligent agents.” Instead of programming a robot to do one thing really well, you can give it general capabilities and let it figure out how to apply them to novel situations. That’s the difference between a really good assembly line worker and someone who can walk into any workspace and immediately start being useful. The implications are pretty staggering when you think about it. Manufacturing environments that need flexible reconfiguration, household robots that can adapt to different homes and tasks, research assistants in labs that can understand experimental protocols—we’re talking about robots that can actually collaborate with humans rather than just following pre-written scripts. DeepMind demonstrated the system working across different robot embodiments, which solves one of the biggest practical problems in robotics: the fact that every robot design requires starting over with training. Now you can develop skills on one platform and transfer them to others, which could dramatically accelerate deployment timelines. This feels like one of those moments where we look back and say “that’s when robots stopped being fancy automation and started being actual agents.” The combination of spatial reasoning, dynamic planning, and transferable skills wrapped in a system that can actually explain what it’s doing? That’s not just an incremental improvement—that’s a fundamental shift in what’s possible. Read more from MarkTechPost [https://www.marktechpost.com/2025/09/28/gemini-robotics-1-5-deepminds-er%e2%86%94vla-stack-brings-agentic-robots-to-the-real-world/] Want more than just the daily AI chaos roundup? I write deeper dives and hot takes on my Substack (because apparently I have Thoughts about where this is all heading): https://substack.com/@limitededitionjonathan

28 de sep de 20250

OpenAI’s Chip Rebellion: Why It’s Breaking Free From Nvidia’s Grip

Descripción

Comentarios

Empieza 7 días de prueba

Todos los episodios