Inside Nemotron: NVIDIA’s Kari Briski on the Architecture Reshaping Enterprise AI

19 min · 14 de abr de 2026

Descripción

NVIDIA’s Kari Briski joins Kim Isenberg live from GTC 2026 to break down Nemotron 3 Super — a 120B parameter model with a hybrid Mamba-2/Transformer/MoE architecture, 1M token context, and 5x throughput gains. They go deep on what makes it different, why NVIDIA released the full training recipe, and what the new Nemotron Coalition signals about where enterprise AI is heading.

Comentarios

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de The Superintelligence Podcast!

Prueba gratis

Todos los episodios

7 episodios

NVIDIA’s Quantum Computing Strategy with Sam Stanwyck (NVIDIA)

In this episode, Kim Isenberg sits down with NVIDIA’s Sam Stanwyck at ISC to discuss one of the most misunderstood frontiers in technology: quantum computing. Sam leads NVIDIA’s quantum computing product team, where he focuses on how accelerated computing, GPUs, AI, and software tools like CUDA-Q can help move quantum computing from research toward practical applications. The conversation explores why NVIDIA is not building its own quantum computer, but instead working on the infrastructure around quantum systems: simulation, control, error correction, hybrid CPU/GPU/QPU workflows, and the software stack needed to make quantum computing useful. They also discuss where quantum computing stands today, what real scientific and product value could emerge first, and why areas like chemistry, materials science, energy, optimization, and fundamental research are central to the long-term promise of the field. A grounded conversation about quantum computing beyond the hype — and how NVIDIA sees its role in building the next generation of accelerated computing.

9 de jul de 202619 min

Notion’s Co-Founder on the Rise of AI Agent Workspaces

Description: In this exclusive Superintelligence interview, Kim Isenberg and Peter Thum sit down with Akshay Kothari, Co-Founder of Notion, to discuss how Notion is evolving from a notes and productivity app into an agent-first workspace. The conversation explores how humans, custom code, and AI agents could soon collaborate side by side inside the same operating layer for work. Akshay explains why Notion’s template ecosystem became such a powerful unlock, how AI agents can automate busy work without replacing human judgment, and why the future of work may be less about headcount and more about outcomes. We also discuss Notion Workers, internal AI agents like “Smilers,” self-improving knowledge bases, model optionality, and how specialized expertise could spread across entire organizations through shareable custom agents. A conversation about the next phase of software, the future of productivity, and what work looks like when AI becomes part of the team.

22 de jun de 202654 min

Google DeepMind on Local Models, Open Source & the Future of AI Competition

In this exclusive interview from Google I/O, I speak with Omar Sanseviero and Paige Bailey from Google DeepMind about the rapidly evolving AI landscape. We discuss the rise of local models, the growing importance of open source and open models, the role of developer communities, and how global competition — especially from China — is shaping the next phase of artificial intelligence. A conversation about where AI is heading next: from frontier labs to local inference, from closed systems to open ecosystems, and from model releases to real-world developer adoption.

9 de jun de 202627 min

LTX CEO Zeev Farbman on Open AI Video Models, Local Inference, and the Future of Creative AI

In this episode, Superintelligence Editor-in-Chief Kim Isenberg speaks with Zeev Farbman, CEO and co-founder of Lightricks/LTX, about the future of AI video, open foundation models, and local creative workflows. Farbman explains why LTX is betting on open weights, local inference, and efficient models optimized for Nvidia GPUs — and why closed API models may be a long-term problem for developers, studios, and enterprises. The conversation also covers Lightricks’ strategic restructuring, competition with Big Tech, the current AI hype cycle, upcoming LTX updates, and the vision that AI models could eventually replace traditional rendering engines. A conversation about open AI infrastructure, multimodality, creative production, and the next stage of generative video AI.

28 de may de 202642 min

"Your Calendar Is Leaking Revenue" — How SkipUp's AI Agent Kills Scheduling Forever

Every company has a scheduling problem. Most just don't know how expensive it is. The coordination tax on mid-market companies runs up to $4,500 per employee per year, and up to 70% of inbound leads never even make it to a booked meeting. In this episode, Superintelligence Editor-in-Chief Kim sits down with SkipUp co-founders Dheer and Sasha to unpack why scheduling is still fundamentally broken in 2026 and how their AI agent is replacing the entire back-and-forth. SkipUp doesn't send a booking link and wait. It lives inside your email thread, reads context, proposes times across calendars and time zones, follows up autonomously, and books the meeting. No forms, no friction, no lost deals. We talk about why traditional scheduling tools like Calendly hit a ceiling, how a two-person team built an email-native AI agent in months, the hidden revenue impact of every meeting that doesn't happen, and what work looks like when the coordination layer is fully automated.

14 de may de 202643 min

Inside Nemotron: NVIDIA’s Kari Briski on the Architecture Reshaping Enterprise AI

Descripción

Comentarios

Empieza 7 días de prueba

Todos los episodios