The Superintelligence Podcast
NVIDIA’s Kari Briski joins Kim Isenberg live from GTC 2026 to break down Nemotron 3 Super — a 120B parameter model with a hybrid Mamba-2/Transformer/MoE architecture, 1M token context, and 5x throughput gains. They go deep on what makes it different, why NVIDIA released the full training recipe, and what the new Nemotron Coalition signals about where enterprise AI is heading.
5 episodes
Comments
0Be the first to comment
Sign up now and become a member of the The Superintelligence Podcast community!