The Superintelligence Podcast
NVIDIA’s Kari Briski joins Kim Isenberg live from GTC 2026 to break down Nemotron 3 Super — a 120B parameter model with a hybrid Mamba-2/Transformer/MoE architecture, 1M token context, and 5x throughput gains. They go deep on what makes it different, why NVIDIA released the full training recipe, and what the new Nemotron Coalition signals about where enterprise AI is heading.
6 episoder
Kommentarer
0Vær den første til at kommentere
Tilmeld dig nu og bliv en del af The Superintelligence Podcast-fællesskabet!