The Superintelligence Podcast
NVIDIA’s Kari Briski joins Kim Isenberg live from GTC 2026 to break down Nemotron 3 Super — a 120B parameter model with a hybrid Mamba-2/Transformer/MoE architecture, 1M token context, and 5x throughput gains. They go deep on what makes it different, why NVIDIA released the full training recipe, and what the new Nemotron Coalition signals about where enterprise AI is heading.
5 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de The Superintelligence Podcast!