The Second Brain AI Podcast ✨🧠
Send us a text [https://www.buzzsprout.com/twilio/text_messages/2507380/open_sms] Why do LLMs still give different answers even with temperature set to zero? In this episode of The Second Brain AI Podcast, we unpack new research from Thinking Machines Lab on defeating nondeterminism in LLM inference. We cover the surprising role of floating-point math, the real system-level culprit, lack of batch invariance, and how redesigned kernels can finally deliver bit-identical outputs. We also explore the trade-offs, real-world implications for testing and reliability, and how this breakthrough enables reproducible research and true on-policy reinforcement learning. Sources: * Defeating Nondeterminism in LLM Inference [https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/] * Non-Determinism of “Deterministic” LLM Settings [http://arxiv.org/html/2408.04667v4]
10 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de The Second Brain AI Podcast ✨🧠!