Imagen de portada del programa Decode AI

Decode AI

Podcast de wendy zhang

inglés

Tecnología y ciencia

Empieza 7 días de prueba

$99 / mes después de la prueba.Cancela cuando quieras.

  • 20 horas de audiolibros al mes
  • Podcasts solo en Podimo
  • Podcast gratuitos
Prueba gratis

Acerca de Decode AI

Decode AI is a podcast that demystifies the rapidly evolving world of artificial intelligence. Join us to explore the latest AI tools, technologies, and trends.From breaking down new models and frameworks, to reviewing cutting-edge AI products, to candid conversations with founders and researchers at the frontier of innovation—Decode AI delivers insight, clarity, and inspiration for anyone building or curious about intelligent systems.Whether you’re an engineer, entrepreneur, or simply AI-curious, tune in to stay ahead of the curve.

Todos los episodios

1 episodios

episode When AI doesn't know: Decode the uncertainty behind LLM artwork

When AI doesn't know: Decode the uncertainty behind LLM

In today's rapidly evolving AI landscape, Large Language Models (LLMs) are revolutionising natural language generation (NLG) tasks, from answering complex questions to summarising vast amounts of information. But a crucial question remains: how can we truly trust the outputs of these powerful foundation models? This episode delves into the unique challenges of measuring uncertainty in free-form natural language generation. We'll explore the concept of 'semantic equivalence' – where different sentences can express the exact same meaning (e.g., "France's capital is Paris" vs. "Paris is France's capital"). Existing methods often fall short because they focus on token-level confidence, ignoring this critical linguistic nuance. Discover Semantic Entropy, a groundbreaking, unsupervised method designed to overcome these challenges. This innovative approach measures uncertainty in the "meaning-space", rather than just the sequence of words. We'll explain how it works by: • Sampling diverse answers from the LLM. • Clustering these answers based on shared meaning using a novel bi-directional entailment algorithm. This algorithm determines if sentences logically imply each other within the given context. • Estimating uncertainty over these distinct meanings. Learn why Semantic Entropy offers better prediction of model accuracy on high-stakes, free-form question answering datasets like TriviaQA and CoQA, outperforming comparable baselines. A key advantage is its "out-of-the-box" compatibility with existing LLMs like OPT, requiring no additional training or modifications, making it highly reproducible and accessible for researchers. This research is vital for building safer AI systems, helping users understand the reliability of AI-generated content and mitigating potential harms such as the propagation of false or misleading information. Tune in to grasp the future of AI trustworthiness and the linguistic insights driving it.

14 de jul de 2025 - 15 min
Regístrate para escuchar
Muy buenos Podcasts , entretenido y con historias educativas y divertidas depende de lo que cada uno busque. Yo lo suelo usar en el trabajo ya que estoy muchas horas y necesito cancelar el ruido de al rededor , Auriculares y a disfrutar ..!!
Muy buenos Podcasts , entretenido y con historias educativas y divertidas depende de lo que cada uno busque. Yo lo suelo usar en el trabajo ya que estoy muchas horas y necesito cancelar el ruido de al rededor , Auriculares y a disfrutar ..!!
Fantástica aplicación. Yo solo uso los podcast. Por un precio módico los tienes variados y cada vez más.
Me encanta la app, concentra los mejores podcast y bueno ya era ora de pagarles a todos estos creadores de contenido

Elige tu suscripción

Más populares

Premium

20 horas de audiolibros

  • Podcasts solo en Podimo

  • Disfruta los shows de Podimo sin anuncios

  • Cancela cuando quieras

Empieza 7 días de prueba
Después $99 / mes

Prueba gratis

Sólo en Podimo

Audiolibros populares

Prueba gratis

Empieza 7 días de prueba. $99 / mes después de la prueba. Cancela cuando quieras.