My Weird Prompts

Embedding Models vs LLMs: What Actually Connects?

27 min · Ayer
Portada del episodio Embedding Models vs LLMs: What Actually Connects?

Descripción

Daniel asks two sharp questions about RAG pipelines: does your embedding model constrain which LLM you can use, and why are new embedding models still releasing if embeddings feel like a solved problem? We break down the architectural decoupling between embedding models and LLMs — they're different neural networks trained for different objectives, and any embedding works with any LLM. But that clean answer makes the second question more urgent: the real innovation in embedding models isn't about general benchmarks — it's about fixing specific failure modes like domain specialization, multilingual alignment, and silent drift that only show up at scale. We also unpack the "silent drift" problem where an auto-embedding model upgrade can quietly break retrieval without anyone noticing until support tickets spike.

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de My Weird Prompts!

Prueba gratis

Empieza 7 días de prueba

$99 / mes después de la prueba. · Cancela cuando quieras.

  • Podcasts solo en Podimo
  • 20 horas de audiolibros al mes
  • Podcast gratuitos

Todos los episodios

300 episodios

episode Embedding Models vs LLMs: What Actually Connects? artwork

Embedding Models vs LLMs: What Actually Connects?

Daniel asks two sharp questions about RAG pipelines: does your embedding model constrain which LLM you can use, and why are new embedding models still releasing if embeddings feel like a solved problem? We break down the architectural decoupling between embedding models and LLMs — they're different neural networks trained for different objectives, and any embedding works with any LLM. But that clean answer makes the second question more urgent: the real innovation in embedding models isn't about general benchmarks — it's about fixing specific failure modes like domain specialization, multilingual alignment, and silent drift that only show up at scale. We also unpack the "silent drift" problem where an auto-embedding model upgrade can quietly break retrieval without anyone noticing until support tickets spike.

Ayer27 min