My Weird Prompts

Embedding Models vs LLMs: What Actually Connects?

27 min · Gestern
Episode Embedding Models vs LLMs: What Actually Connects? Cover

Beschreibung

Daniel asks two sharp questions about RAG pipelines: does your embedding model constrain which LLM you can use, and why are new embedding models still releasing if embeddings feel like a solved problem? We break down the architectural decoupling between embedding models and LLMs — they're different neural networks trained for different objectives, and any embedding works with any LLM. But that clean answer makes the second question more urgent: the real innovation in embedding models isn't about general benchmarks — it's about fixing specific failure modes like domain specialization, multilingual alignment, and silent drift that only show up at scale. We also unpack the "silent drift" problem where an auto-embedding model upgrade can quietly break retrieval without anyone noticing until support tickets spike.

Kommentare

0

Sei die erste Person, die kommentiert

Melde dich jetzt an und werde Teil der My Weird Prompts-Community!

Loslegen

2 Monate für 1 €

Dann 4,99 € / Monat · Jederzeit kündbar.

  • Podcasts nur bei Podimo
  • 20 Stunden Hörbücher / Monat
  • Alle kostenlosen Podcasts

Alle Folgen

300 Folgen

Episode Embedding Models vs LLMs: What Actually Connects? Cover

Embedding Models vs LLMs: What Actually Connects?

Daniel asks two sharp questions about RAG pipelines: does your embedding model constrain which LLM you can use, and why are new embedding models still releasing if embeddings feel like a solved problem? We break down the architectural decoupling between embedding models and LLMs — they're different neural networks trained for different objectives, and any embedding works with any LLM. But that clean answer makes the second question more urgent: the real innovation in embedding models isn't about general benchmarks — it's about fixing specific failure modes like domain specialization, multilingual alignment, and silent drift that only show up at scale. We also unpack the "silent drift" problem where an auto-embedding model upgrade can quietly break retrieval without anyone noticing until support tickets spike.

Gestern27 min