My Weird Prompts

Embedding Models vs LLMs: What Actually Connects?

27 min · 4. juli 2026
episode Embedding Models vs LLMs: What Actually Connects? cover

Description

Daniel asks two sharp questions about RAG pipelines: does your embedding model constrain which LLM you can use, and why are new embedding models still releasing if embeddings feel like a solved problem? We break down the architectural decoupling between embedding models and LLMs — they're different neural networks trained for different objectives, and any embedding works with any LLM. But that clean answer makes the second question more urgent: the real innovation in embedding models isn't about general benchmarks — it's about fixing specific failure modes like domain specialization, multilingual alignment, and silent drift that only show up at scale. We also unpack the "silent drift" problem where an auto-embedding model upgrade can quietly break retrieval without anyone noticing until support tickets spike.

Comments

0

Be the first to comment

Sign up now and become a member of the My Weird Prompts community!

Get Started

1 month for 9 kr.

Then 99 kr. / month · Cancel anytime.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

All episodes

300 episodes

episode Embedding Models vs LLMs: What Actually Connects? artwork

Embedding Models vs LLMs: What Actually Connects?

Daniel asks two sharp questions about RAG pipelines: does your embedding model constrain which LLM you can use, and why are new embedding models still releasing if embeddings feel like a solved problem? We break down the architectural decoupling between embedding models and LLMs — they're different neural networks trained for different objectives, and any embedding works with any LLM. But that clean answer makes the second question more urgent: the real innovation in embedding models isn't about general benchmarks — it's about fixing specific failure modes like domain specialization, multilingual alignment, and silent drift that only show up at scale. We also unpack the "silent drift" problem where an auto-embedding model upgrade can quietly break retrieval without anyone noticing until support tickets spike.

4. juli 202627 min