My Weird Prompts
Daniel asks two sharp questions about RAG pipelines: does your embedding model constrain which LLM you can use, and why are new embedding models still releasing if embeddings feel like a solved problem? We break down the architectural decoupling between embedding models and LLMs — they're different neural networks trained for different objectives, and any embedding works with any LLM. But that clean answer makes the second question more urgent: the real innovation in embedding models isn't about general benchmarks — it's about fixing specific failure modes like domain specialization, multilingual alignment, and silent drift that only show up at scale. We also unpack the "silent drift" problem where an auto-embedding model upgrade can quietly break retrieval without anyone noticing until support tickets spike.
300 episodes
Comments
0Be the first to comment
Sign up now and become a member of the My Weird Prompts community!