Snacks Weekly on Data Science

Hybrid Search for Improved Content Discovery [OLX]

7 min · 11. mai 2026
episode Hybrid Search for Improved Content Discovery [OLX] cover

Beskrivelse

In this episode, we explore how OLX improved discovery by combining keyword search and vector search instead of forcing a choice between the two. Keyword systems remain excellent for precision, while vector systems add semantic understanding. Together, they create a smarter and more user-friendly marketplace experience. For more details, you can refer to their published tech blog, linked here for your reference: https://tech.olx.com/hybrid-search-where-keywords-meet-vectors-enabling-classifieds-discovery-b7c383fe4fc4 [https://tech.olx.com/hybrid-search-where-keywords-meet-vectors-enabling-classifieds-discovery-b7c383fe4fc4]

Kommentarer

0

Vær den første til å kommentere

Registrer deg nå og bli medlem av Snacks Weekly on Data Science sitt community!

Kom i gang

2 Måneder for 19 kr

Deretter 99 kr / Måned · Avslutt når som helst.

  • Eksklusive podkaster
  • 20 timer lydbøker i måneden
  • Gratis podkaster

Alle episoder

141 Episoder

episode Accelerating Experimentation Velocity with Interleaving [Expedia] cover

Accelerating Experimentation Velocity with Interleaving [Expedia]

In this episode, we discuss Expedia’s need to evaluate many ranking ideas quickly so that A/B testing would not become a bottleneck. We explore their three-stage experimentation funnel: backtesting to remove weak candidates, interleaving to rapidly compare promising ones, and A/B testing to validate final business impact. What made this design approach effective was using the right evaluation tool at the right step instead of forcing A/B testing to do everything. For more details, you can refer to their published tech blog, linked here for your reference: https://medium.com/expedia-group-tech/interleaving-for-accelerated-testing-75adc644027b [https://medium.com/expedia-group-tech/interleaving-for-accelerated-testing-75adc644027b]

I går9 min
episode Building Taxonomies with Large Language Models [Microsoft] cover

Building Taxonomies with Large Language Models [Microsoft]

In this episode, we look at how companies deal with large volumes of unstructured text and why traditional clustering methods often fall short at scale. We explore two LLM-powered approaches shared by data scientists from Microsoft: a bottom-up pipeline that builds structure from data using embeddings and clustering, and a top-down pipeline that starts with LLM-generated categories and refines them recursively into a hierarchy. For more details, you can refer to their published tech blog, linked here for your reference: https://medium.com/data-science-at-microsoft/from-chaos-to-clarity-building-taxonomies-from-unstructured-text-using-large-language-models-c1303db3adb1 [https://medium.com/data-science-at-microsoft/from-chaos-to-clarity-building-taxonomies-from-unstructured-text-using-large-language-models-c1303db3adb1]

25. mai 20268 min
episode Fraud Detection with Multi-Agent AI Architecture [Razorpay] cover

Fraud Detection with Multi-Agent AI Architecture [Razorpay]

In this episode, we discuss a classic scaling problem in fraud and risk operations: too much manual review, inconsistent judgments, and growing complexity. We explore the team’s solution, Bumblebee, a multi-agent AI architecture that separates planning, evidence gathering, and analysis into specialized roles, enabling a robust and scalable system to solve the problem. For more details, you can refer to their published tech blog, linked here for your reference: https://engineering.razorpay.com/meet-bumblebee-the-multi-agent-ai-architecture-that-changed-fraud-detection-at-razorpay-c2b6d5704f51 [https://engineering.razorpay.com/meet-bumblebee-the-multi-agent-ai-architecture-that-changed-fraud-detection-at-razorpay-c2b6d5704f51]

18. mai 20267 min
episode Localization-Led Generative AI Product [Udemy] cover

Localization-Led Generative AI Product [Udemy]

In this episode, we explore how Udemy built a multilingual AI platform to bring its generative AI features to learners around the world. The team approached localization across three levels: a translation-first approach for broad and fast coverage, a fully native multilingual system for markets where fluency and cultural precision are essential, and a hybrid solution in between that intelligently routes between the two depending on the situation For more details, you can refer to their published tech blog, linked here for your reference: https://medium.com/udemy-engineering/from-zero-to-hero-localization-led-generative-ai-at-udemy-a422e4f968d4 [https://medium.com/udemy-engineering/from-zero-to-hero-localization-led-generative-ai-at-udemy-a422e4f968d4]

4. mai 20268 min