Talking Search at Github with David Tippett

1 h 10 min · 3 de abr de 2025

Descripción

In this episode, I sit down with David Tippett, a search engineer at Github. We chat about his background and journey into the world of search, focusing on his role at GitHub and how he is improving their search experiences. Our conversation explores the complexities of building effective search, including understanding user intent, the importance of data and indexing, and the challenges of measuring search relevance. We also touch upon the evolving landscape of search with the emergence of AI and multimodal approaches. To close this episode, we talk about the difficulties in justifying investment in search and the critical role it plays across various functions within a platform like GitHub. I hope you enjoy this one as much as I did

Comentarios

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de You know, for search, an Elastic podcast!

Prueba gratis

Todos los episodios

7 episodios

Elastics new High Performance Serverless cloud hosting product

In this episode of You Know for Search, host Steve Mazak is joined by Uri Cohen, head of Elastic’s platform product team, to discuss Elastic’s new serverless deployment option. They cover how serverless eliminates the need to manage nodes, sizing, upgrades, or sharding by using a stateless architecture that separates storage from compute. This re-architecture boosts performance—nearly doubling indexing throughput—and introduces features like adjustable “search power” for cost vs. speed tradeoffs, automatic weekly upgrades, and instant access to the latest capabilities like Lucene 10. Designed for simplicity and scalability, serverless is positioned as the preferred way for most users to run Elastic in the cloud.

30 de sep de 202535 min

Talking Search at Github with David Tippett

3 de abr de 20251 h 10 min

Quantization: The Important Bits

Hey everyone, in this episode I speak to Ben Trent, one of Elastics Sr Principal engineers, about Quantization. We recorded this episode a while ago and since then, launched our latest quantization feature, BBQ [https://www.elastic.co/blog/whats-new-elastic-search-8-16-0]. So this will be a good primer in prep for leveraging that feature which we touch on briefly at the end. I plan to record another episode covering BBQ specifically so hopefully you stay curious! Show Notes: * Fun words - Discretize [https://en.wikipedia.org/wiki/Discretization] * Centroid - https://en.wikipedia.org/wiki/Centroid [https://en.wikipedia.org/wiki/Centroid] * K-means - https://en.wikipedia.org/wiki/K-means_clustering [https://en.wikipedia.org/wiki/K-means_clustering] * NDGC - https://en.wikipedia.org/wiki/Discounted_cumulative_gain#:~:text=NDCG%20is%20often%20used%20to,position%20in%20the%20result%20list [https://en.wikipedia.org/wiki/Discounted_cumulative_gain#:~:text=NDCG%20is%20often%20used%20to,position%20in%20the%20result%20list]. * Binning - https://en.wikipedia.org/wiki/Data_binning [https://en.wikipedia.org/wiki/Data_binning] * RabitQ - https://arxiv.org/abs/2405.12497 [https://arxiv.org/abs/2405.12497]

11 de dic de 202455 min

Function calling with RAG, more than meets the eye

In this episode, we learn how you can do function calling as part of your RAG application built on Elasticsearch with Ashish Tiwari, our Developer Evangelist in India!

11 de nov de 202439 min

HNSW / KNN Deep dive with Mayya Sharipova

I really enjoyed talking to Mayya Sharipova, an Elastic Engineer who been with the company for 7 years! She has worked on many parts of Lucene and Elasticsearch and in this episode we discuss how our HNSW implementation came to be, how KNN works and when you should use it vs Brute force and more talk about Speed, because it's so important to drive engagement with the apps our community and customers build on top of the Elasticsearch stack. Show Notes: * https://www.elastic.co/search-labs/blog/elasticsearch-lucene-vector-database-gains [https://www.elastic.co/search-labs/blog/elasticsearch-lucene-vector-database-gains] * https://www.elastic.co/search-labs/blog/multi-graph-vector-search [https://www.elastic.co/search-labs/blog/multi-graph-vector-search] * https://www.elastic.co/search-labs/blog/how-to-deploy-nlp-text-embeddings-and-vector-search [https://www.elastic.co/search-labs/blog/how-to-deploy-nlp-text-embeddings-and-vector-search] * https://www.elastic.co/guide/en/elasticsearch/reference/current/tune-knn-search.html

5 de sep de 202439 min

Talking Search at Github with David Tippett

Descripción

Comentarios

Empieza 7 días de prueba

Todos los episodios