You know, for search, an Elastic podcast

Talking Search at Github with David Tippett

1 h 10 min · 3 de abr de 2025
Portada del episodio Talking Search at Github with David Tippett

Descripción

In this episode, I sit down with David Tippett, a search engineer at Github. We chat about his background and journey into the world of search, focusing on his role at GitHub and how he is improving their search experiences. Our conversation explores the complexities of building effective search, including understanding user intent, the importance of data and indexing, and the challenges of measuring search relevance. We also touch upon the evolving landscape of search with the emergence of AI and multimodal approaches. To close this episode, we talk about the difficulties in justifying investment in search and the critical role it plays across various functions within a platform like GitHub. I hope you enjoy this one as much as I did

Comentarios

0

Sé la primera persona en comentar

¡Regístrate ahora y únete a la comunidad de You know, for search, an Elastic podcast!

Prueba gratis

Empieza 7 días de prueba

$99 / mes después de la prueba. · Cancela cuando quieras.

  • Podcasts solo en Podimo
  • 20 horas de audiolibros al mes
  • Podcast gratuitos

Todos los episodios

7 episodios

episode Elastics new High Performance Serverless cloud hosting product artwork

Elastics new High Performance Serverless cloud hosting product

In this episode of You Know for Search, host Steve Mazak is joined by Uri Cohen, head of Elastic’s platform product team, to discuss Elastic’s new serverless deployment option. They cover how serverless eliminates the need to manage nodes, sizing, upgrades, or sharding by using a stateless architecture that separates storage from compute. This re-architecture boosts performance—nearly doubling indexing throughput—and introduces features like adjustable “search power” for cost vs. speed tradeoffs, automatic weekly upgrades, and instant access to the latest capabilities like Lucene 10. Designed for simplicity and scalability, serverless is positioned as the preferred way for most users to run Elastic in the cloud.

30 de sep de 202535 min
episode Talking Search at Github with David Tippett artwork

Talking Search at Github with David Tippett

In this episode, I sit down with David Tippett, a search engineer at Github. We chat about his background and journey into the world of search, focusing on his role at GitHub and how he is improving their search experiences. Our conversation explores the complexities of building effective search, including understanding user intent, the importance of data and indexing, and the challenges of measuring search relevance. We also touch upon the evolving landscape of search with the emergence of AI and multimodal approaches. To close this episode, we talk about the difficulties in justifying investment in search and the critical role it plays across various functions within a platform like GitHub. I hope you enjoy this one as much as I did

3 de abr de 20251 h 10 min
episode Quantization: The Important Bits artwork

Quantization: The Important Bits

Hey everyone, in this episode I speak to Ben Trent, one of Elastics Sr Principal engineers, about Quantization. We recorded this episode a while ago and since then, launched our latest quantization feature, BBQ [https://www.elastic.co/blog/whats-new-elastic-search-8-16-0]. So this will be a good primer in prep for leveraging that feature which we touch on briefly at the end. I plan to record another episode covering BBQ specifically so hopefully you stay curious! Show Notes: * Fun words - Discretize [https://en.wikipedia.org/wiki/Discretization] * Centroid - https://en.wikipedia.org/wiki/Centroid [https://en.wikipedia.org/wiki/Centroid] * K-means - https://en.wikipedia.org/wiki/K-means_clustering [https://en.wikipedia.org/wiki/K-means_clustering] * NDGC - https://en.wikipedia.org/wiki/Discounted_cumulative_gain#:~:text=NDCG%20is%20often%20used%20to,position%20in%20the%20result%20list [https://en.wikipedia.org/wiki/Discounted_cumulative_gain#:~:text=NDCG%20is%20often%20used%20to,position%20in%20the%20result%20list]. * Binning - https://en.wikipedia.org/wiki/Data_binning [https://en.wikipedia.org/wiki/Data_binning] * RabitQ - https://arxiv.org/abs/2405.12497 [https://arxiv.org/abs/2405.12497]

11 de dic de 202455 min
episode HNSW / KNN Deep dive with Mayya Sharipova artwork

HNSW / KNN Deep dive with Mayya Sharipova

I really enjoyed talking to Mayya Sharipova, an Elastic Engineer who been with the company for 7 years! She has worked on many parts of Lucene and Elasticsearch and in this episode we discuss how our HNSW implementation came to be, how KNN works and when you should use it vs Brute force and more talk about Speed, because it's so important to drive engagement with the apps our community and customers build on top of the Elasticsearch stack. Show Notes: * https://www.elastic.co/search-labs/blog/elasticsearch-lucene-vector-database-gains [https://www.elastic.co/search-labs/blog/elasticsearch-lucene-vector-database-gains] * https://www.elastic.co/search-labs/blog/multi-graph-vector-search [https://www.elastic.co/search-labs/blog/multi-graph-vector-search] * https://www.elastic.co/search-labs/blog/how-to-deploy-nlp-text-embeddings-and-vector-search [https://www.elastic.co/search-labs/blog/how-to-deploy-nlp-text-embeddings-and-vector-search] * https://www.elastic.co/guide/en/elasticsearch/reference/current/tune-knn-search.html

5 de sep de 202439 min