The Data Frontier Podcast
What does it actually take to build AI systems inside a major cancer center? In this episode, Dr. Anyi Li, Attending Physicist and Chief of Computer Service at Memorial Sloan Kettering Cancer Center, pulls back the curtain on the messy, complex reality of working with clinical data and deploying AI where it matters most. We dig into Woollie, an LLM trained on nearly 40,000 radiology notes to detect cancer progression, and what it unexpectedly revealed about disease pathways. The conversation covers why EHR data are far messier than most people assume, how duplicated clinical notes confuse language models, and why Dr. Li believes the biggest near-term opportunity for AI in healthcare is not diagnosis but reducing the crushing administrative burden on clinicians. Along the way, we unpack the tension between open-source and closed-source models when patient privacy is at stake, whether patient-facing chatbots should be regulated as medical devices (the answer may surprise you), and when fine-tuning your own model may no longer be worth the GPU burn when general-purpose models plus RAG and agents can get you most of the way there. Whether you work in healthcare AI, clinical operations, or data infrastructure, this conversation is packed with hard-won lessons from someone building at the intersection of all three. In this episode, we cover: * How Woollie was trained on radiology impressions to classify cancer progression at 91-92% accuracy * Why clinical notes are full of duplications that break LLM performance * The case for multi-agent architectures to clean and deduplicate patient records * Why ambient listening data and physician-to-physician communications could unlock new clinical insights * The argument that administrative AI is the fastest path to real clinical impact * How to think about FDA regulation for patient-facing vs. physician-facing AI tools * Why general-purpose models with RAG and agents may beat fine-tuned domain models on cost and speed, and vice versa Bio Dr. Anyi Li is an Attending Physicist and Chief of Computer Service in the Department of Medical Physics at Memorial Sloan Kettering Cancer Center. He leads enterprise digital transformation across radiation oncology and medical physics—building the platforms, operating model, and governance required to deploy AI and LLMs safely at scale. Dr. Li directs a multidisciplinary team spanning engineering, data science, physics, and operations, and oversees a portfolio of ~150 clinical systems used by clinicians and staff. His programs reduce administrative friction, improve coordination and throughput, and strengthen patient safety through AI-enabled decision support and workflow intelligence, including timeline analytics and workflow knowledge graphs. Prior to MSK, he spent many years in healthcare IT at IBM Watson Health and Explorys, following early training in theoretical nuclear physics. Relevant Links https://www.mskcc.org/profile/anyi-li [https://www.mskcc.org/profile/anyi-li] https://www.linkedin.com/in/anyili/ [https://www.linkedin.com/in/anyili/] https://pubmed.ncbi.nlm.nih.gov/40604229/
10 episodios
Comentarios
0Sé la primera persona en comentar
¡Regístrate ahora y únete a la comunidad de The Data Frontier Podcast!