Normal Curves: Sexy Science, Serious Statistics
How do you decide whether a clinical trial “worked”? In Part 2 of our Galleri series, we examine the landmark randomized trial of a blood test designed to detect more than 50 cancers. We explore why different outcome measures led to dramatically different headlines, discuss primary versus secondary outcomes, pre-registration, hierarchical testing, and post hoc analyses, and explain why mortality remains the outcome everyone is waiting for. Along the way, we uncover a statistical mystery involving dozens of missing cancers and discover how a little arithmetic can sometimes reveal more than a press release. Statistical topics * cancer screening * exploratory analyses * hierarchical testing * missing data * multiple testing * outcome measures * post hoc analyses * pre-registration * primary and secondary outcomes * randomized clinical trials * screening tests Methodologic Morals * “When the simple numbers don't add up, pay attention. The arithmetic may be trying to tell you something.” * “The first question should not be, did it work? It should be, what counts as success?” References * Giridhar KV, et al. Safety and performance results from PATHFINDER 2, a registrational study of a multi-cancer early detection test in an intended-use population [https://grail.com/wp-content/uploads/2026/05/Giridhar.ASCO-2026.PF2-Primary.ORAL_FINAL-For-GRAIL-Website.pdf]. Presented at the 2026 American Society of Clinical Oncology (ASCO) Annual Meeting. May 2026. * Hubbell E, Clarke CA, Aravanis AM, Berg CD. Modeled Reductions in Late-stage Cancer with a Multi-Cancer Early Detection Test. [https://pubmed.ncbi.nlm.nih.gov/33328254/]Cancer Epidemiol Biomarkers Prev. 2021;30(3):460-468. doi:10.1158/1055-9965.EPI-20-1134 * Neal RD, Johnson P, Clarke CA, et al. Cell-Free DNA-Based Multi-Cancer Early Detection Test in an Asymptomatic Screening Population (NHS-Galleri): Design of a Pragmatic, Prospective Randomised Controlled Trial. [https://pmc.ncbi.nlm.nih.gov/articles/PMC9564213/] Cancers (Basel). 2022;14(19):4818. Published 2022 Oct 1. doi:10.3390/cancers14194818 * ASCO slides: https://grail.com/wp-content/uploads/2026/05/Swanton_ASCO-2026_NHS-Galleri_FINAL-Slides-05.26.2026.pdf [https://grail.com/wp-content/uploads/2026/05/Swanton_ASCO-2026_NHS-Galleri_FINAL-Slides-05.26.2026.pdf] * UK registry protocol: https://www.isrctn.com/ISRCTN91431511 [https://www.isrctn.com/ISRCTN91431511] * Clinicaltrials.gov [http://clinicaltrials.gov] protocol: https://clinicaltrials.gov/study/NCT05611632 [https://clinicaltrials.gov/study/NCT05611632] Common biases in cancer screening studies Cancer screening studies are subject to several well-known biases that can make a screening test appear more effective than it actually is. Three of the most important are: Lead-time bias: Screening advances the time of diagnosis, making survival from diagnosis appear longer even if the patient's lifespan is unchanged. For example, if a screening test detects a Stage II cancer at age 60 that otherwise would have been diagnosed because of symptoms at age 62, but the patient dies at age 68 regardless, survival from diagnosis appears to increase from 6 years to 8 years even though the patient did not live any longer. Length bias: Screening preferentially detects slower-growing, less aggressive cancers because they remain detectable for longer than fast-growing cancers. For example, a slow-growing cancer that remains in Stage I for 5 years is much more likely to be found by screening than an aggressive cancer that progresses to symptoms within months. This can make screened patients appear to have better survival simply because screening preferentially found the less aggressive cancers. Overdiagnosis: Screening detects cancers that would never have caused symptoms or death during a person's lifetime, leading to unnecessary diagnosis and treatment. For example, a screening test may detect a very slow-growing prostate or thyroid cancer in an older adult that would never have become clinically important if it had remained undiscovered. Kristin and Regina’s online courses: Demystifying Data: A Modern Approach to Statistical Understanding [https://online.stanford.edu/courses/som-xche0033-demystifying-data-modern-approach-statistical-understanding] Clinical Trials: Design, Strategy, and Analysis [https://online.stanford.edu/courses/som-xche0030-clinical-trials-design-strategy-and-analysis] Medical Statistics Certificate Program [https://online.stanford.edu/programs/medical-statistics-program] Writing in the Sciences [https://www.coursera.org/learn/sciwrite] Epidemiology and Clinical Research Graduate Certificate Program [https://online.stanford.edu/programs/epidemiology-and-clinical-research-graduate-certificate] Programs that we teach in: Epidemiology and Clinical Research Graduate Certificate Program [https://online.stanford.edu/programs/epidemiology-and-clinical-research-graduate-certificate] Find us on: Kristin - LinkedIn [https://www.linkedin.com/in/kristin-sainani-642b5914/] & Twitter/X [https://x.com/KristinSainani] Regina - LinkedIn [https://www.linkedin.com/in/reginanuzzo/] & https://www.reginanuzzo.com/ReginaNuzzo.com [http://reginanuzzo.com] * (00:00) - Intro * (03:39) - The Claim: Not Ready for Primetime * (03:58) - Trial Design: 142,000 Participants * (07:50) - The Primary Outcome Problem * (20:29) - The Primary Endpoint: Complete Miss * (22:14) - Three Arguments for the Defense * (28:29) - - Statistical Sleuthing: Missing Cancers * (41:14) - - The Stage Shift Argument * (50:30) - - Rating the Claim
38 episodes
Comments
0Be the first to comment
Sign up now and become a member of the Normal Curves: Sexy Science, Serious Statistics community!