Statistical Methods & Thinking

Episode 12 | Clustering and Classification: Finding Structure in Data

38 min · 3 feb 2026
aflevering Episode 12 | Clustering and Classification: Finding Structure in Data cover

Beschrijving

In this episode, we step into multivariate thinking and ask a practical question: when do data points naturally form “groups,” and how can we use those groups to make decisions? We walk through how grouping methods decide what’s “close” or “similar,” then compare two main approaches—building clusters step by step versus forming clusters all at once. You’ll also hear how tree-like visual summaries help us see structure in messy data, and how the same multivariate ideas can be flipped into classification, where the goal is to assign a new case to the most likely group.

Reacties

0

Wees de eerste die een reactie plaatst

Meld je nu aan en word lid van de Statistical Methods & Thinking community!

Begin hier

2 maanden voor € 1

Daarna € 9,99 / maand · Elk moment opzegbaar.

  • Podcasts die je alleen op Podimo hoort
  • 20 uur luisterboeken / maand
  • Gratis podcasts

Alle afleveringen

13 afleveringen

aflevering Episode 11 | Finding Structure in Multivariate Data artwork

Episode 11 | Finding Structure in Multivariate Data

This episode is about what to do when your data has many variables at once. We start with the basic idea of how variables “move together” (correlation and covariance), and why that matters for understanding patterns in real datasets. Then we introduce dimension reduction—ways to compress lots of information into a few summary features, so you can see the main structure without getting lost in details. We explain how these methods find the directions where the data varies most, and how a simple “rotation” can make the results easier to interpret. We wrap up with practical rules of thumb for deciding how many components to keep, and a quick preview of how these ideas connect to grouping similar observations and classifying new cases.

2 feb 202644 min