Statistical Methods & Thinking

Episode 12 | Clustering and Classification: Finding Structure in Data

38 min · 3. feb. 2026
episode Episode 12 | Clustering and Classification: Finding Structure in Data cover

Beskrivelse

In this episode, we step into multivariate thinking and ask a practical question: when do data points naturally form “groups,” and how can we use those groups to make decisions? We walk through how grouping methods decide what’s “close” or “similar,” then compare two main approaches—building clusters step by step versus forming clusters all at once. You’ll also hear how tree-like visual summaries help us see structure in messy data, and how the same multivariate ideas can be flipped into classification, where the goal is to assign a new case to the most likely group.

Kommentarer

0

Vær den første til at kommentere

Tilmeld dig nu og bliv en del af Statistical Methods & Thinking-fællesskabet!

Kom i gang

2 måneder kun 19 kr.

Derefter 99 kr. / måned · Opsig når som helst.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

Alle episoder

13 episoder

episode Episode 11 | Finding Structure in Multivariate Data cover

Episode 11 | Finding Structure in Multivariate Data

This episode is about what to do when your data has many variables at once. We start with the basic idea of how variables “move together” (correlation and covariance), and why that matters for understanding patterns in real datasets. Then we introduce dimension reduction—ways to compress lots of information into a few summary features, so you can see the main structure without getting lost in details. We explain how these methods find the directions where the data varies most, and how a simple “rotation” can make the results easier to interpret. We wrap up with practical rules of thumb for deciding how many components to keep, and a quick preview of how these ideas connect to grouping similar observations and classifying new cases.

2. feb. 202644 min