Statistical Methods & Thinking

Episode 12 | Clustering and Classification: Finding Structure in Data

38 min · 3. feb. 2026
episode Episode 12 | Clustering and Classification: Finding Structure in Data cover

Description

In this episode, we step into multivariate thinking and ask a practical question: when do data points naturally form “groups,” and how can we use those groups to make decisions? We walk through how grouping methods decide what’s “close” or “similar,” then compare two main approaches—building clusters step by step versus forming clusters all at once. You’ll also hear how tree-like visual summaries help us see structure in messy data, and how the same multivariate ideas can be flipped into classification, where the goal is to assign a new case to the most likely group.

Comments

0

Be the first to comment

Sign up now and become a member of the Statistical Methods & Thinking community!

Get Started

2 months for 19 kr.

Then 99 kr. / month · Cancel anytime.

  • Podcasts kun på Podimo
  • 20 lydbogstimer pr. måned
  • Gratis podcasts

All episodes

13 episodes

episode Episode 11 | Finding Structure in Multivariate Data artwork

Episode 11 | Finding Structure in Multivariate Data

This episode is about what to do when your data has many variables at once. We start with the basic idea of how variables “move together” (correlation and covariance), and why that matters for understanding patterns in real datasets. Then we introduce dimension reduction—ways to compress lots of information into a few summary features, so you can see the main structure without getting lost in details. We explain how these methods find the directions where the data varies most, and how a simple “rotation” can make the results easier to interpret. We wrap up with practical rules of thumb for deciding how many components to keep, and a quick preview of how these ideas connect to grouping similar observations and classifying new cases.

2. feb. 202644 min