Courses
Log inSign up

Clustering & Classification

Detect and predict groups in your data to understand differences and make predictions.

13 Lessons97 Exercises

Similarity

Forming Clusters

K-Means Algorithm

Interpreting Clusters

Initial Guesses

Inertia

Total Inertia

Optimization

Number of Clusters

More Than Two Variables

Magnitudes

Min-Max Scaling

Outliers


Course description

This course introduces k-means clustering and logistic regression, which are used to uncover natural groupings in data and to model the probability of membership in a set of classes, respectively. It explores two applications of these methods, using k-means clustering to segment observations into meaningful clusters and logistic regression to classify instances and predict the likelihood of specific categories. Lessons cover choosing the optimal number of clusters, comparing the inertia of different clusterings, and minimizing likelihood with stochastic gradient descent. By the end, you'll understand how and when to deploy these methods and how to derive insights from their results.


Topics covered

  • k-means clustering
  • scatterplots
  • inertia
  • climate data
  • min-max scaling
  • outliers
  • logistic regression
  • classification
  • likelihood
  • decision boundary
  • stochastic gradient descent
  • binary random variables
  • probability