Topic 7. Unsupervised Learning: Principal Component Analysis and Clustering

Topic 7. Unsupervised Learning: Principal Component Analysis and Clustering#

../../_images/topic7-teaser.jpg

Here we turn to the vast topic of unsupervised learning; it’s about the cases when we have data but it is unlabeled with no target feature to predict like in classification/regression tasks. Most of the data out there is unlabeled, and we need to be able to make use of it. We discuss only 2 types of unsupervised learning tasks – clustering and dimensionality reduction.

Steps in this block#

1. Read the article (same in a form of a Kaggle Notebook);

2. Watch a video lecture series coming in 2 parts:

3. Complete demo assignment 7 (same as a Kaggle Notebook) where you analyze data coming from mobile phone accelerometers and gyroscopes to cluster people into different types of physical activities;

4. Check out the solution (same as a Kaggle Notebook) to the demo assignment (optional);

5. Complete Bonus Assignment 7, where we walk you through Sklearn built-in implementations of dimensionality reduction and clustering methods and apply these techniques to the popular “faces” dataset (optional, available under Patreon “Bonus Assignments” tier).