Skip to content

Latest commit

 

History

History
 
 

k-means-101

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

###Understanding the K-means algorithm

In this tutorial, we build up the k-means algorithm step-by-step. This tutorial uses only standard python.

The steps are designed to logically build up the pieces from the perspective of asking and answering simple questions about a data distribution.

To solve any real problems, use scikit-learn.


The capability of the full sklearn package is pretty mind-blowing; this Notebook aims for the lowest hanging fruit, because the same framework is used for the advanced use-cases. This is certainly one of the strengths of sklearn. Note that these materials do not go into explaining what the various estimators are doing or how the algorithm works. For those discussions, definitely see the other materials in this repository and the official documentation.

If you want to explore the IPython Notebook without running Python on your own machine, you can also view it at nbviewer.

Enjoy!