🟣 ML + AI · Lesson 57

K-Means Clustering

What is K-Means Clustering?

K-Means Clustering means k-Means is an unsupervised algorithm that groups similar data points into clusters.

In real programs, this topic helps in grouping students or customers. Learn the idea first, then type the program yourself and compare the output.

💡 At a Glance

Point	Details
Course Area	Machine Learning + AI Concepts used for prediction, classification, clustering and AI-based projects.
Main Use	grouping students or customers
Example File	`kmeans-clustering.py`
Practice Focus	Run, change values, and explain the output line by line.

Why should you learn this?

It is useful for grouping students or customers.
It connects with finding natural clusters.
It improves your ability to read, write and debug Python programs.

Important Terms

These terms are used directly in this lesson. Understand them before memorising the code.

Term	Meaning
unsupervised learning	Finding patterns in data without given labels.
cluster	Group of similar data points.
centroid	Center point of a cluster.
k value	Number of neighbors checked by KNN.
similarity	How close or alike two data points are.

Syntax / Basic Pattern

The simple pattern is: prepare data, apply the concept, then show the result.

Basic Pattern

from sklearn.cluster import KMeans
X = [[1, 2], [1, 4], [10, 12], [11, 13]]
model = KMeans(n_clusters=2, random_state=0, n_init=10)
model.fit(X)
print(model.labels_)

Complete Example Program

Python – kmeans-clustering.py

from sklearn.cluster import KMeans

X = [[1, 2], [1, 4], [10, 12], [11, 13]]

model = KMeans(n_clusters=2, random_state=0, n_init=10)
model.fit(X)
print(model.labels_)

Expected Output

[1 1 0 0]

Program Explanation

from sklearn.cluster import KMeans imports ready-made features from a module/library.
X = [[1, 2], [1, 4], [10, 12], [11, 13]] stores a value in X.
model = KMeans(n_clusters=2, random_state=0, n_init=10) stores a value in model.
model.fit(X) performs the next step of the program logic.
print(model.labels_) displays information or calculated result on the screen.

Where will you use it?

Grouping students or customers.
Finding natural clusters.
Unsupervised pattern discovery.

Common Mistakes

Training and testing the model on the same data.
Using an algorithm without understanding the input features.
Reporting only accuracy without checking actual mistakes and limitations.

Practice Tasks

Type the program in kmeans-clustering.py and run it.
Change input values or sample data and observe the new output.
Create one example related to grouping students or customers.
Write 5 lines explaining the logic in your own words.

Summary

K-Means Clustering is not a theory-only topic. You should be able to explain the meaning, write the example, run it successfully, and use it in a small practical program.