🟣 ML + AI · Lesson 58

Principal Component Analysis (PCA)

What is Principal Component Analysis?

Principal Component Analysis means pCA reduces the number of features while trying to keep the most important information.

In real programs, this topic helps in reducing many features. Learn the idea first, then type the program yourself and compare the output.

💡 At a Glance

Point	Details
Course Area	Machine Learning + AI Concepts used for prediction, classification, clustering and AI-based projects.
Main Use	reducing many features
Example File	`pca.py`
Practice Focus	Run, change values, and explain the output line by line.

Why should you learn this?

It is useful for reducing many features.
It connects with visualising high-dimensional data.
It improves your ability to read, write and debug Python programs.

Important Terms

These terms are used directly in this lesson. Understand them before memorising the code.

Term	Meaning
dimensionality reduction	Reducing number of features while preserving useful information.
variance	Spread of data values.
components	New combined directions created by PCA.
features	features is an important term in this topic.
visualization	Presenting data through charts or graphs.

Syntax / Basic Pattern

The simple pattern is: prepare data, apply the concept, then show the result.

Basic Pattern

from sklearn.decomposition import PCA
X = [[2, 4, 6], [3, 6, 9], [4, 8, 12], [5, 10, 15]]
pca = PCA(n_components=1)
X_new = pca.fit_transform(X)
print(X_new)

Complete Example Program

Python – pca.py

from sklearn.decomposition import PCA

X = [[2, 4, 6], [3, 6, 9], [4, 8, 12], [5, 10, 15]]

pca = PCA(n_components=1)
X_new = pca.fit_transform(X)
print(X_new)

Expected Output

A one-column transformed array will be displayed.

Program Explanation

from sklearn.decomposition import PCA imports ready-made features from a module/library.
X = [[2, 4, 6], [3, 6, 9], [4, 8, 12], [5, 10, 15]] stores a value in X.
pca = PCA(n_components=1) stores a value in pca.
X_new = pca.fit_transform(X) stores a value in X_new.
print(X_new) displays information or calculated result on the screen.

Where will you use it?

Reducing many features.
Visualising high-dimensional data.
Removing redundant information.

Common Mistakes

Training and testing the model on the same data.
Using an algorithm without understanding the input features.
Reporting only accuracy without checking actual mistakes and limitations.

Practice Tasks

Type the program in pca.py and run it.
Change input values or sample data and observe the new output.
Create one example related to reducing many features.
Write 5 lines explaining the logic in your own words.

Summary

Principal Component Analysis is not a theory-only topic. You should be able to explain the meaning, write the example, run it successfully, and use it in a small practical program.

PCA क्या है?

PCA ka matlab hai: PCA reduces the number of features while trying to keep the most important information. Simple words me, ye topic practical Python programs likhne me direct use hota hai.

Is topic ko sirf definition ke liye nahi, balki reducing many features jaise real examples ke liye practice karein.

यह क्यों सीखना जरूरी है?

Ye reducing many features me kaam aata hai.
Ye visualising high-dimensional data se bhi connected hai.
Isse aap code ka output aur errors better samajh paate hain.

Important Terms

Term	Meaning
dimensionality reduction	Reducing number of features while preserving useful information.
variance	Spread of data values.
components	New combined directions created by PCA.
features	features is an important term in this topic.
visualization	Presenting data through charts or graphs.

Syntax / Basic Pattern

Basic idea: pehle data तैयार करें, phir Python logic apply करें, aur finally result display करें.

Basic Pattern

from sklearn.decomposition import PCA
X = [[2, 4, 6], [3, 6, 9], [4, 8, 12], [5, 10, 15]]
pca = PCA(n_components=1)
X_new = pca.fit_transform(X)
print(X_new)

Complete Example Program

Python – pca.py

from sklearn.decomposition import PCA

X = [[2, 4, 6], [3, 6, 9], [4, 8, 12], [5, 10, 15]]

pca = PCA(n_components=1)
X_new = pca.fit_transform(X)
print(X_new)

Expected Output

A one-column transformed array will be displayed.

Program Explanation

from sklearn.decomposition import PCA imports ready-made features from a module/library.
X = [[2, 4, 6], [3, 6, 9], [4, 8, 12], [5, 10, 15]] stores a value in X.
pca = PCA(n_components=1) stores a value in pca.
X_new = pca.fit_transform(X) stores a value in X_new.
print(X_new) displays information or calculated result on the screen.

Practical Uses

Reducing many features.
Visualising high-dimensional data.
Removing redundant information.

Common Mistakes

Training and testing the model on the same data.
Using an algorithm without understanding the input features.
Reporting only accuracy without checking actual mistakes and limitations.

Practice Tasks

Program ko pca.py file me type karke run karein.
Values change karke output compare karein.
reducing many features par ek छोटा example banayen.
Logic ko apne words me 5 lines me likhein.

सारांश

Principal Component Analysis ko tab complete maanenge jab aap iska meaning, example, output aur practical use clearly explain kar saken.

← Back to Python Tutorial