🟣 ML + AI  ·  Lesson 58

Principal Component Analysis (PCA)

Principal Component Analysis (PCA)

What is Principal Component Analysis?

Principal Component Analysis means pCA reduces the number of features while trying to keep the most important information.

In real programs, this topic helps in reducing many features. Learn the idea first, then type the program yourself and compare the output.

💡 At a Glance
PointDetails
Course AreaMachine Learning + AI
Concepts used for prediction, classification, clustering and AI-based projects.
Main Usereducing many features
Example Filepca.py
Practice FocusRun, change values, and explain the output line by line.

Why should you learn this?

  • It is useful for reducing many features.
  • It connects with visualising high-dimensional data.
  • It improves your ability to read, write and debug Python programs.

Important Terms

These terms are used directly in this lesson. Understand them before memorising the code.

TermMeaning
dimensionality reductionReducing number of features while preserving useful information.
varianceSpread of data values.
componentsNew combined directions created by PCA.
featuresfeatures is an important term in this topic.
visualizationPresenting data through charts or graphs.

Syntax / Basic Pattern

The simple pattern is: prepare data, apply the concept, then show the result.

Basic Pattern
from sklearn.decomposition import PCA
X = [[2, 4, 6], [3, 6, 9], [4, 8, 12], [5, 10, 15]]
pca = PCA(n_components=1)
X_new = pca.fit_transform(X)
print(X_new)

Complete Example Program

Python – pca.py
from sklearn.decomposition import PCA

X = [[2, 4, 6], [3, 6, 9], [4, 8, 12], [5, 10, 15]]

pca = PCA(n_components=1)
X_new = pca.fit_transform(X)
print(X_new)

Expected Output

A one-column transformed array will be displayed.

Program Explanation

  • from sklearn.decomposition import PCA imports ready-made features from a module/library.
  • X = [[2, 4, 6], [3, 6, 9], [4, 8, 12], [5, 10, 15]] stores a value in X.
  • pca = PCA(n_components=1) stores a value in pca.
  • X_new = pca.fit_transform(X) stores a value in X_new.
  • print(X_new) displays information or calculated result on the screen.

Where will you use it?

  • Reducing many features.
  • Visualising high-dimensional data.
  • Removing redundant information.

Common Mistakes

  • Training and testing the model on the same data.
  • Using an algorithm without understanding the input features.
  • Reporting only accuracy without checking actual mistakes and limitations.

Practice Tasks

  1. Type the program in pca.py and run it.
  2. Change input values or sample data and observe the new output.
  3. Create one example related to reducing many features.
  4. Write 5 lines explaining the logic in your own words.

Summary

Principal Component Analysis is not a theory-only topic. You should be able to explain the meaning, write the example, run it successfully, and use it in a small practical program.

PCA क्या है?

PCA ka matlab hai: PCA reduces the number of features while trying to keep the most important information. Simple words me, ye topic practical Python programs likhne me direct use hota hai.

Is topic ko sirf definition ke liye nahi, balki reducing many features jaise real examples ke liye practice karein.

यह क्यों सीखना जरूरी है?

  • Ye reducing many features me kaam aata hai.
  • Ye visualising high-dimensional data se bhi connected hai.
  • Isse aap code ka output aur errors better samajh paate hain.

Important Terms

TermMeaning
dimensionality reductionReducing number of features while preserving useful information.
varianceSpread of data values.
componentsNew combined directions created by PCA.
featuresfeatures is an important term in this topic.
visualizationPresenting data through charts or graphs.

Syntax / Basic Pattern

Basic idea: pehle data तैयार करें, phir Python logic apply करें, aur finally result display करें.

Basic Pattern
from sklearn.decomposition import PCA
X = [[2, 4, 6], [3, 6, 9], [4, 8, 12], [5, 10, 15]]
pca = PCA(n_components=1)
X_new = pca.fit_transform(X)
print(X_new)

Complete Example Program

Python – pca.py
from sklearn.decomposition import PCA

X = [[2, 4, 6], [3, 6, 9], [4, 8, 12], [5, 10, 15]]

pca = PCA(n_components=1)
X_new = pca.fit_transform(X)
print(X_new)

Expected Output

A one-column transformed array will be displayed.

Program Explanation

  • from sklearn.decomposition import PCA imports ready-made features from a module/library.
  • X = [[2, 4, 6], [3, 6, 9], [4, 8, 12], [5, 10, 15]] stores a value in X.
  • pca = PCA(n_components=1) stores a value in pca.
  • X_new = pca.fit_transform(X) stores a value in X_new.
  • print(X_new) displays information or calculated result on the screen.

Practical Uses

  • Reducing many features.
  • Visualising high-dimensional data.
  • Removing redundant information.

Common Mistakes

  • Training and testing the model on the same data.
  • Using an algorithm without understanding the input features.
  • Reporting only accuracy without checking actual mistakes and limitations.

Practice Tasks

  1. Program ko pca.py file me type karke run karein.
  2. Values change karke output compare karein.
  3. reducing many features par ek छोटा example banayen.
  4. Logic ko apne words me 5 lines me likhein.

सारांश

Principal Component Analysis ko tab complete maanenge jab aap iska meaning, example, output aur practical use clearly explain kar saken.

← Back to Python Tutorial