Lecture 05 — Types of Machine Learning
~3 hours (core AI taxonomy lecture)
🧠 Big Picture (Read This First)
Machine Learning = systems that improve performance by learning from experience.
But here is the secret:
Not all experience teaches the same way.
Different problems → different learning paradigms.
If you choose the wrong type, even the best model will fail.
🧭 A Simple Mental Map
Think of learning like school:
| Learning Type | Analogy |
|---|---|
| Supervised | Teacher gives answers |
| Unsupervised | Student explores alone |
| Semi-supervised | Few answers, many questions |
| Self-supervised | Student creates own homework |
| Reinforcement | Learning by reward & punishment |
1️⃣ Supervised Learning
“Learning with a teacher”
📘 Definition
Learning from labeled data:
(input → correct output)
Examples:
- image → label (cat / dog)
- email → spam / not spam
- house features → price
🧠 Intuition (Human Version)
A teacher says:
“This is a cat.”
“This is NOT a cat.”
The student adjusts until mistakes are small.
🧪 Classic Problems
🔹 Classification
- spam detection
- medical diagnosis
- fraud detection
🔹 Regression
- house price prediction
- temperature forecasting
💻 Mini Project (Beginner-Friendly)
🎯 Project: Email Spam Classifier
Steps:
- Collect labeled emails
- Convert text → numbers
- Train a classifier
- Evaluate accuracy
from sklearn.linear_model import LogisticRegression
from sklearn.feature_extraction.text import CountVectorizer
texts = ["free money", "meeting tomorrow"]
labels = [1, 0] # 1=spam, 0=not spam
vec = CountVectorizer()
X = vec.fit_transform(texts)
model = LogisticRegression()
model.fit(X, labels)
👉 This is real AI, not magic.
❓ Quiz
Why does supervised learning need labels?
Because the loss function compares predictions with ground truth.
2️⃣ Unsupervised Learning
“Learning without answers”
📘 Definition
Learning patterns without labels.
The system asks:
“What structure exists here?”
😄 Funny Example
You walk into a party with strangers.
No name tags.
Your brain automatically groups people:
- similar clothes
- similar behavior
- similar interests
That’s clustering.
🧪 Common Techniques
- K-means clustering
- PCA (dimensionality reduction)
- Topic modeling
💻 Mini Project
🎯 Project: Customer Segmentation
from sklearn.cluster import KMeans
X = [[20, 500], [25, 700], [45, 2000]] # age, spending
kmeans = KMeans(n_clusters=2)
kmeans.fit(X)
print(kmeans.labels_)
Used in:
- marketing
- recommendation systems
- anomaly detection
❓ Quiz
Is unsupervised learning useful without labels?
Yes — it discovers structure and representations.
3️⃣ Semi-Supervised Learning
“A few answers, many questions”
📘 Definition
Small labeled dataset + Large unlabeled dataset
🧠 Why This Exists
Labeling costs:
- money 💰
- time ⏳
- experts 🧠
But raw data is everywhere.
🧪 Real-World Uses
- medical images
- legal documents
- speech recognition
💻 Mini Project Idea
Label:
- 100 images by hand
Let the model:
- infer the rest
This is how real-world AI systems survive.
❓ Quiz
Why is semi-supervised learning practical?
Labels are expensive; data is cheap.
4️⃣ Self-Supervised Learning
“The AI creates its own teacher”
📘 Definition
Labels are automatically generated from data itself.
🤯 Why This Changed AI Forever
Instead of humans saying:
“This is the answer”
The system asks:
“What part of the data is missing?”
🧪 Famous Examples
🔹 NLP
- Masked word prediction (BERT)
🔹 Vision
- Predict image rotation
- Contrastive learning (CLIP)
💻 Mini Project (Conceptual)
Input: "AI is ____"
Target: "powerful"
This simple idea trained LLMs.
❓ Quiz
Why is self-supervised learning powerful?
It scales with data, not human labeling.
5️⃣ Reinforcement Learning (RL)
“Learning by doing”
📘 Definition
Learning from reward signals, not correct answers.
🕹️ Intuition
You train a dog 🐕:
- Good behavior → treat
- Bad behavior → no treat
No explicit instructions.
🧪 Key Components
| Term | Meaning |
|---|---|
| Agent | Learner |
| Environment | World |
| Action | Choice |
| Reward | Feedback |
💻 Tiny RL Example (Concept)
if reward > 0:
increase_probability(action)
else:
decrease_probability(action)
Used in:
- AlphaGo
- robotics
- recommendation systems
- game AI
❓ Quiz
What is the exploration–exploitation tradeoff?
Balancing trying new actions vs using known good ones.
🧠 How These Power Modern AI
| System | Learning Type |
|---|---|
| ChatGPT | Self-supervised + RLHF |
| Image AI | Self-supervised + supervised |
| Robots | Reinforcement learning |
| Recommenders | Supervised + RL |
🎓 Final Big Insight
There is no “best” learning type. There is only the right tool for the problem.
Great AI engineers choose wisely.
🌱 Final Reflection
If you had unlimited data but no labels, which learning paradigm would you choose?
Self-supervised learning.