ML.1-Overview of ML (Week 1)
ML.1-Overview of ML (Week 1)
Chapter 1
• Introduction to ML
• Types of ML Systems
• Challenges of ML
Machine Learning 2
CONTENTs
•Introduction
• Types of ML Systems
• Challenges of ML
Machine Learning 3
Introduction
• What is ML?
• ML is the science (& art) of programming computers so they can learn from data.
• “ML is the field of study that gives computers the ability to learn without being
explicitly programmed” - [Arthur Samuel, 1959]
• “A computer program is said to learn from experience E with respect to some
task T and some performance measure P, if its performance on T, as measured
by P, improves with experience E” – [Tom Mitchell, 1997]
e.g: spam filter
• task T flag spam for new emails
• experience E training data
• performance measure P ratio of correctly classified emails (accuracy)
Machine Learning 4
Introduction
Machine Learning 5
Introduction
Machine Learning 6
Introduction
Machine Learning 7
Introduction
Machine Learning 8
Introduction
Machine Learning 9
Introduction
• Applications of ML
• Image Recognition
• Speech Recognition
• Sentiment Analysis
• Traffic prediction
• Product recommendations
• Self-driving cars
• Email Spam and Malware Filtering
• Medical Diagnosis
• Automatic Language Translation
• …
Machine Learning 10
CONTENTs
• Introduction
•Types of ML Systems
• Challenges of ML
Machine Learning 11
Types of ML Systems
Machine Learning 12
Types of ML Systems
• Supervised >< Unsupervised Learning
• ML systems can be classified according to the amount and type of
supervision they get during training.
Machine Learning 13
Types of ML Systems
• Supervised
• Supervised learning: the training data you feed to the algorithm includes the
desired solutions, called labels
• 2 types of supervised learning:
• Regression
• Classification
• Important supervised learning algorithms:
• k-Nearest Neighbors
• Linear Regression
• Logistic Regression
• Support Vector Machine (SVM)
• Decision Trees and Random Forests
• Neural networks
Machine Learning 14
Types of ML Systems
• Unsupervised Learning
• Unsupervised learning: the training data is unlabeled (the system tries to learn
without a teacher).
• Important Unsupervised learning algorithms:
• Clustering: K-Means, DBSCAN, Hierarchical Cluster Analysis (HCA)
• Anomaly detection and novelty detection: One-class SVM, Isolation Forest
• Visualization and dimensionality reduction: Principal Component Analysis
(PCA), Kernel PCA, Locally-Linear Embedding (LLE), t-distributed Stochastic
Neighbor Embedding (t-SNE)
• Association rule learning: Apriori, Eclat.
Machine Learning 15
Types of ML Systems
• Unsupervised Learning
Clustering
Anomaly detection
Machine Learning 16
Types of ML Systems
• Semisupervised learning
• Semisupervised learning: deal with partially labeled training data (usually a lot of
unlabeled data & a little bit of labeled data)
Machine Learning 17
Types of ML Systems
• Reinforcement Learning
• Reinforcement learning:
• can observe the
environment, select and
perform actions, and get
rewards in return
(or penalties in the form
of negative rewards)
• then, system must learn
by itself what is the best
strategy to get the most
reward over time.
Machine Learning 18
CONTENTs
• Introduction
• Types of ML Systems
•Challenges of ML
• Testing & Validating
Machine Learning 19
Challenges of ML
• Introduction
• Types of ML Systems
• Challenges of ML
Machine Learning 21
Testing & Validating
• Testing
• Split data into 2 sets:
• training set (for train model )
• test set (for testing model)
• Evaluating model on the test set
estimate of generalization error.
• If the training error is low but the generalization error is high
model is overfitting the training data.
• Validating
• Hold out part of the training set to evaluate several candidate models and select
the best one. The new heldout set is called the validation set
Machine Learning 22
SUMMARY
• Introduction
• Types of ML Systems
• Challenges of ML
Machine Learning 23
Nhân bản – Phụng sự – Khai phóng
Machine Learning 24