0% found this document useful (0 votes)

59 views39 pages

Chapter4 Machine Learning Part1

This document provides an overview of machine learning. It discusses why machine learning is important due to the large amount of data being created every day. It defines machine learning as a field of artificial intelligence that allows machines to learn from experience to improve their performance without being explicitly programmed. The document outlines the typical machine learning process of defining the objective, gathering and preparing data, exploring the data, building a model, evaluating the model, and making predictions. It also describes the main types of machine learning as supervised learning, unsupervised learning, and reinforcement learning.

Uploaded by

Max Sun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views39 pages

Chapter4 Machine Learning Part1

Uploaded by

Max Sun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Chapter 4 Machine Learning

COMP 472 Artificial Intelligence

Russell & Norvig – Section 18.1 & 18.2

2 Why Machine Learning?

Over 2.5 quintillion bytes of data are created every single day, and it is only
going to grow from there. It is estimated that 1.7MB of data will be created
every second for every person on earth.
3 Why Machine Learning?
4 What is Machine Learning

´ In 1959, Arthur Samuel first proposed

the concept Machine Learning.

´ Machine Learning is a subset of Artificial Intelligence which

provides machines the ability to learn automatically &
improve from experience without being explicitly
programmed.
5 What is Machine Learning

´ “A computer program is said to learn from experience E with

respect to some class of tasks T and performance measure P
if its performance at tasks in T, as measured by P, improves
with experience E.”
6 Definitions

´ Algorithms: A set of rules and statistical techniques used to learn

patterns from data.
´ Model: A model is trained by using a machine learning algorithm.
´ Predictor Variable: It is a features(s) of the data that can be used to
predict the output.
´ Response Variable: It is the feature or the output variable that
needs to be predicted by suing the predictor variable(s).
´ Training Data: The Machine Learning model is built using the training
data.
´ Testing Data: The Machine Learning model evaluated using the
testing data.
7 Machine Learning Process

´ Machine Learning Process involves building a Predictive model that

can be used to find a solution for a Problem Statement.

Define Objective

Predictions Data Gathering

Model Evaluation Preparing Data

Building a Model Data Exploration

8 Machine Learning Process

´ Step 1: Define the objective of the problem

To predict the possibility of rain by studying the weather
conditions

Weather Forecast
using Machine
Learning
9 Machine Learning Process

´ What we are trying to predict?

´ What are the target features?
´ What is the input data?
´ What kind of problem are we facing? Binary classification?
Clustering?
Weather
Forecast using
Machine
Learning
10 Machine Learning Process

´ Step 2: Data Gathering

Data such as weather conditions, humidity level, temperature,
pressure etc. are either collected manually or scarped from the
web.
Weather Forecast
using Machine
Learning
11 Machine Learning Process

´ Data Open Sources

´ Google Public Data Explorer
https://www.google.com/publicdata/directory

´ Registry of Open Data on AWS (RODA)

https://registry.opendata.aws/

´ Kaggle
https://www.kaggle.com/datasets

´ Dbpedia
https://wiki.dbpedia.org/
12 Machine Learning Process

´ Step 3: Preparing Data

Data Cleaning involves getting rid of inconsistencies in data
such as missing values or redundant variables.
´ Transform data into desired format
´ Data Cleaning
Missing values
Corrupted data
Remove unnecessary data
13 Machine Learning Process

´ Step 4: Exploratory Data Analysis (EDA)

Data Exploration involves understanding the patterns and trends
in the data. At this stage all the useful insights are drawn and
correlations between the variables are understood.
14 Machine Learning Process

´ Step 5: Building a Machine Learning Model

At this stage a Predictive Model is built by using Machine Learning
Algorithms such as Linear Regression, Decision Tree, etc.
´ Machine Learning model is built by using the training data set.
´ The model is the Machine Learning algorithm that predicts the
output by using the data fed to it.

Training Data Machine Learning Model

15 Machine Learning Process

´ Step 6: Model Evaluation & Optimization

The efficiency of the model is evaluated and any further
improvement in the model are implemented.
´ Machine Learning model is evaluated by using the testing
data set.
´ The accuracy of the model is calculated
´ Further improvement in the model are done by using
techniques like parameter tuning.

Machine Learning Model

16 Machine Learning Process

´ Step 7: Predictions
The final outcome is predicted after performing parameter
tuning and improving the accuracy of the model.
17 Types of Machine Learning

´ Supervised Learning is a technique in which we teach or train the

machine using data which is well labelled.
18 Types of Machine Learning

´ Unsupervised Learning is the training of machine using information

that is unlabeled and allowing the algorithm to act on that
information without guidance.
19 Types of Machine Learning

´ Reinforcement Learning is a part of Machine learning where an

agent is put in an environment and he learns to behave in this
environment by performing certain actions and observing the
rewards which it gets from those actions.
´ e.g., self-driving cars, Alpha GO
20 Types of Machine Learning

Machine Learning

Supervised Unsupervised Reinforcement Learning

Learning Learning
(learns by reacting to
(task-driven) (data analytics) environment)

Classification Regression Association Clustering Reward Based

21 Types of Machine Learning

´ In Supervised learning
´ We are given a training set of (X, f(X)) pairs

big nose big teeth big eyes no moustache f(X) = not person

small nose small teeth small eyes no moustache f(X) = person

small nose big teeth small eyes moustache f(X) = ?

22 Types of Machine Learning

´ In Unsupervised learning
´ We are only given the Xs - not the corresponding f(X)

big nose big teeth big eyes no moustache not given

small nose small teeth small eyes no moustache not given

small nose big teeth small eyes moustache f(X) = ?

´ No teacher involved
´ Goal: find regularities among the Xs (clustering)
´ Data mining
23 Note on Data Mining

´ Other names:
´ Unsupervised Machine Learning
´ Clustering
´ Knowledge Discovery
´ Example: predict if a customer is likely to purchase certain
goods according to history of shopping activities.
24 Types of Machine Learning

´ In Reinforcement learning
´ We are not given the (X, f(X)) pairs

small nose big teeth small eyes moustache f(X) = ?

´ But somehow we are told whether our learned f(X) is right or

wrong
´ Goal: maximize the objective of right answers
25 Types of Machine Learning
Supervised Unsupervised Reinforcement
Learning Learning Learning

An agent interacts with its

The machine is trained
The machine learns by environment by producing
Definition on unlabeled data
using labelled data actions & discovers errors
without any guidance
and rewards

Types of problems Regression &Classification Association & Clustering Reward based

Type of data Labelled data Unlabelled data No pre-defined data
Training External supervision No supervision No supervision
Map labelled input to Understand patterns Follow trail and error
Approach
known output and discover output method

Linear Regression, Logistic

Popular Algorithms K-means, C-means, etc Q-learning, etc
Regression, KNN, etc
26 Types of Problems
27 Example 0

Real ML applications typically require hundreds, thousands or millions of examples

28 Example 1

´ Problem Statement: To study the House Sales dataset and build

a Machine Learning model that predicts the house pricing
index.

Linear Regression
Algorithm

Predict the house

pricing index

Regression
29 Example 2

´ Problem Statement: To study a bank credit dataset and make a

decision about whether to approve the loan of an applicant
based on his profile.

KNN Algorithm

Approve Reject

Classification
30 Example 3

´ Problem Statement: To cluster a set of movies as either good or

average based on their social media outreach.

K-means Algorithm

Popular Unpopular

Clustering
31 Supervised Learning Algorithms

´ Linear Regression
´ Logistic Regression
´ Naïve Bayes Classifier
´ Decision Tree
´ Random Forest
32 Linear Regression
´ Linear Regression is a method to predict dependent variable (Y)
based on values of independent variables (X). It can be used for the
cases where we want to predict some continuous quantity.
´ Dependent variable (Y)
The response variable whose value needs to be predicted.
´ Independent variable (X)
The predictor variable used to predict the response variable.
´ The following equation is used to represent a linear regression model:
33 Linear Regression
34 Supervised Learning Algorithms

´ Linear Regression
´ Logistic Regression
´ Decision Tree
´ Random Forest
´ Naïve Bayes Classifier
35 Logistic Regression

´ Spam Detection : Predicting if an email is Spam or not

´ Credit Card Fraud : Predicting if a given credit card transaction is fraud or
not
´ Health : Predicting if a given mass of tissue is benign or malignant
´ Marketing : Predicting if a given user will buy an insurance product or not
´ Banking : Predicting if a customer will default on a loan.
36 Logistic Regression

´ Logistic Regression is a method used to predict a dependent

variable, given a set of independent variables, such that the
dependent variable is categorical.
´ Logistic Regression is used for classification.
37 Logistic Regression

´ Linear Regression equation:

Representing a relationship between p(X) = P(Y=1|X) and X ?
´ Take the exponent of the equation, since the exponential of any
value is a positive number.
´ Secondly, a number divided by itself + 1 will always be less than 1.
Hence, the formula :
38 Logistic Regression
39 The End

Chapter 4 - Machine Learning
No ratings yet
Chapter 4 - Machine Learning
81 pages
Under Supervision DR/ Zainab Hassan Prepared by Group 2
No ratings yet
Under Supervision DR/ Zainab Hassan Prepared by Group 2
28 pages
1 ML Introduction
No ratings yet
1 ML Introduction
61 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
28 pages
What Is Machine Learning?
No ratings yet
What Is Machine Learning?
6 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
225 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
14 pages
Lecture01 Introduction To Machine Learning (Chapter1)
No ratings yet
Lecture01 Introduction To Machine Learning (Chapter1)
64 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
Made By: Swati Tripathi
No ratings yet
Made By: Swati Tripathi
31 pages
Machine Learning PPTX
No ratings yet
Machine Learning PPTX
24 pages
Machine Learning
No ratings yet
Machine Learning
26 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
Intro to Machine Learning Concepts
No ratings yet
Intro to Machine Learning Concepts
35 pages
An Overview of Machine Learning
No ratings yet
An Overview of Machine Learning
20 pages
Machine Learning
No ratings yet
Machine Learning
74 pages
Machine Learning Fundamentals Guide
No ratings yet
Machine Learning Fundamentals Guide
46 pages
Lecture 1
No ratings yet
Lecture 1
24 pages
ML
No ratings yet
ML
19 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Unit 3 ML
No ratings yet
Unit 3 ML
119 pages
Null 5
No ratings yet
Null 5
16 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Machine Learning Notes
91% (11)
Machine Learning Notes
19 pages
Machine Learning - Introduction
No ratings yet
Machine Learning - Introduction
73 pages
Lecture 1
No ratings yet
Lecture 1
65 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Karthik
No ratings yet
Karthik
10 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
4 pages
Machine Learning Lecture-01
No ratings yet
Machine Learning Lecture-01
37 pages
Introduction To ML
No ratings yet
Introduction To ML
17 pages
Lecture-7 Machine Learning
No ratings yet
Lecture-7 Machine Learning
20 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
Module 1 - Intro To ML - V2
No ratings yet
Module 1 - Intro To ML - V2
47 pages
ML Unit-1
No ratings yet
ML Unit-1
28 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
44 pages
UNIT III (ML, Classification, Regression, Types of ML)
No ratings yet
UNIT III (ML, Classification, Regression, Types of ML)
19 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
Advanced Machine Learning Tutorial
No ratings yet
Advanced Machine Learning Tutorial
37 pages
01 Introduction
No ratings yet
01 Introduction
28 pages
Machine Learning
No ratings yet
Machine Learning
54 pages
ML-Unit 1 Merged
No ratings yet
ML-Unit 1 Merged
151 pages
ML-Unit 1
No ratings yet
ML-Unit 1
43 pages
Machine Learning: Professor Department of Computer Science & Engineering
No ratings yet
Machine Learning: Professor Department of Computer Science & Engineering
59 pages
ML Final
100% (1)
ML Final
28 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
ML - Module 1
No ratings yet
ML - Module 1
30 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
19 pages
Machine Learning Introduction and Types
No ratings yet
Machine Learning Introduction and Types
7 pages
07 Overview of Machine Learning
No ratings yet
07 Overview of Machine Learning
113 pages
ML 1
No ratings yet
ML 1
35 pages
Unit I Machine Learning
No ratings yet
Unit I Machine Learning
78 pages
Module 1 & 2 PDF
No ratings yet
Module 1 & 2 PDF
76 pages
Classification of Multimodal Spam Using Deep Learning
No ratings yet
Classification of Multimodal Spam Using Deep Learning
45 pages
Machine Learning in Power Markets: Bilal Asghar Farooqi Dr. Ali Abbas Kazmi Abdul Kashif Janjua
No ratings yet
Machine Learning in Power Markets: Bilal Asghar Farooqi Dr. Ali Abbas Kazmi Abdul Kashif Janjua
6 pages
AI Search & Parsing Techniques Explained
No ratings yet
AI Search & Parsing Techniques Explained
4 pages
Logistic Regression: Classification
No ratings yet
Logistic Regression: Classification
28 pages
DeepFool: Fooling Deep Neural Nets
No ratings yet
DeepFool: Fooling Deep Neural Nets
9 pages
Deep Learning Module 1
No ratings yet
Deep Learning Module 1
46 pages
Data Mining Question Bank
No ratings yet
Data Mining Question Bank
8 pages
CS 446: Machine Learning: Dan Roth University of Illinois, Urbana-Champaign
No ratings yet
CS 446: Machine Learning: Dan Roth University of Illinois, Urbana-Champaign
75 pages
Optimalisasi Klasifikasi Kanker Payudara Menggunakan Forward Selection Pada Naive Bayes
No ratings yet
Optimalisasi Klasifikasi Kanker Payudara Menggunakan Forward Selection Pada Naive Bayes
5 pages
ML Algorithms for DDoS Detection
No ratings yet
ML Algorithms for DDoS Detection
13 pages
IV - CSE - Data Warehousing and Data Mining
No ratings yet
IV - CSE - Data Warehousing and Data Mining
4 pages
Melek Acar Boyacioglu, Yakup Kara, O Mer Kaan Baykan
No ratings yet
Melek Acar Boyacioglu, Yakup Kara, O Mer Kaan Baykan
12 pages
MSU-Deep Learning
No ratings yet
MSU-Deep Learning
18 pages
B.Tech Exam: Computer Vision
No ratings yet
B.Tech Exam: Computer Vision
2 pages
Data Collection and Presentation
No ratings yet
Data Collection and Presentation
21 pages
AI Decision-Making Under Uncertainty
No ratings yet
AI Decision-Making Under Uncertainty
10 pages
Fatima 2017
No ratings yet
Fatima 2017
4 pages
Study of Algorithm(s) For EEG Based Brain Computer Interface
No ratings yet
Study of Algorithm(s) For EEG Based Brain Computer Interface
7 pages
Next Generation Spectrum Monitoring - Proactive, Autonomous and Data-Driven
100% (1)
Next Generation Spectrum Monitoring - Proactive, Autonomous and Data-Driven
35 pages
Bda Unit 4 PPT 2
No ratings yet
Bda Unit 4 PPT 2
44 pages
Application of Image Processing N Agricultural
No ratings yet
Application of Image Processing N Agricultural
5 pages
Data Science Unit-5
No ratings yet
Data Science Unit-5
37 pages
Phishing Detection via Neural Networks
No ratings yet
Phishing Detection via Neural Networks
5 pages
2001 TMG Chapter 6
No ratings yet
2001 TMG Chapter 6
24 pages
SAP HANA Predictive Analytics Guide
100% (1)
SAP HANA Predictive Analytics Guide
9 pages
Predictive Analytics
No ratings yet
Predictive Analytics
46 pages
Predicting The Reviews of The Restaurant Using Natural Language Processing Technique
No ratings yet
Predicting The Reviews of The Restaurant Using Natural Language Processing Technique
4 pages
10.1186 - s13058 017 0846 1
No ratings yet
10.1186 - s13058 017 0846 1
14 pages
Artificial Intelligence KCA-301-UT QP ODD 21-22
No ratings yet
Artificial Intelligence KCA-301-UT QP ODD 21-22
2 pages

Chapter4 Machine Learning Part1

Uploaded by

Chapter4 Machine Learning Part1

Uploaded by

Chapter 4 Machine Learning

COMP 472 Artificial Intelligence

Russell & Norvig – Section 18.1 & 18.2

´ In 1959, Arthur Samuel first proposed

´ Machine Learning is a subset of Artificial Intelligence which

´ “A computer program is said to learn from experience E with

´ Algorithms: A set of rules and statistical techniques used to learn

´ Machine Learning Process involves building a Predictive model that

Predictions Data Gathering

Model Evaluation Preparing Data

Building a Model Data Exploration

´ Step 1: Define the objective of the problem

´ What we are trying to predict?

´ Step 2: Data Gathering

´ Data Open Sources

´ Registry of Open Data on AWS (RODA)

´ Step 3: Preparing Data

´ Step 4: Exploratory Data Analysis (EDA)

´ Step 5: Building a Machine Learning Model

Training Data Machine Learning Model

´ Step 6: Model Evaluation & Optimization

Machine Learning Model

´ Supervised Learning is a technique in which we teach or train the

´ Unsupervised Learning is the training of machine using information

´ Reinforcement Learning is a part of Machine learning where an

Supervised Unsupervised Reinforcement Learning

Classification Regression Association Clustering Reward Based

small nose small teeth small eyes no moustache f(X) = person

small nose big teeth small eyes moustache f(X) = ?

big nose big teeth big eyes no moustache not given

small nose big teeth small eyes moustache f(X) = ?

small nose big teeth small eyes moustache f(X) = ?

´ But somehow we are told whether our learned f(X) is right or

An agent interacts with its

Types of problems Regression &Classification Association & Clustering Reward based

Linear Regression, Logistic

Real ML applications typically require hundreds, thousands or millions of examples

´ Problem Statement: To study the House Sales dataset and build

Predict the house

´ Problem Statement: To study a bank credit dataset and make a

´ Problem Statement: To cluster a set of movies as either good or

´ Spam Detection : Predicting if an email is Spam or not

´ Logistic Regression is a method used to predict a dependent

´ Linear Regression equation:

You might also like