0% found this document useful (0 votes)

27 views43 pages

01 Introduction Overview

The document outlines the course CE-717: Machine Learning at Sharif University of Technology, taught by Mahdieh Soleymani in Fall 2016. It includes course information, textbooks, marking schemes, and an overview of machine learning concepts, including supervised, unsupervised, and reinforcement learning. The course will focus on supervised learning, particularly regression and classification, while also covering learning theory and advanced topics.

Uploaded by

ghomashchihamed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views43 pages

01 Introduction Overview

Uploaded by

ghomashchihamed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

Course Overview and Introduction

CE-717 : Machine Learning

Sharif University of Technology

M. Soleymani
Fall 2016
Course Info
 Instructor: Mahdieh Soleymani
 Email: soleymani@sharif.edu

 Lectures: Sun-Tue (13:30-15:00)

 Website: http://ce.sharif.edu/cources/95-96/1/ce717-2

2
Text Books
 Pattern Recognition and Machine Learning, C. Bishop, Springer, 2006.
 Machine Learning,T. Mitchell, MIT Press,1998.
 Additional readings: will be made available when appropriate.

 Other books:
 The elements of statistical learning, T. Hastie, R. Tibshirani, J. Friedman,
Second Edition, 2008.
 Machine Learning: A Probabilistic Perspective, K. Murphy, MIT Press,
2012.

3
Marking Scheme
 Midterm Exam: 25%
 Final Exam: 30%
 Project: 5-10%
 Homeworks (written & programming) : 20-25%
 Mini-exams: 15%

4
Machine Learning (ML) and Artificial Intelligence (AI)
 ML appears first as a branch of AI

 ML is now also a preferred approach to other subareas of

AI
 Computer Vision, Speech Recognition, …
 Robotics
 Natural Language Processing

 ML is a strong driver in Computer Vision and NLP

5
A Definition of ML

 Tom Mitchell (1998):Well-posed learning problem

 “A computer program is said to learn from experience E
with respect to some task T and some performance
measure P, if its performance on T, as measured by P,
improves with experience E”.

 Using the observed data to make better decisions

 Generalizing from the observed data

6
ML Definition: Example
 Consider an email program that learns how to filter spam
according to emails you do or do not mark as spam.

 T: Classifying emails as spam or not spam.

 E: Watching you label emails as spam or not spam.
 P: The number (or fraction) of emails correctly classified as
spam/not spam.

7
The essence of machine learning

 A pattern exist

 We do not know it mathematically

 We have data on it

8
Example: Home Price
 Housing price prediction

400

300
Price ($)
200
in 1000’s
100

0
0 500 1000 1500 2000 2500
Size in feet2

Figure adopted from slides of Andrew Ng,

Machine Learning course, Stanford.
9
Example: Bank loan
 Applicant form as the input:

 Output: approving or denying the request

10
Components of (Supervised) Learning
 Unknown target function: 𝑓: 𝒳 → 𝒴
 Input space: 𝒳
 Output space: 𝒴

 Training data: 𝒙1 , 𝑦1 , 𝒙2 , 𝑦2 , … , (𝒙𝑁 , 𝑦𝑁 )

 Pick a formula 𝑔: 𝒳 → 𝒴 that approximates the target

function 𝑓
 selected from a set of hypotheses ℋ

11
Training data: Example
Training data
x2
𝑥1 𝑥2 𝑦
0.9 2.3 1
3.5 2.6 1
2.6 3.3 1
2.7 4.1 1
1.8 3.9 1
6.5 6.8 -1
7.2 7.5 -1
7.9 8.3 -1
6.9 8.3 -1
8.8 7.9 -1
9.1 6.2 -1
x1
12
Components of (Supervised) Learning

Learning
model

13
Solution Components
 Learning model composed of:
 Learning algorithm
 Hypothesis set

 Perceptron example

14
Perceptron classifier
 Input 𝒙 = 𝑥1 , … , 𝑥𝑑 x2

 Classifier:
 If 𝑑𝑖=1 𝑤𝑖 𝑥𝑖 > threshold then output 1
 else output −1

 The linear formula 𝑔 ∈ ℋ can be written: x1

𝑑
𝑔 𝒙 = sign 𝑤𝑖 𝑥𝑖 − threshold
+ 𝑤0
𝑖=1

If we add a coordinate 𝑥0 = 1 to the input:

𝑑
Vector form
𝑔 𝒙 = sign 𝑤𝑖 𝑥𝑖
𝑖=0
𝑔 𝒙 = sign 𝒘𝑇 𝒙
15
Perceptron learning algorithm:
linearly separable data
1 1 𝑁
 Give the training data 𝒙 ,𝑦 , … , (𝒙 , 𝑦 (𝑁) )

𝑛 𝑛
 Misclassified data 𝒙 ,𝑦 :
sign(𝒘𝑇 𝒙 𝑛 ) ≠ 𝑦 (𝑛)

Repeat
Pick a misclassified data 𝒙 𝑛 , 𝑦 𝑛 from training data and
update 𝒘:
𝒘 = 𝒘 + 𝑦 (𝑛) 𝒙(𝑛)
Until all training data points are correctly classified by 𝑔

16
Perceptron learning algorithm:
Example of weight update

x2 x2

17 x1 x1
Experience (E) in ML
 Basic premise of learning:
 “Using a set of observations to uncover an underlying
process”

 We have different types of (getting) observations in

different types or paradigms of ML methods

18
Paradigms of ML
 Supervised learning (regression, classification)
 predicting a target variable for which we get to see examples.
 Unsupervised learning
 revealing structure in the observed data
 Reinforcement learning
 partial (indirect) feedback, no explicit guidance
 Given rewards for a sequence of moves to learn a policy and
utility functions

 Other paradigms: semi-supervised learning, active learning,

online learning, etc.

19
Supervised Learning:
Regression vs. Classification
 Supervised Learning
 Regression: predict a continuous target variable
 E.g., 𝑦 ∈ [0,1]

 Classification: predict a discrete target variable

 E.g.,𝑦 ∈ {1,2, … , 𝐶}

20
Data in Supervised Learning
 Data are usually considered as vectors in a 𝑑 dimensional
space
 Now, we make this assumption for illustrative purpose
 We will see it is not necessary

𝑥1 𝑥2 ... 𝑥𝑑 𝑦
(Target)

Columns: Sample1
Features/attributes/dimensions
Sample
Rows: 2
Data/points/instances/examples/samples …

Y column: Sample
Target/outcome/response/label n-1

Sample
21 n
Regression: Example
 Housing price prediction

400

300
Price ($)
200
in 1000’s
100

0
0 500 1000 1500 2000 2500
Size in feet2

Figure adopted from slides of Andrew Ng

22
Classification: Example
 Weight (Cat, Dog)

1(Dog)

0(Cat)
weight

weight

23
Supervised Learning vs. Unsupervised
Learning
 Supervised learning
 Given:Training set
𝑁
 labeled set of 𝑁 input-output pairs 𝐷 = 𝒙 𝑖 ,𝑦 𝑖
𝑖=1
 Goal: learning a mapping from 𝒙 to 𝑦

 Unsupervised learning
 Given:Training set
𝑖 𝑁
 𝒙 𝑖=1
 Goal: find groups or structures in the data
 Discover the intrinsic structure in the data

24
Supervised Learning: Samples

Classification

x1
25
Unsupervised Learning: Samples

x2 Type II
Type I

Clustering

Type III

x1
26
Sample Data in Unsupervised Learning
 Unsupervised Learning:

𝑥1 𝑥2 ... 𝑥𝑑
Sample1

Columns:
Sample
Features/attributes/dimensions 2

Rows: …
Data/points/instances/examples/s
Sample
amples n-1

Sample
n

27
Unsupervised Learning: Example
Applications
 Clustering docs based on their similarities
 Grouping new stories in the Google news site

 Market segmentation: group customers into different

market segments given a database of customer data.

 Social network analysis

28
Reinforcement
 Provides only an indication as to whether an action is
correct or not

Data in supervised learning:

(input, correct output)
Data in Reinforcement Learning:
(input, some output, a grade of reward for this output)

29
Reinforcement Learning
 Typically, we need to get a sequence of decisions
 it is usually assumed that reward signals refer to the entire sequence

30
Is learning feasible?
 Learning an unknown function is impossible.
 The function can assume any value outside the data we have.

 However, it is feasible in a probabilistic sense.

31
Example

32
Generalization

 We don’t intend to memorize data but need to figure out

the pattern.

 A core objective of learning is to generalize from the

experience.
 Generalization: ability of a learning algorithm to perform
accurately on new, unseen examples after having experienced.

33
Components of (Supervised) Learning

Learning
model

34
Main Steps of Learning Tasks
 Selection of hypothesis set (or model specification)
 Which class of models (mappings) should we use for our data?

 Learning: find mapping 𝑓 (from hypothesis set) based on the

training data
 Which notion of error should we use? (loss functions)
 Optimization of loss function to find mapping 𝑓

 Evaluation: how well 𝑓 generalizes to yet unseen examples

 How do we ensure that the error on future data is minimized?
(generalization)

35
Some Learning Applications
 Face, speech, handwritten character recognition
 Document classification and ranking in web search
engines
 Photo tagging
 Self-customizing programs (recommender systems)
 Database mining (e.g., medical records)
 Market prediction (e.g., stock/house prices)
 Computational biology (e.g., annotation of biological
sequences)
 Autonomous vehicles

36
ML in Computer Science
 Why ML applications are growing?
 Improved machine learning algorithms
 Availability of data (Increased data capture, networking, etc)
 Demand for self-customization to user or environment
 Software too complex to write by hand

37
Handwritten Digit Recognition Example
 Data: labeled samples
0
1
2
3
4
5
6
7
8
9

38
Example: Input representation

39
Example: Illustration of features

40
Example: Classification boundary

41
Main Topics of the Course
 Supervised learning
 Regression Most of the lectures
 Classification (our main focus) are on this topic

 Learning theory
 Unsupervised learning
 Reinforcement learning
 Some advanced topics & applications

42
Resource

 Yaser S. Abu-Mostafa, Malik Maghdon-Ismail, and Hsuan

Tien Lin,“Learning from Data”, 2012.

Lecture01 Introduction To Machine Learning (Chapter1)
No ratings yet
Lecture01 Introduction To Machine Learning (Chapter1)
64 pages
Mlfa Autumn 22 Lec 01
No ratings yet
Mlfa Autumn 22 Lec 01
43 pages
ML Lec 02 Introduction II
No ratings yet
ML Lec 02 Introduction II
22 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
45 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
51 pages
ML Module I
No ratings yet
ML Module I
71 pages
Chapter 1
No ratings yet
Chapter 1
27 pages
01 Introduction
No ratings yet
01 Introduction
28 pages
Report Rahul
No ratings yet
Report Rahul
26 pages
Lecture 1
No ratings yet
Lecture 1
65 pages
Unit 1 - Machine Learning - NOTES1 - ML
No ratings yet
Unit 1 - Machine Learning - NOTES1 - ML
52 pages
21CSC305P ML - Unit 1-E
No ratings yet
21CSC305P ML - Unit 1-E
137 pages
UNIT I-Part 1
No ratings yet
UNIT I-Part 1
52 pages
Asset-V1 MKAU+SEng9032+DEV 01+type@asset+block@ChapOne
No ratings yet
Asset-V1 MKAU+SEng9032+DEV 01+type@asset+block@ChapOne
29 pages
Lec 7 - 8 - Machine Learning Introduction
No ratings yet
Lec 7 - 8 - Machine Learning Introduction
55 pages
ML Chap1
No ratings yet
ML Chap1
26 pages
R22 Machine Learning Digital Notes Final
No ratings yet
R22 Machine Learning Digital Notes Final
143 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
56 pages
Unit I Machine Learning
No ratings yet
Unit I Machine Learning
78 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
7 pages
Tirth PDF
No ratings yet
Tirth PDF
19 pages
Lec 001
No ratings yet
Lec 001
17 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
Machine Learning and Applications (5L)
No ratings yet
Machine Learning and Applications (5L)
185 pages
Chapter 5 AI
No ratings yet
Chapter 5 AI
40 pages
CS3491-AI ML-Chapter 1
No ratings yet
CS3491-AI ML-Chapter 1
19 pages
CHP 1
No ratings yet
CHP 1
47 pages
1 Lecture 1: Introduction To Machine Learning
No ratings yet
1 Lecture 1: Introduction To Machine Learning
12 pages
Module 1
No ratings yet
Module 1
175 pages
Machine Learning
No ratings yet
Machine Learning
74 pages
1 Intro
No ratings yet
1 Intro
18 pages
MLUnit - 1 Share
No ratings yet
MLUnit - 1 Share
162 pages
Machine Learning Concise Notes
No ratings yet
Machine Learning Concise Notes
7 pages
I2ml3e Chap1
No ratings yet
I2ml3e Chap1
20 pages
ML Cahp 1
No ratings yet
ML Cahp 1
35 pages
Lecture 1 - Introduction To Machine Learning-HO - Ch0
No ratings yet
Lecture 1 - Introduction To Machine Learning-HO - Ch0
44 pages
Machine Learning: Sri Chandrasekharendra Saraswathi Viswa Mahavidyalaya
No ratings yet
Machine Learning: Sri Chandrasekharendra Saraswathi Viswa Mahavidyalaya
333 pages
ML Unit1
No ratings yet
ML Unit1
6 pages
Unit 1
No ratings yet
Unit 1
92 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
5 pages
Machine Learning IAI
No ratings yet
Machine Learning IAI
94 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
53 pages
Machine Learning Unit 1
100% (8)
Machine Learning Unit 1
112 pages
Chap 1 Introduction To ML
No ratings yet
Chap 1 Introduction To ML
33 pages
ML Notes
No ratings yet
ML Notes
101 pages
W07 - Intro Basic ML
No ratings yet
W07 - Intro Basic ML
35 pages
Machine Learning Fundamentals Guide
No ratings yet
Machine Learning Fundamentals Guide
46 pages
Machine Learning
No ratings yet
Machine Learning
26 pages
ML Report
No ratings yet
ML Report
19 pages
Practical # 9
No ratings yet
Practical # 9
4 pages
Chapter 5 - 7
No ratings yet
Chapter 5 - 7
72 pages
Unit-1 - Session 1 - Supervised & Unsupervised PDF
No ratings yet
Unit-1 - Session 1 - Supervised & Unsupervised PDF
53 pages
Intro to Machine Learning Concepts
100% (1)
Intro to Machine Learning Concepts
58 pages
Mlintro 3
No ratings yet
Mlintro 3
28 pages
Lect1 Introduction
No ratings yet
Lect1 Introduction
38 pages
Unit 1
No ratings yet
Unit 1
10 pages
TL4242 500-Ma, Adjustable, Constant-Current LED Driver: 1 Features 3 Description
No ratings yet
TL4242 500-Ma, Adjustable, Constant-Current LED Driver: 1 Features 3 Description
21 pages
APJCP Volume 14 Issue 8 Pages 4865-4869
No ratings yet
APJCP Volume 14 Issue 8 Pages 4865-4869
5 pages
AMXW-2327-10A (2300 - 2700 MHZ Directional Wall Mount Flat Patch Panel Antenna) PDF
No ratings yet
AMXW-2327-10A (2300 - 2700 MHZ Directional Wall Mount Flat Patch Panel Antenna) PDF
1 page
Brochure FX 404
No ratings yet
Brochure FX 404
3 pages
Odb Inside User Mgls
No ratings yet
Odb Inside User Mgls
122 pages
Mark Scheme (Results) Summer 2015: Pearson Edexcel International GCSE Further Pure Mathematics (4PM0) Paper 1
No ratings yet
Mark Scheme (Results) Summer 2015: Pearson Edexcel International GCSE Further Pure Mathematics (4PM0) Paper 1
26 pages
Study of AC and-WPS Office
No ratings yet
Study of AC and-WPS Office
6 pages
PVE Voltage Codes for PVG 32 Series
No ratings yet
PVE Voltage Codes for PVG 32 Series
4 pages
Survey II - Lesson Plan
No ratings yet
Survey II - Lesson Plan
3 pages
Jinka
No ratings yet
Jinka
65 pages
Application of A Dual Simplex Method To Transportation Problem To Minimize The Cost
No ratings yet
Application of A Dual Simplex Method To Transportation Problem To Minimize The Cost
5 pages
Corrosion Prevention Specialists: Test Certificate 6901
No ratings yet
Corrosion Prevention Specialists: Test Certificate 6901
1 page
B.Tech Civil Engineering Exam
No ratings yet
B.Tech Civil Engineering Exam
2 pages
13709e2a-4be0-43bd-ab3d-95779c29a596
No ratings yet
13709e2a-4be0-43bd-ab3d-95779c29a596
15 pages
This Is Generalized F-Derivation BE-algebras
No ratings yet
This Is Generalized F-Derivation BE-algebras
9 pages
Math 2 (1st 2nd) May2023
No ratings yet
Math 2 (1st 2nd) May2023
3 pages
Electrical Cable Specs for Engineers
No ratings yet
Electrical Cable Specs for Engineers
1 page
Roving Frame Process Overview
No ratings yet
Roving Frame Process Overview
8 pages
Mobile Light Tower Specs
No ratings yet
Mobile Light Tower Specs
1 page
I.C. Engine Performance Test Single Cylinder Diesel Engine
No ratings yet
I.C. Engine Performance Test Single Cylinder Diesel Engine
6 pages
Non-Destructive Testing Guide
No ratings yet
Non-Destructive Testing Guide
22 pages
Class Xi Chapter 3 Full Test
No ratings yet
Class Xi Chapter 3 Full Test
2 pages
Nettur Technical Training Foundation Diploma in Computer Engineering - CP08 PC Hardware
No ratings yet
Nettur Technical Training Foundation Diploma in Computer Engineering - CP08 PC Hardware
13 pages
SAS STAT 14.3 Whatsnew
No ratings yet
SAS STAT 14.3 Whatsnew
13 pages
Epitaxial Growth
No ratings yet
Epitaxial Growth
31 pages
Mobile Computing: Intents
No ratings yet
Mobile Computing: Intents
46 pages
Elec Insulating Oil 15 SASOL - TDS
No ratings yet
Elec Insulating Oil 15 SASOL - TDS
2 pages
Nm24-Sr Us: Listed 94D5 Temp. Ind & Reg. Equip
No ratings yet
Nm24-Sr Us: Listed 94D5 Temp. Ind & Reg. Equip
2 pages
Gyroscopic Effects in Engineering
No ratings yet
Gyroscopic Effects in Engineering
3 pages

01 Introduction Overview

Uploaded by

01 Introduction Overview

Uploaded by

Course Overview and Introduction

CE-717 : Machine Learning

 Lectures: Sun-Tue (13:30-15:00)

 ML is now also a preferred approach to other subareas of

 ML is a strong driver in Computer Vision and NLP

 Tom Mitchell (1998):Well-posed learning problem

 Using the observed data to make better decisions

 T: Classifying emails as spam or not spam.

 We do not know it mathematically

Figure adopted from slides of Andrew Ng,

 Output: approving or denying the request

 Training data: 𝒙1 , 𝑦1 , 𝒙2 , 𝑦2 , … , (𝒙𝑁 , 𝑦𝑁 )

 Pick a formula 𝑔: 𝒳 → 𝒴 that approximates the target

 The linear formula 𝑔 ∈ ℋ can be written: x1

If we add a coordinate 𝑥0 = 1 to the input:

 We have different types of (getting) observations in

 Other paradigms: semi-supervised learning, active learning,

 Classification: predict a discrete target variable

Figure adopted from slides of Andrew Ng

 Market segmentation: group customers into different

 Social network analysis

Data in supervised learning:

 However, it is feasible in a probabilistic sense.

 We don’t intend to memorize data but need to figure out

 A core objective of learning is to generalize from the

 Learning: find mapping 𝑓 (from hypothesis set) based on the

 Evaluation: how well 𝑓 generalizes to yet unseen examples

 Yaser S. Abu-Mostafa, Malik Maghdon-Ismail, and Hsuan

You might also like