0% found this document useful (0 votes)

37 views40 pages

Lecture Slide 01 - ML

The document outlines the fundamentals of machine learning, defining it as a process where systems improve performance through experience. It discusses various types of learning, including supervised, unsupervised, and reinforcement learning, along with their applications in fields such as medicine, finance, and robotics. Additionally, it provides a historical overview of machine learning developments and emphasizes the importance of generalization in learning algorithms.

Uploaded by

antikmahmudantikmahmud

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views40 pages

Lecture Slide 01 - ML

Uploaded by

antikmahmudantikmahmud

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Fundamentals of Machine Learning

Course 4232: Machine Learning

Dept. of Computer Science

Faculty of Science and Technology

Lecturer No: 1 Week No: 1 Semester: Fall 23-24

Instructor: Md Saef Ullah Miah (saef@aiub.edu)

What is learning?

◼ “Learning is any process by which a system improves performance

from experience.” –Herbert Simon

◼ “Learning is constructing or modifying representations of what is

being experienced.”
–Ryszard Michalski

◼ “Learning is making useful changes in our minds.” –Marvin Minsky

What Is Machine Learning (ML)?
Why “Learn” ?

◼ Machine learning is programming computers to optimize a

performance criterion using example data or past experience.
◼ There is no need to “learn” to calculate payroll
◼ Learning is used when:
 Human expertise does not exist (navigating on Mars),
 Humans are unable to explain their expertise (speech
recognition)
 Solution changes in time (routing on a computer network)
 Solution needs to be adapted to particular cases (user
biometrics)
Why learn?

◼ Build software agents that can adapt to their users or to other

software agents or to changing environments
 Personalized news or mail filter
 Personalized tutoring
 Mars robot
◼ Develop systems that are too difficult/expensive to construct
manually because they require specific detailed skills or knowledge
tuned to a specific task
 Large, complex AI systems cannot be completely derived by
hand and require dynamic updating to incorporate new
information.
◼ Discover new things or structure that were previously unknown to
humans
 Examples: data mining, scientific discovery
Related Disciplines

The following are close disciplines:

 Artificial Intelligence
◼ Machine learning deals with the learning part of AI

 Pattern Recognition
◼ Concentrates more on “tools” rather than theory

 Data Mining
◼ More specific about discovery

The following are useful in machine learning techniques or may give insights:
 Probability and Statistics
 Information theory

 Psychology (developmental, cognitive)

 Neurobiology
 Linguistics
 Philosophy
Data Mining

◼ Retail: Market basket analysis, Customer relationship

management (CRM)
◼ Finance: Credit scoring, fraud detection
◼ Manufacturing: Control, robotics, troubleshooting
◼ Medicine: Medical diagnosis
◼ Telecommunications: Spam filters, intrusion detection
◼ Bioinformatics: Motifs, alignment
◼ Web mining: Search engines
◼ ...
History of Machine Learning
◼ 1950s
 Samuel’s checker player

◼ 1960s:
 Neural networks: Perceptron
 Minsky and Papert prove limitations of
Perceptron

◼ 1970s:
 Expert systems and the knowledge acquisition
bottleneck
 Mathematical discovery with AM
 Symbolic concept induction
History of Machine Learning (cont.)

◼ 1980s:
 Resurgence of neural networks (connectionism,
backpropagation)
 Advanced decision tree and rule learning
 Learning, planning and problem solving
 Utility theory
 Analogy

◼ 1990s
 Data mining
 Reinforcement learning (RL)
 Inductive Logic Programming (ILP)
 Ensembles: Bagging, Boosting, and Stacking
History of Machine Learning (cont.)
◼ 2000s
 Kernel methods
◼ Support vector machines

 Graphical models
 Statistical relational learning
 Transfer learning

◼ Applications
 Adaptive software agents and web applications
 Learning in robotics and vision
 E-mail management (spam detection)
…
What is Machine Learning ?

◼ A computer program M is said to learn from experience E with

respect to some class of tasks T and performance P, if its
performance as measured by P on tasks in T in an environment Z
improves with experience E.

◼ Example:
 T: Cancer diagnosis
 E: A set of diagnosed cases
 P: Accuracy of diagnosis on new cases
 Z: Noisy measurements, occasionally misdiagnosed training cases
 M: A program that runs on a general purpose computer; the
learner
What is Machine Learning ?

◼ A computer program M is said to learn from experience E with respect to

some class of tasks T and performance P, if its performance as measured by
P on tasks in T in an environment Z improves with experience E.
Why Machine Learning ?

◼ Solving tasks that required a system to be adaptive

 Speech, face, or handwriting recognition
 Environment changes over time

◼ Understanding human and animal learning

 How do we learn a new language ? Recognize people ?
◼ Some task are best shown by demonstration
 Driving a car, or, landing an airplane

◼ Objective of Real Artificial Intelligence:

 “If an intelligent system–brilliantly designed, engineered and
implemented– cannot learn not to repeat its mistakes, it is not as
intelligent as a worm or a sea anemone or a kitten.” (Oliver
Selfridge)
Kinds of Learning

◼ Based on the information available

 Association
 Supervised Learning
◼ Classification

◼ Regression

 Reinforcement Learning
 Unsupervised Learning
 Semi-supervised learning

◼ Based on the role of the learner

 Passive Learning
 Active Learning
Major paradigms of machine learning
◼ Rote learning – “Learning by memorization.”
 Employed by first machine learning systems, in 1950s
◼ Samuel’s Checkers program

◼ Supervised learning – Use specific examples to reach general conclusions or

extract general rules
◼ Classification (Concept learning)
◼ Regression

◼ Unsupervised learning (Clustering) – Unsupervised identification of natural

groups in data

◼ Reinforcement learning– Feedback (positive or negative reward) given at the end

of a sequence of steps

◼ Analogy – Determine correspondence between two different representations

◼ Discovery – Unsupervised, specific goal not given

◼ …
Rote Learning is Limited

◼ Memorize I/O pairs and perform exact matching with new

inputs

◼ If a computer has not seen the precise case before, it cannot

apply its experience

◼ We want computers to “generalize” from prior experience

 Generalization is the most important factor in learning
The inductive learning problem

◼ Extrapolate from a given set of examples to make accurate

predictions about future examples

◼ Supervised versus unsupervised learning

 Learn an unknown function f(X) = Y, where X is an input
example and Y is the desired output.
 Supervised learning implies we are given a training set
of (X, Y) pairs by a “teacher”
 Unsupervised learning means we are only given the Xs.
 Semi-supervised learning: mostly unlabelled data
Learning Associations

◼ Basket analysis:
P (Y | X ) probability that somebody who buys X also buys Y where X and Y
are products/services.

Example: P ( sugar | tea ) = 0.7

Supervised Learning
◼ Training experience: a set of labeled examples of the form
< x1, x2, …, xn, y >

◼ where xj are values for input variables and y is the output

◼ This implies the existence of a “teacher” who knows the

right answers

◼ What to learn: A function f : X1 × X2 × … × Xn → Y , which

maps the input variables into the output domain

◼ Goal: minimize the error (loss function) on the test

examples
Types of supervised learning
x2=color

Tangerines Oranges
a) Classification:
• We are given the label of the training objects: {(x1,x2,y=T/O)}

• We are interested in classifying future objects: (x1’,x2’) with

the correct label.
I.e. Find y’ for given (x1’,x2’).

x1=size

Tangerines Not Tangerines b) Concept Learning:

• We are given positive and negative samples for the concept

we want to learn (e.g.Tangerine): {(x1,x2,y=+/-)}

• We are interested in classifying future objects as member of

the class (or positive example for the concept) or not.
I.e. Answer +/- for given (x1’,x2’).
Types of Supervised Learning

◼ Regression
 Target function is continuous rather than class
membership
 For example, you have some the selling
prices of houses as their sizes (sq-mt)
changes in a particular location that may y=price
look like this. You may hypothesize that
the prices are governed by a particular
function f(x). Once you have this
function that “explains” this relationship, f(x)
you can guess a given house’s value,
given its sq-mt. The learning here is the
selection of this function f() . Note that
the problem is more meaningful and
challenging if you imagine several input 60 70 90 120 150 x=size
parameters, resulting in a multi-
dimensional input space.
Classification
◼ Example: Credit scoring
◼ Differentiating between
low-risk and high-risk
customers from their
income and savings

Discriminant: IF income > θ1 AND savings > θ2

THEN low-risk ELSE high-risk
Classification: Applications

◼ Pattern Recognition

◼ Face recognition: Pose, lighting, occlusion (glasses, beard), make-up, hair

style

◼ Character recognition: Different handwriting styles.

◼ Speech recognition: Temporal dependency.

 Use of a dictionary or the syntax of the language.
 Sensor fusion: Combine multiple modalities; eg, visual (lip image) and acoustic for
speech

◼ Medical diagnosis: From symptoms to illnesses

◼ Biometrics: Recognition/authentication using physical and/or behavioral

characteristics: Face, iris, signature, etc
Face Recognition
Training examples of a person

Test images

ORL dataset,
AT&T Laboratories, Cambridge UK
Supervised Learning: Uses

◼ Prediction of future cases: Use the rule or model to predict the

output for future inputs

◼ Knowledge extraction: The rule is easy to understand

◼ Compression: The rule is simpler than the data it explains

◼ Outlier detection: Exceptions that are not covered by the rule,

e.g., fraud
Unsupervised Learning

◼ Learning “what normally happens”

◼ Training experience: no output, unlabeled data

◼ Clustering: Grouping similar instances

◼ Example applications
 Customer segmentation in CRM
 Image compression: Color quantization
 Bioinformatics: Learning motifs
Reinforcement Learning

◼ Training experience: interaction with an environment; learning agent

receives a numerical reward
 Learning to play chess: moves are rewarded if they lead to WIN, else
penalized
 No supervised output but delayed reward

◼ What to learn: a way of behaving that is very rewarding in the long run -
Learning a policy: A sequence of outputs

◼ Goal: estimate and maximize the long-term cumulative reward

◼ Credit assignment problem
◼ Robot in a maze, game playing
◼ Multiple agents, partial observability, ...
Passive Learning and Active Learning

◼ Traditionally, learning algorithms have been passive learners, which

take a given batch of data and process it to produce a hypothesis or
a model

◼ Data → Learner → Model

◼ Active learners are instead allowed to query the environment
 Ask questions
 Perform experiments
◼ Open issues: how to query the environment optimally? how to
account for the cost of queries?
Learning: Key Steps

data and assumptions

– what data is available for the learning task?
– what can we assume about the problem?
• representation
– how should we represent the examples to be classified
• method and estimation
– what are the possible hypotheses?
– what learning algorithm to use to infer the most likely hypothesis?
– how do we adjust our predictions based on the feedback?
• evaluation
– how well are we doing?
Evaluation of Learning Systems

◼ Experimental
 Conduct controlled cross-validation experiments to compare
various methods on a variety of benchmark datasets.
 Gather data on their performance, e.g. test accuracy,
training-time, testing-time…
 Analyze differences for statistical significance.

◼ Theoretical
 Analyze algorithms mathematically and prove theorems about
their:
◼ Computational complexity

◼ Ability to fit training data

◼ Sample complexity (number of training examples needed to

learn an accurate function)
Measuring Performance

Performance of the learner can be measured in one of the

following ways, as suitable for the application:
 Classification Accuracy
◼ Number of mistakes

◼ Mean Squared Error

◼ Loss functions

 Solution quality (length, efficiency)

 Speed of performance
…
Textbook/ Reference Materials

1. Introduction to Machine Learning (MIT Press) by Ethem Alpaydin

Machine Learning & AI Overview
No ratings yet
Machine Learning & AI Overview
5 pages
Intro - Types of Machine Learning
No ratings yet
Intro - Types of Machine Learning
24 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Unit1 2
No ratings yet
Unit1 2
101 pages
Unit 3
No ratings yet
Unit 3
62 pages
Tirth PDF
No ratings yet
Tirth PDF
19 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
606 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
Machine Learning and Applications (5L)
No ratings yet
Machine Learning and Applications (5L)
185 pages
Lecture 1.2 Introduction To Machine Learning
No ratings yet
Lecture 1.2 Introduction To Machine Learning
31 pages
Machine Learning Techniques
100% (2)
Machine Learning Techniques
45 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
Introduction 1175
No ratings yet
Introduction 1175
58 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
16 pages
Artificial Intelligence: Chapter 5 - Machine Learning
No ratings yet
Artificial Intelligence: Chapter 5 - Machine Learning
30 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
21 pages
Machine Learning - 1
No ratings yet
Machine Learning - 1
52 pages
Unit 1
No ratings yet
Unit 1
92 pages
ML 01
No ratings yet
ML 01
23 pages
ML Chapter 1
No ratings yet
ML Chapter 1
37 pages
Presentation of AI ML Session 1
No ratings yet
Presentation of AI ML Session 1
131 pages
Machine Learning: Sri Chandrasekharendra Saraswathi Viswa Mahavidyalaya
No ratings yet
Machine Learning: Sri Chandrasekharendra Saraswathi Viswa Mahavidyalaya
333 pages
AI Chapter 5
No ratings yet
AI Chapter 5
31 pages
Chapter-1 ML Intro
No ratings yet
Chapter-1 ML Intro
36 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
51 pages
Lecture 1
No ratings yet
Lecture 1
30 pages
MACHINE LEARNING ALGORITHM - Unit-1-1
100% (1)
MACHINE LEARNING ALGORITHM - Unit-1-1
78 pages
Technical Report 2.0
No ratings yet
Technical Report 2.0
8 pages
Machine Learning Paradigms
No ratings yet
Machine Learning Paradigms
27 pages
Unit 1
No ratings yet
Unit 1
93 pages
UNIT 1ML Removed Removed
No ratings yet
UNIT 1ML Removed Removed
123 pages
ML - Module 1
No ratings yet
ML - Module 1
30 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
1.machine Learning Basics
No ratings yet
1.machine Learning Basics
74 pages
01 Ml-Overview Notes
No ratings yet
01 Ml-Overview Notes
19 pages
Intro Machine Learning
No ratings yet
Intro Machine Learning
4 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
19 pages
Introduction To Machine Learning For Beginners: Ayush Pant
No ratings yet
Introduction To Machine Learning For Beginners: Ayush Pant
28 pages
Machine Learning Fundamentals Guide
No ratings yet
Machine Learning Fundamentals Guide
46 pages
Data Science & ML Course Guide
No ratings yet
Data Science & ML Course Guide
83 pages
ML Module 1 Final
No ratings yet
ML Module 1 Final
134 pages
01 LecIntro
No ratings yet
01 LecIntro
23 pages
Module1 Introduction
No ratings yet
Module1 Introduction
35 pages
Introduction To ML
100% (1)
Introduction To ML
39 pages
w1 - Introduction To ML
No ratings yet
w1 - Introduction To ML
40 pages
Unit 4
No ratings yet
Unit 4
34 pages
Machine Learning Basics & History
No ratings yet
Machine Learning Basics & History
458 pages
ML - Full Slides Srikanth Allamshatty
No ratings yet
ML - Full Slides Srikanth Allamshatty
369 pages
Introduction To ML
No ratings yet
Introduction To ML
46 pages
CS3491-AI ML-Chapter 1
No ratings yet
CS3491-AI ML-Chapter 1
19 pages
AI & ML: Concepts and Comparisons
No ratings yet
AI & ML: Concepts and Comparisons
179 pages
Module 1
No ratings yet
Module 1
22 pages
5th Sem Report
No ratings yet
5th Sem Report
29 pages
Machine Learning
0% (1)
Machine Learning
8 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
225 pages
Learning
No ratings yet
Learning
25 pages
Lesson 2 - Fundamentals of Machine Learning and Deep Learning
No ratings yet
Lesson 2 - Fundamentals of Machine Learning and Deep Learning
100 pages
Ml-Unit 1
No ratings yet
Ml-Unit 1
53 pages
Voting Machine Lab 11 Grp7
No ratings yet
Voting Machine Lab 11 Grp7
19 pages
BPSC Non-Cadre Application Guide
No ratings yet
BPSC Non-Cadre Application Guide
11 pages
Educational Implications of Piaget's Theory
No ratings yet
Educational Implications of Piaget's Theory
2 pages
CFD Modelling of Drill Cuttings Transport
No ratings yet
CFD Modelling of Drill Cuttings Transport
66 pages
2024-PSYCH 10 Student Booklet
No ratings yet
2024-PSYCH 10 Student Booklet
22 pages
Detecting False Alarms From Automatic Static Analysis Tools: How Far Are We?
No ratings yet
Detecting False Alarms From Automatic Static Analysis Tools: How Far Are We?
12 pages
Sample DLL For BEED
No ratings yet
Sample DLL For BEED
21 pages
Lenovo Laptops: Installment Plan Name Details
No ratings yet
Lenovo Laptops: Installment Plan Name Details
4 pages
PTC6 Report PDF
100% (5)
PTC6 Report PDF
86 pages
The Expectation Maximization (EM) Algorithm
No ratings yet
The Expectation Maximization (EM) Algorithm
10 pages
Group 5 Cracking
No ratings yet
Group 5 Cracking
31 pages
2023 ICS SOLIX IHM Maintenance Procedure
No ratings yet
2023 ICS SOLIX IHM Maintenance Procedure
8 pages
Control
No ratings yet
Control
55 pages
Vacuum Tube Enthusiast Catalog
No ratings yet
Vacuum Tube Enthusiast Catalog
15 pages
Minimize Waste in Tyre Manufacturing
100% (2)
Minimize Waste in Tyre Manufacturing
112 pages
Capital Projects Guide for NDU Staff
No ratings yet
Capital Projects Guide for NDU Staff
26 pages
Zamora Memorial College 1
100% (1)
Zamora Memorial College 1
13 pages
Rc14001 Transition Guide
No ratings yet
Rc14001 Transition Guide
8 pages
Vane Pumps & Parts - V10, V20, V2010, V2020 FLUIDYNE
No ratings yet
Vane Pumps & Parts - V10, V20, V2010, V2020 FLUIDYNE
8 pages
Philosophical Language Debate
No ratings yet
Philosophical Language Debate
42 pages
Flexural Test
No ratings yet
Flexural Test
8 pages
Tenth Grade Second Term
No ratings yet
Tenth Grade Second Term
2 pages
Literature Notes vs. Permanent Notes - Zettelkasten Forum
No ratings yet
Literature Notes vs. Permanent Notes - Zettelkasten Forum
6 pages
1) Unit-3 ODE Part-1
No ratings yet
1) Unit-3 ODE Part-1
43 pages
A Detailed Lesson Plan in Grade X: I. Objectives
No ratings yet
A Detailed Lesson Plan in Grade X: I. Objectives
5 pages
Metal Anchors in Concrete Guide
No ratings yet
Metal Anchors in Concrete Guide
28 pages
Roles de La Vicarianza Versus Elevación
No ratings yet
Roles de La Vicarianza Versus Elevación
9 pages
KISS ImPRESS No Glue Needed Press On Nails, Design, Cupid Heart, White, Short Oval, 30 Count
No ratings yet
KISS ImPRESS No Glue Needed Press On Nails, Design, Cupid Heart, White, Short Oval, 30 Count
1 page
بنود لابات
No ratings yet
بنود لابات
4 pages
Smarter Balance Sbac-Claims-Targets-Standard-Alignment-Grade-4-Ela
No ratings yet
Smarter Balance Sbac-Claims-Targets-Standard-Alignment-Grade-4-Ela
12 pages

Lecture Slide 01 - ML

Uploaded by

Lecture Slide 01 - ML

Uploaded by

Fundamentals of Machine Learning

Course 4232: Machine Learning

Dept. of Computer Science

Lecturer No: 1 Week No: 1 Semester: Fall 23-24

Instructor: Md Saef Ullah Miah (saef@aiub.edu)

◼ “Learning is any process by which a system improves performance

◼ “Learning is constructing or modifying representations of what is

◼ “Learning is making useful changes in our minds.” –Marvin Minsky

◼ Machine learning is programming computers to optimize a

◼ Build software agents that can adapt to their users or to other

The following are close disciplines:

 Psychology (developmental, cognitive)

◼ Retail: Market basket analysis, Customer relationship

◼ A computer program M is said to learn from experience E with

◼ A computer program M is said to learn from experience E with respect to

◼ Solving tasks that required a system to be adaptive

◼ Understanding human and animal learning

◼ Objective of Real Artificial Intelligence:

◼ Based on the information available

◼ Based on the role of the learner

◼ Supervised learning – Use specific examples to reach general conclusions or

◼ Unsupervised learning (Clustering) – Unsupervised identification of natural

◼ Reinforcement learning– Feedback (positive or negative reward) given at the end

◼ Analogy – Determine correspondence between two different representations

◼ Discovery – Unsupervised, specific goal not given

◼ Memorize I/O pairs and perform exact matching with new

◼ If a computer has not seen the precise case before, it cannot

◼ We want computers to “generalize” from prior experience

◼ Extrapolate from a given set of examples to make accurate

◼ Supervised versus unsupervised learning

Example: P ( sugar | tea ) = 0.7

◼ where xj are values for input variables and y is the output

◼ This implies the existence of a “teacher” who knows the

◼ What to learn: A function f : X1 × X2 × … × Xn → Y , which

◼ Goal: minimize the error (loss function) on the test

• We are interested in classifying future objects: (x1’,x2’) with

Tangerines Not Tangerines b) Concept Learning:

• We are given positive and negative samples for the concept

• We are interested in classifying future objects as member of

Discriminant: IF income > θ1 AND savings > θ2

◼ Face recognition: Pose, lighting, occlusion (glasses, beard), make-up, hair

◼ Character recognition: Different handwriting styles.

◼ Speech recognition: Temporal dependency.

◼ Medical diagnosis: From symptoms to illnesses

◼ Biometrics: Recognition/authentication using physical and/or behavioral

◼ Prediction of future cases: Use the rule or model to predict the

◼ Knowledge extraction: The rule is easy to understand

◼ Compression: The rule is simpler than the data it explains

◼ Outlier detection: Exceptions that are not covered by the rule,

◼ Learning “what normally happens”

◼ Training experience: no output, unlabeled data

◼ Clustering: Grouping similar instances

◼ Training experience: interaction with an environment; learning agent

◼ Goal: estimate and maximize the long-term cumulative reward

◼ Traditionally, learning algorithms have been passive learners, which

◼ Data → Learner → Model

data and assumptions

◼ Ability to fit training data

◼ Sample complexity (number of training examples needed to

Performance of the learner can be measured in one of the

◼ Mean Squared Error

 Solution quality (length, efficiency)

1. Introduction to Machine Learning (MIT Press) by Ethem Alpaydin

You might also like