0% found this document useful (0 votes)

38 views12 pages

Data Mining - Ensemble Methods

Uploaded by

pawankr16123114

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views12 pages

Data Mining - Ensemble Methods

Uploaded by

pawankr16123114

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

ENSEMBLE METHODS

ENSEMBLE METHODS: INCREASING THE

CLASSIFICATION ACCURACY

 Ensemble methods
 An ensemble for classification is a composite model,
made up of a combination of classifiers.
 Use a combination of models to increase accuracy

 Popular ensemble methods

 Bagging: averaging the prediction over a collection of
classifiers
 Boosting: weighted vote with a collection of classifiers
 Random Forests: collection of decision trees

2
ENSEMBLE METHODS

 It combines a series of k learned models, M 1 , M 2 , …, M k , with the

aim of creating an improved model M*
 A given data set, D, is used to create k training sets, D 1 , D 2 ,...,
D k , where D i (1 ≤ i ≤ k) is used to generate classifier M i .
 Given a new data tuple to classify, the base classifiers each vote
by returning a class prediction .
 The ensemble returns a class prediction based on the votes of
the base classifiers .
 An ensemble tends to be more accurate than its base classifiers.
BAGGING: BOOSTRAP AGGREGATION

 Analogy: Diagnosis based on multiple doctors’ majority vote

 Training
 Given a set D of d tuples, at each iteration i, a training set D i of d
tuples is sampled with replacement from D (i.e., bootstrap)
 A classifier model M i is learned for each training set D i
 Classification: classify an unknown sample X
 Each classifier M i returns its class prediction
 The bagged classifier M* counts the votes and assigns the class
with the most votes to X
 Prediction: can be applied to the prediction of continuous values
by taking the average value of each prediction for a given test
tuple
 Accuracy
 Often significantly better than a single classifier derived from D
 For noise data: not considerably worse, more robust
 Proved improved accuracy in prediction
4
ALGORITHM - BAGGING
BOOSTING

 Analogy: Consult several doctors, based on a combination

of weighted diagnoses—weight assigned based on the
previous diagnosis accuracy
 How boosting works?
 Weights are assigned to each training tuple
 A series of k classifiers is iteratively learned
 After a classifier M i is learned, the weights are updated to
allow the subsequent classifier, M i+1 , to pay more
attention to the training tuples that were misclassified by
Mi
 The final M* combines the votes of each individual
classifier, where the weight of each classifier's vote is a
function of its accuracy
 The basic idea is that when we build a classifier, we want it
to focus more on the misclassified tuples of the previous
round.
 Some classifiers may be better at classifying some
“difficult” tuples than others.
 In this way, we build a series of classifiers that
complement each other. 6
ADAPTIVE BOOSTING (AdaBoost)

 Suppose, we are given D, a data set of d class-labeled tuples,

(X1 , y1),(X2, y2),..,(Xd, yd), where yi is the class label of tuple Xi.
 Initially, AdaBoost assigns each training tuple an equal weight of
1/d.
 Generating k classifiers for the ensemble requires k rounds
through the rest of the algorithm.
 In round i, the tuples from D are sampled to form a training set,
Di , of size d.
 Sampling with replacement is used—the same tuple may be
selected more than once.
 A classifier model, Mi , is derived from the training tuples of Di
 If a tuple was incorrectly classified, its weight is increased.
 If a tuple was correctly classified, its weight is decreased.
 These weights will be used to generate the training samples for
the classifier of the next round.
ADAPTIVE BOOSTING (AdaBoost)

 A tuple’s weight reflects how dif ficult it is to classify — the higher

the weight, the more often it has been misclassified
 These weights will be used to generate the training samples for
the classifier of the next round .
 The basic idea is that when we build a classifier, we want it to
focus more on the misclassified tuples of the previous round .
 Some classifiers may be better at classifying some “difficult”
tuples than others
 To compute the error rate of model Mi , we sum the weights of
each of the tuples in Di that Mi misclassified.

 where err(Xj)is the misclassification error of tuple Xj : If the tuple

was misclassified, then err( Xj) is 1; otherwise, it is 0
ADAPTIVE BOOSTING (AdaBoost)

 “Once boosting is complete, how is the ensemble of classifiers

used to predict the class label of a tuple, X?”
 Unlike bagging, where each classifier was assigned an equal
vote, boosting assigns a weight to each classifier’s vote, based
on how well the classifier performed
 The lower a classifier’s error rate, the more accurate it is
 and therefore, the higher its weight for voting should be
 The weight of classifier Mi ’s vote is

 For each class, c, we sum the weights of each classifier that

assigned class c to X
 The class with the highest sum is the “winner” and is returned as
the class prediction for tuple X
ALGORITHM - AdaBoost
RANDOM FOREST (BREIMAN 2001)
 Random Forest:
 Each classifier in the ensemble is a decision tree classifier and is
generated using a random selection of attributes at each node to
determine the split
 During classification, each tree votes and the most popular class is
returned
 Two Methods to construct Random Forest:
 Forest-RI (random input selection): Randomly select, at each node, F
attributes as candidates for the split at the node. The CART
methodology is used to grow the trees to maximum size
 Forest-RC (random linear combinations): Creates new attributes (or
features) that are a linear combination of the existing attributes
(reduces the correlation between individual classifiers)
 Comparable in accuracy to Adaboost, but more robust to errors and
outliers
 Insensitive to the number of attributes selected for consideration at
each split, and faster than bagging or boosting

11
CLASSIFICATION OF CLASS-IMBALANCED
DATA SETS

 Class-imbalance problem: Rare positive example but

numerous negative ones, e.g., medical diagnosis, fraud, oil -
spill, fault, etc.
 Traditional methods assume a balanced distribution of
classes and equal error costs: not suitable for class -
imbalanced data
 Typical methods for imbalance data in 2 -class classification:
 Oversampling: re-sampling of data from positive class
 Under-sampling: randomly eliminate tuples from negative
class
 Threshold-moving: moves the decision threshold, t, so that
the rare class tuples are easier to classify, and hence, less
chance of costly false negative errors
 Ensemble techniques: Ensemble multiple classifiers
introduced above
 Still difficult for class imbalance problem on multiclass
tasks
12

Ensemble Method
No ratings yet
Ensemble Method
8 pages
MLDM Lect17 Classification Ensembles
No ratings yet
MLDM Lect17 Classification Ensembles
2 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Ensemble Classifiers Overview
No ratings yet
Ensemble Classifiers Overview
37 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
Ensemble Methods
No ratings yet
Ensemble Methods
19 pages
Bagging
No ratings yet
Bagging
7 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
40 pages
ML-Unit I - Ensemble Methods
No ratings yet
ML-Unit I - Ensemble Methods
54 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Unit 3 Aml
No ratings yet
Unit 3 Aml
9 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
Week 11
No ratings yet
Week 11
16 pages
Random Forest-Supervised ML
No ratings yet
Random Forest-Supervised ML
45 pages
14-AI ML Ensemble 2022
No ratings yet
14-AI ML Ensemble 2022
41 pages
Machine Learning: Ensemble Methods
No ratings yet
Machine Learning: Ensemble Methods
54 pages
Boosting
No ratings yet
Boosting
2 pages
Ensemble - Part 1
No ratings yet
Ensemble - Part 1
33 pages
Ensemble Learning (Autosaved)
No ratings yet
Ensemble Learning (Autosaved)
31 pages
Unit Iv
No ratings yet
Unit Iv
14 pages
کتاب هفتم بارگزاری شده
No ratings yet
کتاب هفتم بارگزاری شده
57 pages
5 - EnsembleModeling
No ratings yet
5 - EnsembleModeling
80 pages
12 Ensemble Model
No ratings yet
12 Ensemble Model
90 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Week 11 EnsembleLearning
No ratings yet
Week 11 EnsembleLearning
34 pages
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
No ratings yet
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
27 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Unit V - Multiple Learners
No ratings yet
Unit V - Multiple Learners
54 pages
Ensemble Methods Final PDF
No ratings yet
Ensemble Methods Final PDF
25 pages
Ensembles
No ratings yet
Ensembles
9 pages
Ensembling Techniques
No ratings yet
Ensembling Techniques
11 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Ensemble Learning for Data Scientists
No ratings yet
Ensemble Learning for Data Scientists
41 pages
UNIT1
No ratings yet
UNIT1
80 pages
16-Ensemble Learning - Cont... - 12-04-2024
No ratings yet
16-Ensemble Learning - Cont... - 12-04-2024
13 pages
Module 5,1 Ensemble - Bagging, RF, Boosting
No ratings yet
Module 5,1 Ensemble - Bagging, RF, Boosting
66 pages
Module 7 Notes
No ratings yet
Module 7 Notes
3 pages
22 Boosting
No ratings yet
22 Boosting
32 pages
Chapter 3 Ensemble Learning
No ratings yet
Chapter 3 Ensemble Learning
37 pages
Voting or Averaging of Predictions of Multiple Pre-Trained Models
No ratings yet
Voting or Averaging of Predictions of Multiple Pre-Trained Models
23 pages
Module 2
No ratings yet
Module 2
34 pages
Ensemble Methods
100% (1)
Ensemble Methods
15 pages
05 - Ensemble Learning
No ratings yet
05 - Ensemble Learning
39 pages
UNIT III Word File
No ratings yet
UNIT III Word File
13 pages
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
100% (1)
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
14 pages
AI25
No ratings yet
AI25
7 pages
ML Unit 3-1
No ratings yet
ML Unit 3-1
14 pages
Ensemble (v6)
No ratings yet
Ensemble (v6)
45 pages
Ensembles 1
No ratings yet
Ensembles 1
4 pages
ML Unit-3
No ratings yet
ML Unit-3
28 pages
ML8 Ensembles
No ratings yet
ML8 Ensembles
31 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
Assessing Predictive Models
No ratings yet
Assessing Predictive Models
25 pages
Lecture 10 Ensemble Methods
No ratings yet
Lecture 10 Ensemble Methods
69 pages
Ensembles of Classifiers: Evgueni Smirnov
No ratings yet
Ensembles of Classifiers: Evgueni Smirnov
43 pages
Econ 3051-Lecture Slide - Five
No ratings yet
Econ 3051-Lecture Slide - Five
17 pages
Kalman Decomposition in Linear Systems
No ratings yet
Kalman Decomposition in Linear Systems
31 pages
Engineering Optimization Guide
No ratings yet
Engineering Optimization Guide
16 pages
DAA Question Bank
No ratings yet
DAA Question Bank
3 pages
Discrete Time Systems & Signal Processing Questions
No ratings yet
Discrete Time Systems & Signal Processing Questions
12 pages
Determination of Polynomials Fro Data Sets Data With Polynomial 4th Order
No ratings yet
Determination of Polynomials Fro Data Sets Data With Polynomial 4th Order
5 pages
Advanced Control: Model Predictive Control
100% (6)
Advanced Control: Model Predictive Control
135 pages
Kannada Font Classification Using HHT
No ratings yet
Kannada Font Classification Using HHT
3 pages
Hashing Presentation
No ratings yet
Hashing Presentation
12 pages
MATH3161 Lecture 1
No ratings yet
MATH3161 Lecture 1
7 pages
10 MCQ RL
No ratings yet
10 MCQ RL
6 pages
LESSON 4 Transportation
No ratings yet
LESSON 4 Transportation
66 pages
Week 3 Cloud
No ratings yet
Week 3 Cloud
5 pages
Viterbi Algorithm for DNA Analysis
No ratings yet
Viterbi Algorithm for DNA Analysis
12 pages
Goal Programming PDF
No ratings yet
Goal Programming PDF
22 pages
CS 601 Psets
No ratings yet
CS 601 Psets
6 pages
Write A Program in Java To Input A Number and Check Whether It Is A Fascinating Number or Not
No ratings yet
Write A Program in Java To Input A Number and Check Whether It Is A Fascinating Number or Not
26 pages
Data Structures C Lab Manual
No ratings yet
Data Structures C Lab Manual
34 pages
Phased Array System Toolbox
100% (2)
Phased Array System Toolbox
7 pages
LDA Assignment Quiz
No ratings yet
LDA Assignment Quiz
4 pages
A3 - 1bm15me039 - Nyquist Plot Using Matlab
No ratings yet
A3 - 1bm15me039 - Nyquist Plot Using Matlab
12 pages
Blotzman Learning
No ratings yet
Blotzman Learning
15 pages
Numerical Methods
No ratings yet
Numerical Methods
8 pages
Tree Data Structures Guide
No ratings yet
Tree Data Structures Guide
80 pages
Sysid24 0033 MS
No ratings yet
Sysid24 0033 MS
7 pages
Pulse Code Modulation (PCM)
No ratings yet
Pulse Code Modulation (PCM)
11 pages
Telekomunikasi: Matched Filter Basics
No ratings yet
Telekomunikasi: Matched Filter Basics
17 pages
Count Min Sketch Algorithm
No ratings yet
Count Min Sketch Algorithm
4 pages
Mathcad Control Systems Guide
No ratings yet
Mathcad Control Systems Guide
14 pages
Exercises - Recursion
No ratings yet
Exercises - Recursion
4 pages

Data Mining - Ensemble Methods

Uploaded by

Data Mining - Ensemble Methods

Uploaded by

ENSEMBLE METHODS

ENSEMBLE METHODS: INCREASING THE

 Popular ensemble methods

 It combines a series of k learned models, M 1 , M 2 , …, M k , with the

 Analogy: Diagnosis based on multiple doctors’ majority vote

 Analogy: Consult several doctors, based on a combination

 Suppose, we are given D, a data set of d class-labeled tuples,

 A tuple’s weight reflects how dif ficult it is to classify — the higher

 where err(Xj)is the misclassification error of tuple Xj : If the tuple

 “Once boosting is complete, how is the ensemble of classifiers

 For each class, c, we sum the weights of each classifier that

 Class-imbalance problem: Rare positive example but

You might also like