0% found this document useful (0 votes)

20 views10 pages

Experiment # 10

The document outlines a lab experiment focused on using Support Vector Machines (SVMs) for data classification in R, detailing objectives, theoretical background, and practical implementation steps. It includes an evaluation sheet for assessing student performance across various knowledge components and provides a structured lab report format. Additionally, it discusses the e1071 package in R, which facilitates the implementation of SVMs, and lists applications of SVMs in real-world scenarios.

Uploaded by

Ali Raza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views10 pages

Experiment # 10

Uploaded by

Ali Raza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Lab Name: To perform classification of data using classification algorithm named support vector

machines (SVMs) in R.
Course title: Soft Computing and Data mining Lab Total Marks: ___20_________
Practical No. 10 Date of experiment performed: ____________
Course teacher/Lab Instructor: Engr. Muhammad Usman Date of marking: ____________
Student Name:__________________________
Registration no.__________________________

Marking Evaluation Sheet

Knowledge components Domain Taxonomy Contribution Max. Obtained

level marks marks

1. Student is aware with

requirement and use of Imitation (P1) 3
apparatus involved in
experiment.
2. Student has conducted the Psychomotor 70%
experiment by practicing the Manipulate (P2) 11
hands-on skills as per
instructions.
3. Student has achieved required -
Precision (P3)
accuracy in performance.

4. Student is aware of discipline &

safety rules to follow them rules Receiving (A1) 2
Affective
during experiment.
20%

5. Student has responded well and

Respond (A2) 2
contributed affectively in
respective lab activity.
6. Student understands use of Understand.
modern programming languages Cognitive 10% 2
and software environment for (C2)
Data Mining (DM)
Total 20

Normalize
marks out of 5
(5)

Signed by Course teacher/ Lab Instructor

EXPERIMENT # 10
To perform classification of data using classification algorithm named support vector
machines (SVMs) in R

PRE LAB TASK

Objective:
1. To be familiar with classification of data and Support vector machines (SVMs).
2. To be familiar with package (e1071) that is helpful for classification of data in modern
programming language R.
3. To know how to use e1071 for classification of data in modern programming language
named R.
Theory:

1. Classification of data:
Classification is labeling new examples with the appropriate class. In the field of machine
learning, statistical classification is the technique of identifying which of a set of categories
(sub-populations) an observation (or observations) belongs to. Classification is a useful tool in
machine learning and data mining. In general, the goal of classification is to use an object's
characteristics to identify which class (or group) it belongs to. A. A. Soofi and Arshad Awan
has defined classification as a data mining (machine learning) technique in “Soofi, A. A., &
Awan, A. (2017). Classification techniques in machine learning: applications and issues. J.
Basic Appl. Sci, 13, 459-465.” as “Classification is a data mining (machine learning) technique
used to predict group membership for data instances.” Classification is categorised as one of
the supremos studied problems by researches of the machine learning and data mining field.

Machine learning can be categorised into supervised and unsupervised methods. Classification
is a key category of supervised machine learning techniques. The supervised machine learning
process involves (1) asking a question, (2) gathering data, (3)based on our research developing
a hypothesis, and (4) analysing the data. If the hypothesis supports the data, it can be accepted
as a scientific theory. If not, it can be rejected or modified. The goal is to find a model that
matches previous predictions. The hypothesis space in classification involves any function that
categorises data into classes, but most hypotheses are incorrect. Training and fitting models
are necessary to filter out bad hypotheses. There are several ways we narrow down the
hypothesis space.

Different classification learning algorithms exist, focusing on specific hypotheses. There is no

single form of classification which is appropriate for all data sets hence a large toolkit of
classification algorithms have also been developed. A list of five basic supervised learning
(classification) techniques along with their associated classification methods (learning
algorithms), as well as details on learning algorithms strengths, weaknesses, potential
applications and issues with their available solution, is given by A. A. Soofi and Arshad Awan
(2017) in “Classification Techniques in Machine Learning: Applications and Issues”.
1.1 Examples:
Some of the classification examples are assigning a given email to the "spam" or "non-spam"
class, and assigning a diagnosis to a given patient based on observed characteristics of the
patient (sex, blood pressure, presence or absence of certain symptoms, etc.).

Classification and clustering are examples of the more general problem of pattern recognition,
which is the assignment of some sort of output value to a given input value. Other examples
are regression, which assigns a real-valued output to each input; sequence labeling, which
assigns a class to each member of a sequence of values (for example, part of speech tagging,
which assigns a part of speech to each word in an input sentence); parsing, which assigns a
parse tree to an input sentence, describing the syntactic structure of the sentence; etc.
2. Support Vector Machines (SVMs):
In machine learning, support vector machines (SVMs) are supervised learning models with
associated learning algorithms that analyze data for classification and regression analysis,
developed by Vladimir Naumovich Vapnik at Bell labs in 1963. The support vector machines
(SVMs) implements the following idea: SVM maps input vectors of training examples to points
in space so as to maximise the width of the gap between the two categories. In this space, an
optimal separating hyperplane is constructed. New examples are then mapped into that same
space and predicted to belong to a category based on which side of the gap they fall.
The original maximum-margin hyperplane algorithm proposed by Vapnik in 1963 constructed
a linear classifier. In addition to performing linear classification, SVMs can efficiently perform
a non-linear classification using what is called the kernel trick, implicitly mapping their inputs
into high-dimensional feature spaces. In nonlinear classifiers such as kernel methods, which
map data to a higher dimensional space, linear classifiers directly work on data in the original
input space. While linear classifiers fail to handle some inseparable data, they may be sufficient
for data in a rich dimensional space. An important advantage of linear classification is that
training and testing procedures are much more efficient. Therefore, linear classification can be
very useful for some large-scale applications.
In general, a SVMs plots input data objects as points in an n-dimensional space, where the
dimensions represent the various features of the object. The algorithm then attempts to
iteratively find a function that represents a hyperplane that can act as a separator between the
spaces occupied by different target output classes. An SVM model is a representation of the
input data objects in a graphical space with a clear gap between groups of points representing
different categories. This division is caused by the hyperplane, which is a line (in case of 2D
space) or a plane (in case of the 3D plane). The hyperplane is a division curve that splits the
space such as it clearly signifies which section of the space is occupied by which category.
The following is an example of a trained SVM model.

Fig. 1. Support Vector Machines (SVMs) Model

In the figure above, the hyperplane has two parallel dotted lines on either side of it. The
perpendicular distance between these two lines is called the margin. Margin is the distance
between the data points of the two different categories. The data points closest to the hyperplane
have the largest impact on the position of the hyperplane. these points are called support
vectors.
2.1 Applications:
Like many other machine learning algorithms, SVM’s have also found wide-spread
applications in the real world. SVM’s help in solving many day-to-day classification problems
all over the world. Some of these SVMs applications are given below.
1. Handwriting detection: Many handwriting detection programs use SVM’s to identify
handwritten characters.
2. Image based searching: SVM’s are an avenue for improving images based searching.
3. Face detection: Every smartphone has a face detection feature in its camera these days.
SVM separates the faces from the rest of the picture.
4. Bioinfomeatics: SVM’s are used to classify people based on genes and other biological
features.
5. Cancer detection: SVM’s can detect malignant tumors from benign ones by considering
their images.
6. Classification of Satellite data: Classification of satellite data like SAR data can be
performed using supervised SVMs.
3. Package e1071:
Package e1071 is specific open source R package for R programming that provides functions
for statistic and probabilistic algorithms like a fuzzy classifier, naive Bayes classifier, bagged
clustering, short-time Fourier transform, support vector machine, etc..

When it comes to SVM, there are many packages available in R to implement it. However,
e1071 is the most intuitive package for this purpose. The svm() function of the e1071 package
provides a robust interface in the form of the libsvm. This interface makes implementing
SVM’s very quick and simple. It also facilitates probabilistic classification by using the kernel
trick. It provides the most common kernels like linear, RBF, sigmoid, and polynomial.
4. Practical Implementation of SVM in R:
Let us now create an SVM model in R to learn it more thoroughly by means of practical
implementation. We will be using the e1071 packages for this.
The following steps are taken as procedure for implementation of SVM in R.
• Step 1: Install package e1071:
• Step 2: Load data set and package e1071:
• Step 3: Select columns of the data set:
• Step 4: Encoding the target feature:
• Step 5: Split the data set:
• Step 6: Feature Scaling:
• Step 7: Fitting SVM to training set:
• Step 8: Predicting the test set result:
• Step 9: Making confusion matrix:
• Step 10: Visualising the training set results:
• Step 11: Visualising the test set results:
LAB SESSION

Lab Task:
1. To perform classification of data using classification algorithm named support vector
machines (SVMs) in R.
Apparatus:
• Laptop
• R

Experimental Procedure:

1. How to Setup R:

1. Start-up the Microsoft Windows.

2. Open the website http://cran.r-project.org or use Pin drive to access software folder
named R-4.2.2-win.exe
3. Double click on the software folder and double click on ‘R-4.2.2-win.exe’ file and run
the setup.
4. Press next until you reach the window which ask for the key.
5. Finally chose Finish and close the installation.

2. Get started with R:

1. Start R by double-click on the R icon on your desktop. It will open following windows
in your PC as shown in image.

Fig. 1. R Startup GUI window

2. Install package (e1071).

> install.packages("e1071")

3. Load the data set and package e1071. Instead of importing data let us generate some 2-
dimensional data. We will generate 20 random observations of 2 variables in the form
of a 20 by 2 matrix. This gives us 20 objects with 2 features each.

> library(e1071)
> set.seed(100)
> x <- matrix(rnorm(40),20,2)
> y <- rep(c(-1,1),c(10,10))
> x[y == 1,] = x[y == 1,] + 1
> plot(x, col = y + 3, pch = 19)

4. Encode the target data as factor and convert data into data frame.

> data = data.frame(x, y = as.factor(y))

5. Split data set into Training set and Test set. Use below packages for this purpose
anyhow we do not split as data is small.
> install.packages('caTools')
> library(caTools)

6. As our data is on a relatively smaller scale, we have set the scale argument as FALSE.
And
7. Create the model by Fitting SVM to data by using svm function. Specify the kernel as
linear, and cost as 10.
> data.svm = svm(y ~ ., data = data, kernel = "linear", cost = 10, scale = FALSE)

8. As our data is on a relatively smaller scale, we have set the scale argument as FALSE.
> data.svm = svm(y ~ ., data = data, kernel = "linear", cost = 10, scale = FALSE)
> print(data.svm)

svm(formula = y ~ ., data = data, kernel = "linear", cost = 10, scale = FALSE)

Parameters:
SVM-Type: C-classification
SVM-Kernel: linear
cost: 10
Number of Support Vectors: 5

9. Predict the test set results using ‘predict( )’ function & training set.
10. Visualise the results by plotting the model using the plot() function
> plot(data.svm, data)

Extra Credit Points:

(Follow Similar procedure as well as using PRE-LAB TASK Session data complete the tasks
provided to you as Exercise)

EXPERIMENT DOMAIN:

Domains Psychomotor (70%) Affective (20%) Cognitive

(10%)

Attributes Realization of Conducting Data Data Discipline Individual Understa

Experiment Experiment Collection Analysis Participation nd
(Receiving)
(Awareness) (Act) (Use (Perform) (Respond/
Instrument) Contribute)
Taxonomy P1 P2 P2 P2 A1 A2 C2
Level
Marks 3 5 3 3 3 1 2
distribution
LAB REPORT
Prepare the Lab Report as below:
TITLE:

OBJECTIVE:

APPARATUS:

PROCEDURE:
(Note: Use all steps you studied in LAB SESSION of this tab to write procedure and to
complete the experiment)
DISCUSSION:

Q1.: List the broad categories of Machine Learning (ML)?

________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________

Q2.: List the activities involved in supervised machine learning process?

Conclusion /Summary
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________

Domains Psychomotor (70%) Affective (20%) Cognitive

(10%)
Attributes Realization of Conducting Data Data Discipline Individual Understa
Experiment Experiment Collection Analysis Participation nd
(Receiving)
(Awareness) (Act) (Use (Perform) (Respond/
Instrument) Contribute)
Taxonomy P1 P2 P2 P2 A1 A2 C2
Level
Marks 3 5 3 3 2 2 2
distribution

Obtained
Marks

AP For NLP-LO2
No ratings yet
AP For NLP-LO2
38 pages
Unit-1 DL
No ratings yet
Unit-1 DL
29 pages
Algorithm of Neural Network M4
No ratings yet
Algorithm of Neural Network M4
25 pages
Steps & Applications of Machine Learning
No ratings yet
Steps & Applications of Machine Learning
32 pages
SVM Guide for Data Scientists
No ratings yet
SVM Guide for Data Scientists
24 pages
Data Science Presentation
No ratings yet
Data Science Presentation
7 pages
SVM Presentation
No ratings yet
SVM Presentation
27 pages
Fundamental Knowledge of Machine Learning: Abstract This Chapter Introduces The Basic Concepts and Methods of Machine
No ratings yet
Fundamental Knowledge of Machine Learning: Abstract This Chapter Introduces The Basic Concepts and Methods of Machine
14 pages
AML Unit 4 Part 1
No ratings yet
AML Unit 4 Part 1
14 pages
VO MCA S4 Data Mining Unit 6
No ratings yet
VO MCA S4 Data Mining Unit 6
21 pages
Slide 10 Chapter9 Classification Advanced Methods
No ratings yet
Slide 10 Chapter9 Classification Advanced Methods
46 pages
ML Unit-4
No ratings yet
ML Unit-4
20 pages
Introduction of Machine Learning
No ratings yet
Introduction of Machine Learning
9 pages
Support Vector Machines Guide
No ratings yet
Support Vector Machines Guide
48 pages
Chapter3 Classification Summary Final
No ratings yet
Chapter3 Classification Summary Final
11 pages
Comparative Study of Four Supervised Machine Learning Techniques For Classification
No ratings yet
Comparative Study of Four Supervised Machine Learning Techniques For Classification
15 pages
Deep Learning l1
No ratings yet
Deep Learning l1
47 pages
SUpport Vector Machine
No ratings yet
SUpport Vector Machine
28 pages
Machine Learning
No ratings yet
Machine Learning
78 pages
Support Vector Machine: Prof. Subodh Kumar Mohanty
No ratings yet
Support Vector Machine: Prof. Subodh Kumar Mohanty
52 pages
Unit - 2-1
No ratings yet
Unit - 2-1
7 pages
Data Science Unit-4 B.sc. III Sem. MDC
No ratings yet
Data Science Unit-4 B.sc. III Sem. MDC
6 pages
Lecture 2 Unit 1
No ratings yet
Lecture 2 Unit 1
60 pages
Unit II 2.2 ML Kernel Machines SVM
No ratings yet
Unit II 2.2 ML Kernel Machines SVM
50 pages
ML 18-20 SVM
No ratings yet
ML 18-20 SVM
44 pages
AI Chapter 3 Part 3
No ratings yet
AI Chapter 3 Part 3
49 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
5 pages
7 - Support Vector Machines (SVM)
No ratings yet
7 - Support Vector Machines (SVM)
29 pages
2024 Scu ML 2 1 SVM
No ratings yet
2024 Scu ML 2 1 SVM
36 pages
Data Science Lecture: Classification & Regression
No ratings yet
Data Science Lecture: Classification & Regression
27 pages
Intro to Support Vector Machines
No ratings yet
Intro to Support Vector Machines
25 pages
ML Module 3
No ratings yet
ML Module 3
44 pages
ML 7th Sem AIML ITE Notes Complete LONG
No ratings yet
ML 7th Sem AIML ITE Notes Complete LONG
202 pages
Unit - 2
No ratings yet
Unit - 2
15 pages
Prediction On Iris
No ratings yet
Prediction On Iris
14 pages
SVM7
No ratings yet
SVM7
53 pages
Chapter 07
No ratings yet
Chapter 07
18 pages
ML Unit2
No ratings yet
ML Unit2
22 pages
Lecture 18 - SVM
No ratings yet
Lecture 18 - SVM
54 pages
Introduction To Support Vector Machines
No ratings yet
Introduction To Support Vector Machines
46 pages
Data Analysis ch1
No ratings yet
Data Analysis ch1
13 pages
PR & ML: CS5691: Machine Learning
No ratings yet
PR & ML: CS5691: Machine Learning
42 pages
SVM Basics for Data Scientists
No ratings yet
SVM Basics for Data Scientists
139 pages
Day 4 Content
No ratings yet
Day 4 Content
35 pages
Prediction & SVM Explained
No ratings yet
Prediction & SVM Explained
33 pages
Support Vector Machine: Abinas Panda
No ratings yet
Support Vector Machine: Abinas Panda
52 pages
SVM Guide for Data Scientists
No ratings yet
SVM Guide for Data Scientists
48 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
AI ML 2024 Solved Question Paper - Vaibhavpandit - Tele - 250522 - 224429
No ratings yet
AI ML 2024 Solved Question Paper - Vaibhavpandit - Tele - 250522 - 224429
41 pages
A Study On Support Vector Machine Based Linear and Non-Linear Pattern Classification
No ratings yet
A Study On Support Vector Machine Based Linear and Non-Linear Pattern Classification
5 pages
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
No ratings yet
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
11 pages
Support Vector Machines: (Vapnik, 1979)
No ratings yet
Support Vector Machines: (Vapnik, 1979)
34 pages
3.unit 3 ML Part-2 Q&A
No ratings yet
3.unit 3 ML Part-2 Q&A
23 pages
UNIT-II-Support Vector Machine Algorithm
No ratings yet
UNIT-II-Support Vector Machine Algorithm
13 pages
Ijcsea 2
No ratings yet
Ijcsea 2
13 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
Entrepreneurship 04
No ratings yet
Entrepreneurship 04
16 pages
The Influence of ISO 9001 & ISO 14001
No ratings yet
The Influence of ISO 9001 & ISO 14001
24 pages
8 Building A Powerful Marketing Plan
No ratings yet
8 Building A Powerful Marketing Plan
66 pages
Entrepreneurship 03
No ratings yet
Entrepreneurship 03
18 pages
7 Buying An Existing Business
No ratings yet
7 Buying An Existing Business
43 pages
5 Forms of Business Ownership
No ratings yet
5 Forms of Business Ownership
34 pages
SCDM List of Experiment
No ratings yet
SCDM List of Experiment
1 page
Assignment Visual Studio
No ratings yet
Assignment Visual Studio
3 pages
A Complete Step-by-Step Process For Italian Universities 2025
No ratings yet
A Complete Step-by-Step Process For Italian Universities 2025
38 pages
Staff Training Slideshow 7 HACCP
No ratings yet
Staff Training Slideshow 7 HACCP
20 pages
Q.) Explain Scan Line Algorithm of Polygon Clipping
No ratings yet
Q.) Explain Scan Line Algorithm of Polygon Clipping
18 pages
Controller Design (Based On Transient Response Criteria: To Determine Controller Settings For P, PI or PID Controllers
No ratings yet
Controller Design (Based On Transient Response Criteria: To Determine Controller Settings For P, PI or PID Controllers
66 pages
Digital Signal Processing T-2 APRIL-2022 Sem - II (T.Y.B.tech E&TC)
No ratings yet
Digital Signal Processing T-2 APRIL-2022 Sem - II (T.Y.B.tech E&TC)
2 pages
Fuzzy Logic Train Braking System
No ratings yet
Fuzzy Logic Train Braking System
9 pages
A New Deep Neural Network For Forecasting Deep Dendritic Artificial Neural Network
No ratings yet
A New Deep Neural Network For Forecasting Deep Dendritic Artificial Neural Network
25 pages
Phys 121
100% (1)
Phys 121
3 pages
Greedy Method in Algorithms
No ratings yet
Greedy Method in Algorithms
16 pages
Appc 1.4 Packet
No ratings yet
Appc 1.4 Packet
5 pages
Laboratory Exercise 5: Image Matching
No ratings yet
Laboratory Exercise 5: Image Matching
9 pages
Undamped - Vibrations
No ratings yet
Undamped - Vibrations
50 pages
COMPSCI5014 1 Machine Learning (M) 201904
No ratings yet
COMPSCI5014 1 Machine Learning (M) 201904
7 pages
Lect-25 Decidebility Reductions Rice Theorem
No ratings yet
Lect-25 Decidebility Reductions Rice Theorem
64 pages
SketchGAN CVPR2019
No ratings yet
SketchGAN CVPR2019
10 pages
2023 June CST306-C
No ratings yet
2023 June CST306-C
3 pages
Relational Calculus
No ratings yet
Relational Calculus
10 pages
APPLIED STATISTICS AND PROBABILITY - Assignment2
No ratings yet
APPLIED STATISTICS AND PROBABILITY - Assignment2
1 page
Writeup On Bank Customer Churn Prediction
No ratings yet
Writeup On Bank Customer Churn Prediction
14 pages
DBMS Proficiency
No ratings yet
DBMS Proficiency
8 pages
AITools Unit 5
No ratings yet
AITools Unit 5
35 pages
Operation Research Class Notes PDF
50% (2)
Operation Research Class Notes PDF
308 pages
Crest Theory Applied To ADC
No ratings yet
Crest Theory Applied To ADC
4 pages
Benchmarking PromptQL - Hasura PromptQL
No ratings yet
Benchmarking PromptQL - Hasura PromptQL
13 pages
MMW Midterm Reviewer
No ratings yet
MMW Midterm Reviewer
6 pages
Car Price Prediction Using Ai
No ratings yet
Car Price Prediction Using Ai
6 pages
Vivim: Efficient Medical Video Segmentation
No ratings yet
Vivim: Efficient Medical Video Segmentation
7 pages
Image Restoration for Engineers
No ratings yet
Image Restoration for Engineers
32 pages
ML-Lab Manual - NEP - DSS
No ratings yet
ML-Lab Manual - NEP - DSS
23 pages
Chapter 1 Introduction To Machine Learning
No ratings yet
Chapter 1 Introduction To Machine Learning
29 pages
Sciencedirect: © 2017, Ifac (International Federation of Automatic Control) Hosting by Elsevier Ltd. All Rights Reserved
No ratings yet
Sciencedirect: © 2017, Ifac (International Federation of Automatic Control) Hosting by Elsevier Ltd. All Rights Reserved
6 pages
Design and Analysis of Algorithms
No ratings yet
Design and Analysis of Algorithms
11 pages

Experiment # 10

Uploaded by

Experiment # 10

Uploaded by

Lab Name: To perform classification of data using classification algorithm named support vector

Marking Evaluation Sheet

Knowledge components Domain Taxonomy Contribution Max. Obtained

1. Student is aware with

4. Student is aware of discipline &

5. Student has responded well and

Signed by Course teacher/ Lab Instructor

PRE LAB TASK

Different classification learning algorithms exist, focusing on specific hypotheses. There is no

Fig. 1. Support Vector Machines (SVMs) Model

1. Start-up the Microsoft Windows.

2. Get started with R:

Fig. 1. R Startup GUI window

2. Install package (e1071).

> data = data.frame(x, y = as.factor(y))

svm(formula = y ~ ., data = data, kernel = "linear", cost = 10, scale = FALSE)

Extra Credit Points:

Domains Psychomotor (70%) Affective (20%) Cognitive

Attributes Realization of Conducting Data Data Discipline Individual Understa

Q1.: List the broad categories of Machine Learning (ML)?

Q2.: List the activities involved in supervised machine learning process?

Domains Psychomotor (70%) Affective (20%) Cognitive

You might also like