ML Lab1 Theory

Uploaded by

Sanjay Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views1 page

ML Lab1 Theory

Uploaded by

Sanjay Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

EXPERIMENT - 1

Introduction to the Iris Dataset

The Iris dataset is one of the most famous datasets used in data science and machine learning.
It was introduced by Sir Ronald Fisher in 1936 and is often used for learning and testing
classification algorithms. The dataset contains 150 samples of iris flowers from three
different species — Setosa, Versicolor, and Virginica.

Dataset Description

Each flower in the dataset has four features: sepal length, sepal width, petal length, and petal
width. All measurements are recorded in centimeters. These features are used to classify the
flower into its correct species. The dataset is simple, clean, and small in size, which makes it
perfect for beginners to practice.

k-Nearest Neighbors (k-NN) Algorithm

KNN is a simple and widely used machine learning algorithm for classification. It works by
finding the “k” closest data points to a new sample and predicting its class based on the
majority vote. It is a distance-based method, meaning it uses the closeness of data points in
feature space to make predictions.

Data Visualization

Visualizing data helps in understanding patterns and relationships between features. Pair plots
show how features vary for different species. Box plots display the spread of feature values,
while heatmaps show the correlation between numerical features. These visualizations make
it easier to analyze the dataset before building a model.

Cross-Validation

Cross-validation is a method to check how well a model works on unseen data. In 5-fold
cross-validation, the dataset is split into 5 parts. The model is trained on 4 parts and tested on
the remaining part, and this process is repeated 5 times. The results are averaged to get a
reliable accuracy score.

Learning Outcomes

The model’s performance is assessed using multiple metrics: Accuracy (percentage of correct
predictions), Precision(proportion of correct positive predictions), Recall (proportion of
actual positives correctly identified), and F1-score(harmonic mean of precision and recall).

Iris Flower Classification
No ratings yet
Iris Flower Classification
3 pages
JAYESH BANSAL - FinalProjectReport - Jayesh Bansal
No ratings yet
JAYESH BANSAL - FinalProjectReport - Jayesh Bansal
38 pages
R Course - Part7 ML - Exercise Sheet 2024
No ratings yet
R Course - Part7 ML - Exercise Sheet 2024
8 pages
Task 1
No ratings yet
Task 1
14 pages
KNN Datacamp
No ratings yet
KNN Datacamp
31 pages
Iris Flower Classification
No ratings yet
Iris Flower Classification
47 pages
Iris Dataset Project Report - Compress
No ratings yet
Iris Dataset Project Report - Compress
16 pages
Lab 3 - SciKitLearn ML
No ratings yet
Lab 3 - SciKitLearn ML
2 pages
王玉 20201108012390
No ratings yet
王玉 20201108012390
13 pages
Iris Classification
No ratings yet
Iris Classification
6 pages
SUMITs MINOR REPORT
No ratings yet
SUMITs MINOR REPORT
16 pages
DS Report
No ratings yet
DS Report
11 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
8 pages
ML#07
No ratings yet
ML#07
21 pages
Data Science Project
No ratings yet
Data Science Project
31 pages
BT-2016 SEM-IV Project Report (Review 1)
No ratings yet
BT-2016 SEM-IV Project Report (Review 1)
42 pages
Ads Exp 3
No ratings yet
Ads Exp 3
7 pages
Ludic - Workshop - Iris - Copie
No ratings yet
Ludic - Workshop - Iris - Copie
5 pages
Bs Report On Iris
No ratings yet
Bs Report On Iris
6 pages
Amber Iris
No ratings yet
Amber Iris
23 pages
Iris Dataset Visualization Guide
No ratings yet
Iris Dataset Visualization Guide
3 pages
A Complete Guide To The Iris Dataset in R
No ratings yet
A Complete Guide To The Iris Dataset in R
3 pages
Sridevi Women'S Engineering College: Mini Project Seminar On
No ratings yet
Sridevi Women'S Engineering College: Mini Project Seminar On
23 pages
Support Vector Machine (SVM Classifier) Implemenation in Python With Scikit-Learn
No ratings yet
Support Vector Machine (SVM Classifier) Implemenation in Python With Scikit-Learn
21 pages
Assignment 4 R Program1
No ratings yet
Assignment 4 R Program1
11 pages
Iris Flower Classification Project
No ratings yet
Iris Flower Classification Project
9 pages
Lab 6
No ratings yet
Lab 6
4 pages
K-Nearest Neighbors Classifiers 2025
No ratings yet
K-Nearest Neighbors Classifiers 2025
33 pages
Data Science: Objectives
No ratings yet
Data Science: Objectives
10 pages
Coincent - Data Science With Python Assignment
100% (2)
Coincent - Data Science With Python Assignment
23 pages
Module 2 Iris Data Set
100% (1)
Module 2 Iris Data Set
1 page
AI & ML Lab Journal for MCA Students
No ratings yet
AI & ML Lab Journal for MCA Students
77 pages
ML Mod-4
No ratings yet
ML Mod-4
30 pages
Iris Flower Classification Final
No ratings yet
Iris Flower Classification Final
15 pages
Mod3 Classification
No ratings yet
Mod3 Classification
32 pages
Data Science Basics for Beginners
No ratings yet
Data Science Basics for Beginners
26 pages
ML N PY Programs
No ratings yet
ML N PY Programs
17 pages
EXPERIMENT
No ratings yet
EXPERIMENT
16 pages
EDA AnalysisA
No ratings yet
EDA AnalysisA
15 pages
Data Mining & Warehousing Lab Report
No ratings yet
Data Mining & Warehousing Lab Report
25 pages
5.1.8 K-Nearest-Neighbor Algorithm
No ratings yet
5.1.8 K-Nearest-Neighbor Algorithm
8 pages
10
No ratings yet
10
7 pages
R Programming for Data Science
No ratings yet
R Programming for Data Science
20 pages
Research
No ratings yet
Research
12 pages
Varad Aiml 3.3
No ratings yet
Varad Aiml 3.3
4 pages
Exno 4
No ratings yet
Exno 4
13 pages
Chap5 - Wei - Ipynb - Colab
No ratings yet
Chap5 - Wei - Ipynb - Colab
29 pages
04 SVM
No ratings yet
04 SVM
8 pages
Iris Flower Classification Project
100% (1)
Iris Flower Classification Project
14 pages
new90机器学习刘扬
No ratings yet
new90机器学习刘扬
12 pages
Iris Dataset Analysis & App Dev
No ratings yet
Iris Dataset Analysis & App Dev
10 pages
Assigmnent 3 (Data Mining)
No ratings yet
Assigmnent 3 (Data Mining)
18 pages
Solution HW2
No ratings yet
Solution HW2
6 pages
Exercise and Experiment 3
No ratings yet
Exercise and Experiment 3
14 pages
Breast Cancer Classification
100% (2)
Breast Cancer Classification
16 pages
Python Machine Learning Guide
No ratings yet
Python Machine Learning Guide
5 pages
Machine Learning Project
No ratings yet
Machine Learning Project
12 pages
Classification in AI
No ratings yet
Classification in AI
21 pages
LDA & KNN: Advanced Analysis Guide
No ratings yet
LDA & KNN: Advanced Analysis Guide
37 pages

ML Lab1 Theory

Uploaded by

ML Lab1 Theory

Uploaded by

EXPERIMENT - 1

Introduction to the Iris Dataset

k-Nearest Neighbors (k-NN) Algorithm

You might also like