0% found this document useful (0 votes)

18 views8 pages

Aychew Chernet

Classification is a supervised machine learning method that predicts the label or category of new data based on patterns learned from training data consisting of inputs and labels. Clustering is an unsupervised learning method that groups unlabeled data points based on similarities without referring to predefined labels, with the goal of discovering hidden patterns in the data. Regression finds relationships between features and continuous outcomes to predict future trends or values from new data based on patterns from labeled training data.

Uploaded by

aychewchernet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views8 pages

Aychew Chernet

Uploaded by

aychewchernet

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

MADDA WALABU

UNIVERSITY

INDIVIDUAL ASSIGNMENT

NAME : AYCHEW
What is Classification ?
Classification is a supervised machine learning method where the model
tries to
predict the correct label of a given input data.
In classification, the model is fully trained using the training data, and
then it is
evaluated on test data before being used to perform prediction on new
unseen
data.
For instance, an algorithm can learn to predict whether a given email is
spam or
ham (no spam), as illustrated below.

Example of classification with python

In this blog, we will focus on logistic regression. Logistic regression is a

method
that statistically models a binary classification task. It predicts the
probability p that the input features fall into a specific class.
Mathematically, we model the logistic regression model as follows:

1
p=1/(1+−).
Here,z defines the weighted linear combination of the input features and is
calculated
as follows:
z=0+111+22+...+ wn
x
n.
The linear regression algorithm, such as gradient descent, finds the optimal
values
for the weights that maximize the likelihood of the observed data.

Let’s see how this can be done using Python:

1# Importing libraries and dataset

2 import numpy as np
3 from sklearn.datasets import load_iris
4 from sklearn.linear_model import LogisticRegression
5 from sklearn.model_selection import train_test_split
6 #from sklearn import metrics
7
8 # Load the Iris dataset
9 iris = load_iris()
10 X = iris.data
11 y = iris.target
12
13 # Splitting the data into training and testing sets
14 X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.2,
random_state=42)
15
16 # Creating the logistic regression model
17 model = LogisticRegression()
18
19 # Training the model
20 model.fit(X_train, y_train)
Lines 4–5: We import Logistic Regression and train test split from the
sklearn
library.Line 14: We split the features X and target y into training and test
datasets.
The training dataset trains the model, while the test dataset evaluates its
performance.
Lines 17–20: We create a logistic regression model and train the classifier
on
training data X train and y train.

2
What is regression?
Regression is a method for understanding the relationship between
independent variables or features and a dependent variable or outcome.
Outcomes can then be predicted once the relationship between
independent and dependent variables has been estimated.

Regression is a field of study in statistics which forms a key part of

forecast models in machine learning. It’s used as an approach to predict
continuous outcomes in predictive modelling, so has utility in forecasting
and predicting outcomes from data. Machine learning regression
generally involves plotting a line of best fit through the data points. The
distance between each point and the line is minimised to achieve the best
fit line.

Alongside classification, regression is one of the main applications of the

supervised type of machine learning (https://www.seldon.io/four-types-
of-machine-learning-algorithms-explained/).

Regression analysis is used to understand the relationship between

different independent variables and a dependent variable or outcome.
Models that are trained to forecast or predict trends and outcomes will be
trained using regression techniques. These models will learn the
relationship between input and output data from labelled training data. It
can then forecast future trends or predict outcomes from unseen input
data, or be used to understand gaps in historic data.

As with all supervised machine learning, special care should be taken to

ensure the labelled training data is representative of the overall population.
If the training data is not representative, the predictive model will be
overfit to data that doesn’t represent new and unseen data. This will result
in inaccurate predictions once the model is deployed. Because regression
analysis involves the relationships of features and outcomes, care should
be taken to include the right selection of features too.

Example of regression

3
# Python code to illustrate
# regression using data set
import matplotlib
matplotlib.use('GTKAgg')

import matplotlib.pyplot as plt

import numpy as np
from sklearn import datasets, linear_model
import pandas as pd

# Load CSV and columns

df = pd.read_csv("Housing.csv")

Y = df['price']
X = df['lotsize']

X=X.values.reshape(len(X),1)
Y=Y.values.reshape(len(Y),1)

# Split the data into training/testing sets

X_train = X[:-250]
X_test = X[-250:]

# Split the targets into training/testing sets

Y_train = Y[:-250]
Y_test = Y[-250:]

# Plot outputs
plt.scatter(X_test, Y_test, color='black')
plt.title('Test Data')
plt.xlabel('Size')
plt.ylabel('Price')
plt.xticks(())
plt.yticks(())

# Create linear regression object

regr = linear_model.LinearRegression()

# Train the model using the training sets

regr.fit(X_train, Y_train)

# Plot outputs

4
plt.plot(X_test, regr.predict(X_test), color='red',linewidth=3)
plt.show()

What is Clustering?
Introduction to Clustering: It is basically a type of unsupervised learning
method (https://www.geeksforgeeks.org/supervised-unsupervised-
learning/). An unsupervised learning method is a method in which we
draw references from datasets consisting of input data without labeled
responses. Generally, it is used as a process to find meaningful structure,
explanatory underlying processes, generative features, and groupings
inherent in a set of examples.

Clustering is the task of dividing the population or data points into a

number of groups such that data points in the same groups are more
similar to other data points in the same group and dissimilar to the data
points in other groups. It is basically a collection of objects on the basis
of similarity and dissimilarity between them.
For example The data points in the graph below clustered together can be
classified into one single group. We can distinguish the clusters, and we
can identify that there are 3 clusters in the below picture.

It is not necessary for clusters to be spherical as depicted below:

5
Example of Clustering

#Implementing E step
def assign_clusters(X, clusters):
for idx in range(X.shape[0]):
dist = []

curr_x = X[idx]

for i in range(k):
dis = distance(curr_x,clusters[i]['center'])
dist.append(dis)
curr_cluster = np.argmin(dist)
clusters[curr_cluster]['points'].append(curr_x)
return clusters

#Implementing the M-Step

def update_clusters(X, clusters):
for i in range(k):
points = np.array(clusters[i]['points'])

6
if points.shape[0] > 0:
new_center = points.mean(axis =0)
clusters[i]['center'] = new_center

clusters[i]['points'] = []
return clusters

Machine Learning
100% (3)
Machine Learning
46 pages
Walmart - Sales: Pandas PD Seaborn Sns Numpy NP Matplotlib - Pyplot PLT Matplotlib Datetime
100% (1)
Walmart - Sales: Pandas PD Seaborn Sns Numpy NP Matplotlib - Pyplot PLT Matplotlib Datetime
26 pages
Module-2 - Logistic Regression in Machine Learning
No ratings yet
Module-2 - Logistic Regression in Machine Learning
28 pages
Supervised and Unsupervised Learning
No ratings yet
Supervised and Unsupervised Learning
92 pages
Handbook of Regression Analysis With Applications in R, Second Edition Samprit Chatterjeepdf Download
100% (2)
Handbook of Regression Analysis With Applications in R, Second Edition Samprit Chatterjeepdf Download
58 pages
ML Introduction
No ratings yet
ML Introduction
76 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
Supervised Learning
No ratings yet
Supervised Learning
187 pages
ML Combined
No ratings yet
ML Combined
254 pages
John W. Best Has Rightly Said, "The Secret of Our Cultural Development Has Been Research
No ratings yet
John W. Best Has Rightly Said, "The Secret of Our Cultural Development Has Been Research
37 pages
Unit 3 DSA
No ratings yet
Unit 3 DSA
69 pages
How Can I Do Mediation Analysis With The Sem Command - Stata FAQ
No ratings yet
How Can I Do Mediation Analysis With The Sem Command - Stata FAQ
19 pages
Unit-4 Pda
No ratings yet
Unit-4 Pda
111 pages
MLP Unit-2
No ratings yet
MLP Unit-2
102 pages
Unit 3
No ratings yet
Unit 3
45 pages
Module 2 Modified
No ratings yet
Module 2 Modified
67 pages
ML 2 ND Unit
No ratings yet
ML 2 ND Unit
50 pages
Chapter - 2-ML
No ratings yet
Chapter - 2-ML
63 pages
Commonly Used Machine Learning Algorithms
No ratings yet
Commonly Used Machine Learning Algorithms
27 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
Raver, J. L., & Nishii, L. H. (2010)
No ratings yet
Raver, J. L., & Nishii, L. H. (2010)
57 pages
Whole ML PDF 1614408656
100% (1)
Whole ML PDF 1614408656
214 pages
Lecture 3
No ratings yet
Lecture 3
47 pages
Machinelearning Algorithm Basics2 NOTES
No ratings yet
Machinelearning Algorithm Basics2 NOTES
72 pages
Unit 2 - NOTES1 - ML
No ratings yet
Unit 2 - NOTES1 - ML
35 pages
Week - 03 Week04
No ratings yet
Week - 03 Week04
32 pages
ML Notes by Pushpa
No ratings yet
ML Notes by Pushpa
26 pages
ML 01 (Shubham)
No ratings yet
ML 01 (Shubham)
14 pages
ML 01 (Pranavv)
No ratings yet
ML 01 (Pranavv)
14 pages
Abey MSC Thesis
No ratings yet
Abey MSC Thesis
68 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
CAT12 Manual (Volumetría)
No ratings yet
CAT12 Manual (Volumetría)
70 pages
CO 2 Session 3
No ratings yet
CO 2 Session 3
39 pages
DS Chapter 5
No ratings yet
DS Chapter 5
28 pages
Logistic Regression in Python - Real Python
No ratings yet
Logistic Regression in Python - Real Python
27 pages
Chapter 1 Review of Basic Concepts of Statistics
No ratings yet
Chapter 1 Review of Basic Concepts of Statistics
31 pages
GSCNHSSHS PR2Q2Mod2
No ratings yet
GSCNHSSHS PR2Q2Mod2
29 pages
Regression Analysis: Terminology and Notation: The PRF (Population Regression Function)
No ratings yet
Regression Analysis: Terminology and Notation: The PRF (Population Regression Function)
25 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
UNIT 2-3 - Notes - Unit-2-3-Notes
No ratings yet
UNIT 2-3 - Notes - Unit-2-3-Notes
16 pages
ASSIGNMENT - WORKSHEET - Hypothesis and Variables Worksheet 2.
No ratings yet
ASSIGNMENT - WORKSHEET - Hypothesis and Variables Worksheet 2.
2 pages
C++ Chapter3
No ratings yet
C++ Chapter3
20 pages
ML 7th Sem AIML ITE Notes Complete LONG (1) - 34-62
No ratings yet
ML 7th Sem AIML ITE Notes Complete LONG (1) - 34-62
29 pages
Logistic Regression in Python Tutorial
100% (2)
Logistic Regression in Python Tutorial
23 pages
Report
No ratings yet
Report
20 pages
ML Report 1
No ratings yet
ML Report 1
23 pages
Internal Marketing Using Marketing-Like Approaches
No ratings yet
Internal Marketing Using Marketing-Like Approaches
28 pages
Essentials of Marketing Research
No ratings yet
Essentials of Marketing Research
27 pages
Broadly, There Are 3 Types of Machine Learning Algorithms.
No ratings yet
Broadly, There Are 3 Types of Machine Learning Algorithms.
33 pages
Lecture Material 11
No ratings yet
Lecture Material 11
14 pages
Silve ́N Et Al., 2004, JEP
No ratings yet
Silve ́N Et Al., 2004, JEP
13 pages
Machine Learning - Regression Notes
No ratings yet
Machine Learning - Regression Notes
9 pages
Unit 3 Machine Learning
No ratings yet
Unit 3 Machine Learning
12 pages
6 Real-World Case Studies: Data Science For Business
No ratings yet
6 Real-World Case Studies: Data Science For Business
18 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Short For Test
No ratings yet
Short For Test
9 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Impact of Corporate Social Responsibilit
No ratings yet
Impact of Corporate Social Responsibilit
17 pages
Android Based JavaMCQS A Mobile Learning Platform
No ratings yet
Android Based JavaMCQS A Mobile Learning Platform
8 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
Learn Machine Learning in One Lesson Book
No ratings yet
Learn Machine Learning in One Lesson Book
8 pages
Ca10bd6d De86 4bae 9427 c60d433d2076 Supervised Learning
No ratings yet
Ca10bd6d De86 4bae 9427 c60d433d2076 Supervised Learning
17 pages
The Influence of Taste and Price On Repurchase Decisions With Consumer Satisfaction As Intervening Variables
No ratings yet
The Influence of Taste and Price On Repurchase Decisions With Consumer Satisfaction As Intervening Variables
11 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Essentials of Machine Learning Algorithms
No ratings yet
Essentials of Machine Learning Algorithms
15 pages
Supervised Learning
No ratings yet
Supervised Learning
24 pages
Regression Vs Classification in Machine Learning Explained!
No ratings yet
Regression Vs Classification in Machine Learning Explained!
10 pages
Lab Experiment 4 - AI
No ratings yet
Lab Experiment 4 - AI
7 pages
Online Learning in Higher Education: Exploring Advantages and Disadvantages For Engagement
No ratings yet
Online Learning in Higher Education: Exploring Advantages and Disadvantages For Engagement
14 pages
Slide 1
No ratings yet
Slide 1
29 pages
C1-2 Hand-Out
No ratings yet
C1-2 Hand-Out
7 pages
SJBMS 24B433 445
No ratings yet
SJBMS 24B433 445
13 pages
Intro To Linear and Logistic Reg
No ratings yet
Intro To Linear and Logistic Reg
5 pages
Bana 3010 Assignment 5
No ratings yet
Bana 3010 Assignment 5
5 pages
Gresham 2012
No ratings yet
Gresham 2012
6 pages
Figure 1.1 (Mediation Diagram) (Automatisch Hersteld)
No ratings yet
Figure 1.1 (Mediation Diagram) (Automatisch Hersteld)
5 pages
2-Machine Learning Algorithms
No ratings yet
2-Machine Learning Algorithms
16 pages
Tender Price Index Development: A Critical Literature Review of Models For Prediction
No ratings yet
Tender Price Index Development: A Critical Literature Review of Models For Prediction
12 pages
Commonly Used Machine Learning Algorithms
No ratings yet
Commonly Used Machine Learning Algorithms
38 pages
Commonly Used Machine Learning Algorithms (With Python and R Codes)
No ratings yet
Commonly Used Machine Learning Algorithms (With Python and R Codes)
19 pages
Rain in Australia Logistic Regression Classifier
No ratings yet
Rain in Australia Logistic Regression Classifier
10 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
SDL Unit 1
No ratings yet
SDL Unit 1
7 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
Data Mining Techniques For Fraud Detection in Banking Sector
No ratings yet
Data Mining Techniques For Fraud Detection in Banking Sector
5 pages
Senior High School Department: Caldwell Adventist Academy
No ratings yet
Senior High School Department: Caldwell Adventist Academy
4 pages
Maxwell Atasha1st-Revision
No ratings yet
Maxwell Atasha1st-Revision
9 pages
Presentation Guideline
No ratings yet
Presentation Guideline
1 page
Simpsons Variables Worksheet
No ratings yet
Simpsons Variables Worksheet
3 pages
Linear Regression Simple Technique For I
No ratings yet
Linear Regression Simple Technique For I
3 pages
R Data Analysis
No ratings yet
R Data Analysis
10 pages
Finance Research Letters: Brian Ayash, Mahdi Rastad
No ratings yet
Finance Research Letters: Brian Ayash, Mahdi Rastad
5 pages
Correlation and Regression Analysis PDF
No ratings yet
Correlation and Regression Analysis PDF
11 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet

Aychew Chernet

Uploaded by

Aychew Chernet

Uploaded by

MADDA WALABU

Example of classification with python

In this blog, we will focus on logistic regression. Logistic regression is a

Let’s see how this can be done using Python:

1# Importing libraries and dataset

Regression is a field of study in statistics which forms a key part of

Alongside classification, regression is one of the main applications of the

Regression analysis is used to understand the relationship between

As with all supervised machine learning, special care should be taken to

import matplotlib.pyplot as plt

# Load CSV and columns

# Split the data into training/testing sets

# Split the targets into training/testing sets

# Create linear regression object

# Train the model using the training sets

Clustering is the task of dividing the population or data points into a

It is not necessary for clusters to be spherical as depicted below:

#Implementing the M-Step

You might also like