machine learning final manual

EX.
NO:1 LINEAR REGRESSION DATE:

AIM:
To implement the linear regression model and to experiment with different features in
building a model.
DEFINITION:
Let us consider a dataset where we have a value of response y for every feature x:
Now, the task is to find a line that fits best in the above scatter plot so that we can
predict the response for any new feature values. (i.e a value of x not present in the datasetThis
line is called a regression line.
PROCEDURE:
 Importing required libraries like pandas & numpy for data analysis and manipulation and
seaborn & matplotlib for data visualization.
 Visualizing the variables in order to interpret business/domain inferences.
 Splitting the data into two sections in order to train a subset of dataset to generate a
trained (fitted) line
 Rescaling the trained model: It is a method used to normalize the range of numerical
variables with varying degrees of magnitude.
1
 Residual analysis of the train data tells us how much the errors are distributed across the
model. A good residual analysis will signify that the mean is centred around 0.
PROGRAM:
import numpy as np
import matplotlib.pyplot as plt
def estimate_coef(x, y):
# number of observations/points
n = np.size(x)
# mean of x and y
vector m_x =
np.mean(x)
m_y = np.mean(y)
# calculating cross-deviation and deviation about x
SS_xy = np.sum(y*x) - n*m_y*m_x
SS_xx = np.sum(x*x) - n*m_x*m_x
# calculating regression coefficients
b_1 = SS_xy / SS_xx
b_0 = m_y - b_1*m_x
return (b_0, b_1)
def plot_regression_line(x, y, b):
# plotting the actual points as scatter plot
plt.scatter(x, y, color = "m",
marker = "o", s = 30)
# predicted response vector
y_pred = b[0] + b[1]*x
# plotting the regression line
plt.plot(x, y_pred, color = "g")
# putting labels
plt.xlabel('x')
plt.ylabel('y')
# function to show plot
plt.show()
def main():
# observations / data
x = np.array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
y = np.array([1, 3, 2, 5, 7, 8, 8, 9, 10, 12])
# estimating coefficients
b = estimate_coef(x, y)
print("Estimated coefficients:\nb_0 = {} \
\nb_1 = {}".format(b[0], b[1]))
# plotting regression line
plot_regression_line(x, y, b)
if name == " main ":
main()
2
OUTPUT:
RESULT:
Thus the program to implement linear regression model was implemented and executed
successfully.
3
EX.NO:2 BINARY CLASSIFICATION MODEL DATE:
AIM:
To write a program to implement the binary classification model using python.
PROCEDURE:
Step 1: Define explanatory and target variables
Step 2: Split the dataset into training and testing sets
Step 3: Normalize the data for numerical stability
Step 4: Fit a logistic regression model to the training data
Step 5: Make predictions on the testing data
Step 6: Calculate the accuracy score by comparing the actual values and predicted values.
PROGRAM:
import numpy as np
class Perceptron(object):
""" Perceptron Classifier
Parameters
rate : float
Learning rate (ranging from 0.0 to 1.0)
number_of_iteration : int
Number of iterations over the input dataset.
Attributes:
weight_matrix : 1d-array
Weights after fitting.
error_matrix : list
Number of misclassification in every epoch(one full training cycle on the training set)
"""
def init (self, rate = 0.01, number_of_iterations = 100):
self.rate = rate
self.number_of_iterations = number_of_iterations
def fit(self, X, y):
""" Fit training data
Parameters:
X : array-like, shape = [number_of_samples, number_of_features]

Training vectors.
y : array-like, shape = [number_of_samples]
4
Target values.
Returns
self : object
"""
self.weight_matrix = np.zeros(1 + X.shape[1])
self.errors_list = []
for _ in range(self.number_of_iterations):
errors = 0
for xi, target in zip(X, y):
update = self.rate * (target - self.predict(xi))
self.weight_matrix[1:] += update * xi
self.weight_matrix[0] += update
errors += int(update != 0.0)
self.errors_list.append(errors)
return self
def dot_product(self, X):
""" Calculate the dot product """
return (np.dot(X, self.weight_matrix[1:]) + self.weight_matrix[0])
def predict(self, X):
""" Predicting the label for the input data """
return np.where(self.dot_product(X) >= 0.0, 1, 0)
if name == ' main ':
X = np.array([[0, 0, 0], [0, 0, 1], [0, 1, 0], [0, 1, 1], [1, 0, 0], [1, 0, 1], [1, 1, 0]])
y = np.array([0, 1, 1, 1, 1, 1, 1])
p = Perceptron()
p.fit(X, y)
print("Predicting the output of [1, 1, 1] = {}".format(p.predict([1, 1, 1])))
OUTPUT:
Predicting the output of [1, 1, 1] = 1
RESULT:
Thus the program for implementing binary classification model was implemented and executed
successfully.
5
EX.NO:3 CLASSIFICATION WITH NEAREST NEIGHBOURS DATE:
AIM:
To write the program for the implementation of the k-nearest neighbor algorithm
ALGORITHM:
Step 1 − For implementing any algorithm, we need dataset. So during the first step of KNN, we
must load the training as well as test data.
Step 2 − Next, we need to choose the value of K i.e. the nearest data points. K can be any
integer.
Step 3 − For each point in the test data do the following −
 3.1 − Calculate the distance between test data and each row of training data with the help
of any of the method namely: Euclidean, Manhattan or Hamming distance. The most
commonly used method to calculate distance is Euclidean.
 3.2 − Now, based on the distance value, sort them in ascending order.
 3.3 − Next, it will choose the top K rows from the sorted array.
 3.4 − Now, it will assign a class to the test point based on most frequent class of these
rows.
Step 4 − End
PROGRAM:
# Import necessary modules
from sklearn.neighbors import
KNeighborsClassifier from sklearn.model_selection
import train_test_split from sklearn.datasets import
load_iris
# Loading data
irisData = load_iris()
# Create feature and target arrays
X = irisData.data
y = irisData.target
# Split into training and test set
X_train, X_test, y_train, y_test =
train_test_split( X, y, test_size = 0.2,
random_state=42)
knn = KNeighborsClassifier(n_neighbors=7)
knn.fit(X_train, y_train)
# Predict on dataset which model has not seen before
print(knn.predict(X_test))
OUTPUT
[1 0 2 1 1 0 1 2 2 1 2 0 0 0 0 1 2 1 1 2 0 2 0 2 2 2 2 2 0 0]
6
PERFORMANCE

from sklearn.neighbors import KNeighborsClassifier
from sklearn.model_selection import train_test_split
from sklearn.datasets import load_iris
# Loading data

X = irisData.data
y = irisData.target

random_state=42)
knn = KNeighborsClassifier(n_neighbors=7)
# Calculate the accuracy of the model

print(knn.score(X_test, y_test))
OUTPUT:
0.9666666666666667
MODEL ACCURACY:
from sklearn.neighbors import
KNeighborsClassifier from sklearn.model_selection
import train_test_split from sklearn.datasets import
load_iris
import numpy as np
X = irisData.data
y = irisData.target
random_state=42) neighbors = np.arange(1, 9)
train_accuracy =
np.empty(len(neighbors)) test_accuracy =
7
np.empty(len(neighbors)) # Loop over K
values
8
for i, k in enumerate(neighbors):
knn = KNeighborsClassifier(n_neighbors=k)
# Compute training and test data accuracy

train_accuracy[i] = knn.score(X_train, y_train)
test_accuracy[i] = knn.score(X_test, y_test)
# Generate plot
plt.plot(neighbors, test_accuracy, label = 'Testing dataset Accuracy')
plt.plot(neighbors, train_accuracy, label = 'Training dataset Accuracy')
plt.legend()
plt.xlabel('n_neighbors')
plt.ylabel('Accuracy')
plt.show()
OUTPUT:
RESULT :
Thus the program for the implementation of the k-nearest neighbor algorithm was verified and
executed successfully.
9
EX.NO:4 EXPERIMENT WITH VALIDATION SET AND TEST SET DATE:
AIM:
To write an experiment with validation sets and test sets for the given dataset.
PROCEDURE:
Training Dataset
The sample of data used to fit the model.The actual dataset that we use to train the model
(weights and biases in the case of a Neural Network). The model sees and learns from this data.
Validation Dataset
The sample of data used to provide an unbiased evaluation of a model fit on the training dataset
while tuning model hyperparameters. The evaluation becomes more biased as skill on the
validation dataset is incorporated into the model configuration.
Test Dataset: The sample of data used to provide an unbiased evaluation of a final model fit on
the training dataset.The Test dataset provides the gold standard used to evaluate the model. It is
only used once a model is completely trained(using the train and validation sets).
PROGRAM:
# Importing numpy & scikit-learn

import numpy as np
# Making a dummy array to

# represent x,y for example
# Making a array for x ranging
# from 0-15 then reshaping it
# to form a matrix of shape 8x2
x = np.arange(16).reshape((8,2))
# y is just a list of 0-7 number

# representing target variable
y = range(8)
10
# Splitting dataset in 80-20 fashion .i.e.
# Testing set is 20% of total data
# Training set is 80% of total data
x_train, x_test, y_train, y_test = train_test_split(x,y,
train_size=0.8,
random_state=42)
# Training set
print("Training set x: ",x_train)
print("Training set y: ",y_train)
import numpy as np
# Making a dummy array to represent x,y for example

# Making a array for x ranging from 0-15 then
# reshaping it to form a matrix of shape 8x2
x = np.arange(16).reshape((8, 2))

representing # target variable
y = range(8)

# Testing set is 20% of total data
x_train, x_test, y_train, y_test = train_test_split(x, y,
test_size=0.2,
random_state=42)
# Testing set
print("Testing set x: ",
x_test) print("Testing set y:
", y_test)
import numpy as np
# Making a dummy array to represent x,y for example

# Making a array for x ranging from 0-23 then reshaping it
# to form a matrix of shape 8x3
x = np.arange(24).reshape((8,3))
11
representing # target variable
y = range(8)

# Combined set of testing & validation is
# 20% of total data
x_train, x_Combine, y_train, y_Combine = train_test_split(x,y,train_size=0.8,random_state=42)
# Splitting combined dataset in 50-50 fashion .i.e.
# Testing set is 50% of combined dataset
# Validation set is 50% of combined dataset
x_val, x_test, y_val, y_test =
train_test_split(x_Combine,y_Combine,test_size=0.5,random_state=42)
# Training set
print("Training set x:
",x_train) print("Training set
y: ",y_train) print(" ")
# Testing set
print("Testing set x:
",x_test) print("Testing set
y: ",y_test) print(" ")
# Validation set
print("Validation set x: ",x_val)
print("Validation set y: ",y_val)
OUTPUT:
Training set x: [[ 0 1]
[14 15]
[ 4 5]
[ 8 9]
[ 6 7]
[12 13]]
Training set y: [0, 7, 2, 4, 3, 6]
Testing set x: [[ 2 3]
[10 11]]
Testing set y: [1, 5]
Training set x: [[ 0 1 2]
[21 22 23]
[ 6 7 8]
[12 13 14]
[ 9 10 11]
12
[18 19 20]]
Training set y: [0, 7, 2, 4, 3, 6]
Testing set x: [[15 16 17]]

Testing set y: [5]
Validation set x: [[3 4 5]]

Validation set y: [1]
RESULT:
Thus the program for the implementation of an experiment with validation sets and test sets for
the given dataset was verified and executed successfully.
13
EX.NO:5 K-MEANS CLUSTERING DATE:
AIM:
To write a program for the implementation of the Kmeans to the given dataset.
PROCEDURE:
Step-1: Select the number K to decide the number of clusters.
Step-2: Select random K points or centroids. (It can be other from the input dataset).
Step-3: Assign each data point to their closest centroid, which will form the predefined K
clusters.
Step-4: Calculate the variance and place a new centroid of each cluster.
Step-5: Repeat the third steps, which means reassign each datapoint to the new closest centroid
of each cluster.
Step-6: If any reassignment occurs, then go to step-4 else go to FINISH.
Step-7: The model is ready.
PROGRAM
from sklearn.cluster import KMeans
import pandas as pd
from sklearn.preprocessing import MinMaxScaler
from matplotlib import pyplot as plt
from sklearn.datasets import load_iris
%matplotlib inline
iris = load_iris()
df = pd.DataFrame(iris.data,columns=iris.feature_names)
df.head()
OUTPUT:
14
df['flower'] = iris.target
df.head()
OUTPUT:
df.drop(['sepal length (cm)', 'sepal width (cm)', 'flower'],axis='columns',inplace=True)

df.head(3)
OUTPUT:
km = KMeans(n_clusters=3)
yp = km.fit_predict(df)
yp
OUTPUT:
df['cluster'] = yp
15
df.head(2)
16
OUTPUT:
df.cluster.unique()
OUTPUT:
df1 =
df[df.cluster==0] df2
= df[df.cluster==1]
df3 = df[df.cluster==2]
plt.scatter(df1['petal length (cm)'],df1['petal width (cm)'],color='blue')

plt.scatter(df2['petal length (cm)'],df2['petal width (cm)'],color='green')
plt.scatter(df3['petal length (cm)'],df3['petal width (cm)'],color='yellow')
OUTPUT:
17
sse = []
k_rng = range(1,10)
for k in k_rng:
km = KMeans(n_clusters=k)
km.fit(df)
sse.append(km.inertia_)
plt.xlabel('K')
plt.ylabel('Sum of squared error')
plt.plot(k_rng,sse)
OUTPUT:
RESULT :
Thus the program for the implementation of the Kmeans to the given dataset was verified and
executed successfully.
18
EX.NO:6 NAIVE BAYES CLASSIFIER DATE:
AIM:
To implement a program for Naïve Bayes model
NAIVE BAYES CLASSIFIER ALGORITHM
 Naive Bayes is among one of the very simple and powerful algorithms for classification
based on Bayes Theorem with an assumption of independence among the predictors.
 The Naive Bayes classifier assumes that the presence of a feature in a class is not
related to any other feature.
 Naive Bayes is a classification algorithm for binary and multi-class classification
problems.
Bayes Theorem
 Based on prior knowledge of conditions that may be related to an event, Bayes
theorem describes the probability of the event
 conditional probability can be found this way
 Assume we have a Hypothesis(H) and evidence(E),
 According to Bayes theorem, the relationship between the probability of
Hypothesis before getting the evidence represented as P(H) and the probability
of the hypothesis after getting the evidence represented as P(H|E) is:
P(H|E) = P(E|H)*P(H)/P(E)
STEPS INVOLVE NAÏVE BAYES ALGORITHM

Step 1: Handling Data
Data is loaded from the .csv file and spread into training and tested assets.
Step 2: Summarizing the data
Summarise the properties in the training data set to calculate the probabilities and make
predictions.
Step 3: Making a Prediction
A particular prediction is made using a summarise of the data set to make a single prediction
Step 4: Making all the Predictions
Generate prediction given a test data set and a summarise data set.
Step 4: Evaluate Accuracy:
Accuracy of the prediction model for the test data set as a percentage correct out of them all the
predictions made.
19
Step 4: Trying all together
Finally, we tie to all steps together and form our own model of Naive Bayes Classifier.
PROGRAM:
# the tuples consist of (delay time of train1, number of times)
# tuples are (minutes, number of times)

in_time = [(0, 22), (1, 19), (2, 17), (3, 18),
(4, 16), (5, 15), (6, 9), (7, 7),
(8, 4), (9, 3), (10, 3), (11, 2)]
too_late = [(6, 6), (7, 9), (8, 12), (9, 17),
(10, 18), (11, 15), (12,16), (13, 7),
(14, 8), (15, 5)]
%matplotlib inline
X, Y = zip(*in_time)
X2, Y2 = zip(*too_late)
bar_width = 0.9
plt.bar(X, Y, bar_width, color="blue", alpha=0.75, label="in time")
bar_width = 0.8
plt.bar(X2, Y2, bar_width, color="red", alpha=0.75, label="too late")
plt.legend(loc='upper right')
plt.show()
in_time_dict = dict(in_time)
too_late_dict = dict(too_late)
def catch_the_train(min):
s = in_time_dict.get(min, 0)
if s == 0:
return 0
else:
m = too_late_dict.get(min,
0) return s / (s + m)
for minutes in range(-1, 13):

print(minutes,
catch_the_train(minutes))
20
OUTPUT:
-1 0
0 1.0
1 1.0
2 1.0
3 1.0
4 1.0
5 1.0
6 0.6
7 0.4375
8 0.25
9 0.15
10 0.14285714285714285
11 0.11764705882352941
12 0
RESULT:
Thus the program to implement naïve bayes classifier hass been verified and executed
successfully.
21
Ex.No:7. MINI PROJECT. DATE:
PNEUMONIA DETECTION AND CLASSIFICATION USING DEEP LEARNING
ABSTRACT
The infectious illness known as Pneumonia is regularly a result of contamination because of a

bacterium in the alveoli of the lungs. While an infected tissue of the lungs has an infection ,it builds up
in it. To find out if the patient has those illnesses, professionals perform bodily exams and diagnose
their patients through Chest X-ray, ultrasound, or biopsy of lungs. Misdiagnosis, erroneous
treatment, and if the disease is overlooked will result in the patient’s lack of lifestyle. The progression
of Deeplearning contributes to aid in the decision-making procedure of specialists to diagnose
sufferers with these illnesses. The look employs a bendy and efficient technique of deep learning of
applying the model of CNN in predicting and detecting a patient unaffected and affected with the sick
nessusing a chest X-ray photograph. The take a look at utilizing an accrued data set of 20,000 images
using a 224x224 photograph decision with 32 batch length is applied to prove the overall performance
of the CNN model being educated. The trainedversion produced an accuracy charge of 95% at some
point of the overall performance training. Based on the end result of the experiment carried out, the
research study can detect and are expecting COVID-19, bacterial, and viralpneumonia sicknesses
based totally on chest X-ray photos.
1. INTRODUCTION
Deep learning is a subset of machine learning and artificial intelligence (AI) that mimics how a
human brain functions, and it allows computers to address complex patterns that create new insights and
solutions. If you’ve used technology like a digital assistant on your phone, received a text alerting you of
credit card fraud, or ridden in a self-driving car, you’ve used deep learning. A deep learning model is a
compilation of nodes that connect and layer in neural networks, much like the human brain. These networks
pass information through each layer, sending and receiving data to identity patterns. Deep learning models
use different types of neural networks to achieve specific solutions.
Deep learning models typically have three or more layers of neural networks to help process data.
These models have the ability to process data that’s unstructured or unlabeled, creating their own methods
for identifying and understanding the information without a person telling the computer what to look for or
solve. Because deep learning models can identify both higher and lower-level information, they can take
data sets that are difficult to understand and create simpler, more efficient categories. This ability allows the
deep learning model to grow more accurate over time. Deep learning is the key to the advancement of
artificial intelligence. In this article, you can learn about deep learning models, the different types of deep
22
learning models, and careers in the field. Two data scientists use a tablet and discuss the deep learning
models that they created. Deep learning is a subset of machine learning and artificial intelligence (AI) that
mimics how a human brain functions, and it allows computers to address complex patterns that create new
insights and solutions. If you’ve used technology like a digital assistant on your phone, received a text
alerting you of credit card fraud, or ridden in a self-driving car, you’ve used deep learning. A deep learning
model is a compilation of nodes that connect and layer in neural networks, much like the human brain.
These networks pass information through each layer, sending and receiving data to identity patterns. Deep
learning models use different types of neural networks to achieve specific solutions. Skills you'll
build:Recurrent Neural Network, Tensorflow, Convolutional Neural Network, Artificial Neural Network,
Transformers, Backpropagation, Python Programming, Deep Learning, Neural Network Architecture, Facial
Recognition System, Object Detection and Segmentation, hyperparameter tuning, Mathematical
Optimization, Decision-Making, Machine Learning, Inductive Transfer, MultiTask Learning, Gated
Recurrent Unit (GRU), Natural Language Processing, Long Short Term Memory (LSTM), Attention
Models
Types of deep learning models

Deep learning models use a variety of constructions and frameworks to achieve specific tasks and
goals. Some types of deep learning models include:
Convolutional neural networks:
Convolutional neural networks is used for image processing and recognition.
Recurrent neural networks:
Recurrent neural networks is used for speech recognition and natural language processing.
Long short-term memory networks:
Long short-term memory networks is used for sequential prediction tasks, such as language
modeling.
Computer vision:
This is a computer’s ability to understand and process images. This is for content moderation,
medical imaging, facial recognition, and image classification.
Speech recognition:
This involves a computer’s ability to analyze and understand human speech. Speech recognition is
primarily used for virtual assistants, such as Siri, which understand what you ask and provide answers.
Recommendation engine:
This is a computer’s ability to track and analyze a user’s habits to create tailored recommendations.
This is for features like Netflix’s movie recommendation stream or content in your social media feeds.
23
Natural language processing: This is a computer’s ability to understand text copy. You can use natural
language processing for translation services, chatbots, and keyword indexing.
Deep learning models work by interacting with immense sets of data and extracting patterns and
solutions from them through learning styles similar to what humans naturally do. They use artificial neural
networks to parse and process data sets. The networks operate using algorithms, which provide the
opportunity for the computer to adapt and learn on its own without needing a human to guide the learning.
Each type of deep learning model applies to different uses, but they all have the same learning and
training process in common. To train a deep learning model, huge sets of data need to feed into the network.
This information passes from neuron to neuron, allowing the computer to analyze and understand the data as
it moves through the network. Deep learning models are scalable and fast, so they have the ability to handle
whatever data sets you might want to be processed without needing a lot of setup or maintenance
Pneumonia impacts all the elderly and young people every where. With the high growth in the
popularity of neural networks, engineers and researchers have been able to find state-of-the- art products for
computer vision. Artificial Intelligence helps us to automate analysis techniques, which is only possible now
because of the technology of DeepLearning. Exposure to Pneumonia is quite high for many people, mainly
in economically underdeveloped and developing countries where the majority are deprived of a nutritious
diet. The World Health Organization states that more than 4 million untimely deaths per year occur from
diseases caused by air pollution.
The purpose of this project is to build an AI network ,which takes the pixel values as input for
a given X-Ray image and then proceeds to perform linear operations and activations on each of
them .Then by taking all the above operations, and then multiplying them with each layer within the
neural network, and the number of nodes. Suddenly you have millions of operations. The objective
and automated detection of pneumonia represents a serious challenge in medical imaging, because the
signs of the illness are not obvious in CT or X-ray scans. Further on, it is also an important task, since
millions of people die of pneumonia every year. The main goal of this paper is to propose a solution
for the above mentioned problem, using a novel deep neural network architecture. The proposed
novelty consists in the use of dropout in the convolutional part of the network. The proposed method
was trained and tested on a set of 5856 labeled images available at one of Kaggle’s many medical
imaging challenges. The chest X-ray images (anterior-posterior) were selected from retrospective
cohorts of pediatric patients, aged between one and five years, from Guangzhou Women and
Children’s Medical Center, Guangzhou, China. Results achieved by our network would have placed
first in the Kaggle competition with the following metrics. A four-class classification of breast CT
images based on two-dimensional fractional Fourier entropy features, and obtained a 92.3% averaged
score.
24
2. LITERATURE REVIEW
Wang et al. [18] deployed a deep rank-based average pooling network that combined an n-conv rank-
based average pooling module with a CNN model inspired by the VGG network, reaching an average
score of 95.5% in a study elaborated on 1164 CT scans of 521 subjects.
Horry et al. [19] performed a comparative study of imaging modalities within the pneumonia detection
framework and found that pneumonia can be best detected from ultrasound scans. Rajaraman et al. [20]
presented a framework in which various CNN models deployed through transfer learning were involved
in ensemble learning. Their ensembles were trained using 16,700 CXR images originating from 4 public
databases, and achieved various accuracy rates between 94% and 99%.
Singh and Yow [21] proposed an interpretable deep learning neural network approach based on two
existing models, ProtoPNet and NP-ProtoPNet, and achieved an accuracy of 87.27%.
Most of the above presented solutions deploy transfer learning, meaning that the networks were
previously trained on images not related to pneumonia. There are several machine vision applications that
use CNNs built from scratch which proved that a smaller architecture may obtain finer accuracy than
several pretrained larger models deployed via transfer learning. In this study we propose to build a novel
CNN architecture to provide an accurate solution to the problem of pneumonia detection. The main
novelty consists in the fact that our CNN model uses dropout in the convolutional part of the network,
unlike the majority of existing architectures, which use dropout exclusively in the fully connected part of
the network where the main part of the parameters is trained. This paper proves that the proposed model
can provide accurate classification despite the reduced number of trained parameters and can even benefit
from this property in terms of efficiency.
25
3. SOFTWARE REQUIREMENT SPECIFICATION
This software requirement specification (SRS) report expresses a complete description about
the proposed System.This document includes all the functions and specifications with their
explanations to solve related problems.
SOFTWARE REQUIREMENTS
● Windows8 and above
● Google Collab
● Visual Studio Code
● TensorFlowv2.7.0
● CUDAv11.5
● CuDNNv8.3
HARDWAREREQUIREMENTS
● 8GBRAM
● i5intelcore/AMDRyzen5
● CUDAEnabledGeForceGTX1650
● 256SSDand1TBHDD
Project Purpose
The Purpose of the project is to develop a system To provide an efficient and effective solution over
the conventional way of detecting pneumonia disease using CNN
Project Scope
● To learn different biomedical terms related to pneumonia disease.

● It will analyse the different scenarios of pneumonia disease (viralor bacterial).
● It can be used by any user or Radiologists for testing purpose
UserClasses and Characteristics
Basic knowledge of using computers is adequate use this application.Knowledge of how to use
a mouse or keyboard and internet browser is necessary. The user interface will be friendly enough to
guide the user.
26
Assumptions and Dependencies
• Assumptions:
1. The product must have an interface which is simple enough to under-stand.
3. User-Friendly.
4. To perform with efficient through out and response time.
• Dependencies:
1. All necessary software is available for implementing and use of the system.
2. The proposed system would be designed, developed and implemented based on the software
requirements specifications document.
3. End users should have basic knowledge of computers and we also assure that the users will be given
software training documentation and reference material.
FUNCTIONAL REQUIREMENT
System Feature1(Functional Requirement)
Functional requirement describes features, functioning, and usage of a product/system or
software from the perspective of the product and its user.Functionalrequirementsarealsocalled
functional specifications where the synonym for specification is design. Provide User friendly Interface
and Interactive as per standards.
NON-FUNCTIONAL REQUIREMENT
Performance Requirements
• High Speed: - System should process requested tasks in parallel for various actions to give a quick
response. Then the system must wait for process completion.
• Accuracy:-Systemshouldcorrectlyexecutetheprocessanddisplaytheresultaccurately. System output
should be in user required format.
• Safety Requirements:
The data safety must been sured by arranging for a secure and reliable transmission media.
Security Requirements
Secure access of confidential data (user’sdetails).Information security means pro-testing

information and information systems from unauthorized access, use, disco- sure, disruption,
modificationordestruction.Thetermsinformationsecurity,com-puttersecurityandinformation assurance are
frequently incorrectly used interchangeably. These fields are interrelated often and share the common
goal of protecting the confidentiality ,integrity and availability of information; however, there are some
subtle differences between them.
27
Software Quality Attributes
1. Runtime System Qualities: Runtime System Qualities can be measured as the system executes.
2. Functionality:The abilityof the system to do the work for which it was intended.
3. Performance:The responsetime,utilization,and through put behavior of the system.Not to be
confused with human performance or system delivery time.
4. Security: A measure of system’s ability to resist unauthorized attempts at usage or behavior
modification, while still providing service to legitimate users.
5. Availability: (Reliability quality attributes fall under this category) the measure of time that the
system is up and running correctly; the length of time between failures and the length of time
needed to resume operation after a failure.
6. Usability:The ease of use and of training the end users of the system.Subqualities:learn ability,
efficiency, affect, helpfulness, control.
7. Interoperability:The ability of two or more systems to cooperate at runtime.
SYSTEM REQUIREMENT
ANALYSISMODELS:(SDLCMODELTOBEAPPLIED)
One of the basic notions of the software development process is SDLC models which stands for
Software Development Life Cycle models.SDLC is a continuous process,which starts from the
moment,when its made adecision to launch the project,and it ends at the moment of its full remove
from the exploitation. There is no one single SDLC model. They are divided into main groups, each
with its features and weaknesses. Evolving from the first and oldest waterfall SDLC model, their
variety significantly expanded.
The SDLC models diversity is pre determined by the wide number of product types starting with a web
application development to a complex medical software. And if you take one of the SDLC models
mentioned below as the basis in any case,its hould be adjusted to the features of the product, project,
and company.
The most used, popular and important SDLC models are given below:
1. Waterfall Model
2. Iterative Model
3. Spiral Model
4. V-shaped Model
5. Agile Model
28
Waterfall Model:
Water fall is a cascade SDLC model,in which the development process looks like the flow,moving
step by step through the phases of analysis, projecting, realization, testing, implementation, and support.
This SDLC model includes gradual execution of every stage completely. This process is strictly documented
and predefined with features expected to every phase of this software development life cycle model.
Figure3.1:WaterfallModel
1. Planning and requirement analysis
Each software development life cycle model starts with the analysis, in which the
stakeholdersoftheprocessdiscusstherequirementsforthefinalproduct.Thegoalofthisstageis the detailed
definition of the system requirements. Besides, it is needed to make sure that all the process
participants have clearly understood the tasks and how every requirement is going to be
implemented.Often,the discussion involves the QA specialists who can interfere the process with
additions even during the development stage if it is necessary.
2. Designing project architecture

At the second phase of the software development life cycle, the developers are actually
designing the architecture. All the different technical questions that may appear on this stage are
discussed by all the stakeholders, including the customer. Also, here are defined the technologies used
in the project, team load, limitations, time frames, and budget. The most appropriate project decisions
are made according to the defined requirements.
3. Development and programming

After the requirements are approved, the process goes to the next stage of actual development.
Programmers start here with the source code writing while keeping in mind previously defined
requirements. The system administrators adjust the software environment, frontend programmers
develop the user interface of the program and the logic for its interaction with the server.
29
The programming by itself assumes four stages:-
• Algorithm development
• Source code writing
• Compilation
• Testing and debugging
4. Testing
The testing phase includes the debugging process. All the code flaws missed during the
development are detected here, documented and passed back to the developers to fix.The testing
process repeats until all the critical issues are removed and software work is stable.
5. Deployment
When the program is finalised and has no critical issues it is time to launch it for the end users.
After the new program version release,the tech support team joins.This department provides user
feedback; consult and support users during the time of exploitation. Moreover, the update of selected
components is included in this phase, to make sure,that the software is up-to-date and is invulnerable to
a security breach.
DESIGN AND MODELLING OF SYSTEM
Unified Modelling Language(UML)is an a logousto the blueprints used in other fields and consists
of different types of diagrams. In the aggregate, UML diagrams describe the boundary, structure, and
behaviour of the system and the objects within it.
Following UML diagrams have been designed for the project:
1. Use Case Diagram
2. Class Diagram
3. Sequence diagram
4. Activity Diagram
5. State Diagram
Use Case Diagram
Use case diagrams are usually referred to as behaviour diagrams used to describe a set of
actions that some system or systems should or can perform in collaboration with one or more external
users of the system.Each use case should provide some observable and valuable result to the actors or
other stakeholders of the system. Figure 1 shows the use case diagram of the system.
30
Use Case Diagram
Class Diagram
A class diagram is an illustration of the relationships and source code dependencies among
classes in the Unified Modelling Language (UML). In this context, a class defines the methods and
variables in an object, which is a specific entity in a program or the unit of code representing that
entity. Figure 2 shows the class diagram of the project, the various classes used in the diagram are
user, student, teacher, Image, Cloud, Face Recognition.
Fig4.2.ClassDia
gram
31
Sequence Diagram
Sequence diagrams are sometimes called event diagrams or event scenarios. A sequence
diagram shows, as parallel vertical lines (lifelines), different processes or objects that live
simultaneously, and, as horizontal arrows, the messages exchanged between them, in the order in which
they occur
Fig4.3. Sequence Diagram

Activity diagram
The activity diagram is an other import and diagramin UML to describe the dynamic aspects of
the system.Anactivity diagramis basically a flow chartto represent the flow from one activity to another
activity. The activity can be described as an operation of the system.
Fig4.4.Activity Diagram
32
State Diagram
A statediagram, some times known as a state machine diagram,is a type of behavioral diagram in
theUnified Modelling Language (UML) that shows transitions between various objects.
Fig4.5.State Diagram
33
5. ALGORITHM AND METHODS
Data Acquisition
The proposed database used to evaluate the performance of the model contains a total of 5863 X-
ray photos from the Kaggle.
In2017 Dr.PaulMooney started a competition on Kaggle on viral and bacterial pneumonia

classification.It contained 5,863 pediatric images;hence it is very different from the other datasets. We
are referring to the revised version of this dataset.[6]
In addition, the database is organized into three folders (Train, Test, Val) and contains subfolders
for each category of image (Pneumonia / General).All images have been resized to a static, A few
examples of common and pneumonia images are listed in Figure 1. Chest X-ray images always have
signs of limited brightness on account of the lowdose of exposure inpatients, due to chest X-ray images
always containing black, white, and grey pants. The lungs are located on both sides of the thoracic
cavity and the lung area can be easily detected by X-ray, which is almost black. The heart, located
between the lungs, appears almost as white as X-rays can completely pass through the heart. Bones are
made of protein and very dense, so X-rays cannot cross it and the bones are shown almost white. Ku
moreover, the bones have clear edges.
(a) (b)
Fig5.1 Examples from the data set.(a) normal cases (b) pneumonia cases
Data Pre-processing
The strategies used throughout this paper are listed in Table 2. In our study, rescale is a value
by which we will multiply the data be fore any other processing.Our original images consist of RGB
coefficients in the 0-255, but such values would be too high for our models to process (given a typical
learning rate), so we target values between 0 and 1 instead of by scaling with a 1/255. factor. shear
range is for randomly applying shearing transformations zoom range is for randomly zooming inside
pictures, horizontal flip is for randomly flipping half of the images horizontally --relevant when there
are no assumptions of horizontal asymmetry (e.g. realworld pictures)
Data pre-processing techniques used in this study
34
Proposed Network
In this study, we designed a CNN model to extract the features of chest X-ray images and use
those features to detect if a patient suffers from pneumonia. In Our CNN Architecture, we began it with
a lower filter value of 32 and increased it layer-wise. Constructed the model with a layer of Conv2D
followed by a layer of Carpooling. The kernel size is preferred to be an odd number like 3x3.
Tanh,ReLU,etc.can be used for activation function,butReLU is the most preferred activation function.
input shape takes in image width & height with the last dimension as a color channel. then we Flatten
the input after CNN layers and added ANN layers.
f(x)=max(0,x)
S(x)=Sigmoid f(x)
= ReLU
Used activation function as Soft Max for the last layers (ANN Layers) also defined units as the total
number of classes and used sigmoid for binary classification and set unit to 1.
Fig5.3.Details of proposed DL mode

WORKING THEORY
Convolutional neural networks refer to a sub-category of neural networks : they,therefore, have all
the characteristics of neural networks.However,CNN is specifically designed to process input images. Their
architecture is then more specific: it is composed of two main blocks.
Conv Layers
The first block makes the particularity of this type of neural network since it functions as a
feature extractor. To do this, it performs template matching by applying convolution filtering
35
operations.The first layer filters the image with several convolution kernels and returns “feature maps”,
which are then normalized (with an activation function) and/or resized.
Image
Convolvedfeature Fig5.4.1ConvLayer
(featureMap)
Pool Layers
The second block is not characteristic of a CNN: it is in fact at the end of all the neural
networks used for classification. The input vector values are transformed (with several linear
combinations and activation functions) to return a new vector to the output. This last vector contains as
many elements as there are classes:element Irepresents the probability that the image belongs to class
I.Each element is there fore between 0 and 1,and the sum of all is worth 1.These probabilities are
calculated by the last layer of this block (and therefore of the network), which uses a Sigmoid function
(binary classification) or a RELU function (multi-class classification) as an activation function.
Fig.5.4.2Mathematics of Convand Pooling Layer

Activation Function
An activation function is a mathematical function applied to the output of a neuron in a neural

network.It introduces non-linearity to the network,allowing it to learn and model complex relationships
36
between inputs and outputs. Common activation functions include the sigmoid, ReLU, and tanh
functions.
ReLU
Rectified linear unit is most widely used and preferred activation function right now Which
ranges from 0 to infinity,All the negative values are converted into zero.
f(x)=max(0,x)
Sigmoid
The sigmoid function also called a logistic function. having a characteristic that can take any
real value and map it to between 0 to 1.It decides which value to pass as output and what not to pass.
VGG16 Model
VGG16 is a popular convolutional neural network architecture for image classification tasks.It
consists of 16 layers,including 13 convolutional layers and 3 fully connected layers.The convolutional layers
use small 3x3 filters with a stride of 1 and are followed by max pooling layers. The fully connected layers at
the end of the network perform the classification task. The VGG16 model has achieved state-of-the-art
performance on several image recognition benchmarks, making it a popular choice for computer vision
tasks.
Fig.5.4.3VGG16 Model Sequence

6. Implementation and Testing
Data Acquisition
Fig6.1.1Mount Drive
37
Fig6.1. Import Train and Test Data
Data Augmentation
Fig6.2.1Defining parameters for Data augmentation
Fig6.2.2Applying defined parameters to Train,Test & Validsets
Using the“tensorflow.keras.preprocessing.image”library,for the TrainSet,we created an Image Data

Generator that randomly applies defined parameters to the train set and for the Test & Validation set, we’re
just going to rescale them to avoid manipulating the test data beforehand.
38
Fig6.2.3Augmented Dataset images
We have analyzed the performance of these bellow CNN architectures namely;Alex Net, ResNet-50, and
VGG Net-16. [3]
Table3.Analysis of model Performance for CNN Models with 32 Batch Size
39
Fig6.3 ResNet-50 Fig6.4 AlexNet
Fig6.5 VGG16
Image pre-processing
We'll use an Image Augmentation approach to artificially boost the size of the image training dataset. Image
Augmentation increases the size of the data set by creating a modified version of the existing training set
photos,which increases data set variation and, as a result,improves the model's ab
40
7. Confusion Matrix
Let’s interpret the output of the confusion matrix. The upper left (TP) denotes the number of
images correctly predicted as normal cases and the bottom right (TN) denotes the correctly predicted
number of images as cases of pneumonia. As Pneumonia case, the upper right denotes the number of
incorrectly predicted images but were actually normal cases and the lower left denotes the number of in
correctly predicted Normal case images but were actually Pneumonia case.
Mathematical Model
Fig.7.2.2 Confusion Matrix
Fig.7.3Predicted Images
41
● This provides you a percentage estimate of the individual image,which you may load straight
from your hard drive by specifying its path.
● After importing the image as we did previously,we must recreate all of the data pretreatment
procedures in order to input the test set into the model and obtain a forecast.
Importing the tensorflow.keras.preprocessing.image class is required for pre-processing.
● Import an image with dimensions of (500,500) and a gray scale color channel.
UserInterface
Using Flask Flask is a micro web framework written in Python that allows developers to quickly
build web applications. It provides simple and easy-to-use tools and libraries for routing requests,
handling HTTP requests and responses,and rendering templates.Flask is known for its flexibility and
extensibility, making it a popular choice among developers for building web applications.
Fig.7.4FrontendUserInterfacewith Flask
42
8. CONCLUSION
This study describes a CNN-based model aiming to diagnose pneumonia on a chest X-ray image
set. The contributions in this paper are listed as follows. We designed a CNN model to extract the features
from original images or previous feature maps, which contained only six layers combining ReLU
activation function, drop operation, and max-pooling layers. The results of the obtained accuracy rate of
92.07% and precision rate of 91.41%,shows that our proposed model performs well In comparison to state-
of-the-art CNN model architectures.To illustrate the performance of our proposed model, several
comparisons of different input shapes and loss functions were provided.
In the future, we will continue the research to explore more accurate classification architectures
to diagnose two types of pneumonia, viruses, and bacteria. According to the description discussed
above,the CNN-based model is apromising method to diagnose the disease through X-rays.
43
9. BIBLIOGRAPHY
[1] Vandecia Fernandes et al., “Bayesian convolutional neural network estimation for pediatric
pneumonia detection and diagnosis”,Computer Methods and Programs in Biomedicine, Elsevier,
2021
[2] Hongen Lu et al., “Transfer Learning from Pneumonia to COVID-19”, Asia-Pacific on

Computer Science and Data Engineering (CSDE), 2020 IEEE
[3] SammyV.Militanteetal.,“Pneumoniaand COVID-19 Detectionusing Convolutional Neural

Networks”, 2020 the third International on Vocational Education and Electrical Engineering
(ICVEE), IEEE, 2021
[4] NanetteV.Dionisioetal.,“Pneumonia Detection through Adaptive Deep Learning Models of

Convolutional Neural Networks”, 2020 11th IEEE Control and System Graduate Research
Colloquium (ICSGRC 2020), 8 August 2020
[5] Md. Jahid Hasan et al., “Deep Learning-based Detection and Segmentation of COVID-19 &
Pneumonia on Chest X-ray Image”, 2021 International Information and Communication Technology
for Sustainable Development (ICICT4SD), 27-28 February 2021
[6] https://www.kaggle.com/paultimothymooney/chest-xray-pneumonia
[7] LeCun, Y.; Boser, B.; Denker, J.S.; Henderson, D.; Howard, R.E.; Hubbard, W.; Jackel, L.D. Back
propagation applied to hand written zip code recognition.NeuralComput.1989,1,541–551.
[8] Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural
networks. Adv. Neural Inf. Process. Syst. 2012, 25, 1097–1105.
[9] Simonyan,K.;Zisserman,A.Very deep convolutional networks for large-scale image recognition.

arXiv 2014, arXiv:1409.1556
[10] R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh and D. Batra, "Grad-CAM: Visual
Explanations from Deep Networks via Gradient-Based Localization," 2020 IEEE International
Conference on Computer Vision (ICCV), Venice, 2020, pp. 618-626.
44
45

machine learning final manual

Uploaded by

Document Informationclick to expand document informationMachine learning

Document Informationclick to expand document information

Copyright:

Available Formats

machine learning final manual

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

machine learning final manual

Uploaded by

Copyright:

Available Formats

EX.

NO:1 LINEAR REGRESSION DATE:

X : array-like, shape = [number_of_samples, number_of_features]

# Import necessary modules

# Create feature and target arrays

# Split into training and test set

# Calculate the accuracy of the model

# Compute training and test data accuracy

# Importing numpy & scikit-learn

# Making a dummy array to

# y is just a list of 0-7 number

# Making a dummy array to represent x,y for example

# y is just a list of 0-7 number

# Splitting dataset in 80-20 fashion .i.e.

# Making a dummy array to represent x,y for example

# Splitting dataset in 80-20 fashion .i.e.

Testing set x: [[15 16 17]]

Validation set x: [[3 4 5]]

df.drop(['sepal length (cm)', 'sepal width (cm)', 'flower'],axis='columns',inplace=True)

plt.scatter(df1['petal length (cm)'],df1['petal width (cm)'],color='blue')

STEPS INVOLVE NAÏVE BAYES ALGORITHM

# the tuples consist of (delay time of train1, number of times)

# tuples are (minutes, number of times)

import matplotlib.pyplot as plt

for minutes in range(-1, 13):

PNEUMONIA DETECTION AND CLASSIFICATION USING DEEP LEARNING

The infectious illness known as Pneumonia is regularly a result of contamination because of a

Types of deep learning models

● Windows8 and above

● Visual Studio Code

● To learn different biomedical terms related to pneumonia disease.

1. The product must have an interface which is simple enough to under-stand.

4. To perform with efficient through out and response time.

Secure access of confidential data (user’sdetails).Information security means pro-testing

1. Planning and requirement analysis

2. Designing project architecture

3. Development and programming

Fig4.3. Sequence Diagram

In2017 Dr.PaulMooney started a competition on Kaggle on viral and bacterial pneumonia

Fig5.3.Details of proposed DL mode

Fig.5.4.2Mathematics of Convand Pooling Layer

An activation function is a mathematical function applied to the output of a neuron in a neural

Fig.5.4.3VGG16 Model Sequence

Fig6.2.1Defining parameters for Data augmentation

Fig6.2.2Applying defined parameters to Train,Test & Validsets

Using the“tensorflow.keras.preprocessing.image”library,for the TrainSet,we created an Image Data

Table3.Analysis of model Performance for CNN Models with 32 Batch Size

Fig.7.2.2 Confusion Matrix

[2] Hongen Lu et al., “Transfer Learning from Pneumonia to COVID-19”, Asia-Pacific on

[3] SammyV.Militanteetal.,“Pneumoniaand COVID-19 Detectionusing Convolutional Neural

[4] NanetteV.Dionisioetal.,“Pneumonia Detection through Adaptive Deep Learning Models of

[9] Simonyan,K.;Zisserman,A.Very deep convolutional networks for large-scale image recognition.

You might also like