0% found this document useful (0 votes)

9 views3 pages

Question2ml - Ipynb - Colab

ml questions

Uploaded by

amiteshkumar91223

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views3 pages

Question2ml - Ipynb - Colab

ml questions

Uploaded by

amiteshkumar91223

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

03/09/2025, 08:04 question2ml.

ipynb - Colab

'''3 Data prepration

Download the "Spambase Data Set" from the UCI Machine Learning Repository
(https://archive.ics.uci.edu/ml/datasets/Spambase). This dataset contains email
messages, where the goal is to predict whether a message is spam or not based on several
input features.
Implementation:
Implement Bernoulli Naive Bayes, Multinomial Naive Bayes, and Gaussian Naive Bayes classifiers using the scikit-learn librar
Results:
Report the following performance metrics for each classifier:
Accuracy,Precision,Recall,F1 score,Confusion_matrix'''

import pandas as pd
df = pd.read_csv("/content/drive/MyDrive/MLlabfiles/spambase.data", header=None)

X = df.iloc[:, :-1] # Features

y = df.iloc[:, -1] # Labels

from sklearn.naive_bayes import BernoulliNB

from sklearn.model_selection import cross_val_predict
from sklearn.metrics import classification_report, confusion_matrix

# Create the model

bnb = BernoulliNB()

# Generate predictions
y_pred_bnb = cross_val_predict(bnb, X, y, cv=10)

from sklearn.naive_bayes import MultinomialNB

mnb = MultinomialNB()
y_pred_mnb = cross_val_predict(mnb, X, y, cv=10)

from sklearn.naive_bayes import GaussianNB

gnb = GaussianNB()
y_pred_gnb = cross_val_predict(gnb, X, y, cv=10)

y_pred_bnb = cross_val_predict(BernoulliNB(), X, y, cv=10)

y_pred_mnb = cross_val_predict(MultinomialNB(), X, y, cv=10)
y_pred_gnb = cross_val_predict(GaussianNB(), X, y, cv=10)

from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score

results = {
"BernoulliNB": {
"accuracy": accuracy_score(y, y_pred_bnb),
"precision": precision_score(y, y_pred_bnb),
"recall": recall_score(y, y_pred_bnb),
"f1": f1_score(y, y_pred_bnb)
},
"MultinomialNB": {
"accuracy": accuracy_score(y, y_pred_mnb),
"precision": precision_score(y, y_pred_mnb),
"recall": recall_score(y, y_pred_mnb),
"f1": f1_score(y, y_pred_mnb)
},
"GaussianNB": {
"accuracy": accuracy_score(y, y_pred_gnb),
"precision": precision_score(y, y_pred_gnb),
"recall": recall_score(y, y_pred_gnb),
"f1": f1_score(y, y_pred_gnb)
}
}

import pandas as pd

df_results = pd.DataFrame(results).T # Transpose to get models as rows

print(df_results)

https://colab.research.google.com/drive/1uvmteLkTYx5yoQrWldQIe345NsQXKt6j#scrollTo=sM_R59djsRu2&printMode=true 1/3
03/09/2025, 08:04 question2ml.ipynb - Colab
accuracy precision recall f1
BernoulliNB 0.883938 0.881336 0.815223 0.846991
MultinomialNB 0.786351 0.732363 0.721456 0.726869
GaussianNB 0.821778 0.700444 0.956977 0.808858

keyboard_arrow_down New section

'''4. Download the "LLM - Detect AI generated text dataset"In this Dataset contains both AI Generated Essay and Human Writte
Dataset Link- https://www.kaggle.com/datasets/sunilthite/llm-detect-ai-generated-text-dataset
The dataset contains more than 28,000 essays written by students and AI generated.
Features :
text : Which contains essay text
generated : This is the target label . 0 - Human Written Essay , 1 - AI Generated Essay
Implementation:
Implement Bernoulli Naive Bayes, Multinomial Naive Bayes, and Gaussian
Naive Bayes classifiers using the scikit-learn library in Python
. Use 10-fold cross-validation to evaluate the performance of each classifier on the dataset.
You should use the default hyperparameters for each classifier.'''
import pandas as pd
import numpy as np
df = pd.read_csv("/content/drive/MyDrive/MLlabfiles/Training_Essay_Data.csv")
X_text = df['text'] # Just the text column
y = df['generated'] # Load the correct labels from the new dataset
from sklearn.feature_extraction.text import TfidfVectorizer

tfidf = TfidfVectorizer(stop_words='english', max_features=5000) # You can tune this

X_tfidf = tfidf.fit_transform(X_text)
from sklearn.naive_bayes import BernoulliNB
from sklearn.model_selection import cross_val_predict
from sklearn.metrics import classification_report, confusion_matrix

bnb = BernoulliNB()
y_pred_bnb = cross_val_predict(bnb, X_tfidf, y, cv=10)

from sklearn.naive_bayes import MultinomialNB

mnb = MultinomialNB()
y_pred_mnb = cross_val_predict(mnb, X_tfidf, y, cv=10)

from sklearn.naive_bayes import GaussianNB

gnb = GaussianNB()
y_pred_gnb = cross_val_predict(gnb, X_tfidf.toarray(), y, cv=10)

from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score

import pandas as pd

df_results = pd.DataFrame(results).T # Transpose to get models as rows

print(df_results)

accuracy precision recall f1

BernoulliNB 0.951484 0.946536 0.931082 0.938745

https://colab.research.google.com/drive/1uvmteLkTYx5yoQrWldQIe345NsQXKt6j#scrollTo=sM_R59djsRu2&printMode=true 2/3
03/09/2025, 08:04 question2ml.ipynb - Colab
MultinomialNB 0.924104 0.901850 0.908825 0.905324
GaussianNB 0.928495 0.877977 0.953424 0.914147

https://colab.research.google.com/drive/1uvmteLkTYx5yoQrWldQIe345NsQXKt6j#scrollTo=sM_R59djsRu2&printMode=true 3/3

Spam Detection Model
No ratings yet
Spam Detection Model
4 pages
Naive Bayes Classification - Jupyter Notebook
No ratings yet
Naive Bayes Classification - Jupyter Notebook
4 pages
Aiml Assignment-2
No ratings yet
Aiml Assignment-2
8 pages
PL LAB 3 File
No ratings yet
PL LAB 3 File
56 pages
CTI Record
No ratings yet
CTI Record
49 pages
Micro
No ratings yet
Micro
5 pages
PAL Codes
No ratings yet
PAL Codes
18 pages
Practical 3
No ratings yet
Practical 3
11 pages
AI Phase4
No ratings yet
AI Phase4
11 pages
Implemention of Sms Spam Filtering
No ratings yet
Implemention of Sms Spam Filtering
27 pages
Bayesian Inference
No ratings yet
Bayesian Inference
20 pages
Purva Rawale - BDA Practical No 2
No ratings yet
Purva Rawale - BDA Practical No 2
9 pages
02 - Email - Spam - Ipynb - Colab
No ratings yet
02 - Email - Spam - Ipynb - Colab
11 pages
Project Ali Huzaifa
No ratings yet
Project Ali Huzaifa
6 pages
2.naïve Bayes Classifier For Sms
No ratings yet
2.naïve Bayes Classifier For Sms
9 pages
Document
No ratings yet
Document
11 pages
Spam Email Detection Documentation
No ratings yet
Spam Email Detection Documentation
3 pages
Dev ML Ex5
No ratings yet
Dev ML Ex5
6 pages
Prog 6
No ratings yet
Prog 6
3 pages
Spam Detection
No ratings yet
Spam Detection
10 pages
Bayesian Algorithm
No ratings yet
Bayesian Algorithm
6 pages
A Comparison of The Accuracy of Support Vector
No ratings yet
A Comparison of The Accuracy of Support Vector
17 pages
ML Project - Classifying Spam Emails
No ratings yet
ML Project - Classifying Spam Emails
3 pages
Email Spam Detection Final Presentation-21BSCHH010002
No ratings yet
Email Spam Detection Final Presentation-21BSCHH010002
17 pages
Microproject Report
No ratings yet
Microproject Report
23 pages
Amlnew
No ratings yet
Amlnew
25 pages
SVM Lab Report
No ratings yet
SVM Lab Report
7 pages
Exp 3 Bi 30
No ratings yet
Exp 3 Bi 30
7 pages
Sodapdf
No ratings yet
Sodapdf
1 page
Ai Project
No ratings yet
Ai Project
8 pages
Arnav MLlab04
No ratings yet
Arnav MLlab04
7 pages
ML Lab 4
No ratings yet
ML Lab 4
6 pages
Lec 09
No ratings yet
Lec 09
50 pages
Manual
No ratings yet
Manual
48 pages
Naive456 Bayes297Classification
No ratings yet
Naive456 Bayes297Classification
21 pages
Ie ML Project (Getting Started)
No ratings yet
Ie ML Project (Getting Started)
3 pages
DWDM Pavan Final
No ratings yet
DWDM Pavan Final
10 pages
Lec 09
No ratings yet
Lec 09
50 pages
Email Spam Detection Guide
No ratings yet
Email Spam Detection Guide
4 pages
Machine Learning Program 4 (SHANKAR)
No ratings yet
Machine Learning Program 4 (SHANKAR)
6 pages
Module3 Ids
No ratings yet
Module3 Ids
17 pages
Sms Spam Using Machine Learning 4
No ratings yet
Sms Spam Using Machine Learning 4
42 pages
Vamshi ml-4
No ratings yet
Vamshi ml-4
3 pages
MLT Practical 3 and 4
No ratings yet
MLT Practical 3 and 4
2 pages
Email Spam Detection
No ratings yet
Email Spam Detection
3 pages
W8 Naive Bayes Lab
No ratings yet
W8 Naive Bayes Lab
4 pages
ICS Lab 1
No ratings yet
ICS Lab 1
5 pages
Naive Bayes Classifier CSV Guide
No ratings yet
Naive Bayes Classifier CSV Guide
7 pages
Complete ML File Word File
No ratings yet
Complete ML File Word File
64 pages
Email Spam Classification with ML & NLP
No ratings yet
Email Spam Classification with ML & NLP
6 pages
Task04 Emailspamdetectionwithmachinelearning 1752340927
No ratings yet
Task04 Emailspamdetectionwithmachinelearning 1752340927
2 pages
Project 2
No ratings yet
Project 2
10 pages
Probabilistic Reasoning Lab Procedure
No ratings yet
Probabilistic Reasoning Lab Procedure
4 pages
Lab 78
No ratings yet
Lab 78
6 pages
ML 2
No ratings yet
ML 2
1 page
Exp 3 Bi
No ratings yet
Exp 3 Bi
12 pages
Cp4252 Machine Learning Lab Manual
No ratings yet
Cp4252 Machine Learning Lab Manual
40 pages
OS Lab Assignment Session 1 To 4
No ratings yet
OS Lab Assignment Session 1 To 4
3 pages
01 - Introduction
No ratings yet
01 - Introduction
80 pages
Assignment2. (4) - JupyterLab
No ratings yet
Assignment2. (4) - JupyterLab
3 pages
MLLabcode 1
No ratings yet
MLLabcode 1
3 pages
ML LAB - Ipynb - (4) - JupyterLab
No ratings yet
ML LAB - Ipynb - (4) - JupyterLab
10 pages
Differential Equations Short Notes (Maths)
No ratings yet
Differential Equations Short Notes (Maths)
2 pages
ch-6 - Demand Forcasting
No ratings yet
ch-6 - Demand Forcasting
32 pages
Practical File Submission: Ganpat University U.V.Patel College of Engineering
No ratings yet
Practical File Submission: Ganpat University U.V.Patel College of Engineering
5 pages
Information - Theory - in - Computer - Vision - and - Pattern - Recognition 2009
No ratings yet
Information - Theory - in - Computer - Vision - and - Pattern - Recognition 2009
375 pages
A Tutorial of Elliptic Curve Cryptography
No ratings yet
A Tutorial of Elliptic Curve Cryptography
58 pages
ASSIGNMENT 2 Production and Logistics Management - Forecasting Numerical Assignment
No ratings yet
ASSIGNMENT 2 Production and Logistics Management - Forecasting Numerical Assignment
5 pages
Confusion Matrix Examples
No ratings yet
Confusion Matrix Examples
2 pages
Knowledge Representation and Artificial Intelligence All Chapter Wise IMP Questions by MCA Scholar's Group ?
No ratings yet
Knowledge Representation and Artificial Intelligence All Chapter Wise IMP Questions by MCA Scholar's Group ?
5 pages
Machine Learning Assignment 2
No ratings yet
Machine Learning Assignment 2
1 page
LP03 Stability 1
No ratings yet
LP03 Stability 1
1 page
Nishtha Prahladka - Worksheet 1 - AC - Ext
No ratings yet
Nishtha Prahladka - Worksheet 1 - AC - Ext
18 pages
Multi-Criteria Decision Analysis
No ratings yet
Multi-Criteria Decision Analysis
1 page
Optimal Control in Biomedicine
No ratings yet
Optimal Control in Biomedicine
45 pages
Week 8: Games of Incomplete Information
No ratings yet
Week 8: Games of Incomplete Information
45 pages
CHE 42 - Problem Set 8 - Sorption Processes
No ratings yet
CHE 42 - Problem Set 8 - Sorption Processes
2 pages
Hashing in Digital Forensics
No ratings yet
Hashing in Digital Forensics
6 pages
Simulation Chapter 3
No ratings yet
Simulation Chapter 3
19 pages
基于ABAQUS的八叉树比... 坝地震响应分析中的应用研究刘云辉
No ratings yet
基于ABAQUS的八叉树比... 坝地震响应分析中的应用研究刘云辉
124 pages
Linear Convolution via DFT
No ratings yet
Linear Convolution via DFT
2 pages
Sports Analyticsfor Football League Tableand Player Performance Prediction CR
No ratings yet
Sports Analyticsfor Football League Tableand Player Performance Prediction CR
8 pages
A First Course in Differential Equations (3rd Edition) Logan
No ratings yet
A First Course in Differential Equations (3rd Edition) Logan
10 pages
Kyber Specification Round3 20210131
No ratings yet
Kyber Specification Round3 20210131
43 pages
CW Fluent
No ratings yet
CW Fluent
3 pages
PDF Saaaaaaaaaaaaaaaaaaaaa
No ratings yet
PDF Saaaaaaaaaaaaaaaaaaaaa
8 pages
Chapter 3 Decision Analysis
No ratings yet
Chapter 3 Decision Analysis
2 pages
Unit Hydrograph Solved Problems
No ratings yet
Unit Hydrograph Solved Problems
2 pages
Load Forecasting Techniques and Methodologies A Review
No ratings yet
Load Forecasting Techniques and Methodologies A Review
10 pages
NLP CIA-i III-bca QP
No ratings yet
NLP CIA-i III-bca QP
1 page
Lecture 2
No ratings yet
Lecture 2
22 pages
CH08a - Finite Element Programming
100% (1)
CH08a - Finite Element Programming
11 pages