0% found this document useful (0 votes)

17 views6 pages

I Implementation of Regression

The document outlines the implementation of regression models using the California Housing dataset, including Linear Regression, Random Forest, and Support Vector Regression. It details the process of loading the dataset, training the models, making predictions, and calculating the mean squared error for each model. Finally, it presents visual comparisons of the models' performance through bar charts and scatter plots of predicted versus true values.

Uploaded by

Yuvarani Aruchamy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views6 pages

I Implementation of Regression

Uploaded by

Yuvarani Aruchamy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

I IMPLEMENTATION OF REGRESSION

import numpy as np
import matplotlib.pyplot as plt
from sklearn.datasets import fetch_california_housing
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.ensemble import RandomForestRegressor
from sklearn.svm import SVR
from sklearn.metrics import mean_squared_error
import pandas as pd

# Load the California Housing dataset

housing = fetch_california_housing()

# Convert the dataset into a DataFrame for easier inspection

housing_df = pd.DataFrame(housing.data,
columns=housing.feature_names)

# Display the shape of the dataset

print(f"Shape of the dataset: {housing_df.shape}")

# Display the first few rows of the dataset

print("First few rows of the dataset:")
print(housing_df.head())

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(housing.data,
housing.target, test_size=0.2, random_state=42)

# Train a linear regression model

lr = LinearRegression()
lr.fit(X_train, y_train)
# Make predictions on the testing set
y_pred_lr = lr.predict(X_test)

# Calculate the mean squared error for the linear regression

model
mse_lr = mean_squared_error(y_test, y_pred_lr)
print(f"Linear Regression Mean Squared Error: {mse_lr:.2f}")

# Train a random forest regression model

rf = RandomForestRegressor(n_estimators=100,
random_state=42)
rf.fit(X_train, y_train)

# Make predictions on the testing set

y_pred_rf = rf.predict(X_test)

# Calculate the mean squared error for the random forest

regression model
mse_rf = mean_squared_error(y_test, y_pred_rf)
print(f"Random Forest Mean Squared Error: {mse_rf:.2f}")

# Train a Support Vector Regression model

svr = SVR()
svr.fit(X_train, y_train)

# Make predictions on the testing set

y_pred_svr = svr.predict(X_test)

# Calculate the mean squared error for the SVR model

mse_svr = mean_squared_error(y_test, y_pred_svr)
print(f"Support Vector Regression Mean Squared Error:
{mse_svr:.2f}")

# Plotting the comparison of the models

models = ['Linear Regression', 'Random Forest', 'Support Vector
Regression']
mse_values = [mse_lr, mse_rf, mse_svr]

# Bar chart of Mean Squared Errors for all models

plt.figure(figsize=(8, 6))
plt.bar(models, mse_values, color=['blue', 'green', 'orange'])
plt.title('Mean Squared Error Comparison')
plt.ylabel('Mean Squared Error')
plt.show()

# Plotting predicted vs true values for all models

plt.figure(figsize=(18, 6))

# Linear Regression
plt.subplot(1, 3, 1)
plt.scatter(y_test, y_pred_lr, color='blue', alpha=0.5)
plt.plot([min(y_test), max(y_test)], [min(y_test), max(y_test)],
color='red', linestyle='--')
plt.title('Linear Regression: Predicted vs True')
plt.xlabel('True Values')
plt.ylabel('Predicted Values')

# Random Forest
plt.subplot(1, 3, 2)
plt.scatter(y_test, y_pred_rf, color='green', alpha=0.5)
plt.plot([min(y_test), max(y_test)], [min(y_test), max(y_test)],
color='red', linestyle='--')
plt.title('Random Forest: Predicted vs True')
plt.xlabel('True Values')
plt.ylabel('Predicted Values')

# Support Vector Regression

plt.subplot(1, 3, 3)
plt.scatter(y_test, y_pred_svr, color='orange', alpha=0.5)
plt.plot([min(y_test), max(y_test)], [min(y_test), max(y_test)],
color='red', linestyle='--')
plt.title('SVR: Predicted vs True')
plt.xlabel('True Values')
plt.ylabel('Predicted Values')

plt.tight_layout()
plt.show()

output :

Shape of the dataset: (20640, 8)

First few rows of the dataset:

MedInc HouseAge AveRooms AveBedrms Population
AveOccup Latitude \
0 8.3252 41.0 6.984127 1.023810 322.0 2.555556
37.88
1 8.3014 21.0 6.238137 0.971880 2401.0 2.109842
37.86
2 7.2574 52.0 8.288136 1.073446 496.0 2.802260
37.85
3 5.6431 52.0 5.817352 1.073059 558.0 2.547945
37.85
4 3.8462 52.0 6.281853 1.081081 565.0 2.181467
37.85

Longitude
0 -122.23
1 -122.22
2 -122.24
3 -122.25
4 -122.25

Linear Regression Mean Squared Error: 0.56

Random Forest Mean Squared Error: 0.26
Support Vector Regression Mean Squared Error: 1.33
SVR

ML Record
No ratings yet
ML Record
19 pages
ML Lab Record
No ratings yet
ML Lab Record
17 pages
Machine Learning Project: TITLE: Predicting The Sale Price of A House Using Linear Regression
No ratings yet
Machine Learning Project: TITLE: Predicting The Sale Price of A House Using Linear Regression
20 pages
Kritika Sejwal - 24MCI10023 - ML Lab - Worksheet 1
No ratings yet
Kritika Sejwal - 24MCI10023 - ML Lab - Worksheet 1
6 pages
Unit 3 5
No ratings yet
Unit 3 5
4 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Message
No ratings yet
Message
5 pages
7 A
No ratings yet
7 A
2 pages
Integrated System Lab
No ratings yet
Integrated System Lab
25 pages
AD-22053227 Lab 401, 402
No ratings yet
AD-22053227 Lab 401, 402
4 pages
SML - Week 3
No ratings yet
SML - Week 3
5 pages
DA Lab2
No ratings yet
DA Lab2
5 pages
Regression
No ratings yet
Regression
8 pages
Ridge vs Lasso: A Python Guide
No ratings yet
Ridge vs Lasso: A Python Guide
3 pages
ML Manual
No ratings yet
ML Manual
9 pages
Boston Housing Price Prediction
No ratings yet
Boston Housing Price Prediction
3 pages
Document From Jahnavi
No ratings yet
Document From Jahnavi
20 pages
ML Practical 5
No ratings yet
ML Practical 5
10 pages
Python
No ratings yet
Python
4 pages
P05 The Regression Pipeline - Training and Testing Ans
No ratings yet
P05 The Regression Pipeline - Training and Testing Ans
13 pages
Ex No.: Date: Problem Statement
No ratings yet
Ex No.: Date: Problem Statement
3 pages
Python File
No ratings yet
Python File
5 pages
Lasso Regression Aim: Roll Number: 160122733094 Date
No ratings yet
Lasso Regression Aim: Roll Number: 160122733094 Date
8 pages
Machine Learning Lab: Regression Analysis
No ratings yet
Machine Learning Lab: Regression Analysis
15 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
AIML
No ratings yet
AIML
5 pages
Exercise4 Solution
No ratings yet
Exercise4 Solution
20 pages
Lab ML
No ratings yet
Lab ML
26 pages
CO3
No ratings yet
CO3
8 pages
ML Exp-5,6
No ratings yet
ML Exp-5,6
6 pages
Exp4 (Linear Regression)
No ratings yet
Exp4 (Linear Regression)
2 pages
Aiml Practicals
No ratings yet
Aiml Practicals
22 pages
Exp 2 (Multiple Linear Regression)
No ratings yet
Exp 2 (Multiple Linear Regression)
6 pages
Data Science Record - 05
No ratings yet
Data Science Record - 05
20 pages
Machine Learning - Lab Record
No ratings yet
Machine Learning - Lab Record
43 pages
ML Lab Prgms Split
No ratings yet
ML Lab Prgms Split
3 pages
Experiment 8&9
No ratings yet
Experiment 8&9
3 pages
ML Brefing
No ratings yet
ML Brefing
28 pages
Project 4 - House Price Prediction - Ipynb - Colab
No ratings yet
Project 4 - House Price Prediction - Ipynb - Colab
5 pages
ML Exp 7
No ratings yet
ML Exp 7
3 pages
Machine Learnin
100% (2)
Machine Learnin
23 pages
EXPNO5
No ratings yet
EXPNO5
2 pages
Linear Regression Mca Lab - Jupyter Notebook
No ratings yet
Linear Regression Mca Lab - Jupyter Notebook
2 pages
ML Manual
No ratings yet
ML Manual
30 pages
Experiment 4 ML
No ratings yet
Experiment 4 ML
9 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
CB Lab 221801017
No ratings yet
CB Lab 221801017
33 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
Big Data Assignment - 4
No ratings yet
Big Data Assignment - 4
6 pages
Pgrm1 Simple Linear Reg
No ratings yet
Pgrm1 Simple Linear Reg
3 pages
California Housing Data Analysis
No ratings yet
California Housing Data Analysis
1 page
Set 2
No ratings yet
Set 2
19 pages
20BCP021 Assignment 6
No ratings yet
20BCP021 Assignment 6
15 pages
Lab 14 Questions
No ratings yet
Lab 14 Questions
4 pages
Eai Exp 2-5
No ratings yet
Eai Exp 2-5
13 pages
T2 Summary VHA
No ratings yet
T2 Summary VHA
14 pages
21brs1474 ML Lab 2
No ratings yet
21brs1474 ML Lab 2
25 pages
ML Lab Experiment Shivansh
No ratings yet
ML Lab Experiment Shivansh
29 pages
Boston House Price Prediction
No ratings yet
Boston House Price Prediction
5 pages
DC - Unit 1 Complte Notes
No ratings yet
DC - Unit 1 Complte Notes
66 pages
DC Syllabus
No ratings yet
DC Syllabus
8 pages
DC Syllabus
No ratings yet
DC Syllabus
3 pages
OS Objective Type Qustions
No ratings yet
OS Objective Type Qustions
27 pages
File System Questions
No ratings yet
File System Questions
34 pages
Input Output Systems
No ratings yet
Input Output Systems
15 pages
7 Cseaimlsyll
No ratings yet
7 Cseaimlsyll
11 pages
Rectangular Prism Volume Worksheet
100% (1)
Rectangular Prism Volume Worksheet
2 pages
MHC PDF
No ratings yet
MHC PDF
2 pages
Business Processes in SAP S/4HANA Portfolio and Project Management
No ratings yet
Business Processes in SAP S/4HANA Portfolio and Project Management
656 pages
DT-1. Familiarization With AIML Platforms
No ratings yet
DT-1. Familiarization With AIML Platforms
25 pages
AI Chatbots for Health Behavior Change
No ratings yet
AI Chatbots for Health Behavior Change
17 pages
AAM Summer 2024 Question Paper
No ratings yet
AAM Summer 2024 Question Paper
4 pages
Complete Ethics of Artificial Intelligence S. Matthew Liao PDF For All Chapters
No ratings yet
Complete Ethics of Artificial Intelligence S. Matthew Liao PDF For All Chapters
65 pages
NESA - Software - Engineering - 11 - 12 - 2022
No ratings yet
NESA - Software - Engineering - 11 - 12 - 2022
32 pages
AIYA Internship Admission Letter
No ratings yet
AIYA Internship Admission Letter
5 pages
ML Security & Privacy Essentials
No ratings yet
ML Security & Privacy Essentials
42 pages
Real Estate Price Prediction With Regression and Classification
No ratings yet
Real Estate Price Prediction With Regression and Classification
5 pages
Supplier Selection Based On Hierarchical Potential Support Vector Machine
No ratings yet
Supplier Selection Based On Hierarchical Potential Support Vector Machine
8 pages
INTERNSHIP
No ratings yet
INTERNSHIP
27 pages
Sustainable Waste Trends in SA
No ratings yet
Sustainable Waste Trends in SA
51 pages
Agent AI
No ratings yet
Agent AI
2 pages
Báo 2
No ratings yet
Báo 2
15 pages
Improving Text Embeddings With Large Language Models:,, Microsoft Corporation
No ratings yet
Improving Text Embeddings With Large Language Models:,, Microsoft Corporation
20 pages
Application of Segment Anything Model For Civil Infrastructure Defect Assessment
No ratings yet
Application of Segment Anything Model For Civil Infrastructure Defect Assessment
31 pages
Arif Jahangir: Data Scientist - Machine Learning Engineer
No ratings yet
Arif Jahangir: Data Scientist - Machine Learning Engineer
4 pages
Assignment 4 DataPreparation Final
No ratings yet
Assignment 4 DataPreparation Final
2 pages
wHAT IS ARTIFICIAL INTELLIGENCE
No ratings yet
wHAT IS ARTIFICIAL INTELLIGENCE
3 pages
Unit 3 MLT
No ratings yet
Unit 3 MLT
18 pages
CUET ML Algorithms Report
No ratings yet
CUET ML Algorithms Report
28 pages
22n01f0031-Identifying Student Profiles Within Online Judge Systems Using Explainable Artificial Intelligence
No ratings yet
22n01f0031-Identifying Student Profiles Within Online Judge Systems Using Explainable Artificial Intelligence
40 pages
Unit 6: Big Data Analytics Using R: 6.0 Overview
No ratings yet
Unit 6: Big Data Analytics Using R: 6.0 Overview
32 pages
Detection of Ddos Attacks and Flash Events Occuring Simultaneously in Network Traffic Using Deep Learning Techniques
No ratings yet
Detection of Ddos Attacks and Flash Events Occuring Simultaneously in Network Traffic Using Deep Learning Techniques
61 pages
Mandatory e Lessosn Answer Key
No ratings yet
Mandatory e Lessosn Answer Key
95 pages
MACHINE LEARNING Updated
No ratings yet
MACHINE LEARNING Updated
12 pages
AI: Transforming Industries & Society
No ratings yet
AI: Transforming Industries & Society
17 pages
3D Pose Estimation for Students
No ratings yet
3D Pose Estimation for Students
10 pages
1 s2.0 S0950705124005999 Main
No ratings yet
1 s2.0 S0950705124005999 Main
12 pages
Title: Formality Challenges To AI Inventorship in South African Law
No ratings yet
Title: Formality Challenges To AI Inventorship in South African Law
6 pages

I Implementation of Regression

Uploaded by

I Implementation of Regression

Uploaded by

I IMPLEMENTATION OF REGRESSION

# Load the California Housing dataset

# Convert the dataset into a DataFrame for easier inspection

# Display the shape of the dataset

# Display the first few rows of the dataset

# Split the data into training and testing sets

# Train a linear regression model

# Calculate the mean squared error for the linear regression

# Train a random forest regression model

# Make predictions on the testing set

# Calculate the mean squared error for the random forest

# Train a Support Vector Regression model

# Make predictions on the testing set

# Calculate the mean squared error for the SVR model

# Plotting the comparison of the models

# Bar chart of Mean Squared Errors for all models

# Plotting predicted vs true values for all models

# Support Vector Regression

Shape of the dataset: (20640, 8)

First few rows of the dataset:

Linear Regression Mean Squared Error: 0.56

You might also like