0% found this document useful (0 votes)

39 views3 pages

Housing Prices Linear Regression

The document outlines a Python script that uses the scikit-learn library to perform linear regression on a housing dataset. It includes data preprocessing steps such as label encoding for categorical variables and normalization of features. The model is trained and evaluated using metrics like mean absolute error, mean squared error, and R-squared score.

Uploaded by

rananavdeep65

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views3 pages

Housing Prices Linear Regression

Uploaded by

rananavdeep65

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

from sklearn.

linear_model import LinearRegression

from sklearn.metrics import mean_squared_error,mean_absolute_error,r2_score
from sklearn.model_selection import train_test_split

import pandas as pd
data=pd.read_csv("Housing.csv")
data

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 yes no no no yes 2

1 12250000 8960 4 4 4 yes no no no yes 3

2 12250000 9960 3 2 2 yes no yes no no 2

3 12215000 7500 4 2 2 yes no yes no yes 3

4 11410000 7420 4 1 2 yes yes yes no yes 2

... ... ... ... ... ... ... ... ... ... ... ...

540 1820000 3000 2 1 1 yes no yes no no 2

541 1767150 2400 3 1 1 no no no no no 0

542 1750000 3620 2 1 1 yes no no no no 0

543 1750000 2910 3 1 1 no no no no no 0

544 1750000 3850 3 1 2 yes no no no no 0

545 rows × 13 columns

data.head(5) #first 5 rows will be printed.

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 yes no no no yes 2 yes

1 12250000 8960 4 4 4 yes no no no yes 3 no

2 12250000 9960 3 2 2 yes no yes no no 2 yes

3 12215000 7500 4 2 2 yes no yes no yes 3 yes

4 11410000 7420 4 1 2 yes yes yes no yes 2 no

data.head(10)

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 yes no no no yes 2 yes

1 12250000 8960 4 4 4 yes no no no yes 3

2 12250000 9960 3 2 2 yes no yes no no 2 yes

3 12215000 7500 4 2 2 yes no yes no yes 3 yes

4 11410000 7420 4 1 2 yes yes yes no yes 2

5 10850000 7500 3 3 1 yes no yes no yes 2 yes

6 10150000 8580 4 3 4 yes no no no yes 2 yes

7 10150000 16200 5 3 2 yes no no no no 0

8 9870000 8100 4 1 2 yes yes yes no yes 2 yes

9 9800000 5750 3 2 4 yes yes no no yes 1 yes

data.shape #tells us the number of rows and columns present in the csv file.

(545, 13)

data.info() #this returns not null values,column,datatype,and information about the data.
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 545 entries, 0 to 544
Data columns (total 13 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 price 545 non-null int64
1 area 545 non-null int64
2 bedrooms 545 non-null int64
3 bathrooms 545 non-null int64
4 stories 545 non-null int64
5 mainroad 545 non-null object
6 guestroom 545 non-null object
7 basement 545 non-null object
8 hotwaterheating 545 non-null object
9 airconditioning 545 non-null object
10 parking 545 non-null int64
11 prefarea 545 non-null object
12 furnishingstatus 545 non-null object
dtypes: int64(6), object(7)
memory usage: 55.5+ KB

from sklearn.preprocessing import LabelEncoder, MinMaxScaler #this command will convert object datatype into integer
le=LabelEncoder() #it converts the categorical entries into numerical entries.
data["mainroad"]=le.fit_transform(data["mainroad"])
data
#change raw feature vectors into a representation that is more suitable for the downstream estimators-sklearn.preproc

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 1 no no no yes 2

1 12250000 8960 4 4 4 1 no no no yes 3

2 12250000 9960 3 2 2 1 no yes no no 2

3 12215000 7500 4 2 2 1 no yes no yes 3

4 11410000 7420 4 1 2 1 yes yes no yes 2

... ... ... ... ... ... ... ... ... ... ... ...

540 1820000 3000 2 1 1 1 no yes no no 2

541 1767150 2400 3 1 1 0 no no no no 0

542 1750000 3620 2 1 1 1 no no no no 0

543 1750000 2910 3 1 1 0 no no no no 0

544 1750000 3850 3 1 2 1 no no no no 0

545 rows × 13 columns

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

data["guestroom"]=le.fit_transform(data["guestroom"])
data.head(5)

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 1 0 no no yes 2 yes

1 12250000 8960 4 4 4 1 0 no no yes 3 no

2 12250000 9960 3 2 2 1 0 yes no no 2 yes

3 12215000 7500 4 2 2 1 0 yes no yes 3 yes

4 11410000 7420 4 1 2 1 1 yes no yes 2 no

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

data["basement"]=le.fit_transform(data["basement"])
data.head(5)

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 1 0 0 no yes 2 yes

1 12250000 8960 4 4 4 1 0 0 no yes 3 no

2 12250000 9960 3 2 2 1 0 1 no no 2 yes

3 12215000 7500 4 2 2 1 0 1 no yes 3 yes

4 11410000 7420 4 1 2 1 1 1 no yes 2 no

from sklearn.preprocessing import LabelEncoder,MinMaxScaler
data["hotwaterheating"]=le.fit_transform(data["hotwaterheating"])
data.head(5)

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 1 0 0 0 yes 2 yes

1 12250000 8960 4 4 4 1 0 0 0 yes 3 no

2 12250000 9960 3 2 2 1 0 1 0 no 2 yes

3 12215000 7500 4 2 2 1 0 1 0 yes 3 yes

4 11410000 7420 4 1 2 1 1 1 0 yes 2 no

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

data["prefarea"]=le.fit_transform(data["prefarea"])
data.head(5)

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 1 0 0 0 1 2

1 12250000 8960 4 4 4 1 0 0 0 1 3

2 12250000 9960 3 2 2 1 0 1 0 0 2

3 12215000 7500 4 2 2 1 0 1 0 1 3

4 11410000 7420 4 1 2 1 1 1 0 1 2

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

data["furnishingstatus"]=le.fit_transform(data["furnishingstatus"])
data.head(5)

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 1 0 0 0 1 2

1 12250000 8960 4 4 4 1 0 0 0 1 3

2 12250000 9960 3 2 2 1 0 1 0 0 2

3 12215000 7500 4 2 2 1 0 1 0 1 3

4 11410000 7420 4 1 2 1 1 1 0 1 2

x=data.drop(columns=["price"])
y=data["price"]
y=y.values.reshape(-1,1)

scaler=MinMaxScaler()
x=scaler.fit_transform(x)
y=scaler.fit_transform(y)

lr=LinearRegression()
x_train,x_test,y_train,y_test=train_test_split(x,y,test_size=0.2)
lr.fit(x_train,y_train)
y_predict=lr.predict(x_test)

mae=mean_absolute_error(y_test,y_predict)
mse=mean_squared_error(y_test,y_predict)
r2=r2_score(y_test,y_predict)
print(mae,mse,r2)

0.06995281320799962 0.007960782075320859 0.6594122430015953

Loading [MathJax]/jax/output/CommonHTML/fonts/TeX/fontdata.js

Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
Housing Linear
No ratings yet
Housing Linear
3 pages
A
No ratings yet
A
2 pages
House Price Prediction: # Importing Necessary Libraries
No ratings yet
House Price Prediction: # Importing Necessary Libraries
18 pages
DA Lab2
No ratings yet
DA Lab2
5 pages
Data Analysis for Beginners
No ratings yet
Data Analysis for Beginners
1 page
Data Scientists' Guide to Predicting House Prices
No ratings yet
Data Scientists' Guide to Predicting House Prices
9 pages
Chirag HOusing Price Pred
No ratings yet
Chirag HOusing Price Pred
12 pages
Code 1
No ratings yet
Code 1
3 pages
Report
No ratings yet
Report
40 pages
House Price Prediction with Python
No ratings yet
House Price Prediction with Python
6 pages
178 - Regulinear - Ipynb - Colab
No ratings yet
178 - Regulinear - Ipynb - Colab
3 pages
House Price Prediction Models
No ratings yet
House Price Prediction Models
16 pages
ML Regression
No ratings yet
ML Regression
9 pages
T2 Summary VHA
No ratings yet
T2 Summary VHA
14 pages
House Price Prediction Guide
No ratings yet
House Price Prediction Guide
14 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
ML Manual
No ratings yet
ML Manual
9 pages
Ash Regression
No ratings yet
Ash Regression
11 pages
ML Beginners: Predict House Prices
No ratings yet
ML Beginners: Predict House Prices
32 pages
Unit 3 5
No ratings yet
Unit 3 5
4 pages
House Price Prediction Analysis
No ratings yet
House Price Prediction Analysis
14 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Machine Learning Project: TITLE: Predicting The Sale Price of A House Using Linear Regression
No ratings yet
Machine Learning Project: TITLE: Predicting The Sale Price of A House Using Linear Regression
20 pages
Deep Learning - House Price Prediction
No ratings yet
Deep Learning - House Price Prediction
17 pages
Week 12
No ratings yet
Week 12
2 pages
QB 1
No ratings yet
QB 1
11 pages
ML Manual
No ratings yet
ML Manual
30 pages
DT As Regressor-Follow
No ratings yet
DT As Regressor-Follow
4 pages
1 Data Mining 2 Lab - 2 3 Vinay Sirohi 4 2139472 5 Select Appropriate Dataset and Apply Data Reduction
No ratings yet
1 Data Mining 2 Lab - 2 3 Vinay Sirohi 4 2139472 5 Select Appropriate Dataset and Apply Data Reduction
7 pages
Intro to Pandas for Data Science
No ratings yet
Intro to Pandas for Data Science
6 pages
Machine Learning - Code - Jupiter
No ratings yet
Machine Learning - Code - Jupiter
14 pages
BCA 5th Sem Lab (ML)
No ratings yet
BCA 5th Sem Lab (ML)
20 pages
ML Guide: Boston House Price Prediction
100% (1)
ML Guide: Boston House Price Prediction
15 pages
Exercise - First Machine Learning Model
No ratings yet
Exercise - First Machine Learning Model
2 pages
Intro to ML with Sklearn & Python
No ratings yet
Intro to ML with Sklearn & Python
10 pages
DL - LR - 1.ipynb - Colab
No ratings yet
DL - LR - 1.ipynb - Colab
5 pages
Project 4 - House Price Prediction - Ipynb - Colab
No ratings yet
Project 4 - House Price Prediction - Ipynb - Colab
5 pages
DMV - 3 - Jupyter Notebook
No ratings yet
DMV - 3 - Jupyter Notebook
2 pages
Multiple - Linear - Regression - AirBNB - Solution-0.2 - New - Ipynb - Colaboratory
No ratings yet
Multiple - Linear - Regression - AirBNB - Solution-0.2 - New - Ipynb - Colaboratory
11 pages
California Housing Price Prediction .
No ratings yet
California Housing Price Prediction .
1 page
Document From Jahnavi
No ratings yet
Document From Jahnavi
20 pages
Houses Prices Prediction Model
No ratings yet
Houses Prices Prediction Model
11 pages
1 - Lab Manual (ML)
No ratings yet
1 - Lab Manual (ML)
42 pages
Python Real Estate Data Analysis
No ratings yet
Python Real Estate Data Analysis
10 pages
HOUSEPRICENB - Ipynb - Colab
No ratings yet
HOUSEPRICENB - Ipynb - Colab
2 pages
Setup: Chapter 2 - End-To-End Machine Learning Project
No ratings yet
Setup: Chapter 2 - End-To-End Machine Learning Project
31 pages
Airbnb Pricing Model Analysis
No ratings yet
Airbnb Pricing Model Analysis
8 pages
Real Estate Valuation Data Set: Section Order
No ratings yet
Real Estate Valuation Data Set: Section Order
17 pages
Lab 14 Questions
No ratings yet
Lab 14 Questions
4 pages
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
No ratings yet
Unit 1: Shobana T S Assistant Professor Dept. of ISE, BMSCE
127 pages
Linear Regression Analysis - Polynomial Regression
No ratings yet
Linear Regression Analysis - Polynomial Regression
25 pages
Assignment-2: Pandas PD Numpy NP Seaborn Sns Matplotlib - Pyplot PLT
No ratings yet
Assignment-2: Pandas PD Numpy NP Seaborn Sns Matplotlib - Pyplot PLT
14 pages
Capstone Project Report
No ratings yet
Capstone Project Report
187 pages
Melbourne Housing Price Prediction
No ratings yet
Melbourne Housing Price Prediction
1 page
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
13 pages
One Hot Encoding
No ratings yet
One Hot Encoding
12 pages
Data Science Record - 05
No ratings yet
Data Science Record - 05
20 pages
ML Merged
No ratings yet
ML Merged
28 pages
Benefits of ICT in Education
No ratings yet
Benefits of ICT in Education
3 pages
OTN - Advance Testing & Dividing The Network: MT1000A MT1100A MU100010A MU110010A MU110011A MU110012A
No ratings yet
OTN - Advance Testing & Dividing The Network: MT1000A MT1100A MU100010A MU110010A MU110011A MU110012A
6 pages
Automation & Bim
No ratings yet
Automation & Bim
32 pages
Crear y Gestionar Jobs en ABAP
No ratings yet
Crear y Gestionar Jobs en ABAP
7 pages
How To Install Nonpdrm Plugin
No ratings yet
How To Install Nonpdrm Plugin
1 page
CEA - CEP Project Report Template
No ratings yet
CEA - CEP Project Report Template
19 pages
Skill Sheet LangGraph Developer - Munshot
No ratings yet
Skill Sheet LangGraph Developer - Munshot
3 pages
3BSE041586-510 - en Compact 800 Engineering Compact Control Builder AC 800M 5.1 Product Guide
No ratings yet
3BSE041586-510 - en Compact 800 Engineering Compact Control Builder AC 800M 5.1 Product Guide
130 pages
Splunk vs. Dynatrace: Four Paradigm Shifts Customers Make After Splunk Migration To Dynatrace
No ratings yet
Splunk vs. Dynatrace: Four Paradigm Shifts Customers Make After Splunk Migration To Dynatrace
5 pages
Intel Atom Processor S1200 Datasheet Vol 2
No ratings yet
Intel Atom Processor S1200 Datasheet Vol 2
332 pages
Synergy2: Leading Reinsurance Platform
No ratings yet
Synergy2: Leading Reinsurance Platform
7 pages
AVR Programming With GNU Tools
No ratings yet
AVR Programming With GNU Tools
49 pages
Suresh Ramakrishnaiah Experience: Principal Software Engineer, Confidential, Mountain View, CA Nov 2016 - Present
No ratings yet
Suresh Ramakrishnaiah Experience: Principal Software Engineer, Confidential, Mountain View, CA Nov 2016 - Present
3 pages
User Manual Control JT-901 Smart Eng ED 23.08.02
100% (1)
User Manual Control JT-901 Smart Eng ED 23.08.02
28 pages
Module 6: Etherchannel: Instructor Materials
100% (1)
Module 6: Etherchannel: Instructor Materials
35 pages
Networking Thesis Title
100% (4)
Networking Thesis Title
7 pages
Service Oriented Architecture: Lecture 7: BPEL
No ratings yet
Service Oriented Architecture: Lecture 7: BPEL
62 pages
Reference Books Ies Ese
No ratings yet
Reference Books Ies Ese
2 pages
Call-Papers Ic2em'2023
No ratings yet
Call-Papers Ic2em'2023
1 page
Rexroth Indradrive Firmware For Drive Controllers Mph-04, Mpb-04, Mpd-04
No ratings yet
Rexroth Indradrive Firmware For Drive Controllers Mph-04, Mpb-04, Mpd-04
928 pages
Verilog HDL 9-Bit UART Design
100% (1)
Verilog HDL 9-Bit UART Design
23 pages
AJAX Incremental Search Innovations
No ratings yet
AJAX Incremental Search Innovations
19 pages
A Brief Blog For Smart Parking Using ESP32 With Camera Module and Some Recommended A Devices That Makes Easy To Make The Smart Parking System
No ratings yet
A Brief Blog For Smart Parking Using ESP32 With Camera Module and Some Recommended A Devices That Makes Easy To Make The Smart Parking System
9 pages
6ES71366BA010CA0 Datasheet en
No ratings yet
6ES71366BA010CA0 Datasheet en
3 pages
Embedded Systems Protocol Guide
No ratings yet
Embedded Systems Protocol Guide
5 pages
Does Gzip Add Integrity CRC Check To Tar - Stackexchange
No ratings yet
Does Gzip Add Integrity CRC Check To Tar - Stackexchange
2 pages
Class Notes CN 3
No ratings yet
Class Notes CN 3
4 pages
E@syfile TC Installation Trouble Shooting Guide.
No ratings yet
E@syfile TC Installation Trouble Shooting Guide.
3 pages
Cisco WS-C2960-24TC-L Switch Specs
No ratings yet
Cisco WS-C2960-24TC-L Switch Specs
5 pages
JavaScript Error Log Analysis
No ratings yet
JavaScript Error Log Analysis
1,231 pages

Housing Prices Linear Regression

Uploaded by

Housing Prices Linear Regression

Uploaded by

from sklearn.

linear_model import LinearRegression

0 13300000 7420 4 2 3 yes no no no yes 2

1 12250000 8960 4 4 4 yes no no no yes 3

2 12250000 9960 3 2 2 yes no yes no no 2

3 12215000 7500 4 2 2 yes no yes no yes 3

4 11410000 7420 4 1 2 yes yes yes no yes 2

540 1820000 3000 2 1 1 yes no yes no no 2

541 1767150 2400 3 1 1 no no no no no 0

542 1750000 3620 2 1 1 yes no no no no 0

543 1750000 2910 3 1 1 no no no no no 0

544 1750000 3850 3 1 2 yes no no no no 0

545 rows × 13 columns

data.head(5) #first 5 rows will be printed.

0 13300000 7420 4 2 3 yes no no no yes 2 yes

1 12250000 8960 4 4 4 yes no no no yes 3 no

2 12250000 9960 3 2 2 yes no yes no no 2 yes

3 12215000 7500 4 2 2 yes no yes no yes 3 yes

4 11410000 7420 4 1 2 yes yes yes no yes 2 no

0 13300000 7420 4 2 3 yes no no no yes 2 yes

1 12250000 8960 4 4 4 yes no no no yes 3

2 12250000 9960 3 2 2 yes no yes no no 2 yes

3 12215000 7500 4 2 2 yes no yes no yes 3 yes

4 11410000 7420 4 1 2 yes yes yes no yes 2

5 10850000 7500 3 3 1 yes no yes no yes 2 yes

6 10150000 8580 4 3 4 yes no no no yes 2 yes

7 10150000 16200 5 3 2 yes no no no no 0

8 9870000 8100 4 1 2 yes yes yes no yes 2 yes

9 9800000 5750 3 2 4 yes yes no no yes 1 yes

0 13300000 7420 4 2 3 1 no no no yes 2

1 12250000 8960 4 4 4 1 no no no yes 3

2 12250000 9960 3 2 2 1 no yes no no 2

3 12215000 7500 4 2 2 1 no yes no yes 3

4 11410000 7420 4 1 2 1 yes yes no yes 2

540 1820000 3000 2 1 1 1 no yes no no 2

541 1767150 2400 3 1 1 0 no no no no 0

542 1750000 3620 2 1 1 1 no no no no 0

543 1750000 2910 3 1 1 0 no no no no 0

544 1750000 3850 3 1 2 1 no no no no 0

545 rows × 13 columns

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

0 13300000 7420 4 2 3 1 0 no no yes 2 yes

1 12250000 8960 4 4 4 1 0 no no yes 3 no

2 12250000 9960 3 2 2 1 0 yes no no 2 yes

3 12215000 7500 4 2 2 1 0 yes no yes 3 yes

4 11410000 7420 4 1 2 1 1 yes no yes 2 no

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

0 13300000 7420 4 2 3 1 0 0 no yes 2 yes

1 12250000 8960 4 4 4 1 0 0 no yes 3 no

2 12250000 9960 3 2 2 1 0 1 no no 2 yes

3 12215000 7500 4 2 2 1 0 1 no yes 3 yes

4 11410000 7420 4 1 2 1 1 1 no yes 2 no

0 13300000 7420 4 2 3 1 0 0 0 yes 2 yes

1 12250000 8960 4 4 4 1 0 0 0 yes 3 no

2 12250000 9960 3 2 2 1 0 1 0 no 2 yes

3 12215000 7500 4 2 2 1 0 1 0 yes 3 yes

4 11410000 7420 4 1 2 1 1 1 0 yes 2 no

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

0.06995281320799962 0.007960782075320859 0.6594122430015953

You might also like