0% found this document useful (0 votes)

95 views22 pages

Machine Learning

The document contains a list of 13 programming tasks related to machine learning algorithms. The tasks include: 1) Preparing a scatter plot using the Iris dataset 2) Finding and removing null values from a dataset 3) Converting categorical values in a dataset to numeric format 4) Implementing simple linear regression to predict house prices 5) Implementing multiple linear regression on a housing dataset 6) Implementing polynomial regression on a housing dataset

Uploaded by

tina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views22 pages

Machine Learning

Uploaded by

tina

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

INDEX

Sr.no Program
1 Write a python program to Prepare Scatter Plot (Use Forge Dataset / Iris
Dataset)

2 Write a python program to find all null values in a given data set and
remove them.
3 Write a python program the Categorical values in numeric format for a
given dataset.
4 Write a python program to implement simple Linear Regression for
predicting house price.
5 Write a python program to implement multiple Linear Regression for a
given dataset.
6 Write a python program to implement Polynomial Regression for given
dataset.
7 Write a python program to Implement Naïve Bayes.

8 Write a python program to Implement Decision Tree whether or not to

play tennis.
9 Write a python program to implement linear SVM.

10 Write a python program to find Decision boundary by using a neural

network with 10 hidden units on two moons dataset
11 Write a python program to transform data with Principal Component
Analysis (PCA)
12 Write a python program to implement k-nearest Neighbors ML algorithm
to build prediction model (Use Forge Dataset)
13 Write a python program to implement k-means algorithm on a synthetic
dataset.
14 Write a python program to implement Agglomerative clustering on a
synthetic dataset.
1. Write a python program to Prepare Scatter Plot (Use Forge
Dataset Iris Dataset)

import matplotlib.pyplot as plt

import numpy as np
import pandas as pd

from sklearn.datasets import load_iris

iris = load_iris()

df= pd.DataFrame(data= np.c_[iris['data'], iris['target']],

columns= iris['feature_names'] + ['target'])

# select setosa and versicolor

y = df.iloc[0:100, 4].values
y = np.where(y == 'Iris-setosa', 0, 1)

# extract sepal length and petal length

X = df.iloc[0:100, [0, 2]].values

# plot data
plt.scatter(X[:50, 0], X[:50, 1],
color='blue', marker='o', label='Setosa')
plt.scatter(X[50:100, 0], X[50:100, 1],
color='green', marker='s', label='Versicolor')

plt.xlabel('Sepal length [cm]')

plt.ylabel('Petal length [cm]')
plt.legend(loc='upper left')

# plt.savefig('images/02_06.png', dpi=300)
plt.show()
Output:
2. Write a python program to find all null values in a given data set and
remove them.

import pandas as pd

df = pd.read_csv('data.csv')

null_mask = df.isnull()

null_columns = df.columns[null_mask.any()]

null_rows = df.loc[:, null_columns]

df = df.dropna(axis=0, subset=null_columns)

df.to_csv('clean_data.csv', index=False)

Output:
3. Write a python program the Categorical values in numeric format for a
given dataset.

from sklearn.preprocessing import LabelEncoder

import pandas as pd

def convert_categorical_to_numeric(dataframe, column_name):

# Create a label encoder object

encoder = LabelEncoder()

# Fit the encoder to the categorical column

encoder.fit(dataframe[column_name])

# Transform the categorical column to numeric format

dataframe[column_name] = encoder.transform(dataframe[column_name])

# Example usage

df = pd.DataFrame({'col1': ['cat', 'dog', 'bird', 'cat', 'dog', 'bird'], 'col2': [1, 2, 3, 4, 5, 6]})

convert_categorical_to_numeric(df, 'col1')

print(df)

Output:

col1 col2
0 1 1
1 2 2
2 0 3
3 1 4
4 2 5
5 0 6
4. Write a python program to implement simple Linear Regression for
predicting house price.

import numpy as np

from sklearn.linear_model import LinearRegression

# Assume that we have a dataset with two columns: 'area' and 'price',

# where 'area' is the size of the house in square feet and 'price' is the price of the house

X = np.array([[1000], [1500], [2000], [2500], [3000]])

y = np.array([300000, 400000, 500000, 600000, 700000])

# Create a Linear Regression model

model = LinearRegression()

# Fit the model to the training data

model.fit(X, y)

# Predict the price of a house with an area of 3500 square feet

prediction = model.predict([[3500]])

print(prediction)

Output:

[800000.]
5. Write a python program to implement multiple Linear Regression for a
given dataset.

import pandas as pd

from sklearn.linear_model import LinearRegression

# Assume that we have a dataset with three columns: 'area', 'bedrooms', and 'price',

# where 'area' is the size of the house in square feet, 'bedrooms' is the number of bedrooms,

# and 'price' is the price of the house

df = pd.DataFrame({'area': [1000, 1500, 2000, 2500, 3000],

'bedrooms': [3, 4, 5, 6, 7],

'price': [300000, 400000, 500000, 600000, 700000]})

# Create a Linear Regression model

model = LinearRegression()

# Split the data into training and test sets

X = df[['area', 'bedrooms']]

y = df['price']

model.fit(X, y)

# Predict the price of a house with an area of 3500 square feet and 4 bedrooms

prediction = model.predict([[3500, 4]])

print(prediction)

Output:

[799998.4000064]
6. Write a python program to implement Polynomial Regression for given
dataset.

import numpy as np

import pandas as pd

from sklearn.preprocessing import PolynomialFeatures

from sklearn.linear_model import LinearRegression

# Assume that we have a dataset with two columns: 'area' and 'price',

# where 'area' is the size of the house in square feet and 'price' is the price of the house

df = pd.DataFrame({'area': [1000, 1500, 2000, 2500, 3000],

'price': [300000, 400000, 500000, 600000, 700000]})

# Create a Linear Regression model

model = LinearRegression()

# Create a polynomial transformer object with a degree of 2

poly_transformer = PolynomialFeatures(degree=2)

# Transform the 'area' column to polynomial features

X_poly = poly_transformer.fit_transform(df[['area']])

# Fit the model to the transformed data

model.fit(X_poly, df['price'])

# Predict the price of a house with an area of 3500 square feet

prediction = model.predict(poly_transformer.transform([[3500]]))

print(prediction)

Output:
[799998.4000064]
7. Write a python program to Implement Naïve Bayes.

import numpy as np

from sklearn.naive_bayes import GaussianNB

# Assume that we have a dataset with two features: 'age' and 'income',

# and a target variable: 'purchased'

X = np.array([[20, 50000], [30, 60000], [40, 80000], [50, 100000], [60, 120000]])

y = np.array(['yes', 'no', 'yes', 'no', 'yes'])

# Create a Gaussian Naive Bayes model

model = GaussianNB()

# Fit the model to the training data

model.fit(X, y)

# Predict whether a person with an age of 25 and an income of 55000 will purchase a product

prediction = model.predict([[25, 55000]])

print(prediction)

Output:

['yes']
8. Write a python program to Implement Decision Tree whether or
not to play tennis.

#numpy and pandas initialization

import numpy as np

import pandas

PlayTennis = pandas.read_csv('playtennis.csv')

print(PlayTennis)

from sklearn.preprocessing import LabelEncoder

Le = LabelEncoder()

PlayTennis['outlook'] = Le.fit_transform(PlayTennis['outlook'])

PlayTennis['temp'] = Le.fit_transform(PlayTennis['temp'])

PlayTennis['humidity'] = Le.fit_transform(PlayTennis['humidity'])

PlayTennis['windy'] = Le.fit_transform(PlayTennis['windy'])

PlayTennis['play'] = Le.fit_transform(PlayTennis['play'])

print(PlayTennis)

y = PlayTennis['play']

X = PlayTennis.drop(['play'],axis=1)

# Fitting the model

from sklearn import tree

clf = tree.DecisionTreeClassifier(criterion = 'entropy')

clf = clf.fit(X, y)

# We can visualize the tree using tree.plot_tree

tree.plot_tree(clf)
# The predictions are stored in X_pred

X_pred = clf.predict(X)

# verifying if the model has predicted it all right.

X_pred == y

Output:
outlook temp humidity windy play
0 sunny hot high False no
1 sunny hot high True no
2 overcast hot high False yes
3 rainy mild high False yes
4 rainy cool normal False yes
5 rainy cool normal True no
6 overcast cool normal True yes
7 sunny mild high False no
8 sunny cool normal False yes
9 rainy mild normal False yes
10 sunny mild normal True yes
11 overcast mild high True yes
12 overcast hot normal False yes
13 rainy mild high True no

outlook temp humidity windy play

0 2 1 0 0 0
1 2 1 0 1 0
2 0 1 0 0 1
3 1 2 0 0 1
4 1 0 1 0 1
5 1 0 1 1 0
6 0 0 1 1 1
7 2 2 0 0 0
8 2 0 1 0 1
9 1 2 1 0 1
10 2 2 1 1 1
11 0 2 0 1 1
12 0 1 1 0 1
13 1 2 0 1 0
9. Write a python program to implement linear SVM.

import numpy as np

from sklearn.svm import LinearSVC

# Assume that we have a dataset with two features: 'age' and 'income',

# and a target variable: 'purchased'

X = np.array([[20, 50000], [30, 60000], [40, 80000], [50, 100000], [60, 120000]])

y = np.array(['yes', 'no', 'yes', 'no', 'yes'])

# Create a Linear SVM model

model = LinearSVC()

# Fit the model to the training data

model.fit(X, y)

# Predict whether a person with an age of 25 and an income of 55000 will purchase a product

prediction = model.predict([[25, 55000]])

print(prediction)

Output:

['no']
10. Write a python program to find Decision boundary by using a neural
network with 10 hidden units on two moons dataset

import matplotlib.pyplot as plt

from sklearn.datasets import make_moons

from sklearn.neural_network import MLPClassifier

import numpy as np

# Generate the two moons dataset

X, y = make_moons(n_samples=1000, noise=0.1)

# Create a neural network with 10 hidden units

model = MLPClassifier(hidden_layer_sizes=(10,), solver='lbfgs', random_state=1)

# Fit the model to the training data

model.fit(X, y)

# Plot the decision boundary

plt.scatter(X[:, 0], X[:, 1], c=y, cmap=plt.cm.RdBu, edgecolor='k')

h = 0.01

x_min, x_max = X[:, 0].min() - 0.1, X[:, 0].max() + 0.1

y_min, y_max = X[:, 1].min() - 0.1, X[:, 1].max() + 0.1

xx, yy = np.meshgrid(np.arange(x_min, x_max, h), np.arange(y_min, y_max, h))

Z = model.predict(np.c_[xx.ravel(), yy.ravel()])

Z = Z.reshape(xx.shape)

plt.contour(xx, yy, Z, cmap=plt.cm.Paired)

plt.show()
Output:
11. Write a python program to transform data with Principal Component
Analysis (PCA)

import numpy as np

from sklearn.decomposition import PCA

# Assume that we have a dataset with three features: 'length', 'width', and 'height'

X = np.array([[2, 3, 4], [3, 4, 5], [4, 5, 6], [5, 6, 7]])

# Create a PCA transformer object with a target dimension of 2

pca = PCA(n_components=2)

# Transform the data to the new lower-dimensional space

X_transformed = pca.fit_transform(X)

print(X_transformed)

Output:

[[ 2.59807621 0. ]
[ 0.8660254 0. ]
[-0.8660254 0. ]
[-2.59807621 -0. ]]
12. Write a python program to implement k-nearest Neighbors ML
algorithm to build prediction model (Use Forge Dataset)

import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

from sklearn.datasets import make_blobs

from sklearn.neighbors import KNeighborsClassifier

from sklearn.model_selection import train_test_split

X, y = make_blobs(n_samples = 500, n_features = 2, centers = 4,cluster_std = 1.5, random_state = 4)

plt.style.use('seaborn')

plt.figure(figsize = (10,10))

plt.scatter(X[:,0], X[:,1], c=y, marker= '*',s=100,edgecolors='black')

plt.show()

X_train, X_test, y_train, y_test = train_test_split(X, y, random_state = 0)

knn5 = KNeighborsClassifier(n_neighbors = 5)

knn1 = KNeighborsClassifier(n_neighbors=1)

knn5.fit(X_train, y_train)

knn1.fit(X_train, y_train)

y_pred_5 = knn5.predict(X_test)

y_pred_1 = knn1.predict(X_test)

from sklearn.metrics import accuracy_score

print("Accuracy with k=5", accuracy_score(y_test, y_pred_5)*100)

print("Accuracy with k=1", accuracy_score(y_test, y_pred_1)*100)

plt.figure(figsize = (15,5))

plt.subplot(1,2,1)

plt.scatter(X_test[:,0], X_test[:,1], c=y_pred_5, marker= '*', s=100,edgecolors='black')

plt.title("Predicted values with k=5", fontsize=20)

plt.subplot(1,2,2)

plt.scatter(X_test[:,0], X_test[:,1], c=y_pred_1, marker= '*', s=100,edgecolors='black')

plt.title("Predicted values with k=1", fontsize=20)

plt.show()

Output:
13. Write a python program to implement k-means algorithm on a synthetic
dataset.

import matplotlib.pyplot as plt

from sklearn.datasets import make_blobs

from sklearn.cluster import KMeans

# Generate the synthetic dataset

X, y = make_blobs(n_samples=300, centers=4, random_state=0, cluster_std=1.0)

# Create a KMeans model with 4 clusters

model = KMeans(n_clusters=4)

# Fit the model to the data

model.fit(X)

# Predict the cluster labels for each point

y_pred = model.predict(X)

# Plot the data and the cluster labels

plt.scatter(X[:, 0], X[:, 1], c=y_pred, cmap=plt.cm.RdBu, edgecolor='k')

plt.show()
Output:
14. Write a python program to implement Agglomerative clustering on a
synthetic dataset.

import matplotlib.pyplot as plt

from sklearn.datasets import make_blobs

from sklearn.cluster import AgglomerativeClustering

# Generate the synthetic dataset

X, y = make_blobs(n_samples=300, centers=4, random_state=0, cluster_std=1.0)

# Create an Agglomerative Clustering model with 4 clusters

model = AgglomerativeClustering(n_clusters=4)

# Fit the model to the data

model.fit(X)

# Predict the cluster labels for each point

y_pred = model.fit_predict(X)

# Plot the data and the cluster labels

plt.scatter(X[:, 0], X[:, 1], c=y_pred, cmap=plt.cm.RdBu, edgecolor='k')

plt.show()
Output:

Train
No ratings yet
Train
17 pages
ML Manual
No ratings yet
ML Manual
30 pages
BCSL606 Machine Learning Lab
No ratings yet
BCSL606 Machine Learning Lab
33 pages
Praveen Ai
No ratings yet
Praveen Ai
6 pages
ML Record
No ratings yet
ML Record
19 pages
ML Lab Manual for CSE Students
No ratings yet
ML Lab Manual for CSE Students
32 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
Linear Regression with Boston Housing Data
No ratings yet
Linear Regression with Boston Housing Data
14 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
9 pages
ML Lab Mannual1
No ratings yet
ML Lab Mannual1
37 pages
ML Lab Manual
No ratings yet
ML Lab Manual
43 pages
ML Lab Manual
No ratings yet
ML Lab Manual
25 pages
Practical (Data Science)
No ratings yet
Practical (Data Science)
13 pages
ML Lab Manual
No ratings yet
ML Lab Manual
36 pages
ML Full For Print New 1
No ratings yet
ML Full For Print New 1
38 pages
ML 1-11
No ratings yet
ML 1-11
27 pages
LAB MANUAL For Machine Learning
No ratings yet
LAB MANUAL For Machine Learning
15 pages
Data Science Lab Experiments
No ratings yet
Data Science Lab Experiments
32 pages
ML File
No ratings yet
ML File
37 pages
ML Manual
No ratings yet
ML Manual
24 pages
ML - Datascience Manual
No ratings yet
ML - Datascience Manual
64 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
Predicting Housin Main Project Ediglobe
No ratings yet
Predicting Housin Main Project Ediglobe
4 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
33 pages
Mlalllabprgs
No ratings yet
Mlalllabprgs
17 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
26 pages
R22 ML Lab Manual
No ratings yet
R22 ML Lab Manual
25 pages
Ai Class 12 Practical 2
0% (1)
Ai Class 12 Practical 2
21 pages
Big Data Practical
No ratings yet
Big Data Practical
20 pages
Set 1 ML Lab+Viva
No ratings yet
Set 1 ML Lab+Viva
14 pages
Project 4 - House Price Prediction - Ipynb - Colab
No ratings yet
Project 4 - House Price Prediction - Ipynb - Colab
5 pages
Lab Manual ML
No ratings yet
Lab Manual ML
23 pages
Machinelearning
No ratings yet
Machinelearning
26 pages
Machine Learning Programs
No ratings yet
Machine Learning Programs
10 pages
Data Analysis for Beginners
No ratings yet
Data Analysis for Beginners
1 page
External
No ratings yet
External
11 pages
Programming Exercises for Students
No ratings yet
Programming Exercises for Students
45 pages
Machine Learning Lab Manaul BCSL606
No ratings yet
Machine Learning Lab Manaul BCSL606
27 pages
ML Assignment 2025 (2022 25)
No ratings yet
ML Assignment 2025 (2022 25)
1 page
ML Yogesh
No ratings yet
ML Yogesh
23 pages
ML Lab Record
No ratings yet
ML Lab Record
17 pages
ML Final Prac
No ratings yet
ML Final Prac
47 pages
ML Lab - BCSL606
No ratings yet
ML Lab - BCSL606
67 pages
Machine Learning Practicals
No ratings yet
Machine Learning Practicals
30 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
22 pages
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
No ratings yet
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
24 pages
LAB-Skill Advanced Course Machine Learning With Python Experiments
No ratings yet
LAB-Skill Advanced Course Machine Learning With Python Experiments
23 pages
Index
No ratings yet
Index
2 pages
Aayushi ML File
No ratings yet
Aayushi ML File
37 pages
Ankit Python
No ratings yet
Ankit Python
26 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
23 pages
ML Manual
No ratings yet
ML Manual
9 pages
Python Lab PRG
No ratings yet
Python Lab PRG
20 pages
22K61A0654 2 Sasi Auto
No ratings yet
22K61A0654 2 Sasi Auto
24 pages
Study Material IP XII
No ratings yet
Study Material IP XII
116 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
ML Lab Question Set - 21
No ratings yet
ML Lab Question Set - 21
4 pages
ML Lab Manual
No ratings yet
ML Lab Manual
23 pages
The Viscoelastic Natures of Beeswax, Candelilla Wax, Camauba Wax PDF
No ratings yet
The Viscoelastic Natures of Beeswax, Candelilla Wax, Camauba Wax PDF
16 pages
SEM Analysis for Researchers
No ratings yet
SEM Analysis for Researchers
30 pages
Doing Analysis How To Conduct Research 2.1.5 (1) - Watermark
No ratings yet
Doing Analysis How To Conduct Research 2.1.5 (1) - Watermark
5 pages
Spark ML
No ratings yet
Spark ML
110 pages
Chapter 3 - Testbank
50% (4)
Chapter 3 - Testbank
62 pages
Statistical Mediation Analysis Using The Sobel Test and Hayes SPSS Process Macro
No ratings yet
Statistical Mediation Analysis Using The Sobel Test and Hayes SPSS Process Macro
20 pages
Night Trading 1
No ratings yet
Night Trading 1
14 pages
C8511 BSC First Year Examination 2012 Research Skills in Psychology
No ratings yet
C8511 BSC First Year Examination 2012 Research Skills in Psychology
22 pages
Telecommunication Masts' Health Impact
No ratings yet
Telecommunication Masts' Health Impact
9 pages
Malaysian Grid Code Demand Forecasting
No ratings yet
Malaysian Grid Code Demand Forecasting
47 pages
Module 2.2 Randomized Assignment
No ratings yet
Module 2.2 Randomized Assignment
10 pages
Rethinking Some of The Rethinking of Partial Least Squares
No ratings yet
Rethinking Some of The Rethinking of Partial Least Squares
19 pages
Advanced Business Stats for Grads
No ratings yet
Advanced Business Stats for Grads
7 pages
IEOR E4709 Spring 2016 Syllabus
No ratings yet
IEOR E4709 Spring 2016 Syllabus
1 page
SRM Formula Sheet
No ratings yet
SRM Formula Sheet
16 pages
On Relationship Between Multicollinearity and Separation in Logistic Regression
No ratings yet
On Relationship Between Multicollinearity and Separation in Logistic Regression
10 pages
Veterinary Internal Medicne - 2020 - Murphy - Utility of Point of Care Lung Ultrasound For Monitoring Cardiogenic Pulmonary
No ratings yet
Veterinary Internal Medicne - 2020 - Murphy - Utility of Point of Care Lung Ultrasound For Monitoring Cardiogenic Pulmonary
10 pages
A Quick Review of Machine Learning Algorithms: Susmita Ray
No ratings yet
A Quick Review of Machine Learning Algorithms: Susmita Ray
5 pages
ECON 153 - Learning Notes 2
No ratings yet
ECON 153 - Learning Notes 2
4 pages
09w77 gblg5
No ratings yet
09w77 gblg5
749 pages
Credit Risk Estimation Model Development Process Main Steps and Model Improvementengineering Economics
No ratings yet
Credit Risk Estimation Model Development Process Main Steps and Model Improvementengineering Economics
8 pages
Forecasting for Operations Management
No ratings yet
Forecasting for Operations Management
24 pages
Teacher Licensure Exam Insights
No ratings yet
Teacher Licensure Exam Insights
5 pages
Biotic Research and Methology
No ratings yet
Biotic Research and Methology
5 pages
Lecture Note 5 - Non-Stationary Time Series and Unit Root Testing
No ratings yet
Lecture Note 5 - Non-Stationary Time Series and Unit Root Testing
21 pages
Regression: Introduction: Basic Idea: Use Data To Identify Among Variables and Use These Relationships To Make
No ratings yet
Regression: Introduction: Basic Idea: Use Data To Identify Among Variables and Use These Relationships To Make
23 pages
Python For Data Science - Unit 6 - Week 4 - Assignment
No ratings yet
Python For Data Science - Unit 6 - Week 4 - Assignment
5 pages
Effect of Human Capital Development Policy On Staff Performance in Nigerian National Petroleum Corporation Limited (NNPCL)
No ratings yet
Effect of Human Capital Development Policy On Staff Performance in Nigerian National Petroleum Corporation Limited (NNPCL)
23 pages
New Algorithm For Earthquake Prediction
No ratings yet
New Algorithm For Earthquake Prediction
7 pages
Total Quality Management Practices and Organisational Performance in Jordanian Courier Services
No ratings yet
Total Quality Management Practices and Organisational Performance in Jordanian Courier Services
19 pages