0% found this document useful (0 votes)

57 views5 pages

Chapter 4 Exercise 11

The document analyzes mpg data from automobiles. It finds mpg is anti-correlated with cylinders, weight, displacement, and horsepower. It then uses LDA, QDA, logistic regression, and KNN models to predict high vs. low mpg using those variables, obtaining test error rates between 12-16%. KNN with 100 nearest neighbors performed best with 14.3% error.

Uploaded by

krisjooniejin tan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views5 pages

Chapter 4 Exercise 11

Uploaded by

krisjooniejin tan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

11

library(ISLR)
summary(Auto)

## mpg cylinders displacement horsepower

## Min. : 9.0 Min. :3.00 Min. : 68 Min. : 46.0
## 1st Qu.:17.0 1st Qu.:4.00 1st Qu.:105 1st Qu.: 75.0
## Median :22.8 Median :4.00 Median :151 Median : 93.5
## Mean :23.4 Mean :5.47 Mean :194 Mean :104.5
## 3rd Qu.:29.0 3rd Qu.:8.00 3rd Qu.:276 3rd Qu.:126.0
## Max. :46.6 Max. :8.00 Max. :455 Max. :230.0
##
## weight acceleration year origin
## Min. :1613 Min. : 8.0 Min. :70 Min. :1.00
## 1st Qu.:2225 1st Qu.:13.8 1st Qu.:73 1st Qu.:1.00
## Median :2804 Median :15.5 Median :76 Median :1.00
## Mean :2978 Mean :15.5 Mean :76 Mean :1.58
## 3rd Qu.:3615 3rd Qu.:17.0 3rd Qu.:79 3rd Qu.:2.00
## Max. :5140 Max. :24.8 Max. :82 Max. :3.00
##
## name
## amc matador : 5
## ford pinto : 5
## toyota corolla : 5
## amc gremlin : 4
## amc hornet : 4
## chevrolet chevette: 4
## (Other) :365

attach(Auto)
mpg01 = rep(0, length(mpg))
mpg01[mpg > median(mpg)] = 1
Auto = data.frame(Auto, mpg01)

cor(Auto[, -9])
## mpg cylinders displacement horsepower weight
## mpg 1.0000 -0.7776 -0.8051 -0.7784 -0.8322
## cylinders -0.7776 1.0000 0.9508 0.8430 0.8975
## displacement -0.8051 0.9508 1.0000 0.8973 0.9330
## horsepower -0.7784 0.8430 0.8973 1.0000 0.8645
## weight -0.8322 0.8975 0.9330 0.8645 1.0000
## acceleration 0.4233 -0.5047 -0.5438 -0.6892 -0.4168
## year 0.5805 -0.3456 -0.3699 -0.4164 -0.3091
## origin 0.5652 -0.5689 -0.6145 -0.4552 -0.5850
## mpg01 0.8369 -0.7592 -0.7535 -0.6671 -0.7578
## acceleration year origin mpg01
## mpg 0.4233 0.5805 0.5652 0.8369
## cylinders -0.5047 -0.3456 -0.5689 -0.7592
## displacement -0.5438 -0.3699 -0.6145 -0.7535
## horsepower -0.6892 -0.4164 -0.4552 -0.6671
## weight -0.4168 -0.3091 -0.5850 -0.7578
## acceleration 1.0000 0.2903 0.2127 0.3468
## year 0.2903 1.0000 0.1815 0.4299
## origin 0.2127 0.1815 1.0000 0.5137
## mpg01 0.3468 0.4299 0.5137 1.0000

pairs(Auto) # doesn't work well since mpg01 is 0 or 1

Anti-correlated with cylinders, weight, displacement, horsepower. (mpg, of course)

train = (year%%2 == 0) # if the year is even

test = !train
Auto.train = Auto[train, ]
Auto.test = Auto[test, ]
mpg01.test = mpg01[test]

d
# LDA
library(MASS)
lda.fit = lda(mpg01 ~ cylinders + weight + displacement +
horsepower, data = Auto,
subset = train)
lda.pred = predict(lda.fit, Auto.test)
mean(lda.pred$class != mpg01.test)

## [1] 0.1264

12.6% test error rate.

# QDA
qda.fit = qda(mpg01 ~ cylinders + weight + displacement +
horsepower, data = Auto,
subset = train)
qda.pred = predict(qda.fit, Auto.test)
mean(qda.pred$class != mpg01.test)

## [1] 0.1319

13.2% test error rate.

# Logistic regression
glm.fit = glm(mpg01 ~ cylinders + weight + displacement +
horsepower, data = Auto,
family = binomial, subset = train)
glm.probs = predict(glm.fit, Auto.test, type = "response")
glm.pred = rep(0, length(glm.probs))
glm.pred[glm.probs > 0.5] = 1
mean(glm.pred != mpg01.test)

## [1] 0.1209

12.1% test error rate.

g
library(class)
train.X = cbind(cylinders, weight, displacement, horsepower)
[train, ]
test.X = cbind(cylinders, weight, displacement, horsepower)[test,
]
train.mpg01 = mpg01[train]
set.seed(1)
# KNN(k=1)
knn.pred = knn(train.X, test.X, train.mpg01, k = 1)
mean(knn.pred != mpg01.test)

## [1] 0.1538

# KNN(k=10)
knn.pred = knn(train.X, test.X, train.mpg01, k = 10)
mean(knn.pred != mpg01.test)

## [1] 0.1648

# KNN(k=100)
knn.pred = knn(train.X, test.X, train.mpg01, k = 100)
mean(knn.pred != mpg01.test)

## [1] 0.1429

k=1, 15.4% test error rate. k=10, 16.5% test error rate. k=100, 14.3% test error rate. K of 100
seems to perform the best. 100 nearest neighbors.

Lab 4
No ratings yet
Lab 4
4 pages
Assignment Auto
No ratings yet
Assignment Auto
6 pages
Fall 2023-2024 IE 451 Homework 2 Solutions
No ratings yet
Fall 2023-2024 IE 451 Homework 2 Solutions
20 pages
CMSC 177 - Regressionlr&Svm
No ratings yet
CMSC 177 - Regressionlr&Svm
30 pages
DMPM-LAB-03-Assignment: Rcode
No ratings yet
DMPM-LAB-03-Assignment: Rcode
9 pages
Assignment
No ratings yet
Assignment
49 pages
Regression Models Assignment 1
No ratings yet
Regression Models Assignment 1
5 pages
Regression Models Assignment 1
No ratings yet
Regression Models Assignment 1
5 pages
R Studio
No ratings yet
R Studio
4 pages
R Studio
No ratings yet
R Studio
5 pages
Manual vs Auto Transmission MPG Analysis
No ratings yet
Manual vs Auto Transmission MPG Analysis
5 pages
Mtcars Dataset: Multilinear Regression Analysis
No ratings yet
Mtcars Dataset: Multilinear Regression Analysis
13 pages
Car Transmission & MPG Analysis
No ratings yet
Car Transmission & MPG Analysis
6 pages
Practical 5
No ratings yet
Practical 5
5 pages
Big Data Analytics Practical Guide
No ratings yet
Big Data Analytics Practical Guide
41 pages
R
No ratings yet
R
3 pages
Bda File
No ratings yet
Bda File
54 pages
R Program
No ratings yet
R Program
2 pages
As Data Manipulation With Dplyr-2
No ratings yet
As Data Manipulation With Dplyr-2
6 pages
HW3 Isye 7406
No ratings yet
HW3 Isye 7406
8 pages
Data Science Lab
No ratings yet
Data Science Lab
28 pages
Regression
No ratings yet
Regression
5 pages
Topic
No ratings yet
Topic
9 pages
Introduction To R Program and Output
No ratings yet
Introduction To R Program and Output
6 pages
Regression Models Project Sid Jas
No ratings yet
Regression Models Project Sid Jas
7 pages
Data Analysis for Auto Enthusiasts
No ratings yet
Data Analysis for Auto Enthusiasts
8 pages
R Analysis of mtcars Dataset
No ratings yet
R Analysis of mtcars Dataset
4 pages
STA1040 Assignment
No ratings yet
STA1040 Assignment
9 pages
Multi Regression
No ratings yet
Multi Regression
12 pages
'Horsepower' "?" 'Horsepower' 'Horsepower' 'Horsepower' 'Horsepower' 'Horsepower'
No ratings yet
'Horsepower' "?" 'Horsepower' 'Horsepower' 'Horsepower' 'Horsepower' 'Horsepower'
5 pages
Exercises 2 Unfinished
No ratings yet
Exercises 2 Unfinished
8 pages
Mtcars Dataset Analysis in R
No ratings yet
Mtcars Dataset Analysis in R
4 pages
R Lab Ex 1 To 5
No ratings yet
R Lab Ex 1 To 5
26 pages
7406HW03
No ratings yet
7406HW03
2 pages
Regression Models Project
No ratings yet
Regression Models Project
5 pages
Lab2 Revathy Report
No ratings yet
Lab2 Revathy Report
5 pages
Statistics Introduction
No ratings yet
Statistics Introduction
8 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
22 pages
Car Price Analysis and Modeling
No ratings yet
Car Price Analysis and Modeling
8 pages
Coursera Regression Models Course Project: Subha Shree S R 08/10/2020
No ratings yet
Coursera Regression Models Course Project: Subha Shree S R 08/10/2020
7 pages
Motor Trend Car Road Tests
No ratings yet
Motor Trend Car Road Tests
5 pages
DS On MTCARS Solutions
No ratings yet
DS On MTCARS Solutions
3 pages
Week2 Submission Assignment Solution AshaA-3
No ratings yet
Week2 Submission Assignment Solution AshaA-3
2 pages
Economics 400 Computer Exercise
No ratings yet
Economics 400 Computer Exercise
7 pages
20231CBC0033 LabSheet 4
No ratings yet
20231CBC0033 LabSheet 4
6 pages
ISyE7406 Homework3
No ratings yet
ISyE7406 Homework3
20 pages
Course2 - DataAnalysis With Python - Week3 - Exploratory Data Analysis
No ratings yet
Course2 - DataAnalysis With Python - Week3 - Exploratory Data Analysis
23 pages
Data Science Using R
No ratings yet
Data Science Using R
11 pages
R11
No ratings yet
R11
2 pages
Miles Per Gallon
No ratings yet
Miles Per Gallon
11 pages
Se Python - Merged
No ratings yet
Se Python - Merged
77 pages
Introduction to Base R Programming
No ratings yet
Introduction to Base R Programming
10 pages
Data Analytics Solution - Assignment - 1
No ratings yet
Data Analytics Solution - Assignment - 1
3 pages
Activity 2
No ratings yet
Activity 2
16 pages
Statisitics Project 3
No ratings yet
Statisitics Project 3
22 pages
ML Foram
No ratings yet
ML Foram
17 pages
Homework Mtcars Challenging
No ratings yet
Homework Mtcars Challenging
2 pages
Jadual Math
No ratings yet
Jadual Math
1 page
ADS & A Unit-1 Study Material
No ratings yet
ADS & A Unit-1 Study Material
13 pages
Antennas: Antenna Theory and Design, 2
No ratings yet
Antennas: Antenna Theory and Design, 2
46 pages
Tutorial 5
No ratings yet
Tutorial 5
14 pages
OOAD Essentials for Developers
No ratings yet
OOAD Essentials for Developers
3 pages
Linear Algebra: Least Squares
No ratings yet
Linear Algebra: Least Squares
13 pages
Handwritten Marathi Compound Character PDF
No ratings yet
Handwritten Marathi Compound Character PDF
6 pages
Class 10 Math Exam Paper
No ratings yet
Class 10 Math Exam Paper
5 pages
Thornton 1990 A Q3
No ratings yet
Thornton 1990 A Q3
3 pages
CH 11 Algebra and Forumulae
No ratings yet
CH 11 Algebra and Forumulae
12 pages
Miniaturized UWB Monopole Microstrip Antenna Design by The Combination of Giusepe Peano and Sierpinski Carpet Fractals
No ratings yet
Miniaturized UWB Monopole Microstrip Antenna Design by The Combination of Giusepe Peano and Sierpinski Carpet Fractals
4 pages
The Voyage of The Vega Round Asia and Europe Volume I and Volume II 1st Edition Nils Adolf Erik Nordenskiöld PDF Download
No ratings yet
The Voyage of The Vega Round Asia and Europe Volume I and Volume II 1st Edition Nils Adolf Erik Nordenskiöld PDF Download
71 pages
6.2VolumesbyDisks and Washers
No ratings yet
6.2VolumesbyDisks and Washers
15 pages
Circular Motion ELP 01-04 1721145705732
No ratings yet
Circular Motion ELP 01-04 1721145705732
9 pages
Math Exam for Grade 8 Students
No ratings yet
Math Exam for Grade 8 Students
2 pages
Chapter 2 Statistics Estimation Final
No ratings yet
Chapter 2 Statistics Estimation Final
13 pages
Conceptual Estimating Techniques
No ratings yet
Conceptual Estimating Techniques
21 pages
Accuracy and Precision
No ratings yet
Accuracy and Precision
3 pages
Exp 5
No ratings yet
Exp 5
6 pages
Microstrip Antenna Array With Four Port Butler Matrix For Switched Beam Base Station Application
No ratings yet
Microstrip Antenna Array With Four Port Butler Matrix For Switched Beam Base Station Application
6 pages
RSA Public-Key Encryption and Signature Lab
No ratings yet
RSA Public-Key Encryption and Signature Lab
8 pages
Navigation
100% (5)
Navigation
212 pages
Fiitjee: Permutation & Combination
No ratings yet
Fiitjee: Permutation & Combination
5 pages
Ligação
No ratings yet
Ligação
5 pages
Chapter 8 - The Poisson and Other Discrete Random Variables
No ratings yet
Chapter 8 - The Poisson and Other Discrete Random Variables
12 pages
Heat Exchanger Effectiveness NTU
No ratings yet
Heat Exchanger Effectiveness NTU
7 pages
Risk Management: Backtesting ES
100% (1)
Risk Management: Backtesting ES
13 pages
Cpps Unit-3 Arrays Array:: Declaration: Syntax: Data Type Array - Name (Size of The Array)
No ratings yet
Cpps Unit-3 Arrays Array:: Declaration: Syntax: Data Type Array - Name (Size of The Array)
24 pages
Digital Leadership and Organizations Performance The Mediating Role of Innovation Capability
No ratings yet
Digital Leadership and Organizations Performance The Mediating Role of Innovation Capability
16 pages
Advanced Quantum Mechanics Course Contents - 2
No ratings yet
Advanced Quantum Mechanics Course Contents - 2
2 pages

Chapter 4 Exercise 11

Uploaded by

Chapter 4 Exercise 11

Uploaded by

11

## mpg cylinders displacement horsepower

pairs(Auto) # doesn't work well since mpg01 is 0 or 1

train = (year%%2 == 0) # if the year is even

12.6% test error rate.

13.2% test error rate.

12.1% test error rate.

You might also like