0% found this document useful (0 votes)

7 views7 pages

Assignment # 4

Uploaded by

7013222908s

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views7 pages

Assignment # 4

Uploaded by

7013222908s

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

ASSIGNMENT # 4

PROBLEM # 1

1.Generating the graph using R

> Steepness <- c(0.629999995, 0.699999988, 0.819999993, 0.879999995,

1.149999976, 1.5, 4.400000095, 7.300000191, 11.30000019+ )

> GranuleDiameter <- c(0.170000002, 0.189999998, 0.219999999,

0.234999999, 0.234999999, 0.300000012, 0.349999994, 0.419999987,
0.850000024+ )

plot(Steepness,GranuleDiameter)

> plot(Steepness,GranuleDiameter, main="Steepness vs Granule

Diameter")

>plot(Steepness,GranuleDiameter, main="Steepness vs Granule

Diameter" , xlab="steepness of the beach" , ylab= "Diameter of
Granule")

abline(lm(GranuleDiameter~Steepness))

2. Genearting the model to predict the Diameter of the Granule from steepness of
the beach:

model <-lm(GranuleDiameter~Steepness)
summary(model)

Call:
lm(formula = GranuleDiameter ~ Steepness)

Residuals:
Min 1Q Median 3Q Max
-0.12826 -0.02434 0.01307 0.02739 0.08950

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 0.160913 0.030102 5.346 0.00107 **
Steepness 0.053061 0.006288 8.438 6.48e-05 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 0.06739 on 7 degrees of freedom

Multiple R-squared: 0.9105, Adjusted R-squared: 0.8977
F-statistic: 71.2 on 1 and 7 DF, p-value: 6.475e-05

sqrt(0.9105)
[1] 0.9542012

Inorder to support my belief that the relationship is linear, I

am creating a residual plot where I can confirm the relationship is
linear.

plot(model)

By seeing the summary of the model, we have met certain assumptions

which are necessary to do linear regression analysis.They are
1. Correlation coefficient ‘r’ is strong because ‘r’ value is
nearest to 1.
2. The relationship is linear. This is confirmed by generating
a residual plot where we can see that residuals are normally
scattered around the 0 line.
3. The relaionship is causal.

As we met certain assumptions, we can now generate a formula where we can

predict the granule size from the steepness of the beach.
The Formula is :
Granule Diameter ^ = 0.161 + 0.054 * steepness of the beach

PROBLEM # 2

1. Generating the graph

povertyPercentage <- c(20.1, 7.1, 16.1, 14.9, 16.7, 8.8, 9.7,
10.3, 22, 16.2, 12.1, 10.3, 14.5, 12.4, 9.6, 12.2, 10.8, 14.7, 19.7,
11.2, 10.1, 11, 12.2, 9.2, 23.5, 9.4, 15.3, 9.6, 11.1, 5.3, 7.8,
25.3, 16.5, 12.6, 12, 11.5, 17.1, 11.2, 12.2, 10.6, 19.9, 14.5,
15.5, 17.4, 8.4, 10.3, 10.2, 12.5, 16.7, -1.5, 12.2 )

> BirthRate <- c(54.5, 39.5, 61.2, 59.9, 41.1, 47, 25.8, 46.3, 69.1,
44.5, 55.7, 38.2, 39.1, 42.2, 44.6, 32.5, 43, 51, 58.1, 25.4, 35.4,
23.3, 34.8, 27.5, 64.7, 44.1, 36.4, 37, 53.9, 20, 26.8, 62.4, 29.5,
52.2, 27.2, 39.5, 58, 36.8, 31.6, 35.6, 53, 38, 54.3, 64.4, 36.8,
24.2, 37.6, 33, 45.5, 32.3, 39.9+ )

> plot(povertyPercentage,BirthRate)

> plot(povertyPercentage,BirthRate, main="POVERTY VS BIRTH RATE ")

> plot(povertyPercentage,BirthRate, main="POVERTY VS BIRTH RATE ",

xlab= "percentage of people living in poverty",ylab="Teen Birth
Rate")

> abline(lm(BirthRate~povertyPercentage))
2. Generating the model:

model <- lm(BirthRate~povertyPercentage)

summary(model)

Call:
lm(formula = BirthRate ~ povertyPercentage)

Residuals:
Min 1Q Median 3Q Max
-19.0644 -7.4246 -0.4238 7.8092 15.5325

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 19.4172 3.7970 5.114 5.23e-06 ***
povertyPercentage 1.7665 0.2765 6.390 5.85e-08 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 9.19 on 49 degrees of freedom

Multiple R-squared: 0.4545, Adjusted R-squared: 0.4434
F-statistic: 40.83 on 1 and 49 DF, p-value: 5.851e-08

> plot(model)
By examining the given dataset,graph and residual plot , I identified an outlier
which is an error because the percentage of people living in poverty in that
particular state will never be a negative value. The least value would be 0
percentage.
So, I am removing the negative percentage of people living in poverty in a
particular state(outlier) and the corresponding Teen Birth rate observation from the
Dataset and again generating a regression model.

Generating the graph after removal of the outlier:

PovertyPercentage <- c(20.1, 7.1, 16.1, 14.9, 16.7, 8.8, 9.7, 10.3,
22, 16.2, 12.1, 10.3, 14.5, 12.4, 9.6, 12.2, 10.8, 14.7, 19.7, 11.2,
10.1, 11, 12.2, 9.2, 23.5, 9.4, 15.3, 9.6, 11.1, 5.3, 7.8, 25.3,
16.5, 12.6, 12, 11.5, 17.1, 11.2, 12.2, 10.6, 19.9, 14.5, 15.5,
17.4, 8.4, 10.3, 10.2, 12.5, 16.7, 12.2 )

> plot(PovertyPercentage,BirthRate)

> plot(PovertyPercentage,BirthRate , main="POVERTY VS BIRTH RATE")

> plot(PovertyPercentage,BirthRate , main="POVERTY VS BIRTH RATE",

xlab="Percentage of people living in poverty", ylab="Teen Birth
Rate")

> abline(lm(BirthRate~PovertyPercentage))
Generating the model:

> model <- lm(BirthRate~PovertyPercentage)

> summary(model)

Call:
lm(formula = BirthRate ~ PovertyPercentage)

Residuals:
Min 1Q Median 3Q Max
-19.5956 -6.7355 -0.6259 7.5750 15.7252

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 15.7266 4.1482 3.791 0.000419 ***
PovertyPercentage 2.0224 0.2991 6.762 1.71e-08 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 8.937 on 48 degrees of freedom

Multiple R-squared: 0.4879, Adjusted R-squared: 0.4772
G-statistic: 45.72 on 1 and 48 DF, p-value: 1.705e-08

> sqrt(0.4879)
[1] 0.6984984

> plot(model)
As a result of generating a regression model after removing the outlier from the
Dataset , I have seen that
1. Correlation coefficient ‘r’ is moderately strong, I.e., ‘r’ is somewhat nearest to
1.
2. The relationship is linear. This is confirmed by generating a
residual plot where we can see that residuals are normally
scattered around the 0 line.

3. In my opinion, the relationship is not causal as there is no causing factor for the
teen birth rate from the people living in poverty . For purposes of the course,
however I am assuming this relaion as causal and generating the linear regression
analysis.

Our Formula is :
Teen Birth rate ^ = 15.72 + 2.02 * percentage of people living in poverty.

Since there is no causing factor, we cannot predict the birth rate from the
percentage of people living in poverty. However, I have created the model for the
purpose of practice.

22 Linear Fit Post
No ratings yet
22 Linear Fit Post
7 pages
Multicollinearity and Oaxaca - Tutorial
No ratings yet
Multicollinearity and Oaxaca - Tutorial
35 pages
Econometrics 2 1
No ratings yet
Econometrics 2 1
7 pages
R Code Default Data PDF
No ratings yet
R Code Default Data PDF
10 pages
Simple Linear Regression Interpretation PDF
No ratings yet
Simple Linear Regression Interpretation PDF
2 pages
Problem-Set - 1 Practise Problems From Textbook
No ratings yet
Problem-Set - 1 Practise Problems From Textbook
2 pages
Seu Ds610 Mod03
No ratings yet
Seu Ds610 Mod03
45 pages
Lab 4 Classification v.0
No ratings yet
Lab 4 Classification v.0
5 pages
Problem Set
No ratings yet
Problem Set
8 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
7 pages
Econ 3750 HW2
No ratings yet
Econ 3750 HW2
9 pages
Student Debt Analysis Guide
No ratings yet
Student Debt Analysis Guide
11 pages
Homework 3: Jiawei Li Sahil Bhagat Shahrzad Baraeinezhad
No ratings yet
Homework 3: Jiawei Li Sahil Bhagat Shahrzad Baraeinezhad
16 pages
PDF
No ratings yet
PDF
9 pages
Basic Regression Analysis 4
No ratings yet
Basic Regression Analysis 4
6 pages
Regression and Probability Models Analysis
No ratings yet
Regression and Probability Models Analysis
16 pages
Notebook 4 - Machine Learning
No ratings yet
Notebook 4 - Machine Learning
17 pages
ps5 Fall+2015
No ratings yet
ps5 Fall+2015
9 pages
STATA Training For Staff
No ratings yet
STATA Training For Staff
23 pages
Lesson Week 13
No ratings yet
Lesson Week 13
6 pages
Print
No ratings yet
Print
5 pages
Empirical Exercises 6
No ratings yet
Empirical Exercises 6
7 pages
Lec 05 - Time Series Regression Model
No ratings yet
Lec 05 - Time Series Regression Model
32 pages
Notebook 4 - Machine Learning
No ratings yet
Notebook 4 - Machine Learning
16 pages
CH 8
No ratings yet
CH 8
60 pages
Impact Evaluation Universidad Del Rosario: Problem Set 3
No ratings yet
Impact Evaluation Universidad Del Rosario: Problem Set 3
10 pages
H311 Regression Practical
No ratings yet
H311 Regression Practical
38 pages
Machine Learning Training Using R AP Statistics
No ratings yet
Machine Learning Training Using R AP Statistics
12 pages
Sociology: Intermediate Quantitative Research Method
No ratings yet
Sociology: Intermediate Quantitative Research Method
37 pages
BDA MSC It
No ratings yet
BDA MSC It
35 pages
Econometrics 5 and 6
No ratings yet
Econometrics 5 and 6
16 pages
Sociology: Intermediate Quantitative Research Method
No ratings yet
Sociology: Intermediate Quantitative Research Method
34 pages
Polynomial Regression and Step Function
100% (1)
Polynomial Regression and Step Function
6 pages
As 2
No ratings yet
As 2
13 pages
Regression Analysis - VCE Further Mathematics
No ratings yet
Regression Analysis - VCE Further Mathematics
5 pages
HW 3
No ratings yet
HW 3
9 pages
Ecotrix Assignment
No ratings yet
Ecotrix Assignment
5 pages
Linear Regression & Residuals Guide
No ratings yet
Linear Regression & Residuals Guide
59 pages
Lec 05 2 - Time Series Regression Model
No ratings yet
Lec 05 2 - Time Series Regression Model
75 pages
Model Specification & Data Issues
No ratings yet
Model Specification & Data Issues
45 pages
Midterm Fall2011
No ratings yet
Midterm Fall2011
13 pages
Homework 2
100% (1)
Homework 2
14 pages
07 GLM
No ratings yet
07 GLM
49 pages
Using R For Linear Regression
No ratings yet
Using R For Linear Regression
9 pages
Exercice V
No ratings yet
Exercice V
5 pages
STAT 5302 Applied Regression Analysis. Hawkins
No ratings yet
STAT 5302 Applied Regression Analysis. Hawkins
7 pages
2101 F 12 Logistic Regression With R1
No ratings yet
2101 F 12 Logistic Regression With R1
10 pages
Contribution of Remittance - Shiva Adhikari
No ratings yet
Contribution of Remittance - Shiva Adhikari
29 pages
Introductory Econometrics A Modern Approach 6th Edition Wooldridge Solutions Manual 1
100% (98)
Introductory Econometrics A Modern Approach 6th Edition Wooldridge Solutions Manual 1
8 pages
Exam Practice 4
No ratings yet
Exam Practice 4
5 pages
Statistical Modeling With R - Fall 2016 Homework 3: Wells in Bangladesh
No ratings yet
Statistical Modeling With R - Fall 2016 Homework 3: Wells in Bangladesh
10 pages
Assignment 2 Mba 652 PDF
No ratings yet
Assignment 2 Mba 652 PDF
11 pages
Empirical Model
No ratings yet
Empirical Model
20 pages
Wooldridge 7e Ch06 SM
No ratings yet
Wooldridge 7e Ch06 SM
9 pages
Revision Guideline and Solved Problems JAN2018
No ratings yet
Revision Guideline and Solved Problems JAN2018
24 pages
Assignment1 Roll 182-001
No ratings yet
Assignment1 Roll 182-001
12 pages
Edexcel IGCSE Higher Tier Mathematics 4H June 2017
No ratings yet
Edexcel IGCSE Higher Tier Mathematics 4H June 2017
7 pages
Basic Statistics For The Behavioral Sciences 7th Edition Gary Heiman PDF Version
100% (1)
Basic Statistics For The Behavioral Sciences 7th Edition Gary Heiman PDF Version
77 pages
SAMPLING DISTRI-WPS Office
No ratings yet
SAMPLING DISTRI-WPS Office
98 pages
3 Hours / 70 Marks: Instructions
No ratings yet
3 Hours / 70 Marks: Instructions
4 pages
Data Cleaning Techniques
No ratings yet
Data Cleaning Techniques
11 pages
Student Performance Data Analysis
No ratings yet
Student Performance Data Analysis
3 pages
Moments
No ratings yet
Moments
42 pages
COT 3 Standard Deviation
No ratings yet
COT 3 Standard Deviation
37 pages
Fundamentals of Statistics With MS Excel
No ratings yet
Fundamentals of Statistics With MS Excel
83 pages
Lesson 3.2 Measures of Central Tendency Position and Variation
No ratings yet
Lesson 3.2 Measures of Central Tendency Position and Variation
62 pages
Statistcal Modellig of Stuent's Weekly Mobile Data and Airtime Expenditure Using The Normal Distribution 4 & 5 - 052325
No ratings yet
Statistcal Modellig of Stuent's Weekly Mobile Data and Airtime Expenditure Using The Normal Distribution 4 & 5 - 052325
8 pages
Josh Quiz
No ratings yet
Josh Quiz
2 pages
Stats for Educators and Researchers
No ratings yet
Stats for Educators and Researchers
3 pages
Measures of Central Tendency: Mean Median Mode Weighted Mean
100% (2)
Measures of Central Tendency: Mean Median Mode Weighted Mean
20 pages
Statistical Formulas for Regression & Probability
No ratings yet
Statistical Formulas for Regression & Probability
2 pages
Math 5.11
No ratings yet
Math 5.11
2 pages
MTP 4 32 QUESTIONS Math
No ratings yet
MTP 4 32 QUESTIONS Math
17 pages
Statistics Problem Set Assignment
No ratings yet
Statistics Problem Set Assignment
2 pages
Ipjugaad - Bba 2nd Sem Quantitative Techniques and Operations Research in Management Paper 2008
No ratings yet
Ipjugaad - Bba 2nd Sem Quantitative Techniques and Operations Research in Management Paper 2008
3 pages
Measures of Dispersion The Range Standard Deviation and Variance 1
No ratings yet
Measures of Dispersion The Range Standard Deviation and Variance 1
23 pages
Pearson Correlation Explained
No ratings yet
Pearson Correlation Explained
14 pages
Class 10 Math: Mean, Median, Mode
No ratings yet
Class 10 Math: Mean, Median, Mode
34 pages
Project MAS291
No ratings yet
Project MAS291
14 pages
Statistics for Pre-Service Teachers
No ratings yet
Statistics for Pre-Service Teachers
41 pages
Technical Competency Assessment Form
No ratings yet
Technical Competency Assessment Form
23 pages
Mps in Elementary Edukasyon Sa Pagpapakatao: First Quarter
No ratings yet
Mps in Elementary Edukasyon Sa Pagpapakatao: First Quarter
4 pages
Measures of Relative Position (Ungrouped Data)
No ratings yet
Measures of Relative Position (Ungrouped Data)
14 pages
32.2 2.12. Correlation - Exercise
No ratings yet
32.2 2.12. Correlation - Exercise
2 pages
Data Mining Project Report
100% (1)
Data Mining Project Report
98 pages
The Z Scores
No ratings yet
The Z Scores
43 pages

Assignment # 4

Uploaded by

Assignment # 4

Uploaded by

ASSIGNMENT # 4

1.Generating the graph using R

> Steepness <- c(0.629999995, 0.699999988, 0.819999993, 0.879999995,

> GranuleDiameter <- c(0.170000002, 0.189999998, 0.219999999,

> plot(Steepness,GranuleDiameter, main="Steepness vs Granule

>plot(Steepness,GranuleDiameter, main="Steepness vs Granule

Residual standard error: 0.06739 on 7 degrees of freedom

Inorder to support my belief that the relationship is linear, I

By seeing the summary of the model, we have met certain assumptions

As we met certain assumptions, we can now generate a formula where we can

1. Generating the graph

> plot(povertyPercentage,BirthRate, main="POVERTY VS BIRTH RATE ")

> plot(povertyPercentage,BirthRate, main="POVERTY VS BIRTH RATE ",

model <- lm(BirthRate~povertyPercentage)

Residual standard error: 9.19 on 49 degrees of freedom

Generating the graph after removal of the outlier:

> plot(PovertyPercentage,BirthRate , main="POVERTY VS BIRTH RATE")

> plot(PovertyPercentage,BirthRate , main="POVERTY VS BIRTH RATE",

> model <- lm(BirthRate~PovertyPercentage)

Residual standard error: 8.937 on 48 degrees of freedom

You might also like