0% found this document useful (0 votes)

16 views6 pages

Statistical Tests in R

Uploaded by

edwinmwongera3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views6 pages

Statistical Tests in R

Uploaded by

edwinmwongera3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

CMT 315

1 Correlattion
Correlation measures the strength and direction of the relationship between two variables. The most
common types of correlation are Pearsons correlation coefﬁcient and Spearmans rank correlation
coefﬁcient.

1.1 Pearson Correlation Coefﬁcient

The Pearson correlation coefﬁcient, denoted as r, measures the linear relationship between two con-
tinuous variables.

1.1.1 Formula
The Pearson correlation coefﬁcient is given by:

∑(Xi − X̄)(Yi − Ȳ )
r= √ (1)
∑(Xi − X̄)2 ∑(Yi − Ȳ )2
where:

• Xi ,Yi are individual data points,

• X̄, Ȳ are the means of X and Y ,

• The numerator represents the covariance,

• The denominator normalizes the covariance by the product of the standard deviations.

1.1.2 Interpretation
• r = 1: Perfect positive correlation

• r = −1: Perfect negative correlation

• r = 0: No correlation

1.2 R Code for Pearson Correlation

# Sample Data
x <- c(10, 20, 30, 40, 50)
y <- c(5, 15, 25, 35, 45)

# Compute Pearson correlation

cor(x, y, method = "pearson")

1.3 Spearman Rank Correlation

The Spearman rank correlation coefﬁcient, denoted as ρ (rho), measures the strength and direction of
a monotonic relationship between two variables by ranking the data.

1
1.3.1 Formula
6 ∑ di2
ρ = 1− (2)
n(n2 − 1)
where:
• di is the difference between the ranks of corresponding values of X and Y ,
• n is the number of observations.

1.3.2 Interpretation
Similar to Pearson correlation, ρ ranges from −1 to 1, with the same interpretation.

1.3.3 R Code for Spearman Correlation

# Compute Spearman correlation
cor(x, y, method = "spearman")

1.4 Differences Between Pearson and Spearman Correlation

• Pearson measures linear relationships, while Spearman measures monotonic relationships.
• Pearson uses actual data values, while Spearman uses ranked values.
• Spearman is more robust to outliers and non-normal distributions.

1.5 Conclusion
Both Pearson and Spearman correlation coefﬁcients are useful depending on the nature of data. Pear-
son is appropriate for linear relationships, whereas Spearman is suitable for monotonic relationships
and non-parametric data.

2 SIMPLE LINEAR REGRESSION

2.1 Introduction
Simple Linear Regression (SLR) is a statistical method that models the relationship between a depen-
dent variable (Y ) and a single independent variable (X) using a linear equation.

2.2 Model Formulation

The simple linear regression model is given by:
Y = β0 + β1 X + ε (3)
where:
• Y is the dependent variable (response variable),
• X is the independent variable (predictor),
• β0 is the intercept,
• β1 is the slope,
• ε is the random error term.

2
2.3 Estimation of Parameters
Using the method of least squares, the estimates of β0 and β1 are obtained as:

∑(Xi − X̄)(Yi − Ȳ )
β̂1 = (4)
∑(Xi − X̄)2

β̂0 = Ȳ − β̂1 X̄ (5)

where:

• X̄ = 1n ∑ Xi is the mean of X,

• Ȳ = 1n ∑ Yi is the mean of Y .

2.4 Goodness of Fit: R2

The coefﬁcient of determination (R2 ) measures how well the regression line ﬁts the data:
SSres
R2 = 1 − (6)
SStot
where:

• SStot = ∑(Yi − Ȳ )2 (Total Sum of Squares),

• SSres = ∑(Yi − Ŷi )2 (Residual Sum of Squares).

2.5 Implementation in R
# Load necessary library
library(ggplot2)

# Sample data
X <- c(1, 2, 3, 4, 5, 6, 7, 8, 9, 10)
Y <- c(2.3, 2.5, 3.1, 3.8, 4.2, 4.8, 5.0, 5.6, 6.1, 6.8)

# Fit linear model

model <- lm(Y ~ X)

# Display model summary

summary(model)

# Plot the regression line

plot(X, Y, main="Simple Linear Regression", xlab="X", ylab="Y", pch=19, col="blue")
abline(model, col="red", lwd=2)

ggplot(data.frame(X, Y), aes(x=X, y=Y)) +

geom_point(color=’blue’) +
geom_smooth(method=’lm’, col=’red’) +
labs(title="Simple Linear Regression", x="X", y="Y")

3
3 Wilcoxon Signed-Rank Test (One-Sample
& Paired)
The Wilcoxon Signed-Rank Test is a non-parametric alternative to the one-sample and paired t-tests.
It is used when the assumptions of normality are violated. This test assesses whether the median
of a single sample differs from a speciﬁed value (one-sample test) or whether the median difference
between paired observations is zero (paired test).

3.1 Assumptions
• The data are paired (for paired tests) or from a single sample (for one-sample tests).

• The differences (for paired tests) or sample values (for one-sample tests) are continuous and at
least ordinal.

• The differences are symmetrically distributed around the median.

3.2 Hypotheses
For the one-sample test:

H0 : The median of the sample is equal to a speciﬁed value m0 .

HA : The median of the sample is different from m0 .

For the paired test:

H0 : The median of the differences is zero.

HA : The median of the differences is not zero.

3.3 Test Statistic

For a paired test:

1. Compute differences di = Xi −Yi .

2. Rank absolute differences |di |, ignoring zeros.

3. Compute the test statistic:

W = ∑ R+ (7)
where R+ is the sum of ranks for positive differences.

3.4 Decision Rule

Compare the test statistic W to the critical value from the Wilcoxon Signed-Rank table (for small
samples) or use the normal approximation for large samples.

4
3.5 R Code
# One-sample test
x <- c(10, 12, 14, 15, 17, 18, 19, 21, 23, 25)
wilcox.test(x, mu = 15, alternative = "two.sided")

# Paired test
y <- c(9, 11, 13, 14, 16, 17, 18, 20, 22, 24)
wilcox.test(x, y, paired = TRUE, alternative = "two.sided")

4 Mann-Whitney U Test (Wilcoxon Rank-Sum Test)

Used for comparing two independent groups.

4.1 Formula
n1 (n1 + 1)
U = n1 n2 + − R1 (8)
2
where R1 is the sum of ranks for group 1.

4.2 R Code
group1 <- c(10, 15, 14, 18, 20)
group2 <- c(12, 17, 16, 19, 22)
wilcox.test(group1, group2, alternative = "two.sided")

5 Kruskal-Wallis Test
Used for comparing more than two independent groups (non-parametric ANOVA).

5.1 Formula
12 R2j
N(N + 1) ∑ n j
H= − 3(N + 1) (9)

where R j is the sum of ranks for group j and n j is the sample size for group j.

5.2 R Code
groupA <- c(15, 18, 21, 24, 27)
groupB <- c(17, 20, 23, 26, 29)
groupC <- c(19, 22, 25, 28, 31)
data <- data.frame(values = c(groupA, groupB, groupC),
group = factor(rep(1:3, each = 5)))
kruskal.test(values ~ group, data = data)

6 Spearman’s Rank Correlation

Measures the strength and direction of a monotonic relationship.

5
6.1 Formula
6 ∑ di2
ρ = 1− (10)
n(n2 − 1)
where di is the rank difference.

6.2 R Code
x <- c(1, 2, 3, 4, 5)
y <- c(2, 3, 5, 6, 7)
cor.test(x, y, method = "spearman")

7 Kendall’s Tau Correlation

Used for ordinal data or small sample sizes.

7.1 Formula
C−D
τ= (11)
C+D
where C is the number of concordant pairs and D is the number of discordant pairs.

7.2 R Code
cor.test(x, y, method = "kendall")

8 Friedman Test
Used for comparing repeated measures across multiple treatments.

8.1 Formula
12
nk(k + 1) ∑ j
Q= R2 − 3n(k + 1) (12)

where k is the number of conditions.

8.2 R Code
treatment1 <- c(15, 20, 25)
treatment2 <- c(18, 22, 28)
treatment3 <- c(16, 21, 27)
friedman.test(y = c(treatment1, treatment2, treatment3),
groups = rep(1:3, each = 3),
blocks = rep(1:3, 3))

Statistical Tests
No ratings yet
Statistical Tests
55 pages
Lesson 9: Test of Correlation and Simple Linear Regression
No ratings yet
Lesson 9: Test of Correlation and Simple Linear Regression
7 pages
Correlation Analysis
No ratings yet
Correlation Analysis
30 pages
MBA LSCM: Correlation & Regression
No ratings yet
MBA LSCM: Correlation & Regression
50 pages
Correlation and Regression: Associate Professor Georgi Iskrov, PHD Department of Social Medicine and Public Health
No ratings yet
Correlation and Regression: Associate Professor Georgi Iskrov, PHD Department of Social Medicine and Public Health
28 pages
Business Stats for Students
No ratings yet
Business Stats for Students
66 pages
Microsoft PowerPoint Session 4 PDF
No ratings yet
Microsoft PowerPoint Session 4 PDF
86 pages
Correlation-Regression 2019
No ratings yet
Correlation-Regression 2019
76 pages
QT 1 UNIT - 4 - Watermarked
No ratings yet
QT 1 UNIT - 4 - Watermarked
10 pages
Co4 (10) Sem R
No ratings yet
Co4 (10) Sem R
12 pages
6 Continuous Data Analysis
No ratings yet
6 Continuous Data Analysis
49 pages
Correlation Analysis: 1101091-1101100 PGDM-B
No ratings yet
Correlation Analysis: 1101091-1101100 PGDM-B
25 pages
生物统计方法与应用9-Regression and Correlation
No ratings yet
生物统计方法与应用9-Regression and Correlation
42 pages
5 Chapter Fi
No ratings yet
5 Chapter Fi
29 pages
Correlation Final
No ratings yet
Correlation Final
52 pages
STA 202 Correlation and Regression
No ratings yet
STA 202 Correlation and Regression
11 pages
Regression Correlation
No ratings yet
Regression Correlation
22 pages
Chapter 3 Complete
No ratings yet
Chapter 3 Complete
109 pages
Correlation and Regression: by Tushar Bhatt
100% (1)
Correlation and Regression: by Tushar Bhatt
66 pages
Correlation and Regression Original
No ratings yet
Correlation and Regression Original
44 pages
Lecture 6
No ratings yet
Lecture 6
41 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
16 pages
Correlation and Regression
No ratings yet
Correlation and Regression
82 pages
Meeting 10 - Statistics 2025
No ratings yet
Meeting 10 - Statistics 2025
34 pages
Correlation & Regression Guide
No ratings yet
Correlation & Regression Guide
25 pages
Correlation and Simple Linear Regression Analyses: Objectives
No ratings yet
Correlation and Simple Linear Regression Analyses: Objectives
6 pages
Correlation Simple Regression
No ratings yet
Correlation Simple Regression
26 pages
NonParametrics pt1
No ratings yet
NonParametrics pt1
13 pages
2023 Statistics Fin 9
No ratings yet
2023 Statistics Fin 9
14 pages
DAM Class 21-24 Regression Analysis
No ratings yet
DAM Class 21-24 Regression Analysis
93 pages
Regression Analysis and Anova Assignment Fin
No ratings yet
Regression Analysis and Anova Assignment Fin
9 pages
Econometrics For Finance
100% (1)
Econometrics For Finance
54 pages
Correlation and Regration
No ratings yet
Correlation and Regration
57 pages
Add1 Bivariate Analysis
No ratings yet
Add1 Bivariate Analysis
4 pages
PS - Module 3 - ViRa
No ratings yet
PS - Module 3 - ViRa
104 pages
Correlation Regression
No ratings yet
Correlation Regression
24 pages
Introduction To Correlation and Regression Analysis
No ratings yet
Introduction To Correlation and Regression Analysis
14 pages
Correlation
No ratings yet
Correlation
46 pages
Regression
No ratings yet
Regression
12 pages
Correlation and Regression
No ratings yet
Correlation and Regression
5 pages
Chapter 8 - PSYC 284
No ratings yet
Chapter 8 - PSYC 284
7 pages
Lecture 4 Regression Analysis
No ratings yet
Lecture 4 Regression Analysis
51 pages
13simple Linear Regression
No ratings yet
13simple Linear Regression
127 pages
Regression and Correlation - Upload Compatibility Mode
No ratings yet
Regression and Correlation - Upload Compatibility Mode
31 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
23 pages
Day 4 Session 2 Descriptive Stats Iii
No ratings yet
Day 4 Session 2 Descriptive Stats Iii
16 pages
Stat Cor Reg
No ratings yet
Stat Cor Reg
85 pages
Correlation & Regression
No ratings yet
Correlation & Regression
31 pages
Correlationppt
No ratings yet
Correlationppt
25 pages
Correlation
100% (1)
Correlation
29 pages
5.chap 5 S&P
No ratings yet
5.chap 5 S&P
3 pages
PBH7003 Tests of Relationships
No ratings yet
PBH7003 Tests of Relationships
68 pages
BSC - Applied Statistics - Correlation and SLR
No ratings yet
BSC - Applied Statistics - Correlation and SLR
67 pages
Lecture 5.Correlation&Regression
No ratings yet
Lecture 5.Correlation&Regression
42 pages
5 Correlation and Cofficient 2023
No ratings yet
5 Correlation and Cofficient 2023
51 pages
SPSS Guide for Social Science Students
No ratings yet
SPSS Guide for Social Science Students
27 pages
Correlation vs. Regression
No ratings yet
Correlation vs. Regression
15 pages
Correlation and Regration
No ratings yet
Correlation and Regration
8 pages
Correlation and Regression Analysis - Updated
No ratings yet
Correlation and Regression Analysis - Updated
49 pages
CS1A
No ratings yet
CS1A
11 pages
Ezekiel John Decipulo OLLC Lesson 5.3 Simple Linear Regression Analysis Application
No ratings yet
Ezekiel John Decipulo OLLC Lesson 5.3 Simple Linear Regression Analysis Application
2 pages
Solution Manual For Statistics For The Life Sciences 5th Edition Samuels Witmer Schaffner 0321989589 9780321989581 PDF Download
No ratings yet
Solution Manual For Statistics For The Life Sciences 5th Edition Samuels Witmer Schaffner 0321989589 9780321989581 PDF Download
145 pages
Chapter 8 - Probability and Statistics
No ratings yet
Chapter 8 - Probability and Statistics
6 pages
KNN vs. Linear Regression Analysis
No ratings yet
KNN vs. Linear Regression Analysis
8 pages
Statistics Handbook For Data Analysts - by Anita Gupta - Medium
No ratings yet
Statistics Handbook For Data Analysts - by Anita Gupta - Medium
17 pages
Quiz 02
No ratings yet
Quiz 02
3 pages
Business Report - Advanced Statistics - Great Learning
100% (1)
Business Report - Advanced Statistics - Great Learning
20 pages
An Analysis On The Unemployment Rate in The Philip
No ratings yet
An Analysis On The Unemployment Rate in The Philip
13 pages
Beta Calculation
No ratings yet
Beta Calculation
24 pages
15MA301 U4v1
No ratings yet
15MA301 U4v1
28 pages
Math7960 202503
No ratings yet
Math7960 202503
3 pages
Intro to Statistical Methods
No ratings yet
Intro to Statistical Methods
3 pages
SEM & AMOS for HIV Prevention Research
No ratings yet
SEM & AMOS for HIV Prevention Research
83 pages
Lecture On C - Chart - Problems
No ratings yet
Lecture On C - Chart - Problems
8 pages
Practical QP 23-24
No ratings yet
Practical QP 23-24
8 pages
ML Unit-1
No ratings yet
ML Unit-1
42 pages
Math-III Question Bank-24-25
No ratings yet
Math-III Question Bank-24-25
7 pages
Availability Growth Modeling and Assessment
No ratings yet
Availability Growth Modeling and Assessment
5 pages
17-18.anova Doe
No ratings yet
17-18.anova Doe
18 pages
CH Var Basic PDF
No ratings yet
CH Var Basic PDF
48 pages
FandI CT6 200909 Exam FINAL
No ratings yet
FandI CT6 200909 Exam FINAL
7 pages
IDX Real Estate Earnings Study
No ratings yet
IDX Real Estate Earnings Study
18 pages
MGS3100 Chapter 13 Forecasting: Slides 13b: Time-Series Models Measuring Forecast Error
No ratings yet
MGS3100 Chapter 13 Forecasting: Slides 13b: Time-Series Models Measuring Forecast Error
36 pages
Tugas 1 (Membangkitkan 500 Data) : Nama: Iswardhani Ariyanti NIM: L1B017035 Prodi: Budidaya Perairan
No ratings yet
Tugas 1 (Membangkitkan 500 Data) : Nama: Iswardhani Ariyanti NIM: L1B017035 Prodi: Budidaya Perairan
29 pages
M Tech Artificial Intelligence Curriculum Syllabus 2024
No ratings yet
M Tech Artificial Intelligence Curriculum Syllabus 2024
91 pages
Safety Inventory
No ratings yet
Safety Inventory
40 pages
8 Hypothesis Testing 1
No ratings yet
8 Hypothesis Testing 1
26 pages
7
No ratings yet
7
47 pages
Lab 5.6 & Alt Lab 5.6 Answer
No ratings yet
Lab 5.6 & Alt Lab 5.6 Answer
5 pages

Statistical Tests in R

Uploaded by

Statistical Tests in R

Uploaded by

CMT 315

1.1 Pearson Correlation Coefﬁcient

• Xi ,Yi are individual data points,

• X̄, Ȳ are the means of X and Y ,

• The numerator represents the covariance,

• r = −1: Perfect negative correlation

1.2 R Code for Pearson Correlation

# Compute Pearson correlation

1.3 Spearman Rank Correlation

1.3.3 R Code for Spearman Correlation

1.4 Differences Between Pearson and Spearman Correlation

2 SIMPLE LINEAR REGRESSION

2.2 Model Formulation

β̂0 = Ȳ − β̂1 X̄ (5)

2.4 Goodness of Fit: R2

• SStot = ∑(Yi − Ȳ )2 (Total Sum of Squares),

• SSres = ∑(Yi − Ŷi )2 (Residual Sum of Squares).

# Fit linear model

# Display model summary

# Plot the regression line

ggplot(data.frame(X, Y), aes(x=X, y=Y)) +

• The differences are symmetrically distributed around the median.

H0 : The median of the sample is equal to a speciﬁed value m0 .

For the paired test:

H0 : The median of the differences is zero.

3.3 Test Statistic

1. Compute differences di = Xi −Yi .

2. Rank absolute differences |di |, ignoring zeros.

3. Compute the test statistic:

3.4 Decision Rule

4 Mann-Whitney U Test (Wilcoxon Rank-Sum Test)

6 Spearman’s Rank Correlation

7 Kendall’s Tau Correlation

where k is the number of conditions.

You might also like