0% found this document useful (0 votes)

100 views44 pages

Theory Hypothesis Design Data: To Answer / To Test Research Study Collect

1. The document provides an overview of key concepts in statistics including descriptive versus inferential statistics, types of variables, scales of measurement, hypothesis testing, and statistical analyses like the t-test and ANOVA. 2. It defines key terms like constructs, variables, operational definitions, and types of errors in hypothesis testing and discusses how to measure effect size and statistical power. 3. The document compares paired and independent sample t-tests and their advantages and disadvantages, and provides explanations of assumptions and interpretations of the t-statistic, F-ratio, and other analyses.

Uploaded by

Charlyn Keith

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

100 views44 pages

Theory Hypothesis Design Data: To Answer / To Test Research Study Collect

Uploaded by

Charlyn Keith

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 44

THE BIG PICTURE OF STATISTICS

Theory
Question to answer / Hypothesis to test
Design Research Study
Collect Data
(measurements, observations)

Organize and make sense of the #s

USING STATISTICS!
Depends on our goal:
Describe characteristics Test hypothesis, Make conclusions,
organize, summarize, condense data interpret data, understand relations
DESCRIPTIVE STATISTICS INFERENTIAL STATISTICS
Some Definitions:
Construct: Intelligence
abstract, theoretical, hypothetical
can’t observe/measure directly

IQ Vocabulary Achievement
Variable: reflects
construct, but is directly measurable and can differ from subject to subject
(not a constant). Variables can be Discrete or Continuous.

Operational SAT Vocab

WISC Grades
Definition: Test
concrete, measurable
Defines variable by specific operations used to measure it
Types of Variables
 Quantitative  Qualitative
 Measured in amounts  Measured in categories
 Ht, Wt, Test score  Gender, race, diagnosis

Discrete: Continuous:
 separate categories  infinite values in between
 Letter grade  GPA
Scales of Measurement
 Nominal Scale: Categories, labels, data carry no numerical
value

 Ordinal Scale: Rank ordered data, but no information about

distance between ranks

 Interval Scale: Degree of distance between scores can be

assessed with standard sized intervals

 Ratio Scale: Same as interval scale with an absolute zero

point.
Errors in Hypothesis Testing
 Type I Errors
 You reject a null hypothesis when you
shouldn’t
 You conclude that you have an effect when
you really do not
 The alpha level determines the probability of
a Type I Error (hence, called an “alpha error”)
 Type II Errors
 Failure to reject a false null hypothesis
 Sometimes called a “Beta” Error.
Statistical Power
 How sensitive is a test to detecting real
effects?
 A powerful test decreases the chances of
making a Type II Error
 Ways of Increasing Power:
 Increase sample size
 Make alpha level less conservative
 Use one-tailed versus a two-tailed test
Assumptions of Parametric
Hypothesis Tests (z, t, anova)
 Random sampling or random assignment
was used
 Independent Observations
 Variability is not changed by experimental
treatment (homogeneity of variance)
 Distribution of Sample Means is normal
Measuring Effect Size
 Statistical significance alone does not imply a substantial
effect; just one larger than chance
 Cohen’s d is the most common technique for assessing
effect size
 Cohen’s d = Difference between the means divided by
the population standard deviation.
 d > .8 means a large effect!
Introduction to the t Statistic
 Since we usually do not know the population variance,
we must use the sample variance to estimate the
standard error
 Remember? S2 = SS/n-1 = SS/df
 Estimated Standard Error = SM = √S2/n
 t = M – μ0/SM
Differences between the distribution of
the t statistic and the normal curve
 t is only normally distributed when n is very
large. Why?
 The more statistics you have in a formula, the more sources of
sampling fluctuation you will have.
 M is the only statistic in the z formula, so z will be normal
whenever the distribution of sample means is normal
 In “t” you have things fluctuating in both the numerator and the
denominator
 Thus, there are as many different t distributions as there are
possible sample sizes. You have to know the degrees of
freedom (df) to know which distribution of t to use in a problem.
 All t distributions are unimodal and symmetrical around zero.
Comparing Differences between
Means with t Tests
 There are two kinds of t tests:
 t Tests for Independent Samples
 Also known as a “Between-Subjects” Design
 Two totally different groups of subjects are compared;
randomly assigned if an experiment
 t Tests for related Samples
 Also known as a “Repeated Measures” or “Within-Subjects”
or “Paired Samples” or “Matched Groups” Design
 A group of subjects is compared to themselves in a different
condition
 Each individual in one sample is matched to a specific
individual in the other sample
Paired Sample T-Test

• The paired sample t-test, sometimes called the dependent

sample t-test, is a statistical procedure used to determine
whether the mean difference between two sets of observations
is zero. In a paired sample t-test, each subject or entity is
measured twice, resulting in pairs of observations. Common
applications of the paired sample t-test include case-control
studies or repeated-measures designs. Suppose you are
interested in evaluating the effectiveness of a company training
program. One approach you might consider would be to
measure the performance of a sample of employees before and
after completing the program, and analyze the differences using
a paired sample t-test.
Independent T-Test
•The Independent Samples t
Test compares the means of
two independent groups in order to
determine whether there is statistical
evidence that the associated
population means are significantly
different. The Independent Samples t
Test is a parametric test.
•When using a two-tailed test, regardless
of the direction of the relationship you
hypothesize, you are testing for the
possibility of the relationship in both
directions. For example, we may wish to
compare the mean of a sample to a
given value x using a t-test. Our null
hypothesis is that the mean is equal to
x. A two-tailed test will test both if the
mean is significantly greater than x and
if the mean significantly less than x.
A one-tailed test will test
either if the mean is
significantly greater than x
or if the mean is
significantly less than x, but
not both.
Advantages of Independent Sample
Designs
 Independent Designs have no carryover effects
 Independent designs do not suffer from fatigue or
practice effects
 You do not have to worry about getting people to
show up more than once
 Demand characteristics may be stronger in repeated
measure studies than in independent designs
 Since more individuals participate in independent
design studies, the results may be more
generalizeable
Disadvantages of Independent
Sample Designs
 Usually requires more subjects (larger n)
 The effect of a variable cannot be assessed for each
individual, but only for groups as a whole
 There will be more individual differences between
groups, resulting in more variability
Advantages of Paired-Sample
Designs
 Requires fewer subjects
 Reduces variability/more statistically efficient
 Good for measuring changes over time
 Eliminates problems caused by individual
differences
 Effects of variables can be assessed for each
individual
Disadvantages of Paired Sample
Designs
 Carryover effects (2nd measure influenced by 1st
measure)
 Progressive Error (Fatigue, practice effects)
 Counterbalancing is a way of controlling carryover and practice
effects
 Getting people to show up more than once
 Demand characteristics may be stronger
What is really going on with t Tests?

 Essentially the difference between the means of the two

groups is being compared to the estimated standard
error.
 t = difference between group means/estimated standard
error
 t = variability due to chance + independent
variable/variability due to chance alone
 The t distribution is the sampling distribution of
differences between sample means. (comparing
obtained difference to standard error of differences)
Assumptions underlying t Tests
 Observations are independent of each other (except
between paired scores in paired designs)
 Homogeneity of Variance
 Samples drawn from a normally distributed
population
 At least interval level numerical data
Analysis of Variance (anova)
 Use when comparing the differences between means
from more than two groups
 The independent variable is known as a “Factor”
 The different conditions of this variable are known as
“levels”
 Can be used with independent groups
 Completely randomized single factor anova
 Can be used with paired groups
 Repeated measures anova
The F Ratio (anova)
 F = variance between groups/variance within groups
 F = Treatment Effect + Differences due to
chance/Differences due to chance
 F = Variance among sample means/variance due to
chance or error
 The denominator of the F Ratio is known as the “error
term”
Evaluation of the F Ratio
 Obtained F is compared with a critical value
 If you get a significant F, all it tells you is that at
least one of the means is different from one of
the others
 To figure out exactly where the differences are,
you must use Multiple Comparison Tests
Multiple Comparison Tests
 The issue of “Experimentwise Error”
 Results from an accumulation of “per comparison
errors”
 Planned Comparisons
 Can be done with t tests (must be few in number)
 Unplanned Comparisons (Post Hoc tests)
 Protect against experimentwise error
 Examples:
 Tukey’s HSD Test
 The Scheffe Test
 Fisher’s LSD Test
 Newman-Keuls Test
Measuring Effect Size in Anova
 Most common technique is “r2”
 Tells you what percent of the variance is due
to the treatment
 r2 = SS between groups/SS total
Single Factor Anova
(One-Way Anova)
 Can be Independent Measures
 Can be Repeated Measures
Do you know
population SD?

Yes
No

Use Z
Test Are there only
2 groups to
Compare?

No -
Yes - More than
Only 2 2
No -
Yes - More than
Only 2 2 Groups
Groups

Use
Do you have ANOVA
Independent data?

If F not If F is
Yes No Significant, Significant,
Retain Null Reject Null
Do you have If F not If F is
Independent data? Significant, Significant,
Retain Null Reject Null

Yes No

Compare Means
With Multiple
Use Comparison Tests
Use Paired
Independent
Sample
Sample
T test
T test
Use
Use Paired
Independent
Sample
Sample
T test
T test

Is t test
Significant?

Yes No

Compare Reject Retain

means Null Null
Hypothesis Hypothesis
Correlational Method
 No manipulation: just observe 2+
variables, then measure relationship
 Also called:
 Descriptive Non-experimental
 Naturalistic Observational
 Survey design
Advantages & Disadvantages of
Correlational Methods
 ADVANTAGE: Efficient for collecting lots of data in a
short time
 ADVANTAGE: Can study problems you cannot study
experimentally
 DISADVANTAGE: Leaves Cause-Effect Relationship
Ambiguous
 DISADVANTAGE: No control over extraneous variables
The Uses of Correlation
 Predicting one variable from another
 Validation of Tests
 Are test scores correlated with what they say
they measure?
 Assessing Reliability
 Consistency over time, across raters, etc
 Hypothesis Testing
Correlation Coefficients
 Can range from -1.0 to +1.0
 The DIRECTION of a relationship is indicated by the sign
of the coefficient (i.e., positive vs. negative)
 The STRENGTH of the relationship is indicated by how
closely the number approaches -1.0 or +1.0
 The size of the correlation coefficient indicates the
degree to which the points on a scatterplot approximate
a straight line
 As correlations increase, standard error of estimate gets smaller
& prediction becomes more accurate
 The closer the correlation coefficient is to zero, the
weaker the relationship between the variables.
Types of Correlation Coefficients
 The Pearson r
 Most common correlation
 Use with scale data (interval & ratio)
 Only detects linear relationships
 The coefficient of determination (r2) measures proportion of
variability in one variable accounted for by the other variable.
 Used to measure “effect size” in ANOVA
 The Spearman Correlation
 Use with ordinal level data
 Can assess correlations that are not linear
 The Point-Biserial Correlation
 Use when one variable is scale data but other variable is
nominal/categorical
Problems with Interpreting Pearson’s r

 Cannot draw cause-effect conclusions

 Restriction of range
 Correlations can be misleading if you do not
have the full range of scores
 The problem of outliers
 Extreme outliers can disrupt correlations,
especially with a small n.
Introduction to Regression
 In any scatterplot, there is a line that provides the “best
fit” for the data
 This line identifies the “central tendency” of the data and it can
be used to make predictions in the following form:
 Y = bx + a
 “b” is the slope of the line, and a is the Y intercept (the value of Y when X =
0)

 The statistical technique for finding the best fitting line is

called “linear regression,” or “regression”
 What defines whether a line is the best fit or not?
 The “least squares solution” (finding the line with the smallest
summed squared deviations between the line and data points)
 The Standard Error of Estimate
 Measure of “average error;” tells you the precision of your
predictions
 As correlations increase, standard error of estimate gets smaller
Simple Regression
 Discovers the regression line that provides
the best possible prediction (line of best fit)
 Tells you if the predictor variable is a
significant predictor
 Tells you exactly how much of the
variance the predictor variable accounts
for
Multiple Regression
 Gives you an equation that tells you how
well multiple variables predict a target
variable in combination with each other.
Nonparametric Statistics
 Used when the assumptions for a
parametric test have not been met:
 Data not on an interval or ratio scale
 Observations not drawn from a normally
distributed population
 Variance in groups being compared is not
homogeneous
 Chi-Square test is the most commonly used
when nominal level data is collected

Data Analysis Final
No ratings yet
Data Analysis Final
61 pages
Parametric Tests
No ratings yet
Parametric Tests
50 pages
Annova
No ratings yet
Annova
4 pages
Anovaandttest 160221163625
No ratings yet
Anovaandttest 160221163625
29 pages
Tests of Significance
No ratings yet
Tests of Significance
35 pages
T Test by SW - Vidyamritananda
No ratings yet
T Test by SW - Vidyamritananda
13 pages
RM Module 4
No ratings yet
RM Module 4
22 pages
Inbound 6674276799695690874
No ratings yet
Inbound 6674276799695690874
67 pages
Statistics for Analysts
No ratings yet
Statistics for Analysts
52 pages
Tests of Significance
No ratings yet
Tests of Significance
19 pages
Hypothesis Testing All Tests
No ratings yet
Hypothesis Testing All Tests
46 pages
T Testz Test
No ratings yet
T Testz Test
47 pages
Parametric and Non-Parametric
No ratings yet
Parametric and Non-Parametric
35 pages
Lecture 7.descriptive and Inferential Statistics
100% (1)
Lecture 7.descriptive and Inferential Statistics
44 pages
Inferential Stats: Two-Group Design
No ratings yet
Inferential Stats: Two-Group Design
36 pages
Types of Test: Group 3
No ratings yet
Types of Test: Group 3
32 pages
Mm13 Content Module 8
No ratings yet
Mm13 Content Module 8
15 pages
Lecture 9 - T-Test
No ratings yet
Lecture 9 - T-Test
29 pages
Research Methodology Lecture 7
No ratings yet
Research Methodology Lecture 7
103 pages
Review and Non Parametric Using SPSS 2023
No ratings yet
Review and Non Parametric Using SPSS 2023
69 pages
CAMI16 - Data Analytics
No ratings yet
CAMI16 - Data Analytics
55 pages
Students T Test
No ratings yet
Students T Test
20 pages
Inferenatial Assign, of Iqra Sajid
No ratings yet
Inferenatial Assign, of Iqra Sajid
8 pages
Statistic Chapter 6
No ratings yet
Statistic Chapter 6
42 pages
T-Statistics Notes
No ratings yet
T-Statistics Notes
7 pages
Parametric Tests
100% (1)
Parametric Tests
57 pages
RM II Day 3
No ratings yet
RM II Day 3
48 pages
FDS Unit4
No ratings yet
FDS Unit4
28 pages
The T Test: Shiza Khaqan
No ratings yet
The T Test: Shiza Khaqan
24 pages
Understanding T-Tests and ANOVA
No ratings yet
Understanding T-Tests and ANOVA
5 pages
Comparing Independent Groups, T-Tests and Anova
No ratings yet
Comparing Independent Groups, T-Tests and Anova
39 pages
Hypothesis Testing Basics
No ratings yet
Hypothesis Testing Basics
41 pages
Inferential Statistics for Educators
No ratings yet
Inferential Statistics for Educators
101 pages
T - Test
No ratings yet
T - Test
45 pages
Central Tendency Dispersion Visualization
No ratings yet
Central Tendency Dispersion Visualization
34 pages
T Testparametrictest 240312054518 6094c5a0
No ratings yet
T Testparametrictest 240312054518 6094c5a0
9 pages
How Do We Decide If The Medication Was Successful in Lowering The Patient's Concentration of Blood Glucose?
No ratings yet
How Do We Decide If The Medication Was Successful in Lowering The Patient's Concentration of Blood Glucose?
7 pages
Hypothesis Formulation and Testing
No ratings yet
Hypothesis Formulation and Testing
23 pages
Continuing Adult Education: Not Sure You're Making Type I Errors or Type II. This Will Help
No ratings yet
Continuing Adult Education: Not Sure You're Making Type I Errors or Type II. This Will Help
4 pages
Unit 10
No ratings yet
Unit 10
30 pages
Statistics
No ratings yet
Statistics
21 pages
Pearson R Correlation: Test
No ratings yet
Pearson R Correlation: Test
5 pages
Statistics for Researchers
No ratings yet
Statistics for Researchers
20 pages
Data Analysis Lecture
No ratings yet
Data Analysis Lecture
17 pages
Session 4 Test of Difference
No ratings yet
Session 4 Test of Difference
14 pages
Unit 4 - Notes
No ratings yet
Unit 4 - Notes
14 pages
Lesson 8
No ratings yet
Lesson 8
7 pages
T (Ea) For Two
No ratings yet
T (Ea) For Two
31 pages
Ttest
No ratings yet
Ttest
8 pages
Parametric and Non Parametric Test
100% (4)
Parametric and Non Parametric Test
36 pages
T Test
No ratings yet
T Test
1 page
Different Types of Statistical Tests
No ratings yet
Different Types of Statistical Tests
19 pages
T-Tests OneWayAnova English
No ratings yet
T-Tests OneWayAnova English
46 pages
Understanding Inferential Statistics
No ratings yet
Understanding Inferential Statistics
15 pages
Statistics-In-Psychology - Compress Notes
No ratings yet
Statistics-In-Psychology - Compress Notes
11 pages
Chapter 6 Hypothesis Test Anova
No ratings yet
Chapter 6 Hypothesis Test Anova
63 pages
Award
100% (1)
Award
22 pages
St. Paul University at San Miguel: Title Proposal
No ratings yet
St. Paul University at San Miguel: Title Proposal
2 pages
St. Paul University San Miguel
No ratings yet
St. Paul University San Miguel
1 page
Rizal
No ratings yet
Rizal
4 pages
Front Page Lab Report
No ratings yet
Front Page Lab Report
1 page
Microscope Maintenance
No ratings yet
Microscope Maintenance
2 pages
SH I - St. John Paul II: Cabaron, Charlyn Keith D
No ratings yet
SH I - St. John Paul II: Cabaron, Charlyn Keith D
3 pages
Module 1
No ratings yet
Module 1
75 pages
Group 1A - Alkali Metals Elements Compounds Pharmaceutical Uses Chemical Name Chemical Formula
100% (1)
Group 1A - Alkali Metals Elements Compounds Pharmaceutical Uses Chemical Name Chemical Formula
2 pages
Discount Mail Order Pharmacy Plan
No ratings yet
Discount Mail Order Pharmacy Plan
18 pages
St. Paul University at San Miguel
No ratings yet
St. Paul University at San Miguel
2 pages
3 Society As A Facility PDF
No ratings yet
3 Society As A Facility PDF
15 pages
Guide to Answering Defense Questions
No ratings yet
Guide to Answering Defense Questions
3 pages
CHAPTER I III Biodiesel
No ratings yet
CHAPTER I III Biodiesel
11 pages
Standard Costing and Variance Analysis
No ratings yet
Standard Costing and Variance Analysis
41 pages
V 122 N 06 P 305 Jiidfii
No ratings yet
V 122 N 06 P 305 Jiidfii
11 pages
Ipsative Personality Tests
No ratings yet
Ipsative Personality Tests
10 pages
Introductory Statistics A Problem Solving Approach 2nd Edition by Stephen Kokoska
No ratings yet
Introductory Statistics A Problem Solving Approach 2nd Edition by Stephen Kokoska
319 pages
Mutally
No ratings yet
Mutally
6 pages
C - TS4CO - 2020 - SAP S4HANA For Management Accounting
No ratings yet
C - TS4CO - 2020 - SAP S4HANA For Management Accounting
34 pages
A Meta-Analytic Review of Work-Family Conflict and Its Antecedents
100% (1)
A Meta-Analytic Review of Work-Family Conflict and Its Antecedents
30 pages
Ai/Ml: Generativeai - Mlops Roadmap
No ratings yet
Ai/Ml: Generativeai - Mlops Roadmap
33 pages
2011 June
No ratings yet
2011 June
22 pages
Exercises Variables Control Charts
No ratings yet
Exercises Variables Control Charts
49 pages
Statistical Methods For Geography 1st Edition Peter A. Rogerson Digital Version 2025
100% (1)
Statistical Methods For Geography 1st Edition Peter A. Rogerson Digital Version 2025
154 pages
Strategy Learner
No ratings yet
Strategy Learner
5 pages
Student Quiz Performance Analysis
100% (1)
Student Quiz Performance Analysis
8 pages
ANOVA for Business Jet Prototypes
No ratings yet
ANOVA for Business Jet Prototypes
46 pages
Econometrics Jimma Assignment
No ratings yet
Econometrics Jimma Assignment
6 pages
Excercise 15.10 12.6
No ratings yet
Excercise 15.10 12.6
8 pages
Lesson-6 - Data Analysis
No ratings yet
Lesson-6 - Data Analysis
24 pages
Difficulty Scaling of Game Ai
No ratings yet
Difficulty Scaling of Game Ai
5 pages
PHD Thesis On Population Studies
100% (2)
PHD Thesis On Population Studies
7 pages
Nowak Verly The Practice of SGS
No ratings yet
Nowak Verly The Practice of SGS
12 pages
Notes ch1 Random Variables and Probability Distributions
No ratings yet
Notes ch1 Random Variables and Probability Distributions
30 pages
Mother and Daughter Reports About Upward Transfers: I-Fen Lin
No ratings yet
Mother and Daughter Reports About Upward Transfers: I-Fen Lin
32 pages
Statistical Methods in Experimental Physics 2nd Ed. Edition James Download
100% (5)
Statistical Methods in Experimental Physics 2nd Ed. Edition James Download
84 pages
Non Equilibrium Stat Mech
No ratings yet
Non Equilibrium Stat Mech
51 pages
Chapter 2: Methods For Describing Sets of Data: Solve The Problem
No ratings yet
Chapter 2: Methods For Describing Sets of Data: Solve The Problem
6 pages
Statistics Solved MCQs (Set-1) McqMate - Com - Merged
No ratings yet
Statistics Solved MCQs (Set-1) McqMate - Com - Merged
47 pages
10 1108 - Apjba 08 2021 0398
No ratings yet
10 1108 - Apjba 08 2021 0398
21 pages
Reasons Behind Students' Lack of Participation in School Activities: A Quantitative Study
No ratings yet
Reasons Behind Students' Lack of Participation in School Activities: A Quantitative Study
8 pages
Likert Scale
No ratings yet
Likert Scale
10 pages
Subject Question Bank
No ratings yet
Subject Question Bank
9 pages

Theory Hypothesis Design Data: To Answer / To Test Research Study Collect

Uploaded by

Theory Hypothesis Design Data: To Answer / To Test Research Study Collect

Uploaded by

THE BIG PICTURE OF STATISTICS

Organize and make sense of the #s

Operational SAT Vocab

 Ordinal Scale: Rank ordered data, but no information about

 Interval Scale: Degree of distance between scores can be

 Ratio Scale: Same as interval scale with an absolute zero

• The paired sample t-test, sometimes called the dependent

• The paired sample t-test, sometimes called the dependent

 Essentially the difference between the means of the two

Compare Reject Retain

 Cannot draw cause-effect conclusions

 The statistical technique for finding the best fitting line is

You might also like