0% found this document useful (0 votes)

27 views26 pages

Chapter 7

statistics

Uploaded by

Ahmed Mohamed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views26 pages

Chapter 7

statistics

Uploaded by

Ahmed Mohamed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

1

INTRODUCTION TO
STATISTICS & PROBABILITY

Chapter 7: Inference for Distributions

Dr. Nahid Sultana

Chapter 7
2
Inference for Distributions

7.1 Inference for the Mean of a Population

7.1 Inference for the Mean of a
Population

Ø The t Distributions
Ø One-Sample t Confidence Interval
Ø One-Sample t Test
Ø Matched Pairs t Procedures

3
Example – Sweetening colas
4

Cola manufacturers want to test how much the sweetness of a new cola
drink is affected by storage. The sweetness difference due to storage was
evaluated by 10 professional tasters (by comparing the sweetness before
and after storage):
Taster Sweetness difference (D = Before – After)
1 2.0
2 0.4 We want to test if storage results in a loss of
3 0.7 sweetness:
4 2.0 H 0 : µ diff 0 ; H a : µ diff 0
5 −0.4
This looks familiar. However, here we do not
6 2.2
know the population parameter σ.
7 −1.3
8 1.2
9 1.1
This situation is very common with real data.
10 2.3
When σ is unknown
The sample s.d. s provides an estimate of the population s.d. σ.
5

When the sample size is large, But when the sample size is small,
then s is a good estimate of σ. then s is a poor estimate of σ.

The sample is likely to contain The sample contains only a few

elements representative of the individuals.
whole population.
The t Distributions
6

When the sampling distribution of x is close to Normal, we can find probabilities

involving x by standardizing: x −µ
z=
σ n

When we don’t know σ, we can estimate it using the sample standard

deviation sx. What happens when we standardize?
x −µ
?? =
sx n This new statistic does not have a Normal distribution!
Standard deviation s – standard error s/√n
7

Ø When σ is unknown, we estimate it with the sample standard deviation s.

Ø Then we estimate the standard deviation of by .
Ø This quantity is called the standard error of the sample mean and we
denote it by SEM or .

Example: A simple random sample of five female basketball players is

selected. Their heights (in cm) are 170, 175, 169, 183, and 177.
What is the standard error of the mean of these height
measurements?
Solution: Sample mean = 174.8
Sample s.d, s= √32.2
SEM = = √(32.2/5) = 2.538.
The t Distributions
8

Suppose that an SRS of size n is drawn from an N(µ,σ) population.

Ø When σ is known, the sampling distribution is N(μ, σ/√n).
Ø When σ is estimated from the sample standard deviation s, the
sampling distribution follows a t-distribution t(μ, s/√n) with degrees of
freedom n − 1.
The t Distributions
When comparing the density curves of the standard Normal distribution
and t distributions, several facts are apparent:
9

üThe density curves of the t distributions are

similar in shape to the standard Normal
curve.
üThe spread of the t distributions is a bit
greater than that of the standard Normal
distribution.
üThe t distributions have more probability in
the tails and less in the center than does
the standard Normal.
üAs the degrees of freedom increase, the t-
density curve approaches the standard
Normal curve even more closely.
We can use Table D to determine critical values t* for t distributions with
different degrees of freedom.
Standardizing the data before using
Table D
10

As with the normal distribution,

• the first step is to standardize the data.
• Then we can use Table D to obtain the area under the curve.

Here, μ is the mean (center) of the sampling distribution,

and the standard error of the mean s/√n is its standard deviation (width).
Using Table D
11

Suppose you want to construct a 95% confidence interval for the mean
µ of a Normal population based on an SRS of size n = 12.
What critical t* should you use?
Upper-tail probability p In Table D, we consult the row
corresponding to df = n – 1 = 11.
df .05 .025 .02 .01
10 1.812 2.228 2.359 2.764
11 1.796 2.201 2.328 2.718
12 1.782 2.179 2.303 2.681
z* 1.645 1.960 2.054 2.326 We move across that row to the
90% 95% 96% 98% entry that is directly above 95%
confidence level.
Confidence level C
One-Sample t Confidence
Interval
12

Ø level C confidence interval

Ø C is the area between −t* and t*.
Ø We find t* from Table D for df = n−1 and confidence
level C.

The One-Sample t Interval for a Population Mean

Choose an SRS of size n from a population having unknown mean µ.
sx
A level C confidence interval for µ is: x t*
n
where t* is the critical value for the t(n – 1) distribution.
sx
The margin of error is t*
n
This interval is exact when the population distribution is Normal and
12
approximately correct for large n in other cases.
Example: Listening to music on cell phones.
On average, U.K. subscribers with 3G phones spent an average of 8.3 hours
per month listening to full-track music on their cell phones.
Suppose we want to determine a 95% confidence interval for the U.S.
average and draw the following random sample of size 8 from the U.S.
population of 3G subscribers: 5 6 0 4 11 9 2 3
Here sample mean, x 5 ; s. d. , s 3.63 ; df n-1 7.
The standard error is

t∗ = 2.365

The 95% CI is

13
Example
14

A manufacturer of high-resolution video terminals must control the tension on the mesh
of fine wires that lies behind the surface of the viewing screen. The tension is measured
by an electrical device with output readings in millivolts (mV). A random sample of 20
screens has the following mean and standard deviation:
x = 306.32 mV and sx = 36.21 mV
We want to estimate the true mean tension µ of all the video terminals produced this
day at a 90% confidence level.
Upper-tail probability
Solution: Since n = 20, we use the t distribution p
€ df .10 .05 .025
with df = 19 to find the critical value. 18 1.130 1.734 2.101
From Table D, we find t* = 1.729. 19 1.328 1.729 2.093
20 1.325 1.725 2.086
Therefore, the 90% confidence interval for µ is:
80% 90% 95%
sx 36.21
x ± t* = 306.32 ± 1.729 = 306.32 ± 14 Confidence level C
n 20
= (292.32, 320.32)
We are 90% confident that the interval from 292.32 to 320.32 mV captures the true mean
tension in the entire batch of video terminals produced that day.
Example
15

Assume that you don’t know what the population standard deviation is. You
draw a sample of 30 screws and calculate their mean length. The mean for
your sample is 4.8, and the standard deviation of your sample (s) is 0.4
centimeters.

What is the 98% confidence interval for the population mean? Round
your answer to two decimal places.
x
The One-Sample t Test t
sx
0

n
As in the previous chapter, a test of hypotheses requires a few
steps:

1. Stating the null and alternative hypotheses (H0 versus Ha)

2. Calculating t and its degrees of freedom

3. Finding the area under the curve with Table D

4. Stating the P-value and interpreting the result

16
The One-Sample t Test (Cont…)
The P-value is calculated as the corresponding area under the curve,
one-tailed or two-tailed depending on Ha:

17
Example

The level of dissolved oxygen (DO) in a river is an important indicator of

the water’s ability to support aquatic life.
A researcher measures the DO level at 15 randomly chosen locations along
a river. Here are the results in milligrams per liter:
4.53 5.04 3.29 5.23 4.13 5.50 4.83 4.40
5.42 6.38 4.01 4.66 2.87 5.73 5.55
A dissolved oxygen level below 5 mg/l puts aquatic life at risk.

We want to perform a test at the a = 0.05 significance level of:

H0 : µ = 5
Ha : µ < 5
where µ is the actual mean dissolved oxygen level in this river.
Example (Cont…)
The sample mean and standard deviation are: x = 4.771 and sx = 0.9396
H0: µ = 5; Ha: µ < 5
x − µ0 4.771− 5
Test statistic t = = = −0.94
€ sx 0.9396
n 15
P-value: The P-value is the area to the left of
t = –0.94 under the t distribution curve with
€ df = 15 – 1 = 14.
Upper-tail probability p But P(t < –0.94 ) = P(t > 0.94 )
df .25 .20 .15
The P-value is between 0.15 and 0.20, which is
13 .694 .870 1.079
14 .692 .868 1.076
greater than our a = 0.05 significance level,
15 .691 .866 1.074 à We fail to reject H0.
50% 60% 70%
19
Confidence level C àWe don’t have enough evidence to conclude
that the mean DO level in the stream is less than
5 mg/l.
Example – Sweetening colas cont…)
Is there evidence that storage results in
sweetness loss for the new cola
recipe at the α=5% level of significance?
H0: μ = 0 versus Ha: μ > 0 (one-sided test)

There is a significant loss of

sweetness, on average, following
20 storage.
Matched Pairs t Procedures
21

Ø Comparative studies are more convincing than single-sample investigations.

For that reason, the one-sample inference is less common than comparative
inference.
Ø Study designs that involve making two observations on the same individual, or
one observation on each of two similar individuals, result in paired data.
Ø When paired data result from measuring the same quantitative variable
twice, as in the Sweetening colas study, we can make comparisons by
analyzing the differences in each pair.
Example: Pre-test and post-test studies look at data collected on the same
sample elements before and after some experiment is performed.
Matched Pairs t Procedures
To compare the responses to the two treatments in a matched-pairs design,
find the difference between the responses within each pair. Then apply the
one-sample t procedures to these differences.
Matched Pairs t Procedures

Conceptually, this is not different from tests on one population.

22
Example – Sweetening colas (revisited)
The sweetness loss due to storage was evaluated by 10 professional
tasters (comparing the sweetness before and after storage):

Although the text didn’t mention it explicitly, this is a pre-/post-test design

and the variable is the difference in cola sweetness before minus after
storage. A matched pairs test of significance is indeed just like a one-
sample test.
23
Example: Does lack of caffeine increase depression?
Individuals diagnosed as caffeine dependent
are assigned to receive daily pills.
Sometimes, the pills contain caffeine and
other times they contain a placebo.
Depression was assessed.
For each individual in the sample, we
have calculated a difference in
depression score (placebo minus
caffeine).

24 Caffeine deprivation causes a significant increase in depression.

Matched Pairs t Procedures Example
25

The MeasureMind 3D MultiSensor metrology software is used by various companies

to measure complex machine parts. GE Healthcare discovered that unchecking one
software option reduced measurement time by 10%. To further investigate this, the
researchers measured 51 parts using the software both with and without the option
checked. (Order of checking the option or not was randomized.) The difference in
the measurements with the option checked and unchecked was recorded.

We want to perform a test at the a = 0.05 significance level of

H0 : µ = 0
Ha : µ ≠ 0
where µ is the actual mean difference for the entire population of
parts.

24
Matched Pairs t Procedures Example
The sample mean and standard deviation are x = 0.0504 and sx = 0.6943

26
x −µ 0 0.0504 − 0
Test statistic t = = = 0.52
sx 0.6943
n 51
P-value: The P-value is the twice the area to the left of t =
0.52 under the t distribution curve with df = 51 – 1 = 50.

The P-value is greater than 2 x 0.25 = 0.50.

Because this is greater than our a = 0.05
Upper-tail probability p significance level, we fail to reject H0. We do not
df .25 .20 .15 have enough evidence to conclude that unchecking
40 0.681 0.851 1.050 the option has an impact on the measurement times.
50 0.679 0.849 1.047
60 0.679 0.848 1.045
However, a lack of statistical significance does not
50% 60% 70%
prove the null hypothesis is true. The company may
Confidence level C
want to consider a larger study to improve precision.

Module3 Part4 One Sample T Procedure
No ratings yet
Module3 Part4 One Sample T Procedure
35 pages
Statistics For Business and Economics: Module 1:probability Theory and Statistical Inference Spring 2010
No ratings yet
Statistics For Business and Economics: Module 1:probability Theory and Statistical Inference Spring 2010
20 pages
CH 35 Statistical Treatment
No ratings yet
CH 35 Statistical Treatment
28 pages
Chapter 9 Testing A Claim-9.3
No ratings yet
Chapter 9 Testing A Claim-9.3
29 pages
Lab 6 For Students
No ratings yet
Lab 6 For Students
21 pages
Business Analytics & Machine Learning: Regression Analysis
No ratings yet
Business Analytics & Machine Learning: Regression Analysis
58 pages
Hypothesis Test
No ratings yet
Hypothesis Test
6 pages
Confidence Interval When SD Is Unknown
No ratings yet
Confidence Interval When SD Is Unknown
23 pages
Lecture 2 Hypothesis Test I - Updated2
No ratings yet
Lecture 2 Hypothesis Test I - Updated2
33 pages
Biol2001 Stats-Lecture 6
No ratings yet
Biol2001 Stats-Lecture 6
35 pages
Stab22 Lecture9
No ratings yet
Stab22 Lecture9
36 pages
Ci 1
No ratings yet
Ci 1
47 pages
Inbound 588667172330667162
No ratings yet
Inbound 588667172330667162
30 pages
08 Chapter 8 Confidient Interval Estimation
No ratings yet
08 Chapter 8 Confidient Interval Estimation
50 pages
05 Statistical Inference-2 PDF
No ratings yet
05 Statistical Inference-2 PDF
14 pages
Final Exam STAT 2120
No ratings yet
Final Exam STAT 2120
2 pages
Small Sample Estimation of A Population Mean
No ratings yet
Small Sample Estimation of A Population Mean
28 pages
Charpter 2
No ratings yet
Charpter 2
26 pages
WINSEM2024-25 MAT1011 ETH AP2024254000644 2025-03-21 Reference-Material-I
No ratings yet
WINSEM2024-25 MAT1011 ETH AP2024254000644 2025-03-21 Reference-Material-I
37 pages
Statistical Intervals 2
No ratings yet
Statistical Intervals 2
58 pages
Estimtion Confidence Interval
No ratings yet
Estimtion Confidence Interval
46 pages
Lecture Notes 7.2 Estimating A Population Mean
No ratings yet
Lecture Notes 7.2 Estimating A Population Mean
5 pages
Estimations
No ratings yet
Estimations
24 pages
Intervals
No ratings yet
Intervals
43 pages
UDEC1203 - Topic 6 Analysis of Experimental Data
No ratings yet
UDEC1203 - Topic 6 Analysis of Experimental Data
69 pages
Program L5: - Confidence Intervals
No ratings yet
Program L5: - Confidence Intervals
40 pages
10.2 Power Point
No ratings yet
10.2 Power Point
23 pages
BSCHAPTER - (Theory of Estimations)
No ratings yet
BSCHAPTER - (Theory of Estimations)
39 pages
T Distribution
No ratings yet
T Distribution
9 pages
Chapter 7 Inferences Using Normal and T-Distribution
100% (1)
Chapter 7 Inferences Using Normal and T-Distribution
16 pages
Small Sample Tests
No ratings yet
Small Sample Tests
10 pages
Ch04le 1
No ratings yet
Ch04le 1
59 pages
Confidence Intervals & Student's t-Distribution
No ratings yet
Confidence Intervals & Student's t-Distribution
5 pages
Confidence Interval
No ratings yet
Confidence Interval
44 pages
Confidence Intervals with t-Distribution
No ratings yet
Confidence Intervals with t-Distribution
16 pages
Statistics
No ratings yet
Statistics
46 pages
ICS Week 4 - Handouts
No ratings yet
ICS Week 4 - Handouts
12 pages
FALLSEM2019-20 MAT2001 ETH VL2019201000363 Reference Material I 19-Sep-2019 MAT2001-SE Materials - Smart Board - Module 6-Compressed
No ratings yet
FALLSEM2019-20 MAT2001 ETH VL2019201000363 Reference Material I 19-Sep-2019 MAT2001-SE Materials - Smart Board - Module 6-Compressed
88 pages
Biostatistics Lecture 8 Sampling Distribution of The Mean and CI
No ratings yet
Biostatistics Lecture 8 Sampling Distribution of The Mean and CI
13 pages
CLO4-PPT1-Estimation and Confidence Intervals
No ratings yet
CLO4-PPT1-Estimation and Confidence Intervals
29 pages
Statistical Analysis Methods Guide
No ratings yet
Statistical Analysis Methods Guide
9 pages
Chapter 8: Estimating With Confidence: Section 8.3 Estimating A Population Mean
No ratings yet
Chapter 8: Estimating With Confidence: Section 8.3 Estimating A Population Mean
27 pages
Biostats Lecture 10 Inference For Means
No ratings yet
Biostats Lecture 10 Inference For Means
43 pages
Lecture 9 - T-Tests
No ratings yet
Lecture 9 - T-Tests
43 pages
Statistical Intervals in Engineering Data Analysis
No ratings yet
Statistical Intervals in Engineering Data Analysis
27 pages
T 4 Sampling Distributions
No ratings yet
T 4 Sampling Distributions
13 pages
Gaussian Distribution in Chemistry
No ratings yet
Gaussian Distribution in Chemistry
48 pages
Confidence Intervals Explained
No ratings yet
Confidence Intervals Explained
34 pages
BRM - Lesson7 Confidence Interval
No ratings yet
BRM - Lesson7 Confidence Interval
17 pages
Statistc in Chemistry
No ratings yet
Statistc in Chemistry
13 pages
Sampling Distributions and Confidence Intervals For Proportions
No ratings yet
Sampling Distributions and Confidence Intervals For Proportions
31 pages
5-6.sampling Error and Confidence Interval
No ratings yet
5-6.sampling Error and Confidence Interval
74 pages
Estimation 1920
No ratings yet
Estimation 1920
51 pages
Bus 7
No ratings yet
Bus 7
48 pages
9 3+sig+test+notes
No ratings yet
9 3+sig+test+notes
33 pages
Confidence Interval Estimation Guide
No ratings yet
Confidence Interval Estimation Guide
61 pages
Lecture 1
No ratings yet
Lecture 1
25 pages
Lec 7 - Arrays - F
No ratings yet
Lec 7 - Arrays - F
49 pages
Lec 5
No ratings yet
Lec 5
38 pages
Ch4-Combinational Logic
No ratings yet
Ch4-Combinational Logic
57 pages
WB41 All
No ratings yet
WB41 All
46 pages
Chapter 6
No ratings yet
Chapter 6
39 pages
Lecture 10
No ratings yet
Lecture 10
44 pages
Ch3-Gate Level Minimization
No ratings yet
Ch3-Gate Level Minimization
38 pages
Desire ةعماجلا ىح - ةيبملولأا ةيرقلا مامأ ىواقرشلا ديعس عراش - لحم ةيصان ىنيصلا ىدانلا راوجب - د لايف / ىدنجلا ديعس - ىناثلا و لولأا رودلا
No ratings yet
Desire ةعماجلا ىح - ةيبملولأا ةيرقلا مامأ ىواقرشلا ديعس عراش - لحم ةيصان ىنيصلا ىدانلا راوجب - د لايف / ىدنجلا ديعس - ىناثلا و لولأا رودلا
22 pages
Lecture 1 PDF
No ratings yet
Lecture 1 PDF
26 pages
DSP Course Overview for Students
No ratings yet
DSP Course Overview for Students
34 pages
Lecture 2
No ratings yet
Lecture 2
4 pages
Discrete Signals: Energy & Power
No ratings yet
Discrete Signals: Energy & Power
26 pages
Poly JC Statistics
No ratings yet
Poly JC Statistics
39 pages
Calculator-Techniques by Dimal PDF
100% (2)
Calculator-Techniques by Dimal PDF
108 pages
Lean Six Sigma Black Belt Certification Training Course
No ratings yet
Lean Six Sigma Black Belt Certification Training Course
18 pages
Physics Problem Set Solutions
No ratings yet
Physics Problem Set Solutions
13 pages
1 s2.0 S0266352X24000910 Main
No ratings yet
1 s2.0 S0266352X24000910 Main
17 pages
FYBCA Syllabus
No ratings yet
FYBCA Syllabus
37 pages
Measurement of The Speed and Energy Distribution of Cosmic Ray Muons
No ratings yet
Measurement of The Speed and Energy Distribution of Cosmic Ray Muons
8 pages
Modelling and Quantitative Methods in Fisheries - Malcolm Haddon 2th Edition-114-227
No ratings yet
Modelling and Quantitative Methods in Fisheries - Malcolm Haddon 2th Edition-114-227
114 pages
Chapter 9: Evalua On and Acceptance of Hardened Concrete
No ratings yet
Chapter 9: Evalua On and Acceptance of Hardened Concrete
9 pages
Hamisi
No ratings yet
Hamisi
34 pages
CFA Mindmap
92% (12)
CFA Mindmap
98 pages
Don't Calibrate MMM Models Through Experiments
No ratings yet
Don't Calibrate MMM Models Through Experiments
27 pages
Newbold Sbe8 Tif ch05 PDF
100% (2)
Newbold Sbe8 Tif ch05 PDF
58 pages
MG221: Applied Probability & Statistics: Syllabus 2018
No ratings yet
MG221: Applied Probability & Statistics: Syllabus 2018
2 pages
Halal Tourism Brand Equity Study
No ratings yet
Halal Tourism Brand Equity Study
18 pages
Bruel & Kjaer Kurtosis in Random Vibration
No ratings yet
Bruel & Kjaer Kurtosis in Random Vibration
8 pages
Mit18 05 s22 Statistics
No ratings yet
Mit18 05 s22 Statistics
173 pages
Adaptive Block-Based Change-Point Detection For Sparse Spatially Clustered Data With Applications in Remote Sensing Imaging
No ratings yet
Adaptive Block-Based Change-Point Detection For Sparse Spatially Clustered Data With Applications in Remote Sensing Imaging
27 pages
Ultimate Data Science - GenAI Bootcamp
No ratings yet
Ultimate Data Science - GenAI Bootcamp
34 pages
Quiz Finals Arrange With Sol
No ratings yet
Quiz Finals Arrange With Sol
3 pages
Special Functions of Signal Processing
No ratings yet
Special Functions of Signal Processing
7 pages
List of Probability Distributions
No ratings yet
List of Probability Distributions
9 pages
Real Estate Hypothesis Testing Guide
No ratings yet
Real Estate Hypothesis Testing Guide
5 pages
SuplRead 1B Sedimentary Textures
No ratings yet
SuplRead 1B Sedimentary Textures
31 pages
Communication Systems, 5e: Chapter 11: Baseband Digital Transmission A. Bruce Carlson Paul B. Crilly
No ratings yet
Communication Systems, 5e: Chapter 11: Baseband Digital Transmission A. Bruce Carlson Paul B. Crilly
48 pages
Introduction To Statistics in IB Math 11
No ratings yet
Introduction To Statistics in IB Math 11
8 pages
$RE54804
No ratings yet
$RE54804
301 pages
ABEN 55 Lab Activity 8 ES
No ratings yet
ABEN 55 Lab Activity 8 ES
5 pages
Probability and Statistics Problems
No ratings yet
Probability and Statistics Problems
8 pages
Bayesian Decision Theory Guide
No ratings yet
Bayesian Decision Theory Guide
39 pages

Chapter 7

Uploaded by

Chapter 7

Uploaded by

1

Chapter 7: Inference for Distributions

Dr. Nahid Sultana

7.1 Inference for the Mean of a Population

The sample is likely to contain The sample contains only a few

When the sampling distribution of x is close to Normal, we can find probabilities

When we don’t know σ, we can estimate it using the sample standard

Ø When σ is unknown, we estimate it with the sample standard deviation s.

Example: A simple random sample of five female basketball players is

Suppose that an SRS of size n is drawn from an N(µ,σ) population.

üThe density curves of the t distributions are

As with the normal distribution,

Here, μ is the mean (center) of the sampling distribution,

Ø level C confidence interval

The One-Sample t Interval for a Population Mean

1. Stating the null and alternative hypotheses (H0 versus Ha)

2. Calculating t and its degrees of freedom

3. Finding the area under the curve with Table D

4. Stating the P-value and interpreting the result

The level of dissolved oxygen (DO) in a river is an important indicator of

We want to perform a test at the a = 0.05 significance level of:

There is a significant loss of

Ø Comparative studies are more convincing than single-sample investigations.

Conceptually, this is not different from tests on one population.

Although the text didn’t mention it explicitly, this is a pre-/post-test design

24 Caffeine deprivation causes a significant increase in depression.

The MeasureMind 3D MultiSensor metrology software is used by various companies

We want to perform a test at the a = 0.05 significance level of

The P-value is greater than 2 x 0.25 = 0.50.

You might also like