Chapter 3 Using Numerical Measures To Describe Data
Chapter 3 Using Numerical Measures To Describe Data
BUSINESS
PROGRAM: UPM
COURSE: Busines Statistics1
PROFESSOR: Hend Ghazzai
TERM Fall 2021
:
1
Statistics for
Business and Economics
8th Edition
Chapter 3
COPYRIGHT © 2013 PEARSON EDUCATION, INC. PUBLISHING AS PRENTICE HALL Ch. 2-2
Chapter Topics
Measures of central tendency, variation, and shape
◦ Mean, median, mode, geometric mean
◦ Quartiles
◦ Range, interquartile range, variance and standard deviation, coefficient of variation
◦ Symmetric and skewed distributions
Mode Variance
Standard Deviation
Coefficient of Variation
Population
values
Population size
Observed
values
Sample size
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
Mean = 3 Mean = 4
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
Median = 3 Median = 3
Note that is not the value of the median, only the position
of the median in the ranked data
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 0 1 2 3 4 5 6
No Mode
Mode = 9
Copyright © 2013 Pearson Education, Inc.
CH. 2-11
Publishing as Prentice Hall
Review Example
House Prices:
$2,000,000
500,000
300,000
100,000
100,000
House Prices:
Mean: ($3,000,000/5)
$2,000,000
500,000
= $600,000
300,000
100,000
100,000 Median: middle value of ranked data
Sum 3,000,000 = $300,000
◦ Example: Median home prices may be reported for a region – less sensitive to
outliers
Measures of shape
◦ Symmetric or skewed
Misleading result
Does not represent the
variation of your money after
two years
Accurate result
18
Exercises
To practice more solve
Exercises 2.6 and 2.7 on page 48.
19
Measures of Relative Standing
Percentiles
Quartiles
20
Percentiles and Quartiles
Example: An IQ score at the 90th percentile means that 10% of the population has a
higher IQ score and 90% have a lower IQ score.
Q1 Q2 Q3
■ The first quartile, Q1, is the value for which 25% of the
observations are smaller and 75% are larger
■ Q2 is the same as the median (50% are smaller, 50% are larger)
■ Only 25% of the observations are greater than the third quartile
(n = 9)
Q1 = is in the 0.25(9+1) = 2.5 position of the ranked data
so use the value half way between the 2nd and 3rd values,
so Q1 = 12.5
minimum
first quartile
median
third quartile
maximum
26
2.2
Measures of Variability
Variation
Same center,
Copyright © 2013 Pearson Education, Inc.
different variation
CH. 2-27
Publishing as Prentice Hall
Range
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Range = 14 - 1 = 13
7 8 9 10 11 12 7 8 9 10 11 12
Range = 12 - 7 = 5 Range = 12 - 7 = 5
Sensitive to outliers
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5
Range = 5 - 1 = 4
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120
Eliminate high- and low-valued observations and calculate the range of the
middle 50% of the data
IQR = Q3 - Q1
Example:
Median X
X Q1 Q3
(Q2) maximum
minimum
12 30 45 57 70
34
Population Variance
◦ Population variance:
◦ Sample variance:
n=8 Mean = x = 16
11 12 13 14 15 16 17 18 19 20 21
s = 3.338
(compare to the two cases
Data A below)
11 12 13 14 15 16 17 18 19 20 21
s = 0.926
(values are concentrated
Data B near the mean)
11 12 13 14 15 16 17 18 19 20 21
s = 4.570
(values are dispersed far
Data C from the mean)
Copyright © 2013 Pearson Education, Inc.
CH. 2-41
Publishing as Prentice Hall
Advantages of Variance and
Standard Deviation
◦ Select:
data / data analysis / descriptive statistics
◦ Enter details in dialog box
Click OK
Microsoft Excel
descriptive statistics output,
using the house price data:
House Prices:
$2,000,000
500,000
300,000
100,000
100,000
Can be used to compare two or more sets of data measured in different units
49
Chebychev’s Theorem
For any population with mean μ and standard deviation σ , and k > 1 ,
the percentage of observations that fall within the interval
[μ + kσ]
Is at least
◦ Examples:
At least within
68%
95% 99.7%
54
z-Score
61
2.4 Measures of Relationships Between Variables
Covariance
◦ a measure of the direction of a linear relationship between two variables
Correlation Coefficient
◦ a measure of both the direction and the strength of a linear relationship between
two variables
Unit free
X X X
r = -1 r = -.6 r=0
Y
Y Y
X X X
r = +1 r = +.3 r=0
Copyright © 2013 Pearson Education, Inc.
CH. 2-67
Publishing as Prentice Hall
Using Excel to Find
the Correlation Coefficient
Select Data / Data Analysis
r = .733
There is a relatively
strong positive linear
relationship between
test score #1
and test score #2
Students who scored high on the first test tended to score
high on second test
71
Chapter Summary
Described measures of central tendency
◦ Mean, median, mode