0% found this document useful (0 votes)

123 views119 pages

Ix. Introduction To Statistical Concepts: Frequency Distribution Measures of Central Tendency Measures of Variability

This document introduces several key statistical concepts. It discusses frequency distributions, which organize a set of data into categories and their frequencies. It outlines the process for constructing grouped frequency distributions for quantitative data, including determining class intervals. The document also introduces measures of central tendency, specifically discussing how to calculate the arithmetic mean. The mean is defined as the sum of all values divided by the total number of values. Examples are provided to demonstrate calculating the mean from raw data sets and from grouped frequency distributions.

Uploaded by

AppleRoseAgravante

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

123 views119 pages

Ix. Introduction To Statistical Concepts: Frequency Distribution Measures of Central Tendency Measures of Variability

Uploaded by

AppleRoseAgravante

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 119

IX.

INTRODUCTION TO
STATISTICAL CONCEPTS

A . I N T R O D U C T I ON
B.FREQUENCY DISTRIBUTION
C . M E A S U R E S O F C E N T R A L T E N D E N CY
D.MEASURES OF VARIABILITY
Statistical Concepts

 Meaning of Statistics
 Basic Terms in Statistics
 Branches of Statistics
Why do we study Statistics?

 “people need to develop a discerning

sense of rational thought that will
enable them to evaluate data as to
make intelligent decisions, inferences
and generalizations” (J.T. McClave &
F. Dietrich Jr., 1985)
Meaning of Statistics

 entities that comprise numerical

information

 the study of rules, techniques or

methods use to collect, present, analyze
and interpret a set of numerical data

 numerical measures derived from a

smaller portion of a data set
What is Statistics?

 Tate(1955) has beautifully summarized

the different meaning of statistics…

“It’s all perfectly clear, you compute

statistics from statistics by statistics.”
Meaning of Statistics

“It’s all perfectly clear, you compute

statistics (mean, median, mode,
etc)
from statistics (numerical facts)
by statistics (statitical
metods).”(Tate, 1955)
Statistics as a discipline it is roughly
defined as a “science of data”
 Data means observations or evidences of a
characteristic of a population

 The scientific educational researches

require the data by means of some
standardized research tools or self-
designed instrument.

 Data are both qualitative and quantitative

in nature.
Basic Terms in Statistics

Population
set of - the totality of all

all
set of
observations or
entities under

votes consideration
some
votes Sample
- representative portion
of a population
Basic Terms in Statistics

Parameter
set - a number that
describes a
of all
set of
characteristic of a
population
votes
some
votes Statistic
- a number that describes a
characteristic of a sample
Branches of Statistics

Descriptive Statistics
-methods concerned with
collecting and describing a set of
data
Braches of Statistics

Inferential Statistics
-methods concerned with the analysis
of a sample leading to a
conclusion/generalization/inference
of the entire population
After getting data from the sample, how do
we organize and present them?

Using Frequency
Distribution
IX. INTRODUCTION TO
STATISTICAL CONCEPTS

B. FREQUENCY
DISTRIBUTION
What is a Frequency Distribution (FD)?

 A frequency distribution is a tabular

arrangement of data indicating the
different classes or categories and the
corresponding frequencies.

 A way of organizing set of data.

What is a Frequency Distribution (FD)?

 Two types of frequency distribution can

arise depending on whether the data
gathered are qualitative or quantitative.

 Qualitative data - variable can vary

only in “quality” or “attribute”
 Quantitative data - the variable can
vary in quantity
How do we construct FD?

For qualitative data, the

construction of the frequency
distribution is straightforward since
the categories are already defined.
Example of FD for qualitative variable marital status
Can we use the same procedure for quantitative data?

 Sometimes, but most of the times, the procedure is different

since reporting each score as a category may result into a table
with so many categories.

 Instead of reporting each score as a category, we condense

the scores into some categories.
Say for this example…
Procedure for Constructing a Grouped Frequency
Distribution for Quantitative Data

Step 1. Find the range R of the

scores, where:

R = highest score – lowest score.

Procedure for Constructing a Grouped Frequency
Distribution for Quantitative Data

Step 2. Decide on the number of class

intervals or classes. We will denote
the number of class intervals by k.
 The ideal number according to some authors
is between 10 to 15 class intervals.
 Sturges’ formula
 Or
Procedure for Constructing a Grouped Frequency
Distribution for Quantitative Data

Step 3. Determine the class size or

class width of the interval. If c
represents the class size, we use the
formula
Procedure for Constructing a Grouped Frequency
Distribution for Quantitative Data

Step 4. Determine the lower limit LL and the upper

limit UL of the lowest class interval.
The lowest class interval should contain the lowest
score or value of the data set.
Hence LL is chosen as the score or value that is less
than or equal to the lowest score. Once the value of LL
has been determine, the value of the upper limit UL is
obtained using the equation
Procedure for Constructing a Grouped Frequency
Distribution for Quantitative Data

Step 4.
Thus,

LL=65 (closest to 67 and a multiple of the class

size c=5)

UL=LL+(c – 1)
Procedure for Constructing a Grouped Frequency
Distribution for Quantitative Data

Step 5. Determine the upper class

intervals by consecutively adding the class
size c to the values of LL and UL of the
lowest class interval until we get a class
interval containing the highest score of
the data set. The interval containing the
highest value is referred to as the
highest class interval.
Procedure for Constructing a Grouped Frequency
Distribution for Quantitative Data

Step 6. Make a tally of the scores using

the obtained class intervals.

Step 7. Summarize the tallies by way of a

table similar to Table 2 in Example 1.
Procedure for Constructing a Grouped Frequency
Distribution for Quantitative Data

Summary of Steps
1. Find the Range (R= HS – LS)
2. Decide on the number of class interval (k=√𝑛)
𝑅
3. Identify the class size (𝑐 = )
𝑘
4. Find LL and UL. LL = LS, UL = LL + (c-1)
5. Find the frequency (f) of each interval.

See demonstration one the board

Next slide ( SCORES)
Say for this example…
NOTE…

The two numbers that define a class interval are

called apparent limits. To reflect the continuity of
the scores, the true limits or class boundaries
are indicated. These are obtained by adding all upper
limits and subtracting all lower limits by “one-half of
the difference between the apparent upper limit of
any class interval and the apparent lower limit of
the succeeding class interval”.
NOTE…

For instance, 79 is the apparent upper limit of

the class interval 75 – 79 and 80 is the
apparent lower limit of the succeeding class
interval . So, half of the difference or 79 and
80 is 0.5 which is the number to be added to
all upper limits and subtracted from all lower
limits to obtain the class boundaries of the
class intervals.
The class
boundaries
remove the
discontinuity
between class
intervals.
NOTE…

In addition to the frequency associated with

each class interval, other statistical
information such as the class midpoint or
class mark, the less than cumulative
frequency (<cf), the greater than
cumulative frequency (>cf), and the
relative frequency (rf) may be reflected in
the final table.
IX. INTRODUCTION TO
STATISTICAL CONCEPTS

C. MEASURES OF
CENTRAL TENDENCY
Measures of Central Tendency

In our everyday life, we are more often than not,

exposed to statements involving the concept of
“average” such as:
 the average price of car,
 the average amount of rainfall in the Philippines or
 the average salary of the nurses abroad.

Somehow, informed individuals generally make

decisions based on their understanding of this
important idea.
What is an “average”?

 values on the scale where most of

the other scores are clustered

 the most typical score

 can be classified as the mean,

median and mode
How is an “average” computed?

can be computed or merely inspected from

a set of data;

 either in their original form (called raw data)

 from that have been organized into frequency

distribution (called grouped data)
Measure of Central Tendency

Mean
 the balancer of the distribution
 if, the entire distribution is likened to a “see-
saw”, the mean serves as the “fulcrum”

• can be classified as an arithmetic mean,

weighted mean, harmonic mean, etc
Mean

Arithmetic Mean is obtained by adding the scores in a

distribution and dividing it by the number of scores,
that is:
𝑠𝑢𝑚 𝑜𝑓 𝑠𝑐𝑜𝑟𝑒𝑠
𝑀𝑒𝑎𝑛 =
𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑠𝑐𝑜𝑟𝑒𝑠
Mean
Example 1. Compute the average score in
science of the 12 students who obtained the
following scores: 22, 24, 17, 21, 19, 18, 21, 30,
30, 17, 15, and 20.

In research, the rule is for us to round-up heavily,

which means that we only have to retain the
digits that are meaningful.
Mean
Example 2. The following data represent the
net take home pay of five rank and file
employees of a certain company. Determine the
average net take home pay of the employees.
NET TAKE HOME PAY: P4,750; P4,535;
P4,380; P3,895; and P9,307.
Mean
Finding mean of a frequency distribution.
 How do we estimate the mean of the data if
the scores are not available?
 The mean of a grouped data can be
computed by:

 where f = class frequency; X = class

midpoint; and N is the total number of
cases in the frequency distribution
Mean

Example 3. Find the mean of the data as shown in

the frequency distribution below.
Mean
Solution:
• First, we expand the table by constructing the
columns for X (midpoints) and
• fX, (the products of corresponding midpoints and
class frequencies).
Mean
We rounded off the
computed mean to the
nearest hundredths
because the class
intervals actually extend
up to the nearest tenths
below and above the
Using the summary given apparent limits.
values from the table,
the mean of the 40
scores is given by
Median (Raw Data)

 the score where half of the total number of scores is

found below it and the other half above it.
 the middle score when all the scores have been
ranked.
 no common symbol for the median. (but we use Md
for median)
 an appropriate measure of central tendency when
the variable is measured in at least the ordinal
scale.
Median (Raw Data)

Common application of median

Suppose you are tasked to put the
following words in a backdrop, which
letters will you start for it to be
symmetrical?
HAPPY
SUMMER
VACATION
Median (Raw Data)
The following are the steps in finding the median
based on raw data:

1. Arrange the data or scores in ascending order

(from lowest to highest);

2. If n is odd, there will be a middle score. This

middle score is the median.

If n is even, there will be two middle scores and the

median is taken as the arithmetic average of
the two middle scores.
Note:

Because the median depends only on

the number of cases, it is more
preferred than the mean whenever
extreme values occur in a data set.
Median (Raw Data)
Example 6. The scores of nine students in a
science test are: 22, 24, 17, 21, 19, 18, 21, 30, 30 .
Find the median score.
Solution: Arranging the scores in ascending order
and identifying the middle score, the median score is 21.
Median (Raw Data)
Example 7. The age (in years) of six teachers are
listed below 42, 23, 24, 30, 27, and 34. Find the
median age of the six teachers.
Solution: Because the number of cases is 6 (even), there will
be two middle scores. The age in ascending order are listed
below. From the arranged data, the two middle scores are 27
and 30. Hence, the median is 28.5.
Median (Raw Data)
When n is large, locating the position of the median by
ocular inspection may not be easy. However, based on
the definition given and the parity (oddness or
evenness) of n, we can use the following formulas for
locating the middle score (s).
Median (Raw Data)
 n 1
Thus, if n=157 (odd) formula if n is odd  th
 2 

 157  1 
The median is the  th  79th score
 2 
Thus, if n=346 (even)
n n 
formula if n is even  th and   1th score
2 2 
 346   346 
The median is the    173   1  174
 2   2 
Hence, the median is the average of the 173rd and 174th scores.
Median (Grouped Data)

The formula can be used in finding the median of a frequency

distribution

where,
• LL = true lower limit or lower class boundary of the
median class;
• Fb = the sum of all frequencies below the median class
(or the <cf directly below the median class)
• f = frequency corresponding to the median class; and
• c= class size.
Median (Grouped Data)

Example 8. Find the median of the following

frequency distribution.
Median (Grouped Data)

Solution: We note that = 50%(30) = 15. Looking at

the <cf, 15 is between 7 and 17.
Hence the median class is 80 – 84. With reference to
the median class, we have ; Fb = 7; f = 10; and c = 5.
Therefore, the value of the median is given by
Mode (Raw Data)

 the value or the score that occurs most

frequently in a collection of scores.

 represented by the tallest column on a

histogram or the highest peak in a
frequency polygon.

 appropriate when the variable is

measured in the nominal scale.
Mode (Raw Data)

Example 10. Find the modal age of the 10 Grade

III pupils whose ages are listed below:
Age x (in years): 10.25, 9.0, 10.25, 9.5, 9.0, 10, 9.0,
9.25, 10.75, 10.
Solution: We first arrange the scores in ascending order and
take note of the frequency of occurrence of each score.
Mode (Raw Data)

Find the mode of the following data set.

 1, 2, 3, 4, 5 1, 2, 3, 3, 4, 4
No Mode Modes = 3 and 4
 1, 2, 3, 4, 4
2, 2, 3, 3, 4, 4
No Mode
Mode = 4 2, 2, 2, 2, 2
Mode = 2
Mode (Grouped Data)

For data that are summarized in a frequency

distribution, an approximate value of the
mode called the “crude mode” is defined
as the midpoint of the class interval with the
highest class frequency (called the modal
class), and thus is also easily obtained by
inspection.
Mode (Grouped Data)
A more accurate value of the mode (exact mode) for grouped
data is also obtained by linear interpolation.
The resulting formula is given by

 where, LL = true lower limit or lower boundary of the modal

class;
 d1 = absolute difference between the frequencies of the
modal class and the lower class interval (interval just below
it);
 d2 = absolute difference between the frequencies of the
modal class and the higher class interval (interval just above
it);
 c = the class size.
Mode (Grouped Data)
Example 11. Find the exact mode based on the same
frequency distribution given in Example 3.
Solution: The modal class of the frequency distribution
shown below is 85 – 89 since it has the largest value of f.

Modal
Class
Mode (Grouped Data)
 Thus, with reference to the
modal class, the frequency of
the lower class interval is 7
Modal while the frequency of the
Class higher class interval is 8. The
values needed to compute the
exact mode are:

 LL = 84.5; d1 = |12 - 7| = 5 and d2

= |12 – 8| = 4 and c = 5.
Mode (Grouped Data)
Note…
There is also a rule in statistics called the
“empirical rule” which allows us to
compute the value of the mode of
grouped data when the mean and the
median are available. This rule is given
by the formula
Mode (Grouped Data)

For instance…
If the mean is 85.38 while the median is
86.17. Using these values, the value of the
mode using the empirical rule would be
Summary of Measures of Central Tendency
IX. INTRODUCTION TO
STATISTICAL CONCEPTS

D. MEASURES OF
RELATIVE POSITION
AND VARIABILITY
What have we learned?
Measures of a Distribution
 Measures of Central Tendency
 Mean
 Arithmetic Mean (Raw Scores and Frequency
Distribution)
 Weighted Mean
 Median

 Raw Scores and Frequency Distribution

 Mode
 Raw Scores and Frequency Distribution
 Relationship of the Three Measures of Central Tendency

 Summary (When to use, advantages and disadvantages)

What to discuss?
 Measures of Relative Position
 Quantiles
 Percentile,
Decile and Quartile
 How to compute Percentile Value (PV)

 How to find Percentile Rank

 Box Plot

 Measures of Variability
Range
Mean Absolute Deviation
Variance and Standard Deviation
Measures of Relative Position
Suppose your score in the LET is the 75th percentile
value, what does this mean? (actual LET 2009
question)

A. You got a score of 75 in the test.

B. You got 75% of the total number of items correct.
C. You are higher than 75% of those who took the test.
D. You are one of the top 75% of all the takers.

Remember your choice as we discuss “QUANTILES”…

What are “quantiles”?

 referred as measure of relative position

 values on a scale or distribution which

we can find a certain percent of all other
scores below this value

 these values divide the entire

distribution into “q” equal parts
for instance

lowest highest
score score

median
1 2
Median is a quantile that divides a distribution into two equal
parts which 50% of the entire scores fall below it.
Common Types of Quantiles

Number of Type of Symbol Number of

Partitions (q) Quantile Quantiles

2 Median M 1
3 Tertiles T 2
4 Quartiles Q 3

10 Deciles D 9
100 Percentile P 99
Deciles

 The values that divide the distribution

into 10 equal parts.

Formula:
𝑘
𝐷𝑘 = (𝑛 + 1)𝑡ℎ
10
*Linear Interpolation
𝐷𝑘 = LV + 𝑏(𝐻𝑉 − 𝐿𝑉
Quartiles

 The values that divide the distribution

into 4 equal parts.

Formula:
𝑘
𝑄𝑘 = (𝑛 + 1)𝑡ℎ
4
*Linear Interpolation
𝑄𝑘 = LV + 𝑏(𝐻𝑉 − 𝐿𝑉
Percentiles

 The values that divide the distribution

Into 100 equal parts.

Formula:
𝑘
𝑃𝑘 = (𝑛 + 1)𝑡ℎ
100
*Linear Interpolation
𝑃𝑘 = LV + 𝑏(𝐻𝑉 − 𝐿𝑉
Percentile

 The values that divide the distribution

into 100 equal parts are called percentiles.

 If Px denotes the xth percentile value, then

 Px = value on the scale below which we

can find x% of the scores.
Percentile
Thus, P90 or the 90th percentile value is the value in
the distribution below which we can find 90% of all
the other scores.
In a class consisting of 50 pupils, a pupil
whose final grade corresponds to or is greater
than P90 is said to belong to the upper 10% of
the entire pupils in the class.

This also means that his grade is better than

90%(50) = 45 pupils in the class.
Example

Find Q1, Q3 and P99 for the following data:

95 81 59 68 100 92 75 67 85 79

71 88 100 94 87 65 93 72 83 91

Solution: We first arrange the data in ascending order as

follows:
59 65 67 68 71 72 75 79 81 83

85 87 88 91 92 93 94 95 100 100
THINK ABOUT THIS 
Is it enough to simply describe a set of data using
averages?
Consider this instance…
Suppose manufacturers of matches claim that there
are 50 matchsticks in every matchbox.
 You took 7 samples of a certain brand, say Rizal, and
observed the following numbers:
45 46 47 49 47 48 47
 You took another brand (Fuego) and observed the
following sample
47 47 45 44 52
 Number of matchsticks for a sample of matchboxes:

 Rizal: 45 46 47 49 47 48 47
 Fuego: 47 47 45 44 52

 What Brand would you more likely select the next

time you buy a match?

 What possible statistics will help you decide on this?

 Finding the average number of matchsticks in each
set:

 Rizal: 45 46 47 49 47 48 47
 Fuego: 47 47 45 44 52

329
 Mean (R) = =47
7
235
 Mean (F) = =47
5

 In fact, both sets have a median and mode of 47.

NOTE…

 The situation shows two possible sets with

the same averages (mean, median and mode)
but of different values.

 Generally, the measure of central tendency

is not enough in describing a set of data.

 We must also consider how the scores vary

among each other.
 A number that describes how scores
vary among in a data
measure of variability

 Can also be called as the measure of

 spread

 scattering

 variation

 dispersion

 consistency

 heterogeneity
Measures of Variability

Range
 a crude estimate of variability

 difference between the highest score

and the lowest score

 R=HS – LS
Measures of Variability
 Finding the range of the number of matchsticks
in each set:
Rizal: 45 46 47 49 47 48 47
Fuego: 47 47 45 44 52

 Range (R) = 49 – 45 =4
 Range (F) = 52 – 44 = 8

 Since the range of Fuego is larger than that

of Rizal, the number of matchsticks of the
former is more varied than the latter.
Measures of Variability
 Since the range is just a crude estimate, there are
times that it may not compare different variations
correctly.

For instance, consider the following sets:

A: 1 2 3 4 5 6 7
B: 1 1 1 1 1 1 1 1 1 1 1 8

 What is the range of each set?

 How do you compare the ranges relative to the actual
variation of the set?
How do we solve this problem?

 we consider all the scores in finding a

measure of variation
 this is done by finding first a number that will
serve as reference point and where all the
scores will be assessed relative to this point
 what number can we use?
 It’s the MEAN, why?
 afterwards, we can now identify how far each
score is away from the mean
We take the case of Fuego…

What is the mean

of these scores?
-3 - 2 5
47
44 45 47 52

How do we identify
mean
the deviation of each
score (x) from the
mean (𝑥)?
ҧ
In tabular form…
How do we get a single
Scor Deviation
value that summarizes
e (score –
these deviations?
mean)
44 -3
45 -2 Answer:
47 0 we get the
47 0 average of these
52 5 deviations
Since the sum of the deviations from the mean
equals zero, we need to remove the negative sign.
How?

Scor Deviation Absolute

Answer:
(score – Deviation
e we get the
mean) absolute value
of these
44 -3 3 deviations
45 -2 2
47 0 0
47 0 0
52 5 5
We can now find the average of these
absolute deviations .

Score Deviation Absolute

(score – Deviation Mean Absolute
mean) Deviation or
44 -3 3 MAD
45 -2 2
3+2+0+0+5
47 0 0 =2
5
47 0 0
52 5 5
Mean Absolute Deviation

Average of the absolute deviations from the mean

𝑠𝑢𝑚 𝑜𝑓 𝑡ℎ𝑒 𝑎𝑏𝑠𝑜𝑙𝑢𝑡𝑒 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛𝑠 𝑜𝑓 𝑒𝑎𝑐ℎ 𝑠𝑐𝑜𝑟𝑒 𝑓𝑟𝑜𝑚 𝑡ℎ𝑒 𝑚𝑒𝑎𝑛

MAD=
𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑠𝑐𝑜𝑟𝑒𝑠

∑ 𝒔𝒄𝒐𝒓𝒆−𝒎𝒆𝒂𝒏
MAD=
𝒏
But… Getting an absolute value is
NOT used in higher
Score Deviation statistics…
(score –
mean)
What mathematical
44 -3 operation can we use to
45 -2 eliminate the negative sign?
47 0
47 0 Answer: square the deviations
52 5
Squaring the deviations and getting the average of these
squared deviations…

Score Deviation Squared Mean Squared

(score – Deviations Deviation or
mean) MSD
44 -3 9
45 -2 4
9 + 4 + 0 + 0 + 25
47 0 0 = 7.6
5
47 0 0 commonly
52 5 25 known as the
VARIANCE
Variance
Average of the squared deviations
from the mean

𝑠𝑢𝑚 𝑜𝑓 𝑡ℎ𝑒 𝑠𝑞𝑢𝑎𝑟𝑒𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛𝑠 𝑜𝑓 𝑒𝑎𝑐ℎ 𝑠𝑐𝑜𝑟𝑒 𝑓𝑟𝑜𝑚 𝑡ℎ𝑒 𝑚𝑒𝑎𝑛

MSD=
𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑠𝑐𝑜𝑟𝑒𝑠

∑(𝒔𝒄𝒐𝒓𝒆−𝒎𝒆𝒂𝒏)𝟐
MSD=
𝒏
NOTE:
There is two types of variance

Biased Estimate s 2

 x  x
2

Unbiased Estimate s 2

 x  x
2

n 1

s  2
 2

n  x   x 
2

nn  1
Squaring the deviations and getting the average of these
squared deviations…

Score Deviation Squared Mean Squared

(score – Deviations Deviation or
mean) MSD
44 -3 9
45 -2 4
9 + 4 + 0 + 0 + 25
47 0 0 = 9.5
5−1
47 0 0 commonly known as
the VARIANCE
52 5 25 (Unbiased Estimate)
However…

 How do you compare the value of the

variance relative to the deviations?

 What do you observe with the units of

the scores, the mean and the variance?
Answer…
 How do you compare the value of the variance
relative to the deviations?

The variance is extremely different

compared with the deviations.

 What do you observe with the units of the

scores, the mean and the variance?

The unit of the variance is inconsistent

with that of the scores and the mean.
How do we solve the problem?

 We get the square root of the variance

and call the value as the standard
deviation…
 Standard deviation (s/sd) = 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒
∑(𝑠𝑐𝑜𝑟𝑒−𝑚𝑒𝑎𝑛)2
 𝑠𝑑 = (biased estimate)
𝑛

∑(𝑠𝑐𝑜𝑟𝑒−𝑚𝑒𝑎𝑛)2
 𝑠𝑑 = (unbiased estimate)
𝑛−1
Therefore…
If the variance are
7.6 (biased estimate)
9.5 (unbiased estimate)

The standard deviation are

Biased Estimate
𝟕. 𝟔 = 𝟐. 𝟕𝟔 matchsticks

Unbiased Estimate
𝟗. 𝟓 = 𝟑. 𝟎𝟖 matchsticks
We find the measure of
variability of the Rizal
brand using MS Excel…
CHARACTERISTICS OF DISTRIBUTIONS

The distribution of data derived from

the measurement of a continuous or
interval variable can be further
characterized using the graph of the
distribution.
By studying the graph closely, we can
describe its symmetry, peakedness, as
well as the modality.
1. Symmetry and Skewness

A symmetric distribution arises when a

vertical line drawn from the highest point
of the graph to the base line divides the
graph into two portions which are mirror
images of each other with respect to the
vertical line drawn.
1. Symmetry and Skewness

Skewness is the degree of departure of

the distribution from symmetry.

In general, distributions can either be

positively skewed or negatively skewed.
1. Skewness

A distribution is said to be positively

skewed when it tapers to the right.

Positive skewness can occur when most

scores or values in the distribution are
low and only few are high.
1. Skewness

A distribution is said to be negatively

skewed when it tapers to the left.

Negative skewness can occur when most

scores or values in the distribution are
high and only few are low.
2. Kurtosis

Kurtosis is defined as the peakedness of a

curve (highly peaked or lowly peaked)
Platykurtic Distribution

When the values in a set of data are

highly heterogeneous, the resulting graph
will be low peaked.
Leptokurtic Distribution

When the values in a set of data are

highly homogeneous, the resulting graph
will be high peaked.
Mesokurtic Distribution

A distribution that is neither highly

peaked nor low peaked and is said to have
an average kurtosis.
Modality of a Distribution

Characterized by the number of peaks of a

distribution.
Modality of a Distribution

-a distribution with
only one peak

-most common
Modality of a Distribution

-a distribution
with two peaks

-may not be of
the same heights
Relationship of the Three Measures of Central
Tendency

When the distribution of a set of data is

symmetric, the three measures of central
tendency have the same values
Relationship of the Three Measures of Central
Tendency

For skewed distributions…

“If you can’t quantify
what you are saying,
you don’t know what
you are saying”
Lord Kelvin

2
No ratings yet
2
2 pages
QM Statistic Notes
No ratings yet
QM Statistic Notes
24 pages
C-1 Introduction To Statistics
No ratings yet
C-1 Introduction To Statistics
212 pages
Gen Exam CH 1 SOLUTION
No ratings yet
Gen Exam CH 1 SOLUTION
6 pages
The Audit Standards' Setting Process
No ratings yet
The Audit Standards' Setting Process
39 pages
Statistics for Business Students
100% (1)
Statistics for Business Students
25 pages
Financial Accounting, 3e: Weygandt, Kieso, & Kimmel
No ratings yet
Financial Accounting, 3e: Weygandt, Kieso, & Kimmel
47 pages
Chapter 09
No ratings yet
Chapter 09
16 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
21 pages
Probability Distributions Guide
No ratings yet
Probability Distributions Guide
10 pages
Binomial Probability Distributions Guide
No ratings yet
Binomial Probability Distributions Guide
7 pages
Lecture Notes Learning Objectives - After Reading This Chapter, Students Should Be Able To
No ratings yet
Lecture Notes Learning Objectives - After Reading This Chapter, Students Should Be Able To
17 pages
Section: An Overview of Statistics
No ratings yet
Section: An Overview of Statistics
17 pages
Chap 007
0% (1)
Chap 007
77 pages
AK - STATISTIKA - 01 - Describing Data
No ratings yet
AK - STATISTIKA - 01 - Describing Data
26 pages
Tutoring Session 2023 - Statistics For Business
No ratings yet
Tutoring Session 2023 - Statistics For Business
65 pages
ch01 Descriptive Statistics
No ratings yet
ch01 Descriptive Statistics
48 pages
Internet Programming Project
No ratings yet
Internet Programming Project
4 pages
Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
40 pages
Book-Sher Muhammad Chaudary - 89-133 PDF
100% (1)
Book-Sher Muhammad Chaudary - 89-133 PDF
45 pages
ME-All Slides
No ratings yet
ME-All Slides
470 pages
Stats Chap12 Notes
No ratings yet
Stats Chap12 Notes
89 pages
Chapter - Two - Data Types and Data Collection v2
No ratings yet
Chapter - Two - Data Types and Data Collection v2
51 pages
Key Concepts in Descriptive Statistics
No ratings yet
Key Concepts in Descriptive Statistics
7 pages
Chapter7 MQM100 MultipleChoice PDF
No ratings yet
Chapter7 MQM100 MultipleChoice PDF
15 pages
Module 3 Topic 3 Lesson 2B Weighted Graphs PDF
No ratings yet
Module 3 Topic 3 Lesson 2B Weighted Graphs PDF
14 pages
BA Module 02 - 2.1 + 2.2
No ratings yet
BA Module 02 - 2.1 + 2.2
12 pages
Choosing The Correct Statistical Test
No ratings yet
Choosing The Correct Statistical Test
26 pages
Cse 205 All Java Slides
No ratings yet
Cse 205 All Java Slides
260 pages
Basic Business Statistics: Analysis of Variance
No ratings yet
Basic Business Statistics: Analysis of Variance
85 pages
Intro to Descriptive Statistics
No ratings yet
Intro to Descriptive Statistics
5 pages
UGC Statistics Curriculum 2001
No ratings yet
UGC Statistics Curriculum 2001
101 pages
Claasification of Costs
No ratings yet
Claasification of Costs
21 pages
Stat Quiz
0% (1)
Stat Quiz
5 pages
Past Exam 1
No ratings yet
Past Exam 1
6 pages
Chapter 1-Basic Statistical Concepts
No ratings yet
Chapter 1-Basic Statistical Concepts
30 pages
Skewness, Moments and Kurtosis
No ratings yet
Skewness, Moments and Kurtosis
23 pages
Albright DADM 6e - PPT - Ch05
No ratings yet
Albright DADM 6e - PPT - Ch05
48 pages
Statistics For Business and Economics: Estimation: Single Population
No ratings yet
Statistics For Business and Economics: Estimation: Single Population
46 pages
Exercises Java Advanced Features
No ratings yet
Exercises Java Advanced Features
26 pages
Confidence Interval Estimationnew
No ratings yet
Confidence Interval Estimationnew
12 pages
Bayes' Law and Probability Concepts
No ratings yet
Bayes' Law and Probability Concepts
7 pages
5 6221861509576786069 PDF
No ratings yet
5 6221861509576786069 PDF
20 pages
Statistics For Business and Economics: Hypothesis Testing II
No ratings yet
Statistics For Business and Economics: Hypothesis Testing II
48 pages
Statistics For Business and Economics: Discrete Random Variables and Probability Distributions
No ratings yet
Statistics For Business and Economics: Discrete Random Variables and Probability Distributions
59 pages
CH 03 - Descriptive Statistics: Numerical: Page 1
100% (3)
CH 03 - Descriptive Statistics: Numerical: Page 1
72 pages
Describing Data: Displaying and Exploring Data
100% (1)
Describing Data: Displaying and Exploring Data
30 pages
Statistics For Business and Economics,: 11E Anderson/Sweeney/Williams
100% (1)
Statistics For Business and Economics,: 11E Anderson/Sweeney/Williams
57 pages
Test Bank For Business Statistics in Practice 8th Edition by Bowerman Chapters 1 18
100% (2)
Test Bank For Business Statistics in Practice 8th Edition by Bowerman Chapters 1 18
27 pages
Probability
No ratings yet
Probability
36 pages
1.1 Introduction To Statistics and Data Gatherings
No ratings yet
1.1 Introduction To Statistics and Data Gatherings
102 pages
Karl Pearson's Measure of Skewness
No ratings yet
Karl Pearson's Measure of Skewness
27 pages
Business Statistics: A First Course: Fifth Edition
No ratings yet
Business Statistics: A First Course: Fifth Edition
20 pages
Estimation
No ratings yet
Estimation
53 pages
Chap 13
No ratings yet
Chap 13
60 pages
L24 - Example of Chi-Square Goodness-of-Fit Test, Chi-Square Cross Table, Association
No ratings yet
L24 - Example of Chi-Square Goodness-of-Fit Test, Chi-Square Cross Table, Association
24 pages
Lecture Notes 1 Cpe106 Assessment in Student Learning 2
No ratings yet
Lecture Notes 1 Cpe106 Assessment in Student Learning 2
21 pages
Research 3 Quarter 3 - MELC 1 Week 1-2 Inferential Statistics
No ratings yet
Research 3 Quarter 3 - MELC 1 Week 1-2 Inferential Statistics
39 pages
Psychological Statistics Midterm - 2023 2024
No ratings yet
Psychological Statistics Midterm - 2023 2024
7 pages
Statistical Analysis With Software Application - Week2
No ratings yet
Statistical Analysis With Software Application - Week2
76 pages
Brandon Jones 2017
No ratings yet
Brandon Jones 2017
6 pages
Applied Stats Exercises & Solutions
100% (1)
Applied Stats Exercises & Solutions
130 pages
Assessing Learning in A Classroom Environment
No ratings yet
Assessing Learning in A Classroom Environment
6 pages
Mean Median Mode
No ratings yet
Mean Median Mode
36 pages
2015 Scheme of Work Add Maths f4
100% (1)
2015 Scheme of Work Add Maths f4
11 pages
Math 7 Lesson Plan: Central Tendency
No ratings yet
Math 7 Lesson Plan: Central Tendency
4 pages
Continuous Probability Distribution Solutions
71% (7)
Continuous Probability Distribution Solutions
8 pages
INTRODUCTION TO STATISTICS Notes
No ratings yet
INTRODUCTION TO STATISTICS Notes
16 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
44 pages
Important MCQ's + S.Q + L.Q Class 10th-1
No ratings yet
Important MCQ's + S.Q + L.Q Class 10th-1
24 pages
STAT - 101 - TUTORIAL - 4 Solutions
No ratings yet
STAT - 101 - TUTORIAL - 4 Solutions
10 pages
Assignment: Whatsapp
No ratings yet
Assignment: Whatsapp
13 pages
Mean Median Mode and Dispersion Question For Exam
No ratings yet
Mean Median Mode and Dispersion Question For Exam
5 pages
Mathematics Project: Data Handling
No ratings yet
Mathematics Project: Data Handling
15 pages
LECTURE NO. 2 (Chapter 2 Normal Distribution)
100% (1)
LECTURE NO. 2 (Chapter 2 Normal Distribution)
10 pages
Biostatistics Mcqs With Key
100% (2)
Biostatistics Mcqs With Key
14 pages
Height Data Analysis and Statistics
No ratings yet
Height Data Analysis and Statistics
4 pages
Statistics Practice Paper 2024
No ratings yet
Statistics Practice Paper 2024
4 pages
Epsc 311
No ratings yet
Epsc 311
3 pages
Chapter 6 II Stastitic III ENHANCE
No ratings yet
Chapter 6 II Stastitic III ENHANCE
55 pages
To Locate Median Graphically
100% (1)
To Locate Median Graphically
4 pages
Recap
No ratings yet
Recap
75 pages
McGrath 2008. Schizophrenia A Concise Overview of Incidence, Prevalence, and Mortality
No ratings yet
McGrath 2008. Schizophrenia A Concise Overview of Incidence, Prevalence, and Mortality
10 pages
HW3 Solution
No ratings yet
HW3 Solution
8 pages
MMW Finals Reviewer
No ratings yet
MMW Finals Reviewer
3 pages
Partnership Operation Exercises With Answers and Solutions Merged
No ratings yet
Partnership Operation Exercises With Answers and Solutions Merged
2 pages
Gamma Extended Frechet Distribution
No ratings yet
Gamma Extended Frechet Distribution
23 pages
TREND User Guide
No ratings yet
TREND User Guide
29 pages
Biology Laboratory Manual 11th Edition Vodopich Solutions Manual
No ratings yet
Biology Laboratory Manual 11th Edition Vodopich Solutions Manual
11 pages
Quality Control & Reliability Guide
No ratings yet
Quality Control & Reliability Guide
38 pages