[go: up one dir, main page]

0% found this document useful (0 votes)
122 views30 pages

Data Organization and Presentation-1

1. The document discusses various methods for organizing and presenting statistical data, including through frequency distribution tables, graphs, and measures of central tendency. 2. It provides steps for constructing a frequency distribution table from a sample data set and includes examples of various statistical graphs like histograms, pie charts, and ogives. 3. The document also discusses measures used to analyze distributions, such as mean, median, mode, standard deviation, and coefficients of variation and skewness.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
122 views30 pages

Data Organization and Presentation-1

1. The document discusses various methods for organizing and presenting statistical data, including through frequency distribution tables, graphs, and measures of central tendency. 2. It provides steps for constructing a frequency distribution table from a sample data set and includes examples of various statistical graphs like histograms, pie charts, and ogives. 3. The document also discusses measures used to analyze distributions, such as mean, median, mode, standard deviation, and coefficients of variation and skewness.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 30

STATISTICS

Data Organization and


Presentation
Introduction:
Gathered data can be made more interesting
by presenting them in a better form like tables
and graphs. This is usually considered because
the readers do not appreciate reading a
statistical report on the current population if
the report is just a list of numbers.
Objective:

In this chapter we are going to learn how to


present the data in tabular and graphical
forms.
After gathering all data needed
in the research study, the next step
is to organized and present them
in a better from.
Organization and Presentation
of Data
Frequency Distribution
A frequency distribution is a device for
organizing and presenting grouped data. When the
set of data contains 30 or more cases, frequency
distribution table may be constructed to make the
task more manageable.

Data that are not presented in a frequency


distribution table are called ungrouped data.
Steps in Setting up a Frequency
Distribution Table
1. Find the Range, R, Using the formula:
R = Highest observation(HO) – Lowest Observation(LO)
2. Compute for the number of class intervals, k
k = 1 + 3.3 x log (n)
where n = no. of observations
3. Compute for the class size, i
i = R/k
4. Using the lowest observation as lower limit,
add (i -1) to it to obtain the upper limit of
the desired class interval.
5.The succeeding class intervals can be
obtained by adding i to both the lower and
upper limits of the preceding class interval.

6.Terminate this process when the highest


observation has been contained.

7. When the desired number of classes has been


reached, tally the raw data and indicate the
frequency for each class by counting the
elements. Write these under the column
denoted by f (for frequency).
Sample Problem
The following data show the ages of a sample of 50
customers of a certain supermarket. Construct a
frequency distribution.

51 23 43 38 35 37 28 29 37 29

10 21 27 23 35 33 20 27 39 53

40 53 23 11 13 35 18 19 50 47

41 55 36 48 39 25 17 34 46 37

59 42 27 28 60 48 26 47 14 52
n = 50
LO = 10
HO = 60
Range = 60 -10 = 50
K = 1+ 3.3 log(50) = 6.6 = 7
i = 50/7 = 7.14 = 7
First lower limit = 10
First upper limit = 10 + (i – 1) = 16
= 10 + (7-1) = 10 + 6 = 16
Frequency Distribution Table
LL UL Tally f x cf< cf> cp< cp>
10 16 IIII 4 13 4 50 8% 100%
17 23 IIIII III 8 20 12 46 24% 92%
24 30 IIIII IIII 9 27 21 38 42% 76%
31 37 IIIII IIII 9 34 30 29 60% 58%
38 44 IIIII II 7 41 37 20 74% 40%
45 51 IIIII II 7 48 44 13 88% 26%
52 58 IIII 4 55 48 6 96% 12%
59 65 II 2 62 50 2 100% 4%
n 50
Other Important Elements of a Frequency Distribution
Table

1. Class Mark (x)


-It is the midpoint of a class interval
LL  UL
x= 2

2. Class Boundaries
-Also known as exact limits. The lower boundary can
be obtained by subtracting 0.5 from the lower limit of
the interval and adding 0.5 to the upper limit for the
upper boundary.
3. Relative Frequency
- Indicates how many percent of the data fall
within each category. A relative frequency (p) is
obtained by the using the formula:
f
p=  100%, f = frequency
n
n = total no. of cases

4. Cumulative Frequency
- Shows the accumulation of the
frequencies of the class intervals.
Two methods of accumulating
frequencies:

a. Less than cumulative frequency (cf<)


- Add the frequencies successively from the lowest
class interval to the highest class interval.

b. Greater than cumulative frequency (cf>)


- Add the frequencies successively from the highest
class interval to the lowest class interval.

Note: The highest cumulative frequency is equal to the


total number of cases (n).
5. Cumulative Percentage Frequency
- The percent form of the cumulative
frequency. The cumulative percentage frequency
(cp) is obtained by using the formula:
cf
cp = x 100 %
n
The cumulative percentage frequency provides data
as to how many percent of the data in a
distribution are above or below a certain class
interval.
Data: No. of Students per class in Statistics
at the ABC University.
45 44 23 50 25 35 20 49 40
45 54 55 50 45 36 38 50 42
55 55 50 45 47 45 28 51 51
46 41 52 52 50 25 35 35 35
40 40 45 45 46 46 47 29 36

Set up a frequency distribution table. Show


your computations for: Range, k, and i.
Graphical Presentation

Frequency Polygon
A graph between the class marks (x) representing
the different class interval and the frequencies (f ).
The frequency will be represented by a point in
the vertical axis and the class mark in the horizontal
axis. The vertices of the polygon are determined by
the ordered pairs (x, f) and the sides of the polygon
are formed by connecting consecutive vertices.
Frequency Histogram
A graph between the class boundaries and
the frequencies. The graph is made up of
connected bars.
Each bar represents a class interval with its
corresponding frequency. The height of a bar
is determined by the frequency and its base
is determined by class boundary.
Pie-Chart
A pie-chart is useful when presenting the sizes
of components that make up a certain whole entity.
The components represent the different class
intervals and the size of each component is
determined by the relative frequency.
To determine the size of each slice of the pie,
the relative frequency (p) is converted into degrees
(d ). Use the transformation:

d = p x 3.6 degrees
Cumulative Percentage Ogive
a. Less than cumulative percentage ogive
This graph is constructed by plotting the less
than cumulative percentages (cp<) against their
corresponding upper class boundaries.

b. Greater than cumulative percentage ogive


This graph is constructed by plotting the greater
than cumulative percentages (cp>) against their
corresponding lower class boundaries.

Remark: The ogive is used in estimating the percentage of the data


that falls below or above a given value in the distribution.
Samples of the
Different Statistical
Graphs
Frequency Polygon
5

0
Category Category Category Category Category Category
1 2 3 4 5 6
Frequency Histogram
5
4.5
4
3.5
3
2.5
2
1.5
1
0.5
0
Category 1
Pie-Chart
Sales

1st Qtr
2nd Qtr
3rd Qtr
4th Qtr
5th Qtr
Ogive
120

100

80

60

40

20

0
1 2 3 4 5 6
o ry o ry o ry o ry o ry o ry
eg g g g g g
at ate ate ate ate at
e
C C C C C C
Measures of Central Tendency
(Averages)
An Average is a single measure that best
represents the entire set of data.

Three types of Average:

1. Arithmetic Mean =average(AGES)


2. Median =Median(AGES)
3. Mode =Mode(AGES)
The Standard Deviation
The Standard Deviation is the most important
measure of variation. With the standard
deviation, we will be able to determine the
position of the scores in a frequency
distribution in relation to the mean.

Note that a lower standard deviation implies


the data in the distribution are spread out near
the mean. =STDEV(AGES)
Skewness of a Distribution
A distribution can be normal or skewed. To
determine whether a distribution is normal or
skewed, we can use the coefficient of
skewness (sk) to measure this. In formula:

EXCEL App: =skew(data)


Coefficient of Skewness (Skew)
sk = skew(AGES) = value

Interpretation:
Sk = 0 perfectly normal
Sk = (-0.05, +0.05) considered normal
Sk ≤ -0.05 negatively skewed
Sk ≥ +0.05 positively skewed
Standard Deviation (s)
Formula:
Sample Graphs

You might also like