0% found this document useful (0 votes)

19 views7 pages

Descriptive Statistics-1

Descriptive statistics

Uploaded by

bongani mungadze

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views7 pages

Descriptive Statistics-1

Descriptive statistics

Uploaded by

bongani mungadze

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

DESCRIPTIVE STATISTICS-MEASURES OF CENTRAL TENDENCY AND DISPERSION

INTRODUCTION

The major focus is descriptive statistics which is used to describe the basic features of the data in a study.
They provide simple summaries about the sample and the measures. Together with simple graphics
analysis studied in unit 2, they form the basis of virtually every quantitative analysis of data.

Descriptive Statistics are used to present quantitative descriptions in a manageable form. In a research
study we may have lots of measures. Descriptive statistics help us to simplify large amounts of data in a
sensible way. Each descriptive statistic reduces lots of data into a simpler summary. In this unit we are
going the concentrate on the measures of central tendency (mean, mode and median) and measures of
dispersion (range, standard deviation and coefficient of variation) as measures that provide a summary of
any given quantitative data.

MEASURE OF CENTRAL TENDENCY

The central tendency of a distribution is an estimate of the "centre" of a distribution of values. There are
three major types of estimates of central tendency: Mean. Mode and media

The Mean or average is probably the most commonly used method of describing central tendency
denoted by x. It lends itself to subsequent analysis because it includes all values in the universe but may
not coincide with any value and in certain instances may be unrepresentative due to extreme numbers. We
compute the mean adding up all the values and then divide by the number of values.

We have two formulas that are used to compute the mean:

In mathematical terms, the general formula is denoted by:

Where n is the sample size and the x correspond to the observed valued

For ungrouped data, the formula is: Σx/n

Mean of grouped data   fx

f

Where n= no. of values, f= no of values in an interval and x = midpoint of class interval.

Example

1. Consider the yields obtained by a farmer for his maize enterprise for the past 10 year

Season Yield of maize in Tonnes

2000-2001 16
2001-2002 13
2002-2003 25
2003-2004 24
2004-2005 18
2005-2006 18
2006-2007 12
2007-2008 15
2008-2009 19
2009-2010 26
Total 186

Mean= Σx/n=16+13+25+24+18+18+12+15+19+26
= 18.6
10

The mode is the most frequently occurring value in the set of scores. To determine the mode, you must
order the yields shown in above table, and then count each one.

12, 13, 15, 16,18,18,19,24,25,26

The most frequently occurring value is the mode. In our example, the value 18 occurs twice and is the
model. In some distributions there is more than one modal value. For instance, in a bimodal distribution
there are two values that occur most frequently. If the distribution is truly normal (i.e., bell-shaped), the
mean, median and mode are all equal to each other.

The Median is the score found at the exact middle of the set of values. More precisely, the median is any
middle value in order of size, if n is odd, or the mean of the two middle numbers if n is even. Median is
more representative, when data contain a few very large numbers or small values although it cannot be
used for subsequent calculation unlike the mean. One way to compute the median is to list all scores in
numerical order, and then locate the score in the centre of the sample. For example, if there are 500 scores
in the list, score #250 would be the median. If we order the 10 yields shown above, we would get:

12, 13, 15, 16,18,18,19,24,25,26

There are 10scores and score #5 and #6 represent the halfway point. Since both of these scores are 18, the
median is 18. If the two middle scores had different values, you would have to interpolate to determine
the median.
Example

The dairy herd was weighed and the results were tabulated in the table below:

Live-weight in (KG) Frequency

150-154 08
155-159 16
160-164 43
165-170 29
170-174 04

Calculate the mean weight of the herd.

Working:

1. The first step is to find the midpoints of each weight category (x).

2. Multiply by frequencies of each category (fx)

3. Sum all products of fx

4. Divide by total frequency (Σf)

This can be summarised in a tabular form below:

X= lower limit plus upper limit divided by 2

Live-weight in (KG) Mid point Frequency Fx

(x) (f)
150-154 152 08 1 216
155-159 157 16 2 512
160-164 162 43 6 866
165-169 167 29 4 843
170-174 172 04 688
Total 100 16 125

Mean of grouped data

 fx
f
16 125
=
100

=161.25Kg

The mean weight was found to be 161.25kg for the dairy herd.

MEASURES OF DISPERSION.

Dispersion refers to the spread/ variation/ scatter of the values around the central tendency. This is vital
for:

i) Assessing reliability of the averages of the data.

ii) Serves as a basis for control of variability e.g. in quality control that assess variations in the products.

There are two common measures of dispersion, the range and the standard deviation.

The range is the simplest measure of dispersion which is calculated by simply taking the difference
between the maximum and minimum values in the data set. However, the range only provides
information about the maximum and minimum values and does not say anything about the values in
between.

The Standard Deviation is a more accurate and detailed estimate of dispersion because an outlier can
greatly exaggerate the range. The Standard Deviation shows the relation that set of scores has to the mean
of the sample.

We have different formula used to compute the standard deviation and these are:

For grouped data:

Or alternatively it can be given as:

 fx  fx 
2
2 
 
f   f 

Where +
x is the variable

f is the frequency of responses

VARIANCE

It is defined as sum of squared deviations from the mean. The general formula is given as
3.3.5 VARIANCE AND STANDARD DEVIATION:

Step by Step Simple calculation:

a. Calculate the mean, x.

b. Write a table that subtracts the mean from each observed value.
c. Square each of the differences.
d. Add this column.
e. Divide by n -1 where n is the number of items in the sample. This is the variance.
f. To get the standard deviation we take the square root of the variance

Although this computation may seem convoluted, it's actually quite simple.

The table below is a summary of the steps above using the ungroup data example:

X x - 49.2 (x - 49.2 )2
15 -5.875 34.515625
20 -0.875 0.765625
21 0.125 0.015625
20 -0.875 0.765625
36 15.125 228.765625
15 -5.875 34.516525
25 4.125 17.015625
15 5.875 34.515625
Total Σ= 350.875

Now, s2 = 350.875
= 50.125
8-1

S = √50.125 =7.07990112953

The standard deviation allows some conclusions about specific scores in our distribution.

Assuming that the distribution of scores is normal or bell-shaped (or close to it!), the following
conclusions can be reached:

 approximately 68% of the scores in the sample fall within one standard deviation of the mean
 approximately 95% of the scores in the sample fall within two standard deviations of the mean
 approximately 99% of the scores in the sample fall within three standard deviations of the mean

For instance, since the mean in our example is 20.875 and the standard deviation is 7.0799, an estimation
can be drawn from the above that approximately 95% of the scores will fall in the range of 20.875-
(2*7.0799) to 20.875+(2*7.0799) or between 6.7152 and 35.0348. This kind of information is a critical
stepping stone to enabling comparison between the performances of an individual on one variable with
their performance on another, even when the variables are measured on entirely different scale

 fx   fx 
2 2

standard deviation of grouped data   

f   f 

The sample standard deviation will be denoted by s and the population standard deviation will be denoted
by the Greek letter s.

The sample variance will be denoted by s2 and the population variance will be denoted by s2.

The variance and standard deviation describe how spread out the data is. If the data all lies close to the
mean, then the standard deviation will be small, while if the data is spread out over a large range of
values, s will be large. Having outliers will increase the standard deviation.

One of the flaws involved with the standard deviation, is that it depends on the units that are used. One
way of handling this difficulty, is called the coefficient of variation which is the standard deviation
divided by the mean times 100%

S
CV= x100%
m

In the above example, it is

17
x100% = 34.6%
49.2

CONCLUSION

Measures of central tendency/ location are estimates of centre of distribution of values. These are the
mean, the mode and the median.

Mean is commonly used measure of location and it lends itself to subsequent analysis since it includes all
values.

Dispersion measures the variation/scatter of values around the central tendency. The measures of
dispersion include standard deviation, variance, coefficient of variation and the range.

Standard deviation is more accurate and detailed estimate of dispersion and it shows the relation that a set
of values has to the mean.

Formulae
Mean Mode Median
Most frequently occurring value For un grouped data: any middle
For ungrouped data in the set of scores value in order of size if n is odd
or the mean of two middle values
Σx if n is even
=
n

Mean Of Grouped Data 

 fx
f

Measures of dispersion

Range Variance Standard deviation Coefficient of

variation
Highest value s
minus lowest CV= x100%
m
value

3.8 ACTIVITY
1. State the three measures of central tendency?
2. State the most important measure of location and give a reason(s)
3. State the mean formula of the grouped data.
4. Define the term dispersion and give the formula for the standard deviation.
5. Evaluate the relationships that do exist between standard deviation, variance, mean and coefficient
of variation.
6. Find the mean, mode, median, standard deviation and relative dispersion of the following data
which is the maize height distribution in field
Height Frequency

153- 157 04
158- 162 11
163- 167 20
168- 172 24
173- 177 17
178- 182 4

Measures of Central Tendency and Variability
No ratings yet
Measures of Central Tendency and Variability
38 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
41 pages
Measures of Central Tendency and Dispersion
100% (1)
Measures of Central Tendency and Dispersion
7 pages
Lesson 6c, 7, 8-Print
No ratings yet
Lesson 6c, 7, 8-Print
5 pages
Measure of Central Tendency and Variability
No ratings yet
Measure of Central Tendency and Variability
73 pages
Biostatistics 5
No ratings yet
Biostatistics 5
28 pages
Quantitative Methods Module
No ratings yet
Quantitative Methods Module
22 pages
Descreptive Statistics 1
No ratings yet
Descreptive Statistics 1
74 pages
Measures of Dispersion in Statistics
No ratings yet
Measures of Dispersion in Statistics
26 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
31 pages
Measures of Central Tendency and Dispersion
No ratings yet
Measures of Central Tendency and Dispersion
30 pages
Lesson 6c, 7, 8
No ratings yet
Lesson 6c, 7, 8
46 pages
Measures of Central Tendency and Dispersion/ Variability
No ratings yet
Measures of Central Tendency and Dispersion/ Variability
35 pages
Lecture III-Measures of Dispersion
No ratings yet
Lecture III-Measures of Dispersion
33 pages
Central Tendency & Variability
No ratings yet
Central Tendency & Variability
5 pages
Math2101Stat 2 2
No ratings yet
Math2101Stat 2 2
23 pages
Data Presentation
No ratings yet
Data Presentation
104 pages
Central Tendency
No ratings yet
Central Tendency
11 pages
Basic Statistics
No ratings yet
Basic Statistics
24 pages
Central Tendency + Dispersion
No ratings yet
Central Tendency + Dispersion
28 pages
Inbound 6077067452405302583
No ratings yet
Inbound 6077067452405302583
4 pages
Statistics: Mean & Variance Basics
No ratings yet
Statistics: Mean & Variance Basics
3 pages
Lecture 4. Dispersion
No ratings yet
Lecture 4. Dispersion
6 pages
1 Descriptive Statistics
No ratings yet
1 Descriptive Statistics
23 pages
Descriptive Statistics MBA
100% (3)
Descriptive Statistics MBA
7 pages
Measure of Dispression
100% (1)
Measure of Dispression
36 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
7 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
55 pages
Intro to Descriptive Statistics
No ratings yet
Intro to Descriptive Statistics
9 pages
Chapt3 Overheads
No ratings yet
Chapt3 Overheads
8 pages
Descriptive Statistics & Probability Guide
No ratings yet
Descriptive Statistics & Probability Guide
510 pages
Central Tendency - HU 2023
No ratings yet
Central Tendency - HU 2023
48 pages
Chapter 4 Basic Statistics
No ratings yet
Chapter 4 Basic Statistics
22 pages
2 Mean Median Mode Variance
No ratings yet
2 Mean Median Mode Variance
29 pages
Mohan Maths
No ratings yet
Mohan Maths
16 pages
Unit 4 & 5 8614
No ratings yet
Unit 4 & 5 8614
58 pages
St130: Basic Statistics Week 3: Lecture: School of Computing Information and Mathematical Sciences
No ratings yet
St130: Basic Statistics Week 3: Lecture: School of Computing Information and Mathematical Sciences
62 pages
Ed216 Chapter 7
No ratings yet
Ed216 Chapter 7
31 pages
CH 4
No ratings yet
CH 4
6 pages
Assessment1 FIN
No ratings yet
Assessment1 FIN
9 pages
Measures of Dispersion and Relative Standing
No ratings yet
Measures of Dispersion and Relative Standing
11 pages
4.measures of Dispersion
No ratings yet
4.measures of Dispersion
7 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
CHAPTER 5 Biostatistics Measureof Dispersion-31!12!2024
No ratings yet
CHAPTER 5 Biostatistics Measureof Dispersion-31!12!2024
27 pages
2830a Lecture 3
No ratings yet
2830a Lecture 3
68 pages
Presentation 4
No ratings yet
Presentation 4
29 pages
Unit 1 - Business Statistics & Analytics
No ratings yet
Unit 1 - Business Statistics & Analytics
25 pages
Chapter 3 - Data Presentation
100% (1)
Chapter 3 - Data Presentation
40 pages
Measures of Dispersion Guide
100% (1)
Measures of Dispersion Guide
11 pages
Stats
No ratings yet
Stats
3 pages
Topic1 3
No ratings yet
Topic1 3
41 pages
Methods of Center Measurement: X N X X X
No ratings yet
Methods of Center Measurement: X N X X X
85 pages
Descriptive Statistic
No ratings yet
Descriptive Statistic
37 pages
Statistics For Data Science
No ratings yet
Statistics For Data Science
93 pages
Measures of Dispersion
100% (1)
Measures of Dispersion
13 pages
Central Tendency and Dispersion
No ratings yet
Central Tendency and Dispersion
8 pages
Lecture 3 Numerical Measures of Data
No ratings yet
Lecture 3 Numerical Measures of Data
36 pages
MTH302 Midterm Solved Subjective With Reference by Uzair
No ratings yet
MTH302 Midterm Solved Subjective With Reference by Uzair
30 pages
Lecture 4 Introduction To Kriging
No ratings yet
Lecture 4 Introduction To Kriging
55 pages
RCBD for Nitrogen Sources on Rice Yield
No ratings yet
RCBD for Nitrogen Sources on Rice Yield
8 pages
SPSS Variables & Measurement Guide
0% (2)
SPSS Variables & Measurement Guide
1 page
Methods in Behavioural Research 3rd Edition Paul C. Cozby - Ebook PDF PDF Download
100% (1)
Methods in Behavioural Research 3rd Edition Paul C. Cozby - Ebook PDF PDF Download
85 pages
Single Variable Calculus Robert Lopez Download
100% (6)
Single Variable Calculus Robert Lopez Download
94 pages
STAT 11 Summative Reviewer
No ratings yet
STAT 11 Summative Reviewer
4 pages
Chapter 03
No ratings yet
Chapter 03
74 pages
Ej 1408181
No ratings yet
Ej 1408181
20 pages
PAPEL PANG Book Bind
No ratings yet
PAPEL PANG Book Bind
52 pages
Data Exploration and Visualization - AD3301 - Important Questions With Answer - Unit 5 - Multivariate and Time Series Analysis
No ratings yet
Data Exploration and Visualization - AD3301 - Important Questions With Answer - Unit 5 - Multivariate and Time Series Analysis
8 pages
Employability Skills Initiatives in Higher Education - What Effects Do They Have On Graduate Labour Market Outcomes
No ratings yet
Employability Skills Initiatives in Higher Education - What Effects Do They Have On Graduate Labour Market Outcomes
32 pages
Measures of Variability-Ungrouped Data
50% (2)
Measures of Variability-Ungrouped Data
10 pages
Poisson Distribution Explained
0% (1)
Poisson Distribution Explained
22 pages
Bootstrap Methods With Applications in R All-in-One Download
No ratings yet
Bootstrap Methods With Applications in R All-in-One Download
14 pages
Processes-I: Unit 7 Renewal
No ratings yet
Processes-I: Unit 7 Renewal
13 pages
Kamruzzaman, 2012
No ratings yet
Kamruzzaman, 2012
9 pages
Diagnostic Research Design Guide
No ratings yet
Diagnostic Research Design Guide
6 pages
Foucault and Postmodern Conceptions of Reason 1st Edition Laurence Barry Digital Download
No ratings yet
Foucault and Postmodern Conceptions of Reason 1st Edition Laurence Barry Digital Download
142 pages
Islamic University Faculty of Engineering
No ratings yet
Islamic University Faculty of Engineering
4 pages
Jail Case Analysis
100% (2)
Jail Case Analysis
24 pages
Ib Maths Coursework Examples
100% (3)
Ib Maths Coursework Examples
8 pages
Hospital Accreditation Impact On Healthcare Quality Dimensions A Systematic Review
No ratings yet
Hospital Accreditation Impact On Healthcare Quality Dimensions A Systematic Review
14 pages
Fault Diagnosis Based On DPCA and CA (Full Paper) : December 2013
No ratings yet
Fault Diagnosis Based On DPCA and CA (Full Paper) : December 2013
6 pages
Final Stat130 Exam
No ratings yet
Final Stat130 Exam
18 pages
CS Class XI Revision
No ratings yet
CS Class XI Revision
44 pages
(Ebook PDF) Elementary Statistics: A Step by Step Approach 10th Edition Instant Download
50% (2)
(Ebook PDF) Elementary Statistics: A Step by Step Approach 10th Edition Instant Download
56 pages
SANDYA VB TIME SERIES FORECASTING PROJECT - HTML PDF
90% (20)
SANDYA VB TIME SERIES FORECASTING PROJECT - HTML PDF
196 pages
ISE Ethics in Engineering 5th Edition Mike Martin Prof. Updated 2025
No ratings yet
ISE Ethics in Engineering 5th Edition Mike Martin Prof. Updated 2025
115 pages
Week-9 Discrete Probability Distributions
No ratings yet
Week-9 Discrete Probability Distributions
97 pages

Descriptive Statistics-1

Uploaded by

Descriptive Statistics-1

Uploaded by

DESCRIPTIVE STATISTICS-MEASURES OF CENTRAL TENDENCY AND DISPERSION

MEASURE OF CENTRAL TENDENCY

We have two formulas that are used to compute the mean:

In mathematical terms, the general formula is denoted by:

For ungrouped data, the formula is: Σx/n

Mean of grouped data   fx

Where n= no. of values, f= no of values in an interval and x = midpoint of class interval.

Season Yield of maize in Tonnes

12, 13, 15, 16,18,18,19,24,25,26

12, 13, 15, 16,18,18,19,24,25,26

Live-weight in (KG) Frequency

Calculate the mean weight of the herd.

2. Multiply by frequencies of each category (fx)

3. Sum all products of fx

4. Divide by total frequency (Σf)

This can be summarised in a tabular form below:

X= lower limit plus upper limit divided by 2

Live-weight in (KG) Mid point Frequency Fx

Mean of grouped data

i) Assessing reliability of the averages of the data.

For grouped data:

Or alternatively it can be given as:

f is the frequency of responses

Step by Step Simple calculation:

a. Calculate the mean, x.

standard deviation of grouped data   

In the above example, it is

Mean Of Grouped Data 

Range Variance Standard deviation Coefficient of

You might also like