REPUBLIC OF CAMEROON REPUBLIQUE DU CAMEROUN
Peace- Work- Fatherland Paix- Travail- Patrie
………………………… ……………………..
UNIVERSITY OF BAMENDA UNIVERSITE DE BAMENDA
………………………… ……………………
NATIONAL HIGHER POLYTECHNIC ECOLE SUPERIURE NATINALE
INSTITUTE (N.A.H.P.I) POLYTECHNIQUE(E.S.N.P)
…………………………….. ……………………
BAMBILI–BAMENDA BAMBILI–BAMENDA
TOPIC: CALCULATING THE MEAN, MEDIAN, AND MODE OF
GROUPED AND UNGROUPED DATA
COURSE INSTRUCTOR: Dr. NDUKUM PASCALINE
PRESENTED BY : GROUP 4
LEVEL 400
SCHOOL : NAHPI
DEPARTMENT : COME
COURSE TITLE: DATA ANALYSIS
COURSE CODE: COME4108
Name Matricule
RON VAN NTUMSI NGANG UBa21E0436
SAHLA SANDRINE NYUYKIKFE UBa21E0056
SHENYUY GOODNESS-DIVINE TATA UBa21E0057
SIBEN CLETUS WIYSANYUY UBa21E0058
SIGNE JOSUE EMMANUEL UBa21E0059
SUKA NIBA NDE UBa21E0437
TAMABANG REMY GAFORBE UBa21E0438
TCHUIFO AKO ERASTUS UBa21E0439
TEBIT ODILIA ANWEI UBa21E0440
TENENG CEDRIC MULUH UBa21E0060
TIFUHAGOH SHAQUILLE FORD MBA UBa21E0061
TUMASANG MANOEL ACHIBE UBa21E0062
WANDJI JUNIOR FERNANDEZ AWAFOR UBa21E0064
WANGO ADAMU NJINJOH UBa21E0065
WUYIKA FABRICE KIDZENWA UBa21E0067
YVAN HAROLD KWEMO TCHIENGA UBa21E0068
NDIFON TITIANA SIH UBa23E3040
TCHINDA TCHOFFO MEGANE UBa23E3041
MASSODA TATIANA LOIS NGO UBa20E0037
DEFINITION OF MEAN, MEDIAN AND MODE
Mean
The mean, also known as the average, is a measure of central tendency in a data set. To
calculate the mean, you add up all the values in the data set and then divide the sum by the
number of values.
In general, the mean of the set: {𝑥1 , 𝑥2 , 𝑥3 , … , 𝑥𝑛 } is
𝑥1 + 𝑥2 +𝑥3 +⋯+𝑥𝑛
𝑥̅ =
𝑛
For example, consider the data set {2, 4, 4, 4, 5, 5, 7, 9} . The mean would be calculated as follows:
2+4+4+4+5+5+7+9
𝑥̅ = =5
8
So, the mean of this data set is 5. The mean provides a measure of central tendency and is useful for
summarizing a set of values with a single representative value.
Median
The median is another measure of central tendency in a data set. To find the median, you first need to
arrange the values in the data set in ascending order. Then, if the number of values (n) is odd, the
median is the middle value. If the number of values is even, the median is the average of the two middle
values.
For example, consider the data set {4, 9, 4, 5, 2, 5, 4, 7}. After arranging the values in ascending order,
you get {2, 4, 4, 4, 5, 5, 7, 9}. Since there are 8 values (an even number), the median is the average of the
two middle values, which are 4 and 5:
4+5 9
Median = = = 4.5
2 2
So, in this case, the median of the data set is 4.5. The median is less sensitive to extreme values than the
mean and can be a useful measure of central tendency, especially when dealing with skewed
distributions or data sets with outliers.
Mode
The mode of a data set is the value (or values) that appear most frequently. In other words, it is the
value that occurs with the highest frequency in a data set.
A data set may have one mode, more than one mode, or no mode at all.
For example, consider the data set {2, 4, 4, 4, 5, 5, 7, 9}. In this case, the mode is 4 because it appears
more frequently than any other value.
In a situation where there are multiple values with the same highest frequency, the data set is said to be
multimodal, and it has more than one mode. If no value is repeated, the data set is considered to have no
mode.
It's worth noting that unlike the mean and median, the mode does not necessarily provide a measure of
central tendency. It simply identifies the most frequently occurring values in a data set.
Ungrouped Data
Data set from page 551, exercise 3.
Determine for the 10 measurements of lengths
shown below:
(a) the arithmetic mean, (b) the median, (c) the mode, and (d) the standard deviation.
28 m, 20 m, 32 m, 44 m, 28 m, 30 m, 30 m, 26 m, 28 m and 34 m
Mean
∑𝑥 Sum of given data
𝑥̅ = =
𝑛 number of give data
28 + 20 +32 + 44 + 28 + 30 + 30 + 26 + 28 + 34
𝑥̅ =
10
𝑥̅ = 30 m
Median
Arranging data in ascending order.
20 m, 26 m, 28 m, 28 m, 28 m, 30 m, 30 m, 32 m, 34 m, 44 m
since the we have an even data set, the median is the mean of the two meddle values.
Median = (28 + 30) / 2
Medain = 29 m
Mode
The number with the highest occurance is 28 m
Hence, Mode = 28 m
GROUPED DATA
Data from exercise 4
The heights of 100 people are measured correct to
the nearest centimetre with the following results:
150 – 157 5,
158 – 165 18,
166 – 173 42,
174 – 181 27,
182 – 189 8.
Data Organization
Class limits Class Class Frequency (f) f.x Cumulative
(Heights) Boundary midpoints (x) Frequency (cf)
150 - 157 149.5 – 157.5 153.5 5 767.5 5
158 - 165 157.5 – 165.5 161.5 18 2907 23
166 - 173 165.5 – 173.5 169.5 42 7119 65
174 - 181 173.5 – 181.5 177.5 27 4792.5 92
182 - 189 181.5 – 189.5 185.5 8 1484 100
Σf = 100 Σ(f.x)=
17070
Mean
𝑠𝑢𝑚 𝑜𝑓 (𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑖𝑒𝑠 × 𝑐𝑙𝑎𝑠𝑠 𝑚𝑖𝑑𝑝𝑜𝑖𝑛𝑡) Σ(fx)
Mean = =
𝑠𝑢𝑚 𝑜𝑓 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑖𝑒𝑠 Σf
17070
= 100
Mean = 170.7 cm
Median
Let Lcbm = lower class boundary for the modal class
Cfbmc = Cumulative frequency of class before modal class
W = Class width = 8
fm = frequency of modal class = 42
Σf
− 𝐶𝑓𝑏𝑚𝑐
2
median = 𝐿𝑐𝑏𝑚 + ( )𝑊
𝑓𝑚
100
– 23
2
median = 165.5 + ( )8
42
median = 170.6 cm
Mode
modal class = Class with the highest frequency
Let ∆1 = difference between the frequency of the modal class and the class above it.
∆2 = difference between the frequency of the modal class and the class below it
∆1
mode = 𝐿𝑐𝑏𝑚 + ( )𝑊
∆1 − ∆ 2
42−18
mode = 165.5 + ( )8
(42−18) − (42−27)
mode = 170.4cm
Significants of the Mean, Median and Mode, and how to choose an
appropriate of measure of center for a given data set
The mean, median, and mode are measures of central tendency, and each provides insight into different
aspects of a data set. The choice of which measure to use depends on the nature of the data and the
specific characteristics of the distribution.
Mean:
Significance: The mean is the average of all the values in a data set.
Use when: The mean is most appropriate when the data is approximately symmetrically
distributed and does not have extreme outliers. It is sensitive to extreme values and may not be
the best measure if the data set is skewed.
Median:
Significance: The median is the middle value in a data set when it is ordered. It is not affected
by extreme values (outliers) and is a measure of the central position.
Use when: The median is useful when the data set is skewed, has outliers, or is not normally
distributed. It provides a better representation of central tendency in such cases.
Mode:
Significance: The mode is the value(s) that occur most frequently in a data set.
Use when: The mode is suitable for categorical data or discrete data sets. It is also useful when
identifying the most common response or category is important. In some cases, a data set may
have no mode, or it may be multimodal (having more than one mode).
In summary:
Use the mean when the data is approximately normally distributed and there are no significant outliers.
Use the median when the data is skewed or contains outliers, providing a robust measure of central
tendency.
Use the mode when identifying the most frequently occurring category or value is essential, especially
for categorical or discrete data.