2.
3 Frequency tables and grouping data
1. The table below shows the height of a group of 60 students
Height (h cm) Number of students Cumulative
frequency
140-145 3
145-150 15
150-155 20
155-160 11
160-165 9
165-170 2
a) Complete the cumulative frequency column
b) Write down the class boundaries for the third class
c) Work out the mid-point of the fifth group
d) Write down the class width of the second class
2. The time taken by a random sample of females to run 400 m is measured and shown in a table below.
Write down the class boundaries, mid-point and the class width for the class 71-75
Time (sec) frequency Cumulative
frequency
56-65 2
66-70 25
71-75 30
76-90 13
91-95 5
a) Complete the cumulative frequency column
b) Write down the class boundaries for the second class
c) Work out the mid-point of the fifth group
d) Write down the class width of the second class
2.4 Measures of location
Mode: the value that occurs most often
Median: middle value when the data is arranged in order
Mean: the sum of all the observations divided by the number of observations
Example 1
Following are hourly rate pay of the company for different level of workers
£10 £10 £12 £10 £8 £10 £20
Find the mode, median and the mean of the following data
Combining means
If set A, of size 𝑛1 , has the mean 𝑥1 and set B of size 𝑛2 , has the mean 𝑥2 , then the combined mean of the
set A and B is
𝑛1 × 𝑥1 + 𝑛2 × 𝑥2
𝑥=
𝑛1 + 𝑛2
Example 2
The mean of 20 observations is 3.5 and the mean of 15 observations is 4.2. Calculate the mean of all 35
observations
Example 3
The mean marks for a statistical exam were worked out for three classes. Class 1 had 12 students with a
mean mark of 78%. Class 2 had 16 students with a mean mark of 84%. Class 3 had 18 students with a mean
mark of 54%. Work out the mean % mark to the nearest whole number for all 46 students
2.5 Which is the correct measure of location
Mode: when data is qualitative or quantitative with either single mode or bimodal.
Median: used for quantitative data. It is used when there are extreme values.
Mean: used for quantitative data and used for all the pieces of data. It is effected by extreme values
2.6 Measures of location of a frequency distribution table
Example 6
Hassan records the collar size (x) of the male students in his grade. The results are shown below.
Collar size (x) Frequency
15 3
15.5 17
16 29
16.5 34
17 12
Find a) the mode b) the median c) the mean
2.7 estimation of mean, mode and median of grouped data.
̅ where:
The mean of sample of data that is summarized as a grouped frequency distribution is 𝒙
∑ 𝒇𝒙
̅=
𝒙 ∑𝒇
and 𝒙 is the mid-point of the group
To find the median divide 𝒏 by 2 and use interpolation to find the value of corresponding term.
The model class is the class with the highest frequency
Example 7
The length 𝑥 mm, to the nearest mm, of a random sample of pine cones is measured. The data is shown
below.
Length of pine Number of pine Cumulative
cone (mm) cone, 𝑓 frequency
30-31 2
32-33 25
34-36 30
37-37 13
a) Write down the model class. b) Estimate the mean length of pine cone
c) Estimate the median length of pine cone
Example 8
The number of questions answered correctly by children taking a general knowledge test are shown in the
following frequency distribution.
Number of Frequency
correct answers
0-5 4
6-10 15
11-15 5
16-20 2
21-60 0
61-70 1
a) Write down the model class. b) Estimate the mean number of correct answers
c) Estimate the median number of correct answers
2.8 coding
𝒙−𝒂
Coding is normally of the form 𝒚 = where a and b are to be chosen.
𝒃
To find the mean of the original data: find the mean of the coded data, equate this to the coding used
and solve.
∑ 𝒇𝒚
To find the mean of coded grouped data, mean =
∑𝒇
Example 9
a) Find the mean of the following heights, 𝑥 cm.
30 40 50 60 70 80
b) Using the following coding to find the mean of these data.
𝑥 𝑥−20
i) 𝑦 = ii) 𝑦 = iii) 𝑦 = 𝑥 − 25
10 10
c) Using your answer to a, write down the mean of
i) 60, 80, 100, 120, 140 and 160 ii) 15, 20, 25, 30, 35 and 40
Example 10
𝑥−165
Use the coding 𝑦 = to find the mean weight of the following distribution
10
Weight (kg) Frequency Mid-point (x) 𝑥 − 165
𝑦=
10
140-150 9
150-160 21
160-170 37
170-180 28
180-190 11
Example 11
𝑥−45
Data is coded using 𝑦 = . The mean of the coded data is 2.55. Find the mean of the original data
25