Chapter 6
Chapter 6
CHAPTER 6
35 63 61 29 54 46 48 57 43 Example 4
60 39 40 51 68 38 56 53
The heights (in cm) of six students in your
If we arrange the marks in ascending class are
order of magnitude, we get
160, 157, 156, 161, 159, and 162.
29 35 38 39 40 43 46 48 51
53 54 56 57 60 61 63 68
What is the median height?
If we arrange heights in ascending
It follows that 8 students secured less order, we get
than 51 per cent marks and 8 secured
156 157 159 160 161 162
more than this percentage. Thus,
Median = 51 per cent. The two middle most values are the
1
Median is the central value in a sense 3 and 4th. The median = (159 + 160) =
rd
2
different from the arithmetic mean. In 159.5. Compare this with the arithmetic
case of the arithmetic mean it is mean = 159.2 (approx.).
the numerical magnitude of the
deviations that balances. But, for the 6. Characteristics of Median
median it is the number of values greater The main features of the median are as
than the median which balances against follows.
the number of values less than the (i) It is simple and easy to understand.
median. (ii) Like the arithmetic mean, median is
In general, if we have n values of x, rigidly defined.
they can be arranged in ascending order (iii) It is not affected by very large or very
as small values.
(iv) The median cannot be expressed as
x1 < x2 < … < xn. a simple function of the values of the
Suppose n is odd, then given set.
7. Mode
( )
Median = the n + 1 -th value.
2 Mode is defined as the value of the
However, if n is even, we have two variable which occurs most frequently.
middle values, viz., the The following is the frequency array
of ages of students in a cricket team in
There are 4 students of age 16. Since, expressed as a simple function of the
this is the most common age, with largest given values.
frequency, modal value is 16 years.
Sometimes, mode is more meaningful 9. Calculation of the Arithmetic Mean
average than the arithmetic mean or the for Ungrouped Data
median. Consider the following example.
Direct Method
Example 5 Let x 1, x 2 , …, x n be n values of some
variable x. The values have not been
A company which manufactures shoes for
grouped into class intervals. Then, by
adult men would like to concentrate on the
direct method, the arithmetic mean of x
production of shoes of only one size. What
is obtained as
should this size be?
1
n∑ i
The company conducted a survey of x= x;
2000 customers regarding their shoe size
and obtained the following distribution. i.e. we simply add up all values of x and
divide the sum by their number.
TABLE 6.3
Frequency Distribution of Shoe Size Assumed Mean Method
Shoe size Number of customers
Let A be the assumed mean. We obtain
5 55 the deviations of x from A, i.e.
6 164
7 360 d1 = x1 – A, d2 = x2 – A, ..., dn = xn – A.
8 528
9 410 Next, obtain the arithmetic mean of
10 260 deviations
11 137
12
Total
86
2000 d=
∑di
n
In this example, neither the and
arithmetic mean nor the median would be
an appropriate average. It would be better x = A + d.
to use mode. Since the largest frequency In other words, obtain the arithmetic
is 528 the mode is 8. It would be advisable mean of the deviations (d) and add A,
for the company to specialize in the which was earlier subtracted.
manufacture of shoes of size 8.
Step Deviation Method
8. Characteristics of Mode
Firstly, obtain deviations (d) from an
(i) Like the arithmetic mean and median, assumed mean.
mode too has an easy interpretation. Divide the deviations by a common
(ii) Mode is not affected by extreme factor, say h, to obtain
values because it depends only on the d1 d2 dn
largest frequency. d1' = , d2 ' = , ..., dn' =
(iii) Mode, like the median, cannot be h h h
MEASURES OF CENTRAL TENDENCY 53
Then gives
x = A + hd' x = 5550 + 1690 = 7240,
as before.
where
1 Step Deviation Method
n∑ i
d' = d' .
We divide the deviations (d) by a common
In other words, obtain the arithmetic factors, say 1000; and obtain
mean of d1', d2', …, dn'; then multiply it by
d1'=1, d2'=2, d3'=4, d4'= –1 and d5'= 2.45
h (because we had originally divided by h)
and add A (because we had earlier Now
subtracted A). ∑d'i 8.45
d' = = = 1.69;
5 5
Example 6 and
x = A + 1000 d'
Suppose the monthly incomes (x), in = 5550 + 1000 × 1.69
rupees, of 5 individuals, are given as = 7240
follows: as before.
6550, 7550, 9550, 4550 and 8000. We should note that taking deviations
Obtain their arithmetic mean. from an assumed mean, as well as the
step deviation method, help in reducing
Direct Method the burden of calculations a great deal.
They are particularly useful, if the
By direct method, we simply add all numbers are large.
incomes and divide by the number of
individuals. Thus, we obtain 10. Calculation of Arithmetic Mean for
36200 a Frequency Array
x= = 7240,
5 The variable x is discrete and takes values
i.e. the arithmetic mean of incomes of 5 x1, x2, …, xn with frequencies f1, f2, …, fn.
individuals is Rs 7240. (No class intervals are specified.)
Let us write the frequency array as
Calculation of the Arithmetic Mean by TABLE 6. 4
Assumed Mean Method Frequency Array
Next, obtain
1 Calculate the arithmetic mean of size
d = ∑ fidi of households.
N
and Direct Method
x = A + d.
Let us arrange the calculations as in the
Calculations of Arithmetic Mean by Step following table.
Deviation Method TABLE 6.6
Calculation of Arithmetic Mean
Let h be the common factor. (By Direct Method)
Divide each deviation (d) by h, and
obtain x f f×x
d1 dn 1 5 5
d1' = , ..., dn' = 2 15 30
h h 3 25 75
Then 4 35 140
1 5 10 50
d' = N∑ fid'i 6 5 30
7 3 21
and 8 2 16
x = A + hd' Total 100 367
In words, we multiply d' by h
(because we had divided by h earlier), Therefore, the arithmetic mean is
and add A (because earlier we had 367
x= = 3.67
subtracted A). 100
MEASURES OF CENTRAL TENDENCY 55
Therefore, Example 9
1 316000
d = ∑ fidi= – ≈ – 57.45 The frequency distribution of the monthly
N 5500
expenditure of 100 rural households is
and
given below.
x = A + d = 12442.55.
TABLE 6.12
Step Deviation Method Frequency Distribution of Monthly
Expenditure of 100 Households
Let us now calculate the arithmetic mean Monthly expenditure in rupees Number of
of incomes by step deviation method. The (class intervals) households
calculations are shown in the following
75 – 125 10
table. 125 – 175 22
TABLE 6.11 175 – 225 38
Calculation of Arithmetic Mean 225 – 275 16
(By Step Deviation Method) 275 – 325 8
(Assumed Mean = 12500 and Common Factor 500) 325 – 375 4
375 – 425 2
Income Deviations d f f × d' Total 100
x d = x – 12500 d' =
500
11500 -1000 -2 629 -1258 Calculate the arithmetic mean of
12000 -500 -1 1705 -1705
12500 0 0 1800 0 monthly expenditures by
13000 500 1 725 725 (a) direct method
13500 1000 2 420 840 (b) assumed mean method, and
14000 1500 3 127 381
14500 2000 4 85 340 (c) step deviation method.
15000 2500 5 9 45
d=
∑fidi = 500 = 5 Ungrouped Data
N 100 Given a set of values of the variable, we
arrange them in ascending (or,
and
descending) order of magnitude. Median
x = A + d = 200 + 5 = 205. is the middle most value.
If the number of values is odd, say 101,
(c) Step Deviation Method
median is the 101 + 1 = 51-st value. In
Looking at the Column (2) of the above 2
general, if N = 2m+1, the median is the
Table 6.14, we see that the common
(m + 1)-th value. Otherwise, if N = 2m is
factor is 50. In fact, this is the width of even, there are two middle most values,
class intervals. viz. the m-th and (m+1)-th values. Median
The calculations are as shown in the is the arithmetic mean of the two.
following table. We have considered examples before.
58 INTRODUCTION TO STATISTICS
EXERCISES
1. Compare the arithmetic mean, median and mode as measures of central tendency.
Describe situations where one is more suitable than the others.
2. The arithmetic mean is described as the centre of gravity of the distribution of
values of the variable. Explain.
3. Show that the sum of deviations of the values of the variable from their arithmetic
mean is equal to zero.
4. If x1, …, xn are n values of the variable x and A is any arbitrary value, so that
∑(xi – A) is the sum of deviations from A; show that ∑(xi – A) = n(x – A) where
1
x = ∑xi is the arithmetic mean of x. Hence ∑(xi–A) = 0 if and only if, A = x.
n
5. ‘Arithmetic mean is affected by very large and very small values, but median and
mode are not affected by them. Explain?
6. Daily expenditures on vegetables (in rupees) of 20 households, in a certain locality,
are given below.
25.00 26.50 30.25 28.00 23.00
31.40 34.00 33.00 30.50 27.20
28.00 35.00 38.60 34.00 22.50
24.00 23.70 28.00 29.00 32.20
Show that the value of the median lies between the arithmetic mean and the mode.
7. Calculate the arithmetic mean, median and mode for the following frequency
distribution.
Class Interval Frequency
1-3 3
3-5 10
5-7 16
7-9 13
9-11 8
11-13 3
13-15 1
Total 54
Examine their relative positions. Does the median lie between the arithmetic
mean and mode?
8. Out of 50 questionnaires distributed among 50 manufacturing firms only 35
replies were received. In the column inquiring about ‘capacity utilization’ the
figures read.
(In percentages)
54.2 96.8 74.2 57.8 99.7 84 81.2
94.3 93.7 61.6 100.0 110.1 68 71
95.2 98.3 84 91.9 84 94.1 73
84 52.6 49.2 45 62.6 75.1 84
69.2 90.7 85.4 87.3 66.1 73.9 89.3
62 INTRODUCTION TO STATISTICS
(a) Calculate the arithmetic mean, median and mode, and examine their relative
position.
(b) Group the data into appropriate classes, and again calculate the arithmetic
mean, median and mode from grouped data. How do your results differ from
those obtained in (a)? Explain.
9. In a certain examination there were 100 candidates of whom 21 failed, 6 secured
distinction, 43 were placed in the third division and 18 in the second division. It
is known that at least 75 per cent marks are required for distinction, at least 40
per cent for passing, at least 50 per cent for second division and at least 60 per
cent for first division.
Calculate the median of the distribution of marks.
10. Calculate the upper and lower quartiles for the following frequency distribution.
Class interval Frequency
13-25 6
25-37 11
37-49 23
49-61 7
61-73 3
Total 50
11. Calculate the median for the distribution in the Question No.10. Also, derive the
median geometrically, by drawing the two ogives and locating the point of
intersection.
12. The daily expenditure, in rupees, of 50 households is given as follows.
Daily expenditure (Rs) Number of households
100-150 3
150-200 9
200-300 21
300-500 10
500-1000 5
Above 1000 2
Total 50
[In this example the last class is open. Therefore, we cannot determine the mid-
value of that class. The arithmetic mean cannot be calculated for the given data.
However, we may set an upper limit on some basis and then obtain the arithmetic
mean. But, this must be regarded as an approximation. Calculation of median,
mode and quartiles do not pose any difficulty.]
(i) Calculate the arithmetic mean, median and mode. Compare the values.
(ii) Also, compute the upper and lower quartiles.