(Maa 4.4) Linear Regression
(Maa 4.4) Linear Regression
MAA
O. Practice questions
Which statement best represents the relationship between the two variables shown in
each of the scatter diagrams below.
(a) y (b) y
10 10
8 8
6 6
4 4
2 2
0 2 4 6 8 10 x 0 2 4 6 8 10 x
(c) y (d) y
10 10
8 8
6 6
4 4
2 2
0 2 4 6 8 10 x 0 2 4 6 8 10 x
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 1
[MAI 4.4] LINEAR REGRESSION
L1 : regression line of y on x
L2 : regression line of x on y
[2]
(c) For the relation between x and y ,
(i) write down the correlation coefficient
(ii) state a description of this relation [3]
(d) Use L1 to express x in term of y in the form x ay b . What do you notice? [3]
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 2
[MAI 4.4] LINEAR REGRESSION
× × × ××
× × × ×
× × ×
× × × ××
× ×
×× ×× ×
× × × ××
×× × × ××
× × ×× × ×× × ×
××
× ×
1 time t 2 time t 3 time t
(a) State which of the diagrams indicate that the pair of variables
(i) is not correlated. (ii) shows strong linear correlation. [2]
(b) A student is given a piece of paper with five numbers written on it. She is told that
three of these numbers are the product moment correlation coefficients for the
three pairs of variables shown above. The five numbers are
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 3
[MAI 4.4] LINEAR REGRESSION
70
60
50
Width
(mm) 40
30
20
10
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 4
[MAI 4.4] LINEAR REGRESSION
40
30
GERMAN
20
10
0 10 20 30 40
FRENCH
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
.................................................................................................................................
Page 5
[MAI 4.4] LINEAR REGRESSION
100
90
80
70
Game 60
score 50
40 M
30
20
10
0
0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95
Mathematics score
(c) Using your graph or otherwise, estimate the score Jane expects on the computer
game, giving your answer to the nearest whole number. [2]
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 6
[MAI 4.4] LINEAR REGRESSION
8. [Maximum mark: 8]
The following table gives the heights and weights of five sixteen-year-old boys.
(a) Find (i) the mean height; (ii) the mean weight. [2]
(b) Plot the above data on the grid below and draw the line of best fit.
190
185
180
175
height
(cm)
170
165
160
0
60 65 70 75
weight (kg)
[4]
(c) Find the Pearson correlation coefficient r. [2]
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 7
[MAI 4.4] LINEAR REGRESSION
Eight students in Mr. O'Neil's Physical Education class did pushups and situps. Their
results are shown in the following table.
Student 1 2 3 4 5 6 7 8
The graph below shows the results for the first seven students.
y
60
50
number
of 40
situps
(y) 30
20
10
O 10 20 30 40 50 60 x
number of pushups (x)
(a) Plot the results for the eighth student on the graph. [1]
(b) Find the equation of the regression line. [2]
(c) Find x and y , and draw a line of best fit on the graph. [4]
(d) A student can do 60 pushups. How many situps can the student be expected to
do? [1]
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 8
[MAI 4.4] LINEAR REGRESSION
Student A B C D E F G H I J
Mathematics (x) 8.6 13.4 12.8 9.3 1.3 9.4 13.1 4.9 13.5 9.6
English (y) 33 51 30 48 12 23 46 18 36 50
(a) Find correct to two decimal places, the correlation coefficient (r). [2]
(b) Use your result from part (a) to comment on the statement:
'Those who do well in Mathematics also do well in English. [2]
..................................................................................................................................
..................................................................................................................................
(a) On the scatter diagram below, plot the remaining points. [2]
60
×
Fuel in litres
40
×
× ×
20
0
0 100 200 300 400 500 600 700 800 900 1000
Distance in km
The mean distance travelled is 421 km ( x ), and the mean amount of fuel in the tank is
28 litres ( y ). This point is plotted on the scatter diagram.
(b) Sketch the line of best fit. [3]
(c) A car travelled 350km. Use your line above to estimate the amount of fuel left in
the tank. [1]
..................................................................................................................................
..................................................................................................................................
Page 9
[MAI 4.4] LINEAR REGRESSION
Student 1 2 3 4 5 6 7 8 9 10
Height
155 161 173 150 182 165 170 185 175 145
x cm
Weight
50 75 80 46 81 79 64 92 74 108
y kg
(b) Calculate the mean height and the mean wight [2]
(c) (i) Find the equation of the line of best fit.
(ii) Draw the line of best fit on your graph. [3]
(d) Use your line to estimate
(i) the weight of a student of height 190 cm;
(ii) the height of a student of weight 72 kg. [2]
(e) It is decided to remove the data for student number 10 from all calculations.
Explain briefly what effect this will have on the line of best fit. [1]
Page 10
[MAI 4.4] LINEAR REGRESSION
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 11
[MAI 4.4] LINEAR REGRESSION
(a) Write down the equation of the regression line of shoe size (y) on height (x),
giving your answer in the form y = mx + c. [2]
(b) State an interpretation for the coefficient m of the regression line in (a). [2]
(c) A student is is 162 cm in height
(i) Use your equation in part (a) to predict the shoe size of the student.
(ii) Is this an interpolation or extrapolation? Explain. [3]
(d) Write down the correlation coefficient. [1]
(e) Describe the correlation between height and shoe size. [2]
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 12
[MAI 4.4] LINEAR REGRESSION
Age
32 40 21 45 24 19 17 21 27 54 33 37 23 45 18
(x)
Time (in h)
10 12 8 15 7 8 6 9 11 16 10 13 9 17 5
(y)
(b) Write down the equation of the regression line for y on x in the form y = ax + b. [2]
(c) Use your equation for the regression line to predict
(i) the time that it would take a 30 year old person to reach proficiency, giving
your answer correct to the nearest hour;
(ii) the age of a person who would take 8 hours to reach proficiency, giving
your answer correct to the nearest year. [4]
(d) Find an estimation for the age of the person in question (c)( ii) by using the
regression line of x on y. [4]
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 13
[MAI 4.4] LINEAR REGRESSION
(a) Calculate the mean and the standard deviation of the prices
(i) in 1992;
(ii) in 2002. [4]
(b) (i) Find the correlation coefficient.
(ii) Comment on the relationship between the prices. [3]
(c) Find the equation of the line of the best fit in the form y = mx + c. [2]
(d) What would you expect to pay now for an item costing $2.60 in 1992? [1]
(e) Which item would you omit to increase the correlation coefficient? [2]
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 14
[MAI 4.4] LINEAR REGRESSION
(c) Find the equation of the regression line of y on x in the form y = ax + b. [2]
(d) Graph this line on the above graph. [2]
(e) Given that a student receives an 88 on the mathematics test, what would you
expect this student's science score to be? Show how you arrived at your result. [2]
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 15
[MAI 4.4] LINEAR REGRESSION
The table below shows the data for x in increasing order and the corresponding ranks.
38 60 1 1
56 80 2 6
58 78 3 5
73 65 4.5 2
73 90 4.5 9
80 85 6 7.5
90 70 8 3
90 71 8 4
90 85 8 7.5
95 96 10 10
(d) The correlation coefficient between the ranks is known as Spearman rank
correlation coefficient rs. Find its value. [2]
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
..................................................................................................................................
Page 16