CONFIDENTIAL CSIDEC 2018/STA408
 
UNIVERSITI TEKNOLOGI MARA
FINAL EXAMINATION
COURSE STATISTICS FOR SCIENCE AND ENGINEERING
COURSE CODE STA408
EXAMINATION : DECEMBER 2018
3 HOURS
 
INSTRUCTIONS TO CANDIDATES
1 This question paper consists of five (5) questions.
2 ‘Answer ALL questions in the Answer Booklet. Start each answer on a new page.
3 Do not bring any material into the examination room unless permission is given by the
invigitator.
4 Please check to make sure that this examination pack consists of
i) _ the Question Paper
ii) two-page Appendix 1
iil) an Answer Booklet — provided by the Faculty
iv) a Statistical Table — provided by the Faculty
6. ‘Answer ALL questions in English.
 
DO NOT TURN THIS PAGE UNTIL YOU ARE TOLD TO DO SO
This examination paper consists of 7 printed pages
© Hak Cipta Universiti Teknologi MARA CONFIDENTIALCONFIDENTIAL, 2 CSIDEC 2018/STA408
QUESTION 1
a) An agricultural area is divided into many fields. It is found that 25% of the fields are
infested with copra beetles.
|) If 20 fields in this area are randomly selected, what is the probability that more than
five of the fields sampled are infested?
(2 marks)
li) If three fields are randomly selected, find the probability that all three fields are not
infested?
(2 marks)
iii) How many fields would you expect to be infested with copra beetles if 100 fields were
randomly taken as a sample?
(2 marks)
iv) Determine how many fields need to be taken as a sample if the probability that none
of the fields infected is 0.0317.
(3 marks)
b) A study stated that the average weight of grains for a particular type of crop is 0.04
grams. Assume that the weight of grains is normally distributed and the standard
deviation is 0.016 grams.
i) Find the probability that the weight of the grains is between 0.025 grams and 0.04
grams?
(4 marks)
il) If a sample of 100 grains is selected, find the probability that the sample mean of the
weight of grains is below 0.037.
(4 marks)
iii) Assume that the standard deviation remains unchanged, find the new average weight
of the grains if 5% of the weight is less than 0.02 grams.
(3 marks)
QUESTION 2
a) A researcher claimed that the distribution of height of men in a population is normally
distributed with p = 69 inches and o = 2.5 inches. A sample of 100 men drawn randomly
from this population had an average height of 68.5 inches,
i) Is there enough evidence to support the researcher's claim on the population mean?
Use « = 0.05.
(7 marks)
(© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL,CONFIDENTIAL, 3 CSIDEC 2018/STA408
ii) Construct a 98% confidence interval for the population mean. Interpret the interval.
(5 marks)
b) A manufacturer of a cell phone batteries claims that the life of his batteries is
approximately normally distributed with a c more than 1.2 years. If a random sample of
10 of these batteries has a standard deviation of 0.9 year, test whether the researcher's
claim is true. Use a = 0.05.
(7 marks)
QUESTION 3
a) A research was conducted to identify the relationship about the population mean time (in
days) required to recover from a common cold for persons given a daily dose of 4 mg of
Vitamin C versus those who were not given a vitamin supplement. Suppose that 20
adults were randomly selected for each treatment category.
i) Based on the Minitab output below, test at 5% level of significance whether there is a
difference in variability of the time required for the two groups of persons with
 
 
different treatments used.
( marks)
Test and Cl for Two Variances: No Vitamin Supplement, 4 mg Vitamin C ]
Null hypothesis Signa(No Vitamin Supplement) / Signa(4 mg Vitamin ¢) =
Alternative hypothesis Signa(No Vitamin Supplement) / Sigma(4 mg Vitamin C) =1
Significance level Alpha = 0.01
statistics
variable N. stbev Variance
No Vitamin Supplement 20 1.387 1.924
‘ng Vitamin € 20 11282 11568 |
Tests
Test |
Method DEL DF2 Statistic P-Value |
F Test (normal) 1919 1.230.661
 
‘Answer part (ii) and (i) using the following output.
 
‘Two-Sample T-Test and Cl: No Vitamin Supplement, 4 mg Vitamin C
| two-sample 7 for No Vitamin Supplement vs 4 mg Vitamin C |
N Mean StDev SE Mean
No Vitamin Supplement 20 5.85 1.39 0.31 |
ang Vitamin C 20 4:10 1128 0.28 |
Difference = mu (No Vitamin Supplement) - mu (4 mg Vitamin C)
Estimate for difference: 1.750 |
95% lower bound for difference: 1.045
T-Test of difference = 0 (vs >): T-Value = 4.19 P-value = 0.000 DF = 37 |
 
(© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL,CONFIDENTIAL 4 CSIDEC 2018/STA408
ii) Show that the test statistic is 4.19.
(3 marks)
 
i) Is there sufficient evidence at 5% significance level to conclude that the use of
vitamin C reduce the mean time required to recover from a common cold?
(5 marks)
b) A sample of ten 13-year old children were provided with a breakfast of low glycemic
index (Gl) foods on the first day and high GI foods on the second day. The two
breakfasts contained the same quantities of carbohydrate, fat and protein. On each day a
buffet lunch was provided, and the number of calories eaten at lunchtime was recorded.
The objective is to determine whether the kind of breakfast eaten has an effect on the
mean calorie intake. The table below summaries the data for children in the sample. A
hypothesis test is needed to determine whether these results show that there would be
differences in the mean calorie intake for other children who ate low and high GI
breakfasts.
 
Student cis |e roe eee te |G
Lunchtime calorie |
Intake after low GI | 300 | 315 | 330 | 400 | 290 | 310 | 315 | 340 | 350 | 300
breakfast | |
Lunchtime calorie T -
Intake after high | 360 | 370 | 450 | 490 | 500 | 330 | 400 | 470 | s40 | 410
Gl breakfast
©
10
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
| Paired T-Test and Cl: Low Gl, High GI
Paired 7 for Low GI - High GI
N Mean stDev SE Mean
Low GI 10 325.0 32.3 10.2
High Gr 10 411.0 63.5 20.1
Difference 10 -86.0 62.3 19.7
958 CI for mean difference: (-130.5, -41.5)
 
T-Test of mean differenci
 
0 (vs #0): T-Value = ~4.37
 
Answer the following questions based on the MINITAB output above.
i) Show that the test statistics is -4.37.
(2 marks)
li) Test at 5% level of significance, whether there is any difference in the mean calorie
intake during lunchtime among the ten children.
(6 marks)
(© Hak Cipta Universiti Teknologi MARA CONFIDENTIALCONFIDENTIAL 5 CSIDEC 2018/STA408
QUESTION 4
a) Determine whether the statement is TRUE or FALSE.
b)
i) The correlation coefficient determines the percentage of total variation of dependent
variable explained by the independent variable.
ii) The simple regression analysis is a statistical technique that can be used to obtain
the linear equation relating to the two variables.
iil) If the scatter diagram is randomly scattered, then the two variables can be assumed
to have no relationship between them.
iv) When the value of correlation equals to zero, it means that any increase or
decrease in the value of one variable will not affect the other variable.
v) The value of the y-intercept in regression equation can be interpreted as the change
in y per unit change in x.
(6 marks)
As a new type of environmentally friendly natural air freshener is being developed, it is
tested to see if the length of time that the air freshener lasts (in days) is affected by the
temperature. The data collected was analyzed and the output is shown below.
 
 
 
 
 
 
 
 
‘Temperature (°C) | Length of time that the air freshener lasts (in days)
18 24
20 - 22
24 24
38 15
33, 18
32 19 1
36 17
 
 
 
 
 
Regression Analysis: Length of time versus Temperature (°C)
Analysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
Regression 1 55.230 $5.2304 111.18 0.000
Temperature (°C) 1 55.230 55.2304 111.18 0.000
Error 5 2.884 0.4968
Total 6 57.714
Model Summary
S$ _Resq R-sq(adj) R-sq(pred)
0.704826 95.708 94.888, 90.588
Coefficients
Term Coef SE Coef T-Value P-Value VIF
Constant 30.36 1.07 28.37 0.000
 
 
 
‘Temperature (°C) -0.3805 0.0361 -10.54 0.000 1.00
 
(© Hak Cipta Universiti Teknologi MARA CONFIDENTIALCONFIDENTIAL,
i) State the independent and dependent variables.
CSIDEC 2018/STA408
(2 marks)
ii) Write down the equation of line that describes the relationship between the
independent and dependent variables.
(2 marks)
iil) Determine the correlation coefficient.
(2 marks)
iv) State the coefficient of determination and interpret its meaning.
(2 marks)
v) By using the p-value method, test whether the linear regression model is significant
or not. Use a = 0.05.
(5 marks)
vi) Estimate the length of time that the air freshener lasts if the temperature is 30°C.
QUESTION 5
(2 marks)
a) A group of researchers in Virginia Polytechnic Institute wishes to measure the serum
alkaline phosphates activity levels in children with seizure disorders who receive
anticonvulsant therapy under the care of a private physician. Forty-five subjects were
found for the study and categorized into four drug groups. From blood samples collected
on each subject the serum alkaline phosphates activity level was recorded. (Control =
not receiving anticonvulsant),
 
 
 
 
 
 
 
 
 
 
 
 
 
 
‘Serum Alkaline Phosphates Activity Level
4020 | «454 | 4580 | oes | 2010 | 2050 | e200 | ares | soo] waza
Control
0730 | 10500 | se0s | neo | sess | 7200 | 1970) «515 | 709s | 770
Tasnobar | szor | raao | oeso | ores |soneo| asr | av | o77 | oat
‘Carbama-
Garbama- [cao | was | v250| e300 | 7500] 7050 | zaso | vaao | rars0
Other
anteon- | 11060] 57-0 | 11700| 777 | 15000| sa90 | 11150
‘alse :
 
 
 
 
 
The Minitab output for the above data is as follows,
 
Bnalysis of Variance
Source DF Adj SS Adj MS F-Value P-Value
Factor 3 13939 4646 3.570.022.
Error 41 53376 1302
Total 44 67315
 
© Hak Cipta Universiti Teknologi MARA
CONFIDENTIAL,CONFIDENTIAL
CSIDEC 2018/STA408
i) Based on the above data, show that the total sum of squares is 67315.
(3 marks)
ii) By using the p-value, test at « = 0.05 that the average serum alkaline phosphates
activity levels are the same for the four drug groups.
(6 marks)
b) A factory is considering buying new production machines. Six different operators are to
be assigned to test on the efficiency of the machines. Four different machines are
assigned in a random order to each operator. The operation of the machines requires
physical dexterity, and it is anticipated that there will be a difference among the
operators in the speed with which they operate the machines (nuisance factor). The
amount of time (in seconds) were recorded for assembling the product.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
= Machines
Operators | 1 2 3 4
1 425 | 398 | 402 | 413
2 393 | aoa | 405 | 422
3 306 | 405 | 413 | 435
4 309 | 423 | 434 | 442 |
5 a9 | as | 449 | 45.9
6 436 | 431 | 451 | 423
Bnalysis of Variance
source DF Adj SS Adj MS F-Value P-Value
Machines 3 @ 5.308 5 0,048
Operators 5 42.09 8.417 5.290.005
Error 15 23.85 R
Total PB _61.86
 
‘Answer the following questions based on the MINITAB output above.
i) Identify the factor, block and response variables.
ii) Find the values of P, Q,
Rand,
(3 marks)
(4 marks)
ill) By using the p-value, test at 5% level of significance that the machines perform at the
same mean rate of speed.
(© Hak Cipta Universiti Teknologi MARA
END OF QUESTION PAPER
(5 marks)
CONFIDENTIALCONFIDENTIAL,
APPENDIX 4(1) CSIDEC 2018/STA408
KEY FORMULAS
 
Binomial probability formula
Pox=)=(") at aP x20)1.2,.040
 
Poisson probability formula
ar
P(x= x)=,
 
=0,1,2,
 
 
ONFIDENCE INTERVALS
 
Parameter & description
‘Two-sided (1 - a)100% confidence interval
 
Mean, 1, variance, o? unknown,
‘small samples
): df=n-1
tale
 
 
 
Difference in means, 11; ~ 2,
variances c,? =o,” and unknown
  
  
df=nj+n-2,
 
 
Difference in means, 14; —H,
variances o,? #0,” and unknown
men
Sy
G-Btena fa
ay [otis lng F
Ge/nF , ?/ne
mat) mnt
 
Mean difference for
paired samples, Hg
Attang df =1—1 where nis no. of pairs
 
 
Variance, o?
a
Aarng Kara
 
Ratio of the variances o,?/o,*
 
2
st
+ Sr Fate:van
gira
 
2
set = 7
a Mem at, vg=m=t
(g Faspinva }
 
 
 
‘© Hak Cipta Universiti Teknologi MARA
CONFIDENTIAL,CONFIDENTIAL,
APPENDIX 4(2) CSIDEC 2018/STA408
HYPOTHESIS TESTING
 
{ Null Hypothesis
Test statistic
 
 
 
 
Ho: yy — py =0
variances o,? #0,” and unknown
 
r= [tle siting F
Gin G/F
m1” my-t
 
Ho: Hy =0
df =n—1 where nis no. of pairs
 
 
 
 
 
 
 
CORRELATION AND REGRESSION
 
Product Moment Correlation
Coefficient
SS,
ISS,,SS,
.
where $8,