[go: up one dir, main page]

0% found this document useful (0 votes)
479 views9 pages

STA408

final exam question for statistic

Uploaded by

Najwa
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
0% found this document useful (0 votes)
479 views9 pages

STA408

final exam question for statistic

Uploaded by

Najwa
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 9
CONFIDENTIAL CSIDEC 2018/STA408 UNIVERSITI TEKNOLOGI MARA FINAL EXAMINATION COURSE STATISTICS FOR SCIENCE AND ENGINEERING COURSE CODE STA408 EXAMINATION : DECEMBER 2018 3 HOURS INSTRUCTIONS TO CANDIDATES 1 This question paper consists of five (5) questions. 2 ‘Answer ALL questions in the Answer Booklet. Start each answer on a new page. 3 Do not bring any material into the examination room unless permission is given by the invigitator. 4 Please check to make sure that this examination pack consists of i) _ the Question Paper ii) two-page Appendix 1 iil) an Answer Booklet — provided by the Faculty iv) a Statistical Table — provided by the Faculty 6. ‘Answer ALL questions in English. DO NOT TURN THIS PAGE UNTIL YOU ARE TOLD TO DO SO This examination paper consists of 7 printed pages © Hak Cipta Universiti Teknologi MARA CONFIDENTIAL CONFIDENTIAL, 2 CSIDEC 2018/STA408 QUESTION 1 a) An agricultural area is divided into many fields. It is found that 25% of the fields are infested with copra beetles. |) If 20 fields in this area are randomly selected, what is the probability that more than five of the fields sampled are infested? (2 marks) li) If three fields are randomly selected, find the probability that all three fields are not infested? (2 marks) iii) How many fields would you expect to be infested with copra beetles if 100 fields were randomly taken as a sample? (2 marks) iv) Determine how many fields need to be taken as a sample if the probability that none of the fields infected is 0.0317. (3 marks) b) A study stated that the average weight of grains for a particular type of crop is 0.04 grams. Assume that the weight of grains is normally distributed and the standard deviation is 0.016 grams. i) Find the probability that the weight of the grains is between 0.025 grams and 0.04 grams? (4 marks) il) If a sample of 100 grains is selected, find the probability that the sample mean of the weight of grains is below 0.037. (4 marks) iii) Assume that the standard deviation remains unchanged, find the new average weight of the grains if 5% of the weight is less than 0.02 grams. (3 marks) QUESTION 2 a) A researcher claimed that the distribution of height of men in a population is normally distributed with p = 69 inches and o = 2.5 inches. A sample of 100 men drawn randomly from this population had an average height of 68.5 inches, i) Is there enough evidence to support the researcher's claim on the population mean? Use « = 0.05. (7 marks) (© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL, CONFIDENTIAL, 3 CSIDEC 2018/STA408 ii) Construct a 98% confidence interval for the population mean. Interpret the interval. (5 marks) b) A manufacturer of a cell phone batteries claims that the life of his batteries is approximately normally distributed with a c more than 1.2 years. If a random sample of 10 of these batteries has a standard deviation of 0.9 year, test whether the researcher's claim is true. Use a = 0.05. (7 marks) QUESTION 3 a) A research was conducted to identify the relationship about the population mean time (in days) required to recover from a common cold for persons given a daily dose of 4 mg of Vitamin C versus those who were not given a vitamin supplement. Suppose that 20 adults were randomly selected for each treatment category. i) Based on the Minitab output below, test at 5% level of significance whether there is a difference in variability of the time required for the two groups of persons with different treatments used. ( marks) Test and Cl for Two Variances: No Vitamin Supplement, 4 mg Vitamin C ] Null hypothesis Signa(No Vitamin Supplement) / Signa(4 mg Vitamin ¢) = Alternative hypothesis Signa(No Vitamin Supplement) / Sigma(4 mg Vitamin C) =1 Significance level Alpha = 0.01 statistics variable N. stbev Variance No Vitamin Supplement 20 1.387 1.924 ‘ng Vitamin € 20 11282 11568 | Tests Test | Method DEL DF2 Statistic P-Value | F Test (normal) 1919 1.230.661 ‘Answer part (ii) and (i) using the following output. ‘Two-Sample T-Test and Cl: No Vitamin Supplement, 4 mg Vitamin C | two-sample 7 for No Vitamin Supplement vs 4 mg Vitamin C | N Mean StDev SE Mean No Vitamin Supplement 20 5.85 1.39 0.31 | ang Vitamin C 20 4:10 1128 0.28 | Difference = mu (No Vitamin Supplement) - mu (4 mg Vitamin C) Estimate for difference: 1.750 | 95% lower bound for difference: 1.045 T-Test of difference = 0 (vs >): T-Value = 4.19 P-value = 0.000 DF = 37 | (© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL, CONFIDENTIAL 4 CSIDEC 2018/STA408 ii) Show that the test statistic is 4.19. (3 marks) i) Is there sufficient evidence at 5% significance level to conclude that the use of vitamin C reduce the mean time required to recover from a common cold? (5 marks) b) A sample of ten 13-year old children were provided with a breakfast of low glycemic index (Gl) foods on the first day and high GI foods on the second day. The two breakfasts contained the same quantities of carbohydrate, fat and protein. On each day a buffet lunch was provided, and the number of calories eaten at lunchtime was recorded. The objective is to determine whether the kind of breakfast eaten has an effect on the mean calorie intake. The table below summaries the data for children in the sample. A hypothesis test is needed to determine whether these results show that there would be differences in the mean calorie intake for other children who ate low and high GI breakfasts. Student cis |e roe eee te |G Lunchtime calorie | Intake after low GI | 300 | 315 | 330 | 400 | 290 | 310 | 315 | 340 | 350 | 300 breakfast | | Lunchtime calorie T - Intake after high | 360 | 370 | 450 | 490 | 500 | 330 | 400 | 470 | s40 | 410 Gl breakfast © 10 | Paired T-Test and Cl: Low Gl, High GI Paired 7 for Low GI - High GI N Mean stDev SE Mean Low GI 10 325.0 32.3 10.2 High Gr 10 411.0 63.5 20.1 Difference 10 -86.0 62.3 19.7 958 CI for mean difference: (-130.5, -41.5) T-Test of mean differenci 0 (vs #0): T-Value = ~4.37 Answer the following questions based on the MINITAB output above. i) Show that the test statistics is -4.37. (2 marks) li) Test at 5% level of significance, whether there is any difference in the mean calorie intake during lunchtime among the ten children. (6 marks) (© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL CONFIDENTIAL 5 CSIDEC 2018/STA408 QUESTION 4 a) Determine whether the statement is TRUE or FALSE. b) i) The correlation coefficient determines the percentage of total variation of dependent variable explained by the independent variable. ii) The simple regression analysis is a statistical technique that can be used to obtain the linear equation relating to the two variables. iil) If the scatter diagram is randomly scattered, then the two variables can be assumed to have no relationship between them. iv) When the value of correlation equals to zero, it means that any increase or decrease in the value of one variable will not affect the other variable. v) The value of the y-intercept in regression equation can be interpreted as the change in y per unit change in x. (6 marks) As a new type of environmentally friendly natural air freshener is being developed, it is tested to see if the length of time that the air freshener lasts (in days) is affected by the temperature. The data collected was analyzed and the output is shown below. ‘Temperature (°C) | Length of time that the air freshener lasts (in days) 18 24 20 - 22 24 24 38 15 33, 18 32 19 1 36 17 Regression Analysis: Length of time versus Temperature (°C) Analysis of Variance Source DF Adj SS Adj MS F-Value P-Value Regression 1 55.230 $5.2304 111.18 0.000 Temperature (°C) 1 55.230 55.2304 111.18 0.000 Error 5 2.884 0.4968 Total 6 57.714 Model Summary S$ _Resq R-sq(adj) R-sq(pred) 0.704826 95.708 94.888, 90.588 Coefficients Term Coef SE Coef T-Value P-Value VIF Constant 30.36 1.07 28.37 0.000 ‘Temperature (°C) -0.3805 0.0361 -10.54 0.000 1.00 (© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL CONFIDENTIAL, i) State the independent and dependent variables. CSIDEC 2018/STA408 (2 marks) ii) Write down the equation of line that describes the relationship between the independent and dependent variables. (2 marks) iil) Determine the correlation coefficient. (2 marks) iv) State the coefficient of determination and interpret its meaning. (2 marks) v) By using the p-value method, test whether the linear regression model is significant or not. Use a = 0.05. (5 marks) vi) Estimate the length of time that the air freshener lasts if the temperature is 30°C. QUESTION 5 (2 marks) a) A group of researchers in Virginia Polytechnic Institute wishes to measure the serum alkaline phosphates activity levels in children with seizure disorders who receive anticonvulsant therapy under the care of a private physician. Forty-five subjects were found for the study and categorized into four drug groups. From blood samples collected on each subject the serum alkaline phosphates activity level was recorded. (Control = not receiving anticonvulsant), ‘Serum Alkaline Phosphates Activity Level 4020 | «454 | 4580 | oes | 2010 | 2050 | e200 | ares | soo] waza Control 0730 | 10500 | se0s | neo | sess | 7200 | 1970) «515 | 709s | 770 Tasnobar | szor | raao | oeso | ores |soneo| asr | av | o77 | oat ‘Carbama- Garbama- [cao | was | v250| e300 | 7500] 7050 | zaso | vaao | rars0 Other anteon- | 11060] 57-0 | 11700| 777 | 15000| sa90 | 11150 ‘alse : The Minitab output for the above data is as follows, Bnalysis of Variance Source DF Adj SS Adj MS F-Value P-Value Factor 3 13939 4646 3.570.022. Error 41 53376 1302 Total 44 67315 © Hak Cipta Universiti Teknologi MARA CONFIDENTIAL, CONFIDENTIAL CSIDEC 2018/STA408 i) Based on the above data, show that the total sum of squares is 67315. (3 marks) ii) By using the p-value, test at « = 0.05 that the average serum alkaline phosphates activity levels are the same for the four drug groups. (6 marks) b) A factory is considering buying new production machines. Six different operators are to be assigned to test on the efficiency of the machines. Four different machines are assigned in a random order to each operator. The operation of the machines requires physical dexterity, and it is anticipated that there will be a difference among the operators in the speed with which they operate the machines (nuisance factor). The amount of time (in seconds) were recorded for assembling the product. = Machines Operators | 1 2 3 4 1 425 | 398 | 402 | 413 2 393 | aoa | 405 | 422 3 306 | 405 | 413 | 435 4 309 | 423 | 434 | 442 | 5 a9 | as | 449 | 45.9 6 436 | 431 | 451 | 423 Bnalysis of Variance source DF Adj SS Adj MS F-Value P-Value Machines 3 @ 5.308 5 0,048 Operators 5 42.09 8.417 5.290.005 Error 15 23.85 R Total PB _61.86 ‘Answer the following questions based on the MINITAB output above. i) Identify the factor, block and response variables. ii) Find the values of P, Q, Rand, (3 marks) (4 marks) ill) By using the p-value, test at 5% level of significance that the machines perform at the same mean rate of speed. (© Hak Cipta Universiti Teknologi MARA END OF QUESTION PAPER (5 marks) CONFIDENTIAL CONFIDENTIAL, APPENDIX 4(1) CSIDEC 2018/STA408 KEY FORMULAS Binomial probability formula Pox=)=(") at aP x20)1.2,.040 Poisson probability formula ar P(x= x)=, =0,1,2, ONFIDENCE INTERVALS Parameter & description ‘Two-sided (1 - a)100% confidence interval Mean, 1, variance, o? unknown, ‘small samples ): df=n-1 tale Difference in means, 11; ~ 2, variances c,? =o,” and unknown df=nj+n-2, Difference in means, 14; —H, variances o,? #0,” and unknown men Sy G-Btena fa ay [otis lng F Ge/nF , ?/ne mat) mnt Mean difference for paired samples, Hg Attang df =1—1 where nis no. of pairs Variance, o? a Aarng Kara Ratio of the variances o,?/o,* 2 st + Sr Fate:van gira 2 set = 7 a Mem at, vg=m=t (g Faspinva } ‘© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL, CONFIDENTIAL, APPENDIX 4(2) CSIDEC 2018/STA408 HYPOTHESIS TESTING { Null Hypothesis Test statistic Ho: yy — py =0 variances o,? #0,” and unknown r= [tle siting F Gin G/F m1” my-t Ho: Hy =0 df =n—1 where nis no. of pairs CORRELATION AND REGRESSION Product Moment Correlation Coefficient SS, ISS,,SS, . where $8,

You might also like