[go: up one dir, main page]

0% found this document useful (0 votes)
22 views7 pages

Statap Practicetest 27

Download as pdf or txt
Download as pdf or txt
Download as pdf or txt
You are on page 1/ 7

Chapter 27 Practice Test Name__________________________

Part I: Multiple Choice (Questions 1-10) - Circle the answer of your choice.

1. If the assumptions for regression inference are met, then a normal probability plot of the residuals
should be

(A) Bell shaped


(B) A group of randomly scattered points
(C) Roughly linear
(D) Clearly curved
(E) Skewed right

2. If a test of hypotheses rejects H0 :   0 in favor of the alternative hypothesis H a :   0 , where


 is the population regression slope, then the least-squares regression line

(A) slopes downward and to the right when plotted on the scatter plot of paired observations  x, y  .
(B) is useful for predicting y given x (within the limits of x-values covered by the data).
(C) can be extrapolated beyond the limits of the x-values covered by the data to predict y at any
possible x.
(D) is not useful for predicting y given x.
(E) is money.

3. Inference for regression on the population regression slope  is based on which of the following
distributions?

(A) The t distribution with n – 1 degrees of freedom


(B) The standard normal distribution
(C) The chi-square distribution with n – 1 degrees of freedom
(D) The t distribution with n – 2 degrees of freedom
(E) The F distribution

4. A bivariate set of data relates the amount of annual salary raise and previous performance rating.
The least-squares regression equation is yˆ  1400  2000 x , where ŷ is the estimated raise and x
is the performance rating. Which of the following statements is not correct?

(A) For each increase of one point in performance rating, the raise will increase on average by
$2,000.
(B) This equation produces predicted raises with an average error of 0.
(C) A rating of 0 will yield a predicted raise of $1,400.
(D) The correlation coefficient of the data is positive.
(E) All of the above are true.
5. A coefficient of determination is found to be 0.81. Which of the following is true?

(A) 81% of the variation between the variables is accounted for in the linear relationship.
(B) 81% of the data points lie on a line.
(C) The correlation coefficient is approximately 0.9 .
(D) 19% of the variation between the variables is accounted for in the linear relationship.
(E) All of these are true.

6. If the 90% confidence interval for the slope of a regression line does not contain 0, then which of
the following is a valid conclusion?

(A) The confidence interval is not valid.


(B) A significance test will not be significant at the 10% level.
(C) There is sufficient evidence to conclude that the slope of the true regression line is 0.
(D) There is sufficient evidence to conclude that the slope of the true regression line is not 0.
(E) None of these is valid.
In the next four questions, use the following printout of the linear regression relating the SAT Math
scores of 200 randomly chosen college freshmen and their first semester GPA’s. The assumptions for
inference are satisfied.

The regression equation is


GPA = 1.53 + 0.00170 Math

Predictor Coeff StDev T P


Constant 1.5264 0.3981 3.83 0.000
Math 0.0016990 0.0006098 2.79 0.006

7. The value of SEb for this regression is:

(A) 0.0006098
(B) 0.0016990
(C) 0.006
(D) 0.3981
(E) 1.5264

8. The test statistic for a test of significance for a non-zero slope is:

(A) 0.0006098
(B) 0.3891
(C) 2.79
(D) 3.83
(E) None of these.

9. Which of the following is a valid conclusion that could be drawn from this regression analysis?

(A) There is sufficient evidence to reject the hypothesis that   0 .


(B) There is not sufficient evidence to reject the hypothesis that   0
(C) This test is not significant at the 1% level.
(D) Significance cannot be determined from this printout.
(E) None of these is a valid conclusion.

10. Which of the following is the 95% confidence interval for the population slope?

(A) (0.0005, 0.0029)


(B) (0.0129, 0.0211)
(C) (−0.0170, 0.0340)
(D) (0.0008, 0.0026)
(E) None of these.
Part II – Free Response (Questions 11-13) – Show your work and explain your results clearly.

11. A new process designed to increase the temperature inside steel girders shows great promise. In a
test of 90 randomly selected girders, the following regression was performed; a partial computer
printout is displayed:

Predictor Coeff StDev T P


Constant 0.2074 0.2318 0.89 0.373
Temp 1 1.05651 0.02221 ? ?

S= 0.6009 R-Sq = 96.3 % R-Sq(adj) = 96.2 %

Temp 1 is the initial temperature and Temp 2 is the temperature after the process has terminated.

(a) State the regression equation.

(b) Interpret the slope of the regression in the context of the problem.

(c) Interpret the value of R 2 in words.

(d) Find the values of T and P indicated by the question marks in the printout.

12. A midterm exam in Applied Mathematics consists of problems in 8 topical areas. One of the
teachers believes that the most important of these, and the best indicator of overall performance,
is the section on problem solving. She analyzes the scores of 36 randomly chosen students using
computer software and produces the following printout relating the total score to the problem
solving sub-score, ProbSolv:

Predictor Coeff StDev T P


Constant 12.960 6.228 2.08 0.045
ProbSolv 4.0162 0.5393 7.45 0.000

S= 11.09 R-Sq = 62.0 % R-Sq(adj) = 60.9 %

(a) What is the predicted Total Score if the ProbSolve scores is 20?

(b) What is the residual for the data point 10,55 ?

(c) Calculate the 95% confidence interval of the slope of the regression line for all Applied
Mathematics students. Assume the conditions for inference have been checked and satisfied.
(d) Use the information provided to test whether there is a significant relationship between the
problem solving subsection and the total score at the 5% level.

(e) Are the decisions reached through the construction of the confidence interval and through the use
of a significance test consistent? Explain the reasons for your answer.
13. Does how long young children remain at the lunch table help predict how much they eat? Data
were collected on a random sample of 20 toddlers observed over several months. “Time” is the
average number of minutes a child spent at the table when lunch was served. “Calories” is the
average number of calories the child consumed during lunch, calculated from careful observation
of what the child ate each day. A partial Minitab output are shown below.

Calories = 560.7 - 3.077 Time

Predictor Coeff StDev T P


Constant 560.7 29.4 19.09 0.000
Time -3.077 0.850 -3.62 0.002

S = 23.3980 R-Sq = 42.1% R-Sq(adj) = 38.9 %

(a) Interpret each in the context of the problem.

(i) The y-intercept

(ii) r

(iii) s

(iv) The standard error of the slope

(b) The randomness condition has been satisfied and we can assume the independence condition is
satisfied. Check whether the remaining conditions for inference are met using the plots below.

Scatterplot of Calories vs Time Histogram


(response is Calories)
520

5
500

4
480
Frequency
Calories

3
460

2
440

420 1

400 0
20 25 30 35 40 45 -37 -25 -13 -1 11 23 35
Time Residual

You might also like