Call
Call
Ruppert, 2002) 1
The project described was supported by Grant Number R44 RR12435 from the National Institutes of Health, National
Center for Research Resources. Its contents are solely the responsibility of the authors and do not necessarily
represent the official views of the National Center for Research Resources.
Introduction (@ R.J. Carroll & D. Ruppert, 2002) 2
OUTLINE OF SEGMENT 1
• What is measurement error?
• Some examples
• Effects of measurement error in simple linear regression
• Effects of measurement error in multiple regression
• Analysis of Covariance: effects of measurement error in a covariate on the com-
parisons of populations
• The correction for attenuation: the classic way of correcting for biases caused
by measurement error
Introduction (@ R.J. Carroll & D. Ruppert, 2002) 3
OUTLINE OF SEGMENT 2
• Broad classes of measurement error
OUTLINE OF SEGMENT 3
• Transportability: using other data sets to estimate properties of measurement
error
• Conceptual definition of an exact predictor
• The classical error model
∗ You observe the real predictor plus error
• The Berkson error model
∗ The real predictor is what you observe plus error
• Functional and structural models defined and discussed
Introduction (@ R.J. Carroll & D. Ruppert, 2002) 5
OUTLINE OF SEGMENT 4
• The regression calibration method: replace X by an estimate of it given the
observed data
• Regression calibration is correction for attenuation (Segment 1)in linear re-
gression
• Use of validation, replication and external data
• Logistic and Poisson regression
• Use of an unbiased surrogate to estimate the calibration function
Introduction (@ R.J. Carroll & D. Ruppert, 2002) 6
OUTLINE OF SEGMENT 5
• The SIMEX method
• Motivation from design of experiments
• The algorithm
OUTLINE OF SEGMENT 6
• Instrumental variables:
OUTLINE OF SEGMENT 7
• Likelihood methods
• The Berkson model and the Utah fallout study
∗ The essential parts of a Berkson likelihood analysis
• The classical model and the Framingham study
∗ The essential parts of a classical likelihood analysis
• Model robustness and computational issues
Segment 1 (@ R.J. Carroll & D. Ruppert, 2002) 9
• Radiation Dosimetry
• Unobserved is X
• Key point: The regression model in the observed data is not the same as the
regression model when X is observed
• In symbols, Wi = Xi + Ui
• Y = β0 + βxX + ²
• Wi = Xi + Ui (additive)
• Ui are:
SIMULATION STUDY
• Generate X1, . . . , X50, IID N(0, 1)
• Generate Yi = β0 + βxXi + ²i
• Set Wi = Xi + Ui
10
5 Reliable Data
-5
-6 -2 0 2 4 6
10
Error--prone Data
5 Reliable Data
-5
-6 -2 0 2 4 6
• Note how classical measurement error causes a bias in the least squares regres-
sion coefficient
Segment 1 (@ R.J. Carroll & D. Ruppert, 2002) 24
• Important Factoids:
βc0 = Y − βcxW
−→ µy − λβxµx
= β0 + (1 − λ)βxµx
∗ Y | W ∼ Normal
∗ E(Y | W ) = β0 + (1 − λ)βxµx + λβxW
∗ Var(Y | W ) = σ²2 + (1 − λ)βx2σx2
30
25
20
Sample Size
15
10
Figure 3: Sample Size for 80% Power. True slope βx = 0.75. Variances σx2 = σ²2 = 1.
Segment 1 (@ R.J. Carroll & D. Ruppert, 2002) 29
∗ σx|z
2
= residual variance in regression of X on Z
∗ σx|z
2
≤ σx2 =⇒
2
σx|z σx2
λ1 = 2 ≤ =λ
σx|z + σu2 σx2 + σu2
∗ =⇒ Collinearity accentuates attenuation
Segment 1 (@ R.J. Carroll & D. Ruppert, 2002) 31
βz∗ = βz + (1 − λ1)βxγz ,
∗ X is correlated with Z
∗ Z is a significant predictor were X to be observed
Segment 1 (@ R.J. Carroll & D. Ruppert, 2002) 32
ANALYSIS OF COVARIANCE
• These results have implications for the two group ANCOVA.
∗ X = true covariate
∗ Z = dummy indicator of group
• We are interested in estimating βz , the group effect. Biased estimates of βz :
βz∗ = βz + (1 − λ1)βxγz ,
1.5
1.0
Response
0.5
0.0
-0.5
-4 -2 0 2 4
Predictors
Y = β0 + βzt Z + βxt X + ²
W = X + U is observed instead of X
• In simple linear regression, this means that the ordinary least squares slope is
Segment 2 (@ R.J. Carroll & D. Ruppert, 2002) 35
∗ Additivity
∗ Homoscedasticity
∗ Normality
Segment 2 (@ R.J. Carroll & D. Ruppert, 2002) 36
NONDIFFERENTIAL ERROR
• Error is said to be nondifferential if W and T would not be measured if one
could have measured X.
∗ It is not clear how this term arose, but it is in commopn use.
• More formally, (W, T ) are conditionally independent of Y given (X, Z).
∗ The idea: (W, T ) provide no additional information about Y if X were
observed
• This often makes sense, but it may be fairly subtle in each application.
Segment 2 (@ R.J. Carroll & D. Ruppert, 2002) 38
NONDIFFERENTIAL ERROR
• Many crucial theoretical calculations revolve around nondifferential error.
• Consider simple linear regression: Y = β0 + βxX + ², where ² is independent of
X.
E(Y |W ) = E [{E(Y |X, W )} |W ]
= E [{E(Y |X)} |W ] Note
= β0 + βxE(X|W ).
IS THIS NONDIFFERENTIAL?
• From Tosteson et al. (1989).
• Y = I{wheeze}.
IS THIS NONDIFFERENTIAL?
• From Küchenhoff & Carroll
• Y = I{lung irritation}.
IS THIS NONDIFFERENTIAL?
• Y = average daily percentage of calories from fat as measured by a food fre-
quency questionnaire (FFQ).
• Assume Y = β0 + βxX + ²
∗ Along with the FFQ, on 6 days over the course of a year women are inter-
viewed by phone and asked to recall their food intake over the past year
(24–hour recalls). Their average is recorded and denoted by W .
Segment 2 (@ R.J. Carroll & D. Ruppert, 2002) 43
∗ Nondifferential error
∗ An estimate of the error variance var(U )
• The best way is to get a subsample of the study in which X is observed. This
is called validation.
REPLICATION
• In a replication study, for some of the study participants you measure more
than one W .
• This is an unbalanced 1–factor ANOVA with mean squared error var(U ) esti-
mated by
Pn Pmi 2
c2 i=1 j=1 (Wij − W i• )
σu = Pn .
i=1 (mi − 1)
• Of course, as the proxy or surrogate for Xi one would use the sample mean W i•.
W i• = Xi + U i•
var(U i•) = σu2 /mi.
Segment 2 (@ R.J. Carroll & D. Ruppert, 2002) 45
REPLICATION
• Replication allows you to test whether your model is basically additive with
constant error variance.
• If Wij = Xi +Uij with Uij symmetrically distributed about zero and independent
of Xi, we have a major fact:
• Also, if Uij are normally distributed, then so too are differences Wi1 − Wi2 =
Ui1 − Ui2.
∗ q-q plots of these differences can be used to assess normality of the measure-
ment errors
REPLICATION: WISH
• The WISH study measured caloric intake using a 24–hour recall.
• A plot of the caloric intake data showed that W was no where close to being
normally distributed in the population.
• When plotting standard deviation versus the mean, typical to use the rule that
the method “passes” the test if the essential max–to–min is less than 2.0.
6000
5000
•
••
4000
•
••
s.d.
•
•
•••••
3000 ••••
••
••••
••
••••
•••
•••••
2000 ••••
••••••
••
••
•••••••
•
••••••••
•••••
••
••••••••••
•••
•
•••••••
•••••••••
1000 •••
•••••••
•••••••
••••••••••••
••••••••••
•
• •••
•
• •
-3 -2 -1 0 1 2 3
Figure 5: WISH, CALORIC INTAKE, Q–Q plot of Observed data. Caloric intake is
clearly not normally distributed.
Segment 2 (@ R.J. Carroll & D. Ruppert, 2002) 48
3000
•
•
•
•
2000 ••
••••
•
••
•••
•••
• •••••
•••
••••
••
••••••
1000 ••••
••••
•••••••
•••••
•••••
••
•••••
••••••
•••••
••
••••••
•
0 •
•••••
•••••
••••
s.d.
••
••
••••••
••••
•
••••••
•
••
••••
••
-1000 ••
••••
••
•••
•••
••••
••
••••
•••
•••
••
-2000 •••
•••••••
•
•
•
••
-3000
•
•
-3 -2 -1 0 1 2 3
Figure 6: WISH, CALORIC INTAKE, Q–Q plot of Differenced data. This suggests that
the measurement errors are reasonably normally distributed.
Segment 2 (@ R.J. Carroll & D. Ruppert, 2002) 49
1200
• •
•
•
•
•
1000 ••
•• •
• • • •• • •
• •• • • •
• • • • • • • •
• • • •
• •
800 • •• •
• • •• • •
• • • • • •
• • •
• • • •• •
s.d.
•• ••••
• •
• •• •• • • •
• • • • ••• •••
• •
••• • • •
600 • • •
• • • • • • • •
• • • • • • •
• • • • •• •
• •• • •
• • • • • ••• • •• • ••
• • •• • • •• •••
• • •
• • • •• • •
400 • • • • • • • •• ••
• • ••• •• • • • •• • • •
• •
•
• •• • •
• • ••
• •• • • •
• • • • •
• •• • • •
• • •• • • •• ••• ••
•• • •• • •
•
• •• • •
200 • • • • •• •• •
• • •• •
• •
• •
Figure 7: WISH, CALORIC INTAKE, plot for additivity, loess and OLS. The standard
deviation versus the mean plot suggests lots of non–constant variance. Note how the range of the
fits violates the 2:1 rule.
Segment 2 (@ R.J. Carroll & D. Ruppert, 2002) 50
REPLICATION: WISH
• Taking logarithms improves all the plots.
Segment 2 (@ R.J. Carroll & D. Ruppert, 2002) 51
8.5
•
••
•• •
••
••
8.0 •••••••
••
••••
•
•••
•
••••
••
•••••
•••••
•••••
7.5
•••
•••••••
•
••••••
s.d.
••••
•
•••••••
••
••••
••••
•
•••••
••
••••••
7.0
••
••••
•
••
••
••••
•••
••••••
•••••
•
•••
6.5 ••••
••••
••
••
6.0 •
•
-3 -2 -1 0 1 2 3
Figure 8: WISH, LOG CALORIC INTAKE, Q–Q plot of Observed data. The actual
logged data appears nearly normally distributed.
Segment 2 (@ R.J. Carroll & D. Ruppert, 2002) 52
•
•
••
•
••
••••
9.999392*10^-1 ••••••••
•
•
••••
•••
•
•••••••
•••
••••••••
••
•••••••
•••
•••••
•••••
•
•••••••
•
•••
••••••
-4.053116*10^-5 ••••••
•••••••
•
•
••••••
•••••••
•
•••••••
s.d.
••
•
•••••
••••
••••
•
••
•••••
•••
•
•••••••
••
-1.000020*10^0 •••••
•••••
••••
•••
••••
•
•
-2*10^0
•
-3 -2 -1 0 1 2 3
Figure 9: WISH, LOG CALORIC INTAKE, Q–Q plot of Differenced data. The mea-
surement errors appear normally distributed.
Segment 2 (@ R.J. Carroll & D. Ruppert, 2002) 53
0.8 • •
•
• • •
•
•
• • •
0.6 •
• •
•
• ••
•• • •
• • • • •
• • •• • •
• • • • • •
• • • ••
••• • • • • •
•
s.d.
• • •• •
• •• • • • • •• •• •
0.4 • • • • • ••
• ••••• •
• • ••• • • • •
• • • • • •• • •
•
•• • ••• • •• • • • •
• • •• • •
•• • • ••
• • • • •
••• •• ••• ••••• • • •
•• • • •
• ••• • • • •••• •
•
• • • • • •••• • • • •
• • • • •
• • •
• • • • • •
• • • •• • ••
• •• • • •
0.2 •• • • •• ••• • • • • ••• •• • •
•• • •• •
• • •• • •
• • •• • •• • •• •
• • ••
•• • •
• •
• •
•
•
Figure 10: WISH, LOG CALORIC INTAKE, plot for additivity, loess and OLS. The
2:1 rule is not badly violated, suggested constant variance of the errors. This transformation seems
to work fine.
Segment 2 (@ R.J. Carroll & D. Ruppert, 2002) 54
SUMMARY
• Nondifferential error is an important assumption.
E(Y |W ) = β0 + βxE(X|W )
≈ β0 + βxλW.
TRANSPORTABILITY
• It is tempting to try to use outside data and transport this distribution to your
problem.
∗ Bad idea!!!!!!!!!!!!
σx2
λ= 2
σx + σu2
∗ Internal data, i.e., data from the current study, is ideal since there is no
question about transportability.
∗ Sometimes transportability clearly will not hold. Then the value of the ex-
ternal data is, at best, questionable.
∗ Even is transportability seems to be a reasonable assumption, it is still just
that, an assumption.
Segment 3 (@ R.J. Carroll & D. Ruppert, 2002) 59
∗ National sample
• Since the same instrument is used, error properties should be about the
same.
NHS
50
40
30
20
10
0
10 20 30 40 50
pccal.nhs
ACS
40
30
20
10
0
10 20 30 40 50
pccal.acs
Figure 11: FFQ Histograms of % Calories from Fat in NHS and ACS
Segment 3 (@ R.J. Carroll & D. Ruppert, 2002) 62
X = W + Ub
• Replication possible?
• Using a phantom, all persons of a given height and weight with a given recorded
dose are assigned the same radiation exposure.
Segment 3 (@ R.J. Carroll & D. Ruppert, 2002) 66
• In classical error models, you have to know the structure of the error.
∗ Additive or multiplicative?
∗ Some experimentation is necessary to give information about the measure-
ment error variance.
• With all this information, you have to decide upon a method of estimation.
∗ validation data
∗ instrumental data
∗ replication data
Segment 4 (@ R.J. Carroll & D. Ruppert, 2002) 73
NHANES-I
• The NHANES-I example is from Jones et al., (1987).
• Y = I{breast cancer}.
• Z = (age, poverty index ratio, body mass index, I{use alcohol}, I{family
history of breast cancer}, I{age at menarche ≤ 12}, I{pre-menopause}, race).
• Untransformed surrogate:
NHANES-I—CONTINUED
• w/o adjustment for Z, W appears to have a small protective effect
Healthy
Breast Cancer
20
15
10
5
0
Figure 12: Histograms of log(.05 + Saturated Fat/100) in the NHANES data, for women
with and without breast cancer in 10 year follow–up..
Segment 4 (@ R.J. Carroll & D. Ruppert, 2002) 80
NHANES-I—CONTINUED
• External replication data;
∗ CSFII (Continuous Survey of Food Intake by Individuals).
∗ 24-hour recall (W ) plus three additional 24-hour recall phone interviews,
(T1, T2, T3).
∗ Over 75% of σW2
|Z appears due to measurement error.
• From CSFII:
∗ σcW2
|Z = 0.217.
∗ σcU2 = 0.171 (assuming W = X + U )
∗ Correction for attenuation:
c2
σ
βx = c 2 W |Z c 2 βcw
c
σW |Z − σu
0.217
= (−.97) = −4.67
0.217 − 0.171
∗ How this is done depends, of course, on the type of auxiliary data available.
• Same approach can be used for external validation data, but with the usual
concern for non-transportability.
Segment 4 (@ R.J. Carroll & D. Ruppert, 2002) 82
∗ n individuals
∗ ki replicates for the ith individual
∗ Wij = Xi + Uij , i = 1, . . . , n and j = 1, . . . , ki, where E(Uij |Zi, Xi) = 0.
∗ W i· := k1i Pj Wij .
∗ Notation: µz is E(Z), Σxz is the covariance (matrix) between X and Z, etc.
• E(X|Z, W )
Σxx + Σuu/k Σxz −1 W − µw
≈ µx + (Σxx Σxz )
. (1)
Σtxz
Σzz
Z − µz
(best linear approximation = exact conditional expectation under joint normal-
ity).
∗ Σc uu
Pki
Pn
i=1 j=1 (Wij − W i· )(Wij − W i·)t
= Pn .
i=1 (ki − 1)
∗ Σc xx
n
X c )t − (n − 1)Σ
c
=
ki(W i· − µ
c )(W
w i· − µw uu /ν.
i=1
Segment 5 (@ R.J. Carroll & D. Ruppert, 2002) 86
• An Empirical Version
• Summary
Segment 5 (@ R.J. Carroll & D. Ruppert, 2002) 87
• Objective is to study how the statistic depends on the variance of the measure-
ment error
Segment 5 (@ R.J. Carroll & D. Ruppert, 2002) 89
• Extrapolate to θ = −1
Illustration of SIMEX
1.2
1.0
0.8
Coefficient
0.6
Naive Estimate
0.4
0.2
0.0
Illustration of SIMEX
1.2
1.0
Coefficient 0.8
0.6
Naive Estimate
0.4
0.2
0.0
Figure 14: This shows what happens to your estimate when you have more error, but
you still ignore the error.
Segment 5 (@ R.J. Carroll & D. Ruppert, 2002) 92
Illustration of SIMEX
1.2
1.0
0.8
Coefficient
0.6
Naive Estimate
0.4
0.2
0.0
Illustration of SIMEX
1.2
SIMEX Estimate
1.0
0.8
Coefficient
0.6
Naive Estimate
0.4
0.2
0.0
• Extrapolate to θ = −1
∗ Y = indicator of CHD
∗ Wk = SBP at Exam k, k = 1, 2
∗ X = “true” SBP
∗ Data, 1660 subjects:
(Yj , W1,j , W2,j ), j = 1, . . . , 1660
• Model Assumptions
Predictor
Measurement Error Slope
θ Variance Estimate
= (1 + θ)σu2 /2
−1 0 ?
0 σu2 /2 βbA
A SIMEX–type plot for the Framingham data, where the errors are not
Figure 18:
computer–generated.
Segment 5 (@ R.J. Carroll & D. Ruppert, 2002) 99
∗ Y = indicator of CHD
∗ Age (at Exam 2)
∗ Smoking Status (at Exam 1)
∗ Serum Cholesterol (at Exam 3)
∗ Transformed SBP
WA = (W1 + W2) /2,
Wk = ln (SBP − 50) at Exam k
• Consider logistic regression of Y on Age, Smoke, Chol and SBP with transformed
SBP measured with error
Segment 5 (@ R.J. Carroll & D. Ruppert, 2002) 104
• The plots on the following page illustrate the simulation extrapolation method
for estimating the parameters in the logistic regression model
Segment 5 (@ R.J. Carroll & D. Ruppert, 2002) 105
Age Smoking
6.72 8.43
Coefficient (x 1e+2)
Coefficient (x 1e+1)
6.13 7.18
• • • 0.601
• • • 0.593
5.54 • • 5.93 • • • • • • • •
0.0554
0.0536
4.95 4.68
4.36 3.43
Cholesterol Log(SBP-50)
9.98 2.12
1.93
Coefficient (x 1e+3)
Coefficient (x 1e+0)
8.93 1.92
0.00787 1.71
7.87 • • • • • • • • 1.71 •
0.00782 •
•
•
6.82 1.50 •
•
•
•
5.76 1.29
Figure 20:
Segment 5 (@ R.J. Carroll & D. Ruppert, 2002) 106
∗ The calculations, too involved to review here, are similar as well. See Chapter
4 of our book.
• In many cases, with decent coding, you can use the bootstrap to estimate
the variance of SIMEX.
Segment 5 (@ R.J. Carroll & D. Ruppert, 2002) 107
A MIXED MODEL
• Data from the Framingham Heart Study
• There were m = 75 clusters (individuals) with most having n = 4 exams, each
taken 2 years apart.
• The variables were
∗ Y = evidence of LVH (left ventricular hypertrophy) diagnosed by ECG in
patients who developed coronary heart disease before or during the study
period
∗ W = log(SBP-50)
∗ Z = age, exam number, smoking status, body mass index.
∗ X = average log(SBP-50) over many applications within 6 months (say) of
each exam.
Segment 5 (@ R.J. Carroll & D. Ruppert, 2002) 108
A MIXED MODEL
• We fit this as a logistic mixed model, with a random intercept for each
person having mean β0 and variance θ.
• We assumed that measurement error was independent at each visit.
Segment 5 (@ R.J. Carroll & D. Ruppert, 2002) 109
4.0
3.5
3.0 beta(SBP)
2.5
2.0
1.5 theta
1.0
SUMMARY
• Bootstrap-like method for estimating bias and variance due to measurement
error
• Functional method for classical measurement error
• Not model dependent
• Computer intensive
∗ Generate and analyze several pseudo data sets
• Approximate method like regression calibration
Segment 6 (@ R.J. Carroll & D. Ruppert, 2002) 111
LINEAR REGRESSION
• Let’s remember what the linear model says.
Y = β0 + βxX + ²;
W = X + U;
U ∼ Normal(0, σu2 ).
∗ W1 = X + U1
∗ W2 = X + U2.
• If these are indeed replicates, then we can estimate σu2 via a components of
variance analysis.
• The third and least efficient method is to use Instrumental Vari-
ables, or IV’s
Y = β0 + βxX + ²;
W = X + U;
U ∼ Normal(0, σu2 ).
∗ T is independent of ²
∗ T is independent of U
∗ T is related to X.
∗ You only measure T to get information about measurement error: it is not
part of the model.
∗ In our parlance, T is a surrogate for X!
Segment 6 (@ R.J. Carroll & D. Ruppert, 2002) 115
AN EXAMPLE
∗ Questionnaire
∗ Then one year later, the recalls were done fairly close together in time.
∗ Then 6 months later, the diaries were measured.
• One could think of the recalls as replicates, but some researchers have worried
that major correlations exist, i.e., they are not independent replicates.
• The 6–month gap with the recalls and the 18–month gap with the questionnaire
makes the diary records a good candidate for an instrument.
Segment 6 (@ R.J. Carroll & D. Ruppert, 2002) 117
USING INSTRUMENTAL
VARIABLES:MOTIVATION
• In what follows, we will use underscores to denote which coefficients go where.
• This means
E(Y | T ) = βY |1T + βY |1T T
= βY |1X + βY |1X E(X | T )
= βY |1X + βY |1X E(W | T )
Segment 6 (@ R.J. Carroll & D. Ruppert, 2002) 119
MOTIVATION
• Algebraically, this means that the slope Y on T is the product of the slope for
Y on X times the slope for W on T :
MOTIVATION
∗ Equivalently, it means
βY |1T
βY |1X = .
βW |1T
∗ If the instrument is very weak, the slope βW |1T will be near zero.
∗ This will make the IV estimate very unstable.
• It is generally far more efficient in practice to take replicates and
get a good estimate of the measurement error variance than it is to “hope and
pray” with an instrumental variable.
Segment 6 (@ R.J. Carroll & D. Ruppert, 2002) 122
OTHER ALGORITHMS
• The book describes other algorithms which improve upon the simple algorithm,
in the sense of having smaller variation.
• The methods are described in the book, but are largely algebraic and difficult
to explain here.
• However, for most generalized linear models the two methods are fairly similar
in practice.
Segment 6 (@ R.J. Carroll & D. Ruppert, 2002) 123
FIRST EXAMPLE
• WISH Data (Women’s Interview Study of Health).
∗ Regress W on T
∗ Form predicted values
∗ Regress Y on the predicted values.
• Dietary intake data have large error, and signals are difficult to find.
Segment 6 (@ R.J. Carroll & D. Ruppert, 2002) 124
5.5
5.0
4.5
FFQ
4.0
3.5
3.0
Figure 22: Wish Data: Regression of FFQ (Y) on Mean of Recalls (W).
Segment 6 (@ R.J. Carroll & D. Ruppert, 2002) 125
5.5
5.0
4.5
FFQ
4.0
3.5
3.0
Figure 23: Wish Data: Regression of FFQ (Y) on Mean of Diaries (T).
Segment 6 (@ R.J. Carroll & D. Ruppert, 2002) 126
4.5
4.0
Recall Average
3.5
3.0
2.5
Figure 24: WISH Data: regression of mean of recalls (W) on mean of diaries (T)
Segment 6 (@ R.J. Carroll & D. Ruppert, 2002) 127
5.0
4.5
FFQ
4.0
3.5
3.0
WISH Data: Regression of FFQ (Y) on the Predictions from the regression
Figure 25:
250
200
150
100
50
0
IV
200
150
100
50
0
Figure 26: Bootstrap sampling, comparison with SIMEX and Regression Calibration
Segment 6 (@ R.J. Carroll & D. Ruppert, 2002) 129
250
200
150
100
50
0
0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4
betaxmom
IV
200
150
100
50
0
when the instrument is of lower quality and one one of the diaries is used.
Segment 6 (@ R.J. Carroll & D. Ruppert, 2002) 130
FURTHER ANALYSES
• The naive analysis has
∗ Slope = 0.4832
∗ OLS standard error = 0.0987
∗ Bootstrap standard error = 0.0946
• The instrumental variable analysis has
∗ Slope = 0.8556
∗ Bootstrap standard error = 0.1971
• For comparison purposes, the analysis which treats the 6 24–hour recalls
as independent replicates has
∗ Slope = 0.765
∗ Bootstrap standard error = 0.1596
• Simulations show that if the 24–hour recalls were really replicates, then the EIV
estimate is less variable than the IV estimate.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 131
• Comments on Computation
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 132
• Dose to the thyroid was measured by a complex modeling process (more later)
• If true dose in the log–scale is X, and other covariates are Z, fit a logistic
regression model:
· ¸
T
pr(Y = 1|X, Z) = H Z βz + log{1 + βx exp(X)} .
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 133
∗ Age of exposure
∗ Amount of milk drunk
∗ Milk producers
∗ I–131 (a radioisotope) deposition on the ground
∗ Physical transport models from milk and vegetables to the thyroid
• For purposes of today we are going to assume that the error are Berkson in
the log–scale:
Xi = Wi + Ubi.
Utah Study
Utah
Nevada
Arizona
1.5
SD log dose
1.0
0.5
0.0
-15 -10 -5 0 5
Log Dose
1.5
SD log dose
1.0
0.5
0.0
-15 -10 -5 0 5
Log Dose
Figure 29: Log(Dose) and estimated uncertainty in the Utah Data. Large black octo-
gons are the 19 cases of thyroid neoplasm. Note the neoplasm for a person with
no dose.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 138
1.5
Utah
Nevada
Arizona
SD log dose
1.0
0.5
0.0
-15 -10 -5 0 5
Log Dose
Figure 30: Log(Dose) and estimated uncertainty in the Utah Data for the thyroid
neoplasm cases, by state.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 139
• We propose that in the Berkson model, the only real available methods
for this complex, heteroscedastic nonlinear logistic model have to be based on
likelihood methods.
• Let’s see if we can understand what the likelihood is for this problem.
• The first step in any likelihood analysis is to write out the likelihood if there
were no measurement error.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 140
• As a generality, this is
x − w
fX|Z,W (x|z, w, A) = σb−1φ ;
σb
φ(z) = (2π)−1/2 exp(−z 2/2).
• This calculation is obviously dependent upon the problem, and can be more or
less difficult.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 142
fY |W,Z (y|w, z, Θ, A)
Z
= fY,X|W,Z (y, x|w, z, Θ, A)dx
Z
= fY |Z,X (y|z, x, Θ)fX|Z,W (x|z, w, A)dx.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 143
• The maximum likelihood estimate maximizes the loglikelihood of all the data.
n
X
L(Θ, A) = log fY |Z,W (Yi|Zi, Wi, Θ, A).
i=1
∗ In our case, the Utah study data files tells us the Berkson error variance for
each individual.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 145
fY |W,Z (y|w, z, Θ, A)
Z
= fY |Z,X (y|z, x, Θ)fX|Z,W (x|z, w, A)dx.
3
LR chi square
Naive
Berkson
gamma
Figure 31: Likelihood Ratio χ2 tests for naive and Berkson analyses. Note that the
dose effect is statistically significant for both, but that the estimate of γ is larger
for the naive than for the Berkson analysis. Very strange.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 147
• INFERENCE AS USUAL:
• Efficient
• Here is a q–q plot of the observed SBP’s (W ), along with a density estimate.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 151
* *
**
0.5
*
*
*
*****
****
*****
***
***
*****
*******
*
******
*
**
******
******
*******
**
**
********
*****
******
*******
******
**
********
********
******
0.0
diff
*******
**
**
*******
**
*****
******
*******
**
**
******
*
****
*******
*********
******
**
******
*
****
********
*****
*
****
*
*****
**
**
***
-0.5
*
*
-2 0 2
0.4
0.3
0.2
0.1
0.0
-4 -2 0 2 4
Figure 33: Kernel density estimate and best fitting normal density plot in Framingham
for log(SBP − 50)
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 153
W = X + U, σu2 = 0.01259.
X ∼ Normal(µx, σx2 ).
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 154
• We need a formula for the likelihood function, and for this we need a little theory.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 155
• fX|Z (x|z, α2) is the density of X given Z depending on the parameter A. This
is the model for the unobserved predictor. This density may be hard
to specifiy but it is needed. This is where model robustness becomes a big
issue.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 156
∗ We used starting values for β0, βz , βx, µx, σx2 from the naive analysis which
ignores measurement error.
∗ We will show you the profile loglikelihood functions for βx for both analyses.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 159
10
LR chi square
A NOTE ON COMPUTATION
• It is almost always better to standardize the covariates to have sample
mean zero and sample variance one.
• Especially in logistic regression, this improves the accuracy and stability of nu-
merical integration and likelihood maximization.
Segment 7 (@ R.J. Carroll & D. Ruppert, 2002) 161
A NOTE ON COMPUTATION
• Not all problems are amenable to numerical integration to com-
pute the log–likelihood
• For example, multiplicative measurement error will often make sense. These are
additive models in the log scale, e.g., the Utah data.
• Generally, the numerical issues are no more or less difficult for multiplicative
error.