Coleman 1995
Coleman 1995
Publication in late 1993 by the International Organization for Standardization (ISO) of the Guide to the Expres-
sion of Uncertainty in Measurement in the name of ISO and six other international organizations has, in everything
but name only, established a new international experimental uncertainty standard. In this article, an analysis of
the assumptions and approximations used in the development of the methods in the ISO guide is presented, and
a comparison of the resulting equation with previously published uncertainty analysis approaches is made. Also
Downloaded by BIBLIOTHEK DER TU MUENCHEN on August 12, 2013 | http://arc.aiaa.org | DOI: 10.2514/3.12742
discussed are the additional assumptions necessary to achieve the less complex "large sample" methodology that
is recommended in AIAA Standard S-071-1995, Assessment of Wind Tunnel Data Uncertainty, issued in 1995. It
is shown that these assumptions are actually less restrictive than those associated with some previously accepted
methodologies. The article concludes with a discussion of some practical aspects of implementing experimental
uncertainty analysis in engineering testing.
- 0(bias)-
a) Xtn
Downloaded by BIBLIOTHEK DER TU MUENCHEN on August 12, 2013 | http://arc.aiaa.org | DOI: 10.2514/3.12742
Fig. 1 Errors in the measurement of a variable X: a) two readings and where the function is continuous and has continuous derivatives in
b) infinite number of readings. the domain of interest. The situation is shown schematically in Fig. 2
for the kih set of measurements (x*, yk) that is used to determine rk.
thermistors. One might assume that the bias errors (the differences Here, pxk and €Xk are the bias and precision errors, respectively, in
between the actual, but unknown, R-T curves of the various thermis- the fcth measurement of x, with a similar convention for the errors
tors and the reference curve) belong to a Gaussian parent distribution in y and in r. Assume that the test instrumentation and/or apparatus
with a standard deviation b — 0.5 C. Then the interval defined by is changed for each measurement so that different values of f>Xk and
±B = ±2b = ±1.0 C would include about 95% of the possible Pyk will occur for each measurement. Therefore, the bias errors and
bias errors that could be realized from the parent distribution. (The precision errors will be random variables so that
bias limit is sometimes called the "systematic uncertainty.") xk = jctrue + .fix + ex (4)
As an estimator of the magnitude of the precision errors (the
width of the distribution of readings in Fig. Ib) a precision limit P (5)
= )>true + Pyk + ^
is defined.7 A 95% confidence estimate of P is interpreted to mean
that the ±P interval about a single reading of X, should cover [L Now approximate the function r in the DRE using a Taylor series
95 times out of 100. (The precision limit is sometimes called the expansion. Expanding to the general point rk from rtrue gives
"precision uncertainty.")
In nearly all experiments, the measured values of different vari- = ''true + —— (Xk ~ *true) + —— (yk ~ Jtrue) + (6)
ables are combined using a data reduction equation (DRE) to form
some desired result. A good example is the experimental determi- where R2 is the remainder term and where the partial derivatives
nation of drag coefficient of a particular model configuration in a are evaluated at (jctrue, ytrue)- Since the true values of x and y are un-
wind-tunnel test. Defining drag coefficient as known, an approximation is always introduced when the derivatives
are evaluated at some measured values (xk, yk).
(1) The remainder term has the form8
pV2A
~ "T~r ~ *true)
one can envision that errors in the values of the variables on the right- 2\\_dx2 dxdy
hand side of Eq. (1) will cause errors in the experimental result CD.
A more general representation of a data reduction equation is
— (yk- (7)
= r(XlfX2,...,Xj) (2)
where the partial derivatives are evaluated at (£, x) which is some-
where between (xk, yk) and (jctrue, ytrue). This term is usually assumed
where r is the experimental result determined from / measured
to be negligible, and so it is useful to consider the conditions under
variables X/. Each of the measured variables contains bias errors and
which this assumption might be reasonable. The factors (xk — jctrue)
precision errors. These errors in the measured values then propagate
and (yk — ytrue) are the total errors in x and y. If the derivatives are of
through the data reduction equation, thereby generating the bias
reasonable magnitude and the total errors in x and y are small, then
and precision errors in the experimental result r. In the following
R2, containing the squares of the errors, will approach zero more
section, a derivation of the equation describing the error propagation
quickly than will the first-order terms. Also, if r(jc, y) is a linear
is presented, with particular attention paid to the assumptions and
function, then the partial derivatives in Eq. (7) are identically zero
approximations that are made to obtain the final equation.
(as is /?2).
Neglecting R2, the expansion gives [taking rtrue to the left-hand
III. Derivation of the Uncertainty side (LHS)]
Propagation Equation
Rather than present the derivation for the case in which the result 3r 9r
——(Xk- *true) + —— (j* ~ (8)
is a function of many variables, the simpler case in which the result dx ay
is a function of only two variables will be considered first. The
This expression relates the total error 8 in the Ath determination of
expressions for the more general case will then be presented as the result r to the total errors in the measured variables and using
extensions of the two-variable case.
the notation
Suppose that the data reduction equation is
r = r(x, y) (3)
', = (9)
1890 COLEMAN AND STEELE
17 2.110 2.898
k=\ k=\ 18 2.101 2.878
19 2.093 2.861
-, N N . N
20 2.086 2.845
* tf E y 25 2.060 2.787
N E 30 2.042 2.750
60 2.000 2.660
00 1.960 2.576
e
«? = E +2 E +E > vr =
E/.,
(19)
/-i
where the v.y/ are the number of degrees of freedom associated with
(16) the Si and the v/,/ are the number of degrees of freedom to associate
with the hi.
where (&/) 2 is the estimate of the variance of the bias error dis- If an Si has been determined from Ni readings of X, taken
tribution of variable X/, etc. The derivation to this point has been over an appropriate interval, the number of degrees of freedom is
presented by the authors in Ref. 9 and in Appendix B of Ref. 7. given by
No assumptions about type(s) of error distributions are made to
obtain the preceding equation for uc. To obtain an uncertainty Ur v,t =N,-l (20)
COLEMAN AND STEELE 1891
For the number of degrees of freedom v/,/ to associate with a non- Approach of Coleman and Steele7'9
statistical estimate of £/, it is suggested in the ISO guide1 that one The authors, expanding on the ideas advanced by Kline and
might use the approximation McClintock10 and assuming Gaussian error distributions, proposed
viewing Eq. (14) as a propagation equation for 68% confidence in-
1 /Afr, tervals. A 95% coverage estimate of the uncertainty in the result was
(21) then proposed as that given by an equation similar to Eq. (23)
2\ bi
(28)
where the quantity in parentheses is the relative uncertainty of £/.
For example, if one thought that the estimate of &/ was reliable to with the bias limit of the result defined by
within ±25%, then j-i
B
=
(22)
Consideration of the 95% confidence t values in Table 1 reveals and the precision limit of the result given by
that the value of t approaches 2.0 (approximately) as the number of
degrees of freedom increases. We thus face the somewhat paradox- (30)
ical situation that as we have more information (vr increases), we
can take t equal to 2.0 and do not have to deal with Eq. (19), but for
Downloaded by BIBLIOTHEK DER TU MUENCHEN on August 12, 2013 | http://arc.aiaa.org | DOI: 10.2514/3.12742
the cases in which we have little information (vr small) we need to where pbik is the correlation coefficient appropriate for the bias errors
make the more detailed estimates required by Eq. (19). In Sec. V of in X/ and Xk, and psik is the correlation coefficient appropriate for
this article, we examine the assumptions required to discard Eq. (19) the precision errors in X, and X*. The precision limit of the variable
and to simply use t equal to 2 (for 95% confidence). X/ is given by
Pi=ttSi (31)
IV. Comparison with Published Approaches
Approach of Abernethy et al.5 where ^ is determined with v, = TV/ — 1 degrees of freedom.
An approach that was widely used in the 1970s and 1980s was Equations (29) and (30) were viewed as propagation equations
the URss, ^ADD technique formulated by Abernethy et al.5 and used for 95% confidence bias limits and precision limits, and thus this
in Refs. 3 and 4 and other Society of Automotive Engineers (SAE), approach avoided use of the Welch-Satterthwaite formula. Compar-
Instrument Society of America (ISA), JANNAF, National Research ison of uncertainty coverages for a range of sample sizes using this
Council (NRC), U.S. Air Force (USAF), NATO, and ISO standards approach and the Abernethy et al. approach have been previously
documents.5 According to Abernethy et al., presented.11 As was stated earlier in reference to the Abernethy et
al. approach, consideration of these expressions in the context of the
derivation presented in the previous section shows that they cannot
URSS = [B2r + (tSr)2] i (23) be justified on a rigorous basis.
Both the approach of Coleman and Steele7 and the Abernethy et al.
for a 95% confidence estimate and URSS approach5 (properly modified to account for correlated bias
effects) agree with the 95% confidence form of Eq. (18) for "large
= Br + (tSr) (24) sample sizes"—that is, Nt (and vr) large enough so that t can be taken
as 2.0. As will be discussed in Sec. V, an argument can be made that
for a 99% confidence estimate, where Br is given by this corresponds roughly to Nt > 10.
wants an estimate of the expected dispersion of results for a particu- little consideration in the engineering "measurement uncertainty"
lar experimental approach or process, then the systematic/precision literature. As stated in the ISO guide,1
categorization is useful—particularly when used in the "debugging"
phase of an experiment.7 This ... shows that the standard deviation of a statistically esti-
Considering the tradition in engineering of the bias/precision mated standard deviation is not negligible for practical values
(systematic/random) uncertainty categorization and its usefulness of n. One may therefore conclude that Type A evaluations
in engineering experimentation as mentioned earlier, this article's of standard uncertainty are not necessarily more reliable than
Type B evaluations, and that in many practical measurement
authors have chosen to retain that categorization while adopting the situations where the number of observations is limited, the
mathematical procedures of the ISO guide. This categorization is components obtained from Type B evaluations may be better
also used in the AGARD2 document, by the committee currently known than the components obtained from Type A evaluations.
revising Ref. 3, and in the AIAA standard.6
Considering the 95% confidence t table (Table 1), one can ob-
12
serve that for vr > 9 the values of t are within about 13% of the
Approach of NIST large sample t value of 2. This difference is relatively insignificant
Taylor and Kuyatt12 reported guidelines for the implementation compared with the uncertainties inherent in estimating the S/ and
of a National Institute of Standards and Technology (NIST) policy bj as discussed earlier. Therefore, for most engineering applications
that states the following: it is proposed that Gaussian error distributions and vr > 9 be as-
sumed so that t = 2 always. (This could be called the "large sample
assumption.") This assumption eliminates the need for evaluation
Downloaded by BIBLIOTHEK DER TU MUENCHEN on August 12, 2013 | http://arc.aiaa.org | DOI: 10.2514/3.12742
were randomly drawn from the specified parent population, and a where % is determined with TV/ — 1 degrees of freedom.
sample standard deviation was calculated using In cases in which all TV/ > 10, then the various (25/) in Eq. (34)
correspond to the precision limits P/ so that
j j-i j J
(35)
N-:
i = \ k = i+i 1= 1
where the mean value is defined as
(38)
(36)
where Pik equals 4(Sik).
If we define the bias limit (systematic uncertainty) of the result as
A check was then made to see if the ±(tS) and ±(25) intervals cen-
tered about the next (ninth) randomly chosen reading covered /x,
and the appropriate counter was incremented when the check was (39)
positive. This procedure was followed for 100,000 trials, and the
percent coverage was the percentage of the 100,000 trials in which
JJL was covered. An average, 5avg, of the 100,000 S was calculated, and the precision limit (precision uncertainty) of the result as
Downloaded by BIBLIOTHEK DER TU MUENCHEN on August 12, 2013 | http://arc.aiaa.org | DOI: 10.2514/3.12742
and the precision limit ratios (25avg/2o-) and (f Savg/2cr) were deter- j j-\ j
mined. This entire procedure was followed for N from 3 to 30, and
•P?+2YY
1
' ^ ' -1
e&P* (40)
the results are shown in Figs. 3 and 4.
i = l jfe=i+l
For N = 10, the percent coverage by 2S is 92.5% and the average
25" interval is about 0.98 of the 2a interval, whereas the percent then Eq. (38) can be written as
coverage by t S is 95% and the average t S interval is about 1.10 of the
2<r interval. As N increases, the percent coverage by 2S approaches (41)
95% and the average 25 interval approaches the 2a interval value.
Considering the previous discussion of the dispersion associated and Eqs. (39) and (40) can be viewed as propagation equations for
with determinations of S using N readings, it is proposed that the the bias limits and precision limits, respectively.
95% confidence precision limit for a variable X, be estimated as
Pi = *iSi N -< 10 Single Test
(37) When the result is determined from a single test, that is, at a given
Pi = 2St N > 10 test condition the result is determined once using Eq. (2)
100—• _ r/y v v \ fO\
and Eq. (40) is used to find the precision limit of the result as follows.
For the variables that are single readings, the various P, are the
precision limits determined from previous information or estimated
from the best available information. For the averaged variables when
N2 and N3 are equal to or greater than 10, P2 and P3 should be taken
as precision limits of means, (2S2)/(N2)1/2 and (2S3)/(W3)1/2, with
the various S calculated using Eq. (35). When N2 and N3 are less
than 10, it is the authors' recommendation that the precision limits
used in Eq. (40) for the averaged variables be taken as (P2)/(A^2)1/2
and (P 3 )/(N 3 ) 1/2 , where P2 and P3 are determined from previous
10 12 14 16 18 20 22 24 26 28 30 information, as is done for the single reading variables.
Number of Readings, N For tests in which multiple readings of all of the variables can
Fig. 4 Ratio of calculated to true precision limit for N readings of a be obtained over an appropriate period, the following method is
Gaussian-distributed variable. recommended.
1894 COLEMAN AND STEELE
Multiple Tests conducted, these estimates can be updated using the additional infor-
If a test is performed so that M multiple sets of measurements mation gained about the accuracy of the calibration standards, errors
(Xi, X2,..., Xj)k at the same test condition are obtained, then M associated with the calibration process and curvefit procedures, and
results can be determined using Eq. (2) and an average result r can perhaps analytical estimates of installation errors.
be determined using As Moffat14 suggests, there can be additional conceptual bias er-
rors resulting from not measuring the variable whose symbol appears
in the data reduction equation. An example would be a freestream
(43) velocity value measured at a particular axial position on a wind-
tunnel centerline and used as the freestream velocity at that cross
If the M sets of measurements were obtained over an appropriate section in determining CD, but there may be a cross-sectional gra-
time period, the precision limit that should be associated with a dient of velocity at that location causing the average value to be
single result would be different. Hence, the inclusion of an elemental bias term for the
conceptual error would be appropriate.
Pr = tSr (44) Examples of estimating elemental bias limits in engineering ex-
periments are presented in Refs. 2, 6,7, and 15, and the procedures
where t is determined with M — I degrees of freedom [and as in for estimating bias uncertainties assumed from a non-Gaussian dis-
Eq. (37) is taken as 2 for M > 10] and Sr is the standard deviation tribution are presented in Ref. 1.
of the sample of M results
Correlated Bias Limits
Downloaded by BIBLIOTHEK DER TU MUENCHEN on August 12, 2013 | http://arc.aiaa.org | DOI: 10.2514/3.12742
Correlated bias limits are those that are not independent of each
(45) other, typically a result of different measured variables sharing some
identical elemental error sources. It is not unusual for the uncertain-
ties in the results of experimental programs to be influenced by the
The precision limit that should be associated with the average result effects of correlated bias errors in the measurements of several of
is given by the variables. A typical example occurs when different variables
are measured using the same transducer, such as multiple pressures
Pr
(46) sequentially ported to and measured with the same transducer or
temperatures at different positions in a flow measured with a single
with Pr given by Eq. (44). The uncertainty that should be associated probe that is traversed across the flowfield. Obviously, the bias errors
with r is (using the large sample assumption) in the variables measured with the same transducer are not indepen-
dent of one another. Another common example occurs when differ-
(47)
ent variables are measured using different transducers all of which
have been calibrated against the same standard, a situation typical
with Br given by Eq. (39). of the electronically scanned pressure (ESP) measurement systems
in wide use in aerospace test facilities. In such a case, at least a part
Correlated Precision Uncertainties
of the bias error arising from the calibration procedure will be the
The Stk terms in Eq. (34) and the Pik terms in Eq. (38) take into same for each transducer, and thus some of the elemental bias error
account the possibility of precision errors in different variables be- contributions in the measurements of the variables will be correlated.
ing correlated. These terms have traditionally been neglected,3"5'7'10 The Bjk terms in Eqs. (34) or (39) must be approximated—there is
although precision errors in different variables caused by the same in general no way to obtain the data with which to make a statistical
uncontrolled factor(s) are certainly possible and can have a sub- estimate of the covariance of the bias errors in Xi and the bias errors
stantial impact on the value of the precision limit.13 In such cases, in Xj. The approximation of such terms was considered in detail
one would need to acquire sufficient data to allow a valid statisti- in Ref. 16, where it was shown that the approach that consistently
cal estimate of the precision covariance terms to be made if using gives the most satisfactory approximation for the correlated bias
Eqs. (34) or (38). Note, however, that the multiple tests approach us-
limits was
ing Eq. (47) implicitly includes the correlated error effect—a definite
advantage when multiple sets of measurements over an appropriate (49)
time period are available.
Estimating Bias Limits where L is the number of elemental systematic error sources that
Bias Limits of Individual Variables are common for measurements of variables Xt and Xk.
When attempting to estimate the bias limits Bt of the individual If, for example,
variables in Eqs. (34) or (39), one might separate the bias errors that
influence the measurement of a variable into different categories: (50)
calibration errors, data acquisition errors, data reduction errors, test and it is possible for portions of the bias limits and B2 to arise
technique errors, etc. Within each category, there may be several from the same source(s), then Eq. (39) gives
elemental sources of bias. For instance, if for the Jth variable, Xj,
there are M elemental bias errors identified as significant and whose f = Q(E{ -f 02B2 + 20102B12 (51)
bias limits are estimated as (#7)1, ( B j ) 2 , . . . , (Bj)M> then the bias
limit for the measurement of Xj is calculated as the root-sum-square For a case in which the measurements of X\ and X2 are each influ-
(rss) combination of the elemental limits enced by four elemental error sources and sources 2 and 3 are the
same for both X\ and X2, Eq. (48) gives
B,= (48) (52)
*=i
and
The elemental bias limits, (£/)*, must be estimated for each vari-
able Xi using the best information one has available at the time. In the = (B2)\ + (£2)2 + (B2)l + (#2)4 (53)
design phase of an experimental program, manufacturer's specifi-
cations, analytical estimates, and previous experience will typically whereas Eq. (49) gives
provide the basis for most of the estimates. As the experimental
program progresses, equipment is assembled, and calibrations are (54)
COLEMAN AND STEELE 1895
Elemental
this study has been accomplished, resources can be concentrated,
error sources if necessary, on obtaining detailed estimates of the uncertainties in
the most important variables.
3) A detailed uncertainty analysis7 should then be performed con-
sidering both systematic uncertainties and precision uncertainties.
In doing this, it is sometimes useful to visualize the experiment and
the propagation of uncertainties as shown schematically in Fig. 5.
For each X/, estimates of the bias limit and the precision limit are
usually made.
It is generally easiest to obtain an estimate of the bias limit for
Xi by estimating the bias limits of the significant elemental sources
and combining them using Eq. (48). To achieve this, the sources of
systematic uncertainty for each X/ should be identified by consider-
ing the potential elemental error sources affecting the determination
of that variable. Once the sources of uncertainty have been identi-
fied, their relative significance should be established. This is often
done using order of magnitude estimates of the sources. As a rule
of thumb for a given X,-, those uncertainty sources that are smaller
than one-fourth or one-fifth of the largest sources can usually be
Downloaded by BIBLIOTHEK DER TU MUENCHEN on August 12, 2013 | http://arc.aiaa.org | DOI: 10.2514/3.12742
1()
Kline, S. J., and McClintock, F. A., "Describing Uncertainties in Precision Errors on the Uncertainty of a Subsonic Venturi Calibration,"
Single-Sample Experiments," Mechanical Engineering, Vol. 75, Jan. 1953, AIAA Paper 95-0797, Jan. 1995.
I4
pp 3-8. Moffat, R. J., "Describing the Uncertainties in Experimental Results,"
n
Steele, W. G., Taylor, R. P., Burrell, R. E., and Coleman, H. W., "The Experimental Thermal and Fluid Science, Vol. 1, No. 1, 1988, pp. 3-17.
15
Use of Data from Previous Experience to Estimate the Precision Uncertainty Coleman, H. W., Hosni, M. H., Taylor, R. P., and Brown, G. B., "Using
of Small Sample Experiments," AIAA Journal, Vol. 31, No. 10, 1993, pp. Uncertainty Analysis in the Debugging and Qualification of a Turbulent Heat
1891-1896. Transfer Test Facility," Experimental Thermal and Fluid Science, Vol. 4,
12
Taylor, B. N., and Kuyatt, C. E., "Guidelines for Evaluating and Ex- No. 6,1991, pp. 673-683.
16
pressing the Uncertainty of NIST Measurement Results," National Inst. of Brown, K. K., Coleman, H. W., Steele, W. G., and Taylor, R. P., "Evalu-
Standards and Technology, NIST TN 1297, Gaithersburg, MD, Sept. 1994. ation of Correlated Bias Approximations in Experimental Uncertainty Anal-
13
Hudson, S. T., Bordelon, W., and Coleman, H. W., "Effect of Correlated ysis," AIAA Paper 94-0772, Jan. 1994.