Professional Documents
Culture Documents
Abstract
A procedure to compare the performance (precision and bias) of an alternative measurement method and a reference method
has been extensively described. It is based on ISO 5725-6 which has been adapted to the intralaboratory situation. This means
that the proposed approach does not evaluate the reproducibility, but considers the (operatorinstrumenttime)-different
intermediate precision and/or the time-different intermediate precision. A 4-factor nested design is used for the study. The
calculation of different variance estimates from the experimental data is carried out by ANOVA. The Satterthwaite
approximation is included to determine the number of degrees of freedom associated with the compound variances. Taken into
account the acceptable bias, the acceptable ratio between the precision parameters of the two methods, the signicance level
and the probability to wrongly accept an alternative method with an unacceptable performance, the formulae to determine
the number of measurements required for the comparison are given. For the evaluation of the bias, in addition to the point
hypothesis testing, the interval hypothesis testing is also included as an alternative. Two examples are given as an illustration
of the proposed approach. # 1999 Elsevier Science B.V. All rights reserved.
Keywords: Comparison; Alternative measurement method; Bias; Precision; Repeatability; Time-different intermediate precision;
(Operatorinstrumenttime)-different intermediate precision; Nested design; ANOVA; Satterthwaite approximation; Interval hypothesis
testing
1. Introduction
When a laboratory wants to replace an existing
analytical method by a new method (e.g. because
the latter is cheaper or easier to use) it has to show
that the new method performs at least as good as the
existing one. A comparison of the performance (precision and bias) of both methods has therefore to be
performed. One of the most advanced guidelines for
the comparison of two methods can be found in ISO
5725-6 [1]. However the ISO guideline is based on
*Corresponding author. Tel.: +32-2477-4737; fax: +32-24774735; e-mail: asmeyers@vub.vub.ac.be
0003-2670/99/$ see front matter # 1999 Elsevier Science B.V. All rights reserved.
PII: S 0 0 0 3 - 2 6 7 0 ( 9 9 ) 0 0 1 1 5 - 4
204
2. Methods
All symbols and abbreviations used in this paper are
dened in Table 1.
2.1. Experimental design
A 4-factor nested experimental design is used
[37]. This design is also one of the designs recommended by ISO [3]. The schematic layout of the
design is given in Fig. 1. The four factors represent
four sources of variation that contribute to the variability of the measurements within one laboratory. The
factors considered are operator, instrument, time, and
random error. The experimental approach can be
described as follows. For each analytical method,
the sample is analysed by m operators. Each operator
performs, on each of q instruments, n replicated
measurements on each of p different days. To avoid
an underestimation of the day effect, the set of p
different days during which the measurements are
performed on each of the q instruments must be
different, i.e. two instruments cannot be operated on
the same day.
205
Table 1
Definition of symbols and abbreviations applied in the document
d
D
E
FI(OIT)
FI(T)
Fr
FB ;A
FA ;B
I
m
M
MS
n
n
N
O
p
q
s
s2
tcal
t/2
t
UCL
y
y
yi
yij
yijk
yijkL
z/2
2
I(OIT)
I(T)
Absolute difference between the grand means obtained with two methods
Component of day effect in a test result
Random error component occurring in every test result
Calculated F-value obtained from the comparison of (operatorinstrumenttime)-different intermediate precision (variance)
Calculated F-value obtained from the comparison of time-different intermediate precision (variance)
Calculated F-value obtained from the comparison of repeatability variance
Value of the F-distribution with B degrees of freedom associated with the numerator and A degrees of freedom associated with
the denominator; represents the portion of the F-distribution to the right of the given F-value
Value of the F-distribution with A degrees of freedom associated with the numerator and B degrees of freedom associated with
the denominator; represents the portion of the F-distribution to the right of the given F-value
Component of instrumental effect in a test result
Number of operators
General mean (expectation) of the test results
Mean squares
Number of replicates performed on each day
Average number of replicates performed on each day
Total number of measurements
Component of operator effect in a test result
Number of days
Number of instruments
Estimate of
Estimate of 2
Calculated t-value obtained from the comparison of the means obtained with two methods
Two-sided tabulated t-value at significance level and degrees of freedom
One-sided tabulated t-value at significance level and degrees of freedom
Upper confidence limit
Test result
Grand mean of test results
Arithmetic mean of the test results obtained from the ith operator
Arithmetic mean of the test results obtained from the ith operator and the jth instrument
Arithmetic mean of the test results obtained from the ith operator, the jth instrument and the kth day
Particular test result related to the Lth replicate of the kth day, the jth instrument and the ith operator
Two-sided tabulated z-value of the standard normal distribution at significance level
Significance level (type I error probability)
Type II error probability
Detectable difference between the means obtained from the two methods
Numbers of degrees of freedom
Detectable ratio between the repeatability standard deviations of method B and method A
True value of a standard deviation
True value of variance
Detectable ratio between the square roots of the (operatorinstrumentday) mean squares (or the (operatorinstrumenttime)different intermediate precision (standard deviation)) of method B and method A
Detectable ratio between the square roots of the between-day mean squares (or the time-different intermediate precision
(standard deviation)) of method B and method A
206
Table 1 (Continued )
j
k
L
m
nijk
pij
qi
O
OID
r
(1)
Table 2
Calculation of the variance components (ANOVA table)
Source
Mean squares
Estimate of
pij
qi X
m X
X
Operatorinstrumentday
MSOID
n2OID
2r
i1 j1 k1
qi
m X
X
pij 1
i1 j1
pij
qi X
m X
X
Day
MSD
n2D
2r
pij 1
i1 j1
pij X
nijk
qi X
m X
X
yijkL yijk 2
Residual
MSE
i1 j1 k1 L1
pij
qi X
m X
X
2r
nijk 1
i1 j1 k1
0
B
B
Byijk
B
@
nijk
X
L1
pij
X
yijkL
nijk
; yij
nijk yijk
k1
pij
X
pij
qi X
m X
X
; y
i1 j1 k1
nijk
0
nijk yijk
BN
B
;
nB
B
@
k1
!1
1
pij
qi X
m X
X
2
nijk =N C
C
p
qi X
ij
m X
X
C
C
i1 j1 k1
C
C; N
n
total
number
of
measurements
ijk
q
m
i
C
C
XX
A
A
j1
i1
k1
pij 1
i1 j1
s2r MSE ;
pij
qi X
m X
X
nijk 1
i1 j1 k1
MSD MSE
if s2D < 0 set s2D 0
n
MSOID MSE
s2OID
if s2OID < 0 set s2OID 0
n
MSD
n 1MSE
s2IT s2D s2r
n
MSOID
n 1MSE
2
2
2
sIOIT sOID sr
n
s2D
pij
qi X
m X
X
yijk y2
s2yijk
i1 j1 k1
qi
m X
X
pij 1
MSOID
s2OID s2r =
n;
n
qi
m X
X
i1 j1 k1
qi
m X
X
pij 1
i1 j1
i1 j1
207
nijk is the number of replicates on the kth day performed on the jth instrument by the ith operator (L1, 2, . . ., nijk); pij the number of days performed on the jth instrument by the ith
operator (k1, 2, . . ., pij); qi the number of instruments performed by the ith operator (j1, 2, . . ., qi); m is the number of operators (i1, 2, . . ., m).
Source
Mean squares
208
Table 3
Calculation of the variance components in case of equal nijk and equal qi for all i1, 2, . . ., m, j1, 2, . . ., q and k1, 2, . . ., pij. Only pij that may be unequal for different operators
and instruments due to possible rejection of some discordant data (ANOVA table)
Estimate of
pij
q X
m X
X
n
yijk y2
i1 j1 k1
q
m X
X
MSOID
Operatorinstrumentday
2r n2OID
pij 1
i1 j1
Day
MSD
i1 j1 k1
q
m X
X
pij
i1 j1
pij X
q X
m X
n
X
Residual
MSE
2r n2OID
yijkL yijk 2
i1 j1 k1 L1
q
m X
X
n 1
2r
pij
i1 j1
0
B
B
Byijk
B
@
n
X
L1
pij
X
yijkL
; yij
yijk
k1
pij
pij X
q X
m X
n
X
; y
i1
yijkL C
C
j1 k1 L1
C
q
m X
C
X
A
n
p
ij
i1 j1
s2r MSE ;
n 1
q
m X
X
MSD MSE
if s2D < 0 set s2D 0
n
MSOID MSE
s2OID
if s2OID < 0 set s2OID 0
n
MSD n 1MSE
s2IT s2D s2r
n
MSOID n 1MSE
2
2
2
sIOIT sOID sr
n
s2D
pij
q X
m X
X
yijk y2
pij
i1 j1
s2yijk
i1 j1 k1
q
m X
X
pij 1
MSOID
s2OID s2r =n;
n
q
m X
X
pij 1
i1 j1
i1 j1
n is the number of replicates (L1, 2, . . ., n); pij the number of days performed on the jth instrument by the ith operator (k1, 2, . . ., pij); q the number of instruments (j1, 2, . . ., q);
m is the number of operators (i1, 2, . . ., m).
pij
q X
m X
X
n
yijk yij 2
the day means yijk and the grand mean y. This
might result in an underestimation of the effects of the
instrument and the operator since those parameters are
not changed for every yijk obtained. However, this is
the best possible approach to estimate the intermediate
precision s2IOIT with small numbers of operators and
instruments and although it might not adequately
reect the true precision it is useful for comparison
studies as long as the number of operators and instruments for the methods being compared are equal.
Considering the formulae to calculate the betweenday variance component s2D and the (operatorinstrumentday) variance component s2OID in
Table 3, negative values for those parameters can be
obtained. For example, if due to random effects MSD
is smaller than MSE, we will get a negative value for
s2D . In that case, the negative estimates of variance are
given the value 0. This is the usual practice which is
also considered by ISO [8] if a negative value for the
between-laboratory variance s2L is obtained. Another
approach to deal with negative variance estimates is
reported in [9]. It applies the method of pooling
minimal mean squares with predecessors.
2.4. Number of measurements
As mentioned earlier the probability to accept an
alternative method, which in fact is not appropriate
(-error) because it is not precise enough or too
much biased in comparison with the reference method,
can be controlled by determining the number of
measurements required to detect a certain bias as well
as a certain difference in precision (if it exists). This
implies that an acceptable difference between the
means of the two methods as well as an acceptable
ratio between the precision parameters of the two
methods have to be specied. The former is called
by ISO the detectable difference between the biases of
the two methods, , and is dened as the minimum
difference between the means of the two methods that
the experimenter wishes to detect with high probability. The latter is called by ISO the detectable ratio
between the precision parameters of the two methods.
It is dened as the minimum ratio of precision parameters that the experimenter wishes to detect with
high probability from the results obtained with the two
methods. In analogy with what is given in ISO, the
detectable ratio to be considered in the intralaboratory
209
situation are:
r
B for the comparison of repeatabilities;
r A
s
MSDB
for the comparison of timeIT
MSDA
IOIT
210
s
mA qA pA 1s2OIDA s2rA =nA mB qB pB 1s2OIDB s2rB =nB
1
1
t=2 t
;
mA qA pA mB qB pB
mA qA pA mB qB pB 2
where the subscript A and B refer to method A and
method B, respectively, t/2: two-sided tabulated
t-value at signicance level and degrees of freedom
mAqApAmBqBpB2, t,: one-sided tabulated
t-value at signicance level and degrees of freedom
mAqApAmBqBpB2.
This expression is based on the t-test for the comparison of two means and therefore assumes that the
precision of both methods are equal. This assumption
should be acceptable for an estimation of the optimal
number of measurements. If the precision of the
alternative method B is unknown which might often
be the case, it is substituted by the precision of the
reference method A.
s
mA qA pA 1s2OIDA s2rA =nA mB qB pB 1s2OIDA s2rA =nB
1
1
t=2 t
mA qA pA mB qB pB
mA qA pA mB qB pB 2
where is the acceptable difference between the
means, which one wants to detect with (1)100%
condence from a two-tailed t-test performed at the
signicance level . The t-distribution of the non-zero
mean difference is a non-central t-distribution. Therefore, instead of (t/2t), the non-centrality parameter
of the non-central t-distribution should be used. An
evaluation of the effect of approximating this by
means of the central t-distribution indicated that very
similar results are obtained. Therefore the central
t-distribution is used.
As indicated earlier, it is strongly recommended to
have the same numbers of operators (mAmB) and
instruments (qAqB) for both methods. If moreover,
the number of days as well as the number of replicates
are taken the same for both methods, i.e. pApB and
nAnB, Eq. (3) simplies to
s
2s2OIDA s2rA =nA
:
(4)
t=2 t
m A qA pA
Generally, the number of operators (mAmB) and
instruments (qAqB) will be xed by practical constraints. It is recommended that the number of repli-
(2)
(3)
(5)
(6)
211
and
Table 4
Values of ( A, B, , ) or I(T)( A, B, , ) or I(OIT)( A, B, , ) for (0.05, 0.05)
B
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
25
50
200
A
5
10
11
12
13
14
15
16
17
18
19
20
25
50
200
5.05
4.66
4.40
4.22
4.08
3.97
3.88
3.81
3.75
3.70
3.66
3.62
3.59
3.56
3.54
3.52
3.43
3.27
3.15
4.66
4.28
4.03
3.85
3.72
3.61
3.53
3.46
3.40
3.36
3.31
3.28
3.25
3.22
3.20
3.17
3.09
2.93
2.81
4.40
4.03
3.79
3.61
3.48
3.38
3.29
3.23
3.17
3.12
3.08
3.05
3.02
2.99
2.96
2.94
2.86
2.70
2.59
4.22
3.85
3.61
3.44
3.31
3.21
3.13
3.06
3.00
2.96
2.92
2.88
2.85
2.82
2.80
2.78
2.70
2.54
2.42
4.08
3.72
3.48
3.31
3.18
3.08
3.00
2.93
2.88
2.83
2.79
2.75
2.72
2.70
2.67
2.65
2.57
2.41
2.29
3.97
3.61
3.38
3.21
3.08
2.98
2.90
2.83
2.78
2.73
2.69
2.66
2.62
2.60
2.57
2.55
2.47
2.31
2.19
3.88
3.53
3.29
3.13
3.00
2.90
2.82
2.75
2.70
2.65
2.61
2.58
2.55
2.52
2.49
2.47
2.39
2.23
2.11
3.81
3.46
3.23
3.06
2.93
2.83
2.75
2.69
2.63
2.59
2.55
2.51
2.48
2.45
2.43
2.41
2.33
2.16
2.05
3.75
3.40
3.17
3.00
2.88
2.78
2.70
2.63
2.58
2.53
2.49
2.46
2.42
2.40
2.37
2.35
2.27
2.11
1.99
3.70
3.36
3.12
2.96
2.83
2.73
2.65
2.59
2.53
2.48
2.44
2.41
2.38
2.35
2.33
2.30
2.22
2.06
1.94
3.66
3.31
3.08
2.92
2.79
2.69
2.61
2.55
2.49
2.44
2.40
2.37
2.34
2.31
2.29
2.26
2.18
2.02
1.90
3.62
3.28
3.05
2.88
2.75
2.66
2.58
2.51
2.46
2.41
2.37
2.33
2.30
2.28
2.25
2.23
2.15
1.98
1.86
3.59
3.25
3.02
2.85
2.72
2.62
2.55
2.48
2.42
2.38
2.34
2.30
2.27
2.24
2.22
2.20
2.12
1.95
1.83
3.56
3.22
2.99
2.82
2.70
2.60
2.52
2.45
2.40
2.35
2.31
2.28
2.24
2.22
2.19
2.17
2.09
1.92
1.80
3.54
3.20
2.96
2.80
2.67
2.57
2.49
2.43
2.37
2.33
2.29
2.25
2.22
2.19
2.17
2.15
2.06
1.90
1.77
3.52
3.17
2.94
2.78
2.65
2.55
2.47
2.41
2.35
2.30
2.26
2.23
2.20
2.17
2.15
2.12
2.04
1.87
1.74
3.43
3.09
2.86
2.70
2.57
2.47
2.39
2.33
2.27
2.22
2.18
2.15
2.12
2.09
2.06
2.04
1.96
1.78
1.65
3.27
2.93
2.70
2.54
2.41
2.31
2.23
2.16
2.11
2.06
2.02
1.98
1.95
1.92
1.90
1.87
1.78
1.60
1.45
3.15
2.81
2.59
2.42
2.29
2.19
2.11
2.05
1.99
1.94
1.90
1.86
1.83
1.80
1.77
1.74
1.65
1.45
1.26
212
Table 5
Values of ( A, B, , ) or I(T)( A, B, , ) or I(OIT)( A, B, , ) for (0.05, 0.2)
B
A
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
25
50
200
10
11
12
13
14
15
16
17
18
19
20
25
50
200
5.22
4.76
4.51
4.35
4.24
4.15
4.09
4.04
4.00
3.97
3.94
3.92
3.90
3.88
3.87
3.86
3.84
3.83
3.79
3.71
3.65
4.41
3.98
3.74
3.59
3.49
3.41
3.35
3.30
3.26
3.23
3.21
3.18
3.17
3.15
3.13
3.12
3.11
3.10
3.06
2.98
2.92
4.00
3.59
3.35
3.21
3.10
3.03
2.97
2.92
2.88
2.85
2.83
2.80
2.79
2.77
2.75
2.74
2.73
2.72
2.68
2.60
2.54
3.76
3.35
3.12
2.97
2.87
2.79
2.74
2.69
2.65
2.62
2.60
2.57
2.55
2.54
2.52
2.51
2.50
2.49
2.45
2.37
2.31
3.60
3.19
2.96
2.82
2.71
2.64
2.58
2.53
2.50
2.47
2.44
2.42
2.40
2.38
2.37
2.35
2.34
2.33
2.29
2.21
2.15
3.48
3.08
2.85
2.70
2.60
2.53
2.47
2.42
2.38
2.35
2.33
2.30
2.28
2.27
2.25
2.24
2.23
2.22
2.17
2.09
2.03
3.39
2.99
2.77
2.62
2.52
2.44
2.38
2.34
2.30
2.27
2.24
2.22
2.20
2.18
2.17
2.15
2.14
2.13
2.09
2.00
1.94
3.32
2.92
2.70
2.55
2.45
2.38
2.32
2.27
2.23
2.20
2.17
2.15
2.13
2.11
2.10
2.08
2.07
2.06
2.02
1.93
1.87
3.27
2.87
2.65
2.50
2.40
2.32
2.26
2.22
2.18
2.15
2.12
2.10
2.08
2.06
2.04
2.03
2.02
2.01
1.96
1.88
1.81
3.23
2.83
2.60
2.46
2.36
2.28
2.22
2.17
2.14
2.10
2.08
2.05
2.03
2.01
2.00
1.98
1.97
1.96
1.92
1.83
1.76
3.19
2.79
2.57
2.42
2.32
2.24
2.18
2.14
2.10
2.07
2.04
2.02
1.99
1.98
1.96
1.95
1.93
1.92
1.88
1.79
1.72
3.16
2.76
2.54
2.39
2.29
2.21
2.15
2.11
2.07
2.03
2.01
1.98
1.96
1.94
1.93
1.91
1.90
1.89
1.85
1.76
1.69
3.13
2.74
2.51
2.37
2.26
2.19
2.13
2.08
2.04
2.01
1.98
1.96
1.94
1.92
1.90
1.89
1.87
1.86
1.82
1.73
1.66
3.11
2.71
2.49
2.34
2.24
2.16
2.10
2.06
2.02
1.98
1.96
1.93
1.91
1.89
1.88
1.86
1.85
1.84
1.79
1.70
1.63
3.09
2.69
2.47
2.32
2.22
2.14
2.08
2.04
2.00
1.96
1.94
1.91
1.89
1.87
1.86
1.84
1.83
1.82
1.77
1.68
1.60
3.07
2.68
2.45
2.31
2.20
2.13
2.07
2.02
1.98
1.95
1.92
1.89
1.87
1.85
1.84
1.82
1.81
1.80
1.75
1.66
1.58
3.05
2.66
2.44
2.29
2.19
2.11
2.05
2.00
1.96
1.93
1.90
1.88
1.86
1.84
1.82
1.81
1.79
1.78
1.73
1.64
1.56
3.04
2.65
2.42
2.28
2.17
2.10
2.04
1.99
1.95
1.91
1.89
1.86
1.84
1.82
1.80
1.79
1.78
1.76
1.72
1.62
1.55
2.99
2.60
2.37
2.22
2.12
2.04
1.98
1.93
1.89
1.86
1.83
1.81
1.78
1.76
1.75
1.73
1.72
1.71
1.66
1.56
1.48
2.89
2.49
2.27
2.12
2.02
1.94
1.88
1.83
1.78
1.75
1.72
1.69
1.67
1.65
1.63
1.62
1.60
1.59
1.54
1.43
1.33
2.81
2.42
2.20
2.05
1.94
1.86
1.80
1.75
1.70
1.67
1.64
1.61
1.59
1.56
1.55
1.53
1.51
1.50
1.44
1.32
1.19
s2IOIT ; s2IOIT
A
s2yijk ; s2yijk
A
yA ; yB
s2rB
s2rA
(11)
and
It follows that
MSD ns2IT ns2r s2r ns2IT n 1s2r :
Therefore provided that the repeatabilities of both
methods are equal 2rA 2rB and the number of
replicates per day for both methods is equal (nAnB),
the day mean squares MSD are considered instead of
s2IT . If nAnB, the equality of the repeatabilities for
both methods is rst to be tested H0 : 2rA 2rB ;
H1 : 2rA 6 2rB by means of a two-sided F-test. The
results obtained from the comparison of the repeatabilities in Section 2.5.1.1 cannot be used here, since
a one-sided F-test has been considered to test the
hypotheses: H0 : 2rB 2rA ; H1 : 2rB > 2rA . A
non-signicant test, which means that the repeatability
213
s21
s22
(12)
r 1 rA
and
r 1 rB
rB nB 1
rA nA 1
qB
mB X
X
i1 j1
qA
mA X
X
i1 j1
pijB
and
pijA :
FIT
:
MSDA
nA s2DA s2rA nA s2IT nA 1s2rA
A
(13)
214
degrees
of
freedom
of
the
numerator
IT
B
Pi1
Pq B
A
pijB 1 and the denominator ITA m
i1
Pj1
qA
j1 pijA 1, represents the portion of the Fdistribution to the right of the given value; 0.05.
If FIT FITB ;ITA , method B has worse timedifferent intermediate precision than method A at the
(1)100% (i.e. 95%) condence level and therefore
the time-different intermediate precision of the alternative method B is not acceptable.
If FIT FITB ;ITA , the time-different intermediate precision of the alternative method B is
acceptable, which means that it is at most a factor
IT worse than that of method A.
In the situation where the number of replicates per
day for both methods are not equal (nA6nB) or the
equality of the repeatabilities for both methods cannot
be assumed 2rA 6 2rB , the comparison of the timedifferent intermediate precisions cannot be performed
by Eq. (13) but it must be investigated through the
comparison of s2IT . Then FI(T) is calculated as follows:
FIT
s2ITB
s2ITA
(14)
MSDB =nB 2 =
Pm B Pq B
MSDA =nA 2 =
i1
FIOIT
MSOIDB
MSOIDA
!
2
2
nB s2OIDB s2rB nB sIOITB nB 1srB
:
nA s2OIDA sr2A nA s2IOIT nA 1s2rA
The comparisonPof FP
I(OIT) with FIOITB ;IOITA ,
qB
mB
where
IOIT
j1 pijB 1 and IOITA
i1
PmA PqA B
i1 j1 pijA 1 is then performed in analogy
with the comparison of the time-different intermediate
precision mentioned earlier when Eq. (13) is
considered.
In the situation where the number of replicates
per day for both methods are not equal (nA6nB)
or the equality of the repeatabilities of both
methods cannot be assumed 2rA 6 2rB , the comparison of (operatorinstrumenttime)-different intermediate precision must be investigated through
the comparison of s2IOIT . Then FI(OIT) is calculated
as follows:
s2IT 2
B
PmB PqB
s2IT 2
A
2
j1 pijA 1 nA 1MSEA =nA =nA 1
(17)
2
j1 pijB 1 nB 1MSEB =nB =nB 1
PmA PqA
i1
FIOIT
s2IOIT
s2IOIT
i1
j1
PmA PqA
i1
pijB
j1
pijA
(15)
:
(16)
(18)
IOITB
IOITA
215
s2IOIT 2
B
;
P
P B PqB
P
qB
B
MSOIDB =nB 2 = m
p
PmA PqA
i1
j1
s2IOIT 2
A
PmA PqA
i1
j1
pijA
(19)
;
(20)
or 2d 2sd :
216
s2p
Pm A Pq A
i1
s2y
ijk A
PmA PqA
i1
j1
pijA 2 =
Pm A Pq A
i1
j1
jyA yB j
;
sd
PmB PqB
i1
j1
pijB 1s2yijk
d
qA
mA X
X
i1 j1
qB
mB X
X
i1 j1
pijB 2;
(24)
ijk B
pijA
sd tPmA P
:
(25)
qA
mB PqB
p
ij
A
j1
j1 pijB
i1
i1
pijA 1 s2y
(21)
(23)
with
s2d 2
pijA 1s2yijk
j1
d
d
where
Pm B Pq B
i1
j1
P B PqB
pijB 2 = m
j1 pijB 1
i1
(26)
(27)
(28)
217
218
Table 6
Data obtained with method A (example 1)
Operator
Instrument 1
Instrument 2
Day
yi1k1
yi1k2
yi1k
1
2
3
4
5
97.13
101.23
97.13
97.17
96.82
98.81
100.68
96.63
95.82
97.46
97.970
100.955
96.880
96.495
97.140
1
2
3
4
5
100.71
101.26
98.49
97.06
101.85
99.37
103.78
100.87
98.92
99.77
100.040
102.520
99.680
97.990
100.810
Grand mean99.890
yi1
Day
yi2k1
yi2k2
yi2k
yi2
97.888
1
2
3
4
5
98.98
102.84
99.65
98.67
97.08
99.52
100.93
99.29
98.46
97.11
99.250
101.885
99.470
98.565
97.095
99.253
100.208
1
2
3
4
5
101.55
99.66
102.54
104.95
103.10
104.04
98.70
100.60
102.61
104.36
102.795
99.180
101.570
103.780
103.730
102.211
219
Table 7
Data obtained with method B (example 1)
Operator
Instrument 1
Day
Instrument 2
yi1k1
yi1k2
yi1k
yi1
Day
yi2k1
yi2k2
yi2k
yi2
1
2
3
4
5
97.86
93.73
98.12
101.45
99.68
98.40
95.43
98.42
99.61
99.54
98.130
94.580
98.270
100.530
99.610
98.224
1
2
3
4
5
100.86
105.23
100.66
98.44
100.75
103.98
102.35
99.98
97.32
101.58
102.420
103.790
100.320
97.880
101.165
101.115
1
2
3
4
5
101.61
99.54
101.45
104.38
103.42
103.85
97.34
102.28
103.84
104.58
102.730
98.440
101.865
104.110
104.000
102.229
1
2
3
4
5
104.73
102.94
105.64
103.24
103.78
107.77
103.58
107.14
101.28
101.73
106.250
103.260
106.390
102.260
102.755
104.183
1:4810
1:20:
1:2317
Fr
Table 8
Calculation of the variance estimates for method A (example 1) (ANOVA table)
Source
Mean squares
Estimate of
Operatorinstrumentday
MSOID10.5459
2rA nA 2OIDA
Day
MSD6.3345
r2A nA sDA 2
Residual
MSE1.2317
s2rA
s2rA 1:2317;
20
6:3345 1:2317
2
2:5514
sDA
2
10:5459
1:2317
4:6571
s2OIDA
2
2
2
2
sIT sDA srA 3:7831
19
220
Table 9
Calculation of the variance estimates for method B (example 1) (ANOVA table)
Source
Mean squares
Estimate of
Operatorinstrumentday
MSOID17.9587
2rB nB s2OIDB
Day
MSD9.7042
2rB nB s2DB
Residual
MSE1.4810
2rB
s2rB 1:4810;
20
9:7042 1:4810
2
sDB
4:1116
2
17:9587 1:4810
2
8:2389
sOIDB
2
2
2
2
sIT sDB srB 5:5926
MSDB 9:7042
1:53:
MSDA 6:3345
MSOIDB 17:9587
1:70:
MSOIDA 10:5459
19
sd 7:1262
0:8442:
20 20
s2p
sd
0:8442
221
s
2s2DA s2rA =nA
0:5 t=2 t
;
pA
s
20:08 0:023=2
:
0:5 t=2 t
pA
With pA10 and [2mAqApA2][2(1110)
2]18, (t/2t)3.835 and the right side of the
equation above equals 0.519; with pA11 and
[2mAqApA2][2(1111)2]20, (t/2t)
3.811 and the right side of the equation above equals
0.492. Hence pApB11. (The use of a constant multiplication factor equal to 4 would yield pApB12.)
(2) For the comparison of precision. From Table 4 it
can be seen that 3 or I(T)3 is given by A B10.
To compare repeatability standard deviations,
A m A qA p A
and B mB qB pB ;
so pA pB 10:
To compare between-day mean squares,
A mA qA pA 1 and B mB qB pB 1;
so pA pB 10 1 11:
(3) Conclusion. The minimum number of days
required (with two measurements per day) is 11.
39:845 39:486
2:544;
0:1411
222
Table 10
Data for the example 2
Day
1
2
3
4
5
6
7
8
9
10
11
Method A
Method B
y11k1
y11k2
y11k
y11k1
y11k2
y11k
39.68
39.08
40.39
39.87
39.70
39.93
39.78
39.92
40.34
40.12
39.43
39.77
39.38
40.33
39.98
39.95
39.95
39.97
40.20
39.89
40.26
39.54
39.725
39.230
40.360
39.925
39.825
39.940
39.875
40.060
40.115
40.190
39.485
39.29
39.51
39.45
39.59
39.41
39.45
39.55
39.29
39.82
39.44
39.45
39.36
39.38
39.49
39.51
39.41
39.54
39.55
39.36
39.87
39.45
39.53
39.325
39.445
39.740
39.550
39.410
39.495
39.550
39.325
39.845a
39.445
39.490
Grand mean39.884
a
Grand mean39.486
Straggler.
Table 11
Calculation of the variance estimates for method A (example 2) (ANOVA table)
Source
Mean squares
Estimate of
Day
MSD0.2050
2rA nA s2DA
Residual
MSE0.0239
2rA
s2rA 0:0239;
11
0:2050 0:0239
2
0:0906
sDA
2
2
2
2
sIT sDA srA 0:1145
10
Table 12
Calculation of the variance estimates for method B (example 2) (ANOVA table)
Source
Mean Squares
Estimate of
Day
MSD0.0397
2rB nB s2DB
Residual
MSE0.0024
2rB
s2rB 0:0024;
11
0:0397 0:0024
2
0:0187
sDB
2
2
2
2
sIT sDB srB 0:0211
10
0:0239
10:
0:0024
FIT
S2IT
0:0211
0:18:
0:1145
ITA
0:1055;
sd
11
11
0:01112
13:
d
0:1025=112 =10 0:0199=112 =10
0:02112
0:0397=22 =10 0:0024=22 =11
0:11452
0:2050=22 =10 0:0239=22 =11
tcal
sd
0:1055
0:1025
5:15:
F
0:0199
223
11;
12:
224
References
[1] International Standard, Accuracy (Trueness and Precision) of
Measurement methods and results, ISO 5725-6, Geneva,
1994.
[2] C. Hartmann, J. Smeyers-Verbeke, W. Penninckx, Y. Vander
Heyden, P. Vankeerberghen, D.L. Massart, Anal. Chem. 67
(1995) 4491.
[3] International Standard, Accuracy (Trueness and Precision) of
Measurement methods and results, ISO 5725-3, Geneva,
1994.
225