The winner of the quality assurance challenge 3 (published in issue 384/5) is:

Josef Dempír, Hlízov, Czech Republic

The award entitles the winner to select a Springer-Verlag book from our catalogue up to a value of € 75.- .

Our Congratulations!

Solution

The subject of the quality assurance challenge 3 was controlling limit values in production batches. In the example presented here the residual monomer concentration of styrene in polysterene used for pharmaceutical purposes was not to exceed a specified limit.

The task was to prove experimentally that the specified precision has been reached. Furthermore, the upper limit for batch release must be specified. Four standard samples were analyzed, and four replicate determinations were carried out for each sample. The results obtained are once again listed in Table 1.

Table 1 Data presented on the residual monomer concentration of styrene in polysterene in four replicate determinations for each of four standard samples

We assumed 0.480 m% for the upper limit of styrene monomer. Styrene concentration in polystyrene was determined by headspace gas chromatography. The specified standard deviation of the method determined at s=0.010 m% was obtained by a round robin analysis performed among eight laboratories, with each laboratory conducting five replicate measurements.

Before the precision can be calculated, the data sets must first be checked for outliers. Possible outliers are then to be eliminated.

Test for outlier

In the quality assurance challenge 2 (published in ABC issue 383/1) we used the test by Grubbs et al., but here Dixon's test [1] will be applied for the testing. Although both tests are appropriate, Dixon's test is one of the most popular tests for the detection of an outlier because it is easy to calculate [2]. It is based on a comparison of the difference between the suspect value x1 and its direct neighbour, with the overall range for n=3–7 – as in the present case – or a modified range for n>7. The test value, \( \widehat{{{M}}} \) is given by

$$ \widehat{{{M}}} = {\left| {\frac{{x_{1} - x_{2} }} {{x_{1} - x_{k} }}} \right|} $$
(1)

The calculated \( \widehat{{{M}}} \)-value is compared in the usual way with the critical value for testing outliers by Dixon at a chosen significance level. An outlier is detected if the calculated \( \widehat{{{M}}} \) exceeds the critical M (= 0.765) for n=4 at p=95%.

The results are listed in Table 2.

Table 2 Data for the outlier (OL) test by Dixon

Answer to question (a): the value 0.398 for the second standard is an outlier at the 95% level of significance; therefore, it is removed from the data set.

Note: The same results are obtained by the Grubb's test.

Homogeneity of the variances

For the calculation of the overall standard deviation of the data set of Table 1, it is necessary to determine whether the group variances are homogeneous. Tests appropriate for more than two variances are those of Bartlett [2] or Cochran [3]. The simple Cochran test is based on comparing smax 2 with the sum of all the variances, but it requires equal numbers of data in each data set. As this is not the case after having removed an outlier in the second data set, Bartlett's test is used here.

The test value \( \widehat{\chi }^{{{2}}} \) is calculated by Eq. (2).

$$ \widehat{\chi }^{{{2}}} = {{ 2}}{{.303}} \cdot {\left( {{{df}} \cdot {{lg s}}^{{{2}}} {{ - }}{\sum {{{df}}_{{{j}}} \cdot \lg {{ s}}^{{{2}}}_{{{j}}} } }} \right)} $$
(2)

The overall standard deviation ‘s’ is calculated by Eq. (3):

$$ {{s}} = {{ }}{\sqrt {\frac{{{\sum {{{SS}}_{{{j}}} } }}} {{{{df}}}}} } $$

SS j is the sum of squares in the set j:

$$SS_{j} = {\sum {{\left( {\overline{x} _{j} - x_{{ij}} } \right)}} }$$
(3)

The degree of freedom df is given by Eq. (4).

$$ \operatorname{df} = n - m $$
(4)

with the overall number of the value n and the number of the samples m.

sj and dfj are the standard deviation and the degree of freedom in each sample j, respectively.

The \( \widehat{\chi }^{2} \)-value calculated by Eq. (2) is compared to the critical of the chi-square distribution. If the \( \widehat{\chi }^{2} \)-value is smaller than the critical chi-square value we assume homogeneity of variances or – in other words – homoscedasticity.

The results of Barlett's test using the outlier-free data sets are listed in Table 3.

Table 3 Results of Barlett's test

Answer to question (b): The calculated chi-square value is smaller than the critical value at the significance level p=95%. The null hypothesis that the variances are homogeneous is accepted.

Test of the precision of the procedure

Once we have ensured the homogeneity of the group variances, based on the outlier-eliminated data sets we can show that the specified precision of s=0.010 m% is reached. From the variance in Table 3, a standard deviation of s=0.0145 m% is obtained for the analytical method.

Comparison of two variances is performed by means of an F-test. The test value \( \widehat{{{F}}} \) is given by Eq. (5)

$$ \widehat{F} = \frac{{s^{2}_{1} }} {{s^{2}_{2} }} $$
(5)

in which s1 2 is the variance of the laboratory and s2 2 is the specified value. The following test value is calculated to \( \widehat{{{F}}} = {{0}}{{.298}} \). This value is compared with the critical F-value at the chosen significance level p.

The specified standard deviation was determined by a round robin analysis performed among eight laboratories, each conducting five replicate measurements. Therefore, the degree of freedom df2 is 32 (df 2=40–5). With df 2=32 and df 1=11 (the degree of freedom of the laboratory) the critical F-value is 2.103 at the significance level p=95% (at simplest received from Excel with the function fx=FINV(5%; 11; 32)).

Answer to question (c): The \( \widehat{{{F}}} \)-value calculated is smaller than the critical F-value at the significance level p=95%. The precision of the analytical procedure of the laboratory does not differ from the specified value. The precision is O.K.

Note: Obviously, when introducing an analytical method in a laboratory, both precision and accuracy have to be documented. This is the subject for a further analytical challenge...

Maximum allowable mean of the analytical results for two replicate determinations

The maximum allowable mean of the analytical results is given by the difference between the specified critical value T0 and the one-sided confidence interval (CIone-sided) because for controlling of the limit value only the upper value is of interest [3]. This difference is calculated by Eq. (6).

$$ {{T}}_{{{0}}} {{ - }}C{{I}}_{{{{one - sided}}}} {\left( {{x}} \right)} = {{ T}}_{{{0}}} - {{ }}\frac{{{{s}} \cdot {{t}}{\left( {\overline{{{P}}} _{{{{one - sided}}}} {{;d f}}} \right)}}} {{{\sqrt {{n}} }}} $$
(6)

In our example, T0=0.480 m%, s is the standard deviation of the laboratory (s=0.0145 m% with df=11), and n=2 (two replicates in the analysis). The statistical factor for the one-sided test is again received from Excel with the function fx=TINV(0.10; 11) to 1.796 because \(t{\left( {\overline{P} _{{one - sided}} = 95\% ;df = 11} \right)} \approx t{\left( {P_{{two - sided}} = 90{\text{ }}\% ;{\text{ }}df = 11} \right)}{\text{ }}\).

Answer to question (d): With CIone-sided (x)=0.01839 m%, the maximum allowable mean of the analytical results is 0.4616 m% styrene monomer. This value must not be exceeded if the production batch is to pass the quality check.

Analysis of two production batches

In routine analysis, the determination of the monomer content of the batches should be carried out based on double determinations. Therefore, the allowed difference between a pair of individual values must be established. Pearson's criterion can be used for this [4].

According to Pearson, the maximum difference (xmax– xmin) of the two replicate determinations is given by Eq. (7):

$${\left( {x_{{max }} {\text{ }} - {\text{ }}x_{{min }} } \right)} = D{\left( {P;n_{j} } \right)} \cdot s$$
(7)

The Pearson factor D(P; nj) for two replicate determinations (n j=2) is 2.77 at the significance level p=95% [4].

With s=0.0145 m%, the critical difference is calculated to be 0.0401 m%.

The differences between the replicates for the two production batches are: batch 1: 0.01 m%, batch 2: 0.007 m%. In each batch, the difference is smaller than 0.0401 m%; averaging is allowed. The mean value of batch 1 \( {\left( {{{x}}_{{{1}}} = {{0}}{{.461 m\% }}} \right)} \) is smaller than the maximum allowable mean value 0.4616 m%, but \( {\left( {{{x}}_{{{2}}} = {{0}}{{.462 m\% }}} \right)} \) exceeds the critical value.

Answer to question (e): Batch 1 may be released for selling, but the release of batch 2 is not allowed.

Note: With the false two-sided statistic factor P (p two-sided = 95%; df = 2.201) the CI(x) calculated is 0.02254 m%, and the maximum allowable mean value is 0.457 m% monomer.

With this critical value also, the release of Batch 1 is not allowed. This result is false!