Introduction

Recently, the European Commission (EC) adopted a Recommendation on the definition of a nanomaterial (NM) (EC 2011). For legislative and policy purposes in the European Union, this Recommendation should be used as basis for determining whether a material should be considered as a “nanomaterial” (SCENIHR 2010). The implementation of this definition relies on the availability of validated particle counting/imaging methods (EFSA 2011; OECD 2010).

TEM has been found to be a key method for characterizing various types of nanomaterials with sizes between 10 nm and 1 μm, including monodisperse types of silica, gold, and polystyrene latex nanoparticles which can be used for calibration, validation, and/or performance evaluation of particle sizing instruments (Pyrz and Buttrey 2008; Ehara and Sakurai 2010; Bau et al. 2010). For reliable and accurate quantitative analysis of nanoparticles present in representative TEM micrographs, a validated method is required. To date, TEM methods have only been rarely validated. As a result, the majority of today’s nanomaterial characterization studies lack information that reflects the quality (i.e. measurement uncertainties) of the presented data. Reliably estimated measurement uncertainties are essential to underpin the credibility of experimental data and to compare TEM measurement results over time and space.

The capability of the TEM method to provide robust and accurate results of physical NM properties, such as particle size, shape, and surface topology depends on the type of material, on the preparation of a test specimen, and also on a number of image analysis parameters. A suitable sample preparation which does not change the primary particle size and shape is crucial to ensure that no additional size-fractionation occurs. The most relevant parameters are considered to be the sample preparation, the selection of particles on the micrographs, the number of measured particles, the data binning procedure, the applied magnification, and the calibration strategy of the microscope. Documentary standards ISO 13322 and ISO 9276 describe instructions for sample preparation, instrument operation, and image analysis (ISO 13322-1 2004; ISO 9276-3 2008; ISO 9276-1 1998). However, detailed procedures with respect to the choice of magnification, calibration, and data binning are not included in these standards.

In this paper, the use of TEM in combination with a systematic selection procedure for unbiased random image collection, semi-automatic image analysis, and data processing is evaluated for the measurement of selected physical properties of single primary particles. The accuracy of the TEM method is examined based on measurements performed on two colloidal silica-certified reference materials (CRMs). In addition to the uncertainties on the modal ECD of single primary particles, the uncertainties on the modal and median size, shape, and surface topology parameters are estimated.

Materials and methods

Reference materials

The accuracy of the presented TEM method is assessed using CRMs: ERM-FD100 and ERM-FD304. These CRMs consist of near-monodisperse, near-spherical nanoparticles suspended in an aqueous solution. The materials have been produced and supplied by the Institute for Reference Materials and Measurements (IRMM) of the Joint Research Centre (JRC) of the European Commission (Geel, Belgium). The CRMs have been certified in a transparent, detailed, and metrologically rigorous manner, and are primarily intended for quality control in daily laboratory practice, method development, and proficiency testing schemes.

These CRMs are certified for different equivalent diameters of the silica nanoparticles following multiple methods, including TEM and scanning electron microscopy (SEM). For ERM-FD100, the certified number-based modal diameter as established by TEM and SEM is 19.4 nm with a certified expanded uncertainty (coverage factor k = 2) of 1.3 nm (ISO 13322-1 2004; IRMM 2011). ERM-FD304 has certified values based on dynamic light scattering and centrifugal liquid sedimentation. However, a certified value for electron microscopy (SEM and TEM) is not assigned due to the low reproducibility obtained in the interlaboratory comparison part of the certification study (Franks et al. 2012). Consequently, only an indicative number-based modal diameter of 27.8 nm with an indicative expanded uncertainty (coverage factor k = 2) of 1.5 nm is available for ERM-FD304 (IRMM 2012).

Sample preparation and analysis

TEM specimens are prepared using the grid on drop method by bringing 10 μl of the diluted dispersion on pioloform- and carbon-coated, 400 mesh copper grids (Agar Scientific, Essex, England) that are pretreated with 1 % Alcian blue (Fluka, Buchs, Switzerland) to increase hydrophilicity as described in Mast and Demeestere (2009). ERM-FD100 is 1000 times diluted and ERM-FD304 is 100 times diluted in double distilled water. This water was home-made by twice distilling demineralized tap water.

The samples are imaged in bright field mode using a Tecnai Spirit TEM (FEI, Eindhoven, The Netherlands) with Biotwin lens configuration operating at 120 kV. Micrographs are recorded using a systematic selection procedure for unbiased random image collection as described in De Temmerman et al. (2012) using a 4*4 K CCD camera (Eagle, FEI). Briefly: micrographs are taken at positions pre-defined by the microscope stage and evenly distributed over the entire grid area. When the field of view was obscured, e.g. by a grid bar or an artifact, the stage was moved sideways to the nearest suitable field of view. The magnifications of 18,500 and 68,000 times are calibrated using the cross-grating method and the image shift method based on a 2,160 lines/mm optical diffraction-cross grating (Agar Scientific, Stansted, England). The calibration method is implemented following ASTM E766 guidelines (ASTM E766-98(2008)e1 2008) and by using the magnification calibration software which is integrated in the Tecnai user interface software (FEI).

Micrographs are analyzed using the iTEM software (Olympus, Münster, Germany). To prevent bias in the selection of the particles on the micrographs, and in line with (ISO 13322-1 2004), the particles on the lower and right hand side borders of the frame are omitted from analysis, as illustrated in Fig. 1.

Fig. 1
figure 1

Representative micrographs of ERM-FD100 and ERM-FD304 and illustration of the method of detection based on electron density and a measurement frame. The NM in the representative electron micrograps of ERM-FD100 (a) and ERM-FD304 (c) are detected, manually classified into single primary particles (green) and agglomerates (red) and false color-coded in the corresponding annotated images (b) and (d). The measurement frame (dashed red line) is placed at a distance from the top and left side which corresponds with 10 % of the width and height of the micrograph. The bottom and right side of the frame are set as rejected sides. Magnification 68,000, bar 100 nm. (Color figure online)

Validation methodology

The precision part of the quantitative TEM method validation studies is based on determining repeatability (within one day variability) and intermediate precision (day-to-day variability) for measurements on ERM-FD100 and ERM-FD304. One sample of ERM-FD100 and one sample of ERM-FD304 are analyzed on five consecutive days. On each day, three diluted aliquots are prepared from the original sample and three specimens are independently prepared from each aliquot (vial). From each specimen, TEM micrographs of 10 fields are recorded (De Temmerman et al. 2012) at two magnifications, namely 18,500 and 68,000 times, and analyzed (Appendix 1).

Identification of the measurands

Twenty-three measurands of size, shape, and surface topology are considered potentially relevant for the morphological characterization of the examined silica nanoparticles. Detailed information regarding these measurands can be found in De Temmerman et al. (2012). For each particle, these characteristic measurands are measured and the corresponding number-based distributions are obtained. The median values of the measurands are estimated for each sample and the distributions are constructed by binning the raw data on a linear abscissa. The width of the bins is manually altered until the number of non-empty bins is equal to the number of measurements in the largest bin (Fig. 2). The modes and full width at half maximum (FWHM) of the distributions of seven one-dimensional size measurands, namely ECD, Feret max, Feret mean, Feret min, diameter max, diameter mean, and diameter min, are determined for each dataset of 10 systematically selected TEM micrographs. This mode is defined as the position of the maximum of the log-normal function fitted to the distribution (Fig. 2). Iterative curve fitting of the log-normal distribution is performed using the Fityk software by selecting the Levenberg–Marquart algorithm (Wojdyr 2010). One-dimensional size measurements are compared by performing one-way analysis of variance and the Tukey test using sigmaplot (STATSCONsult, Drunen, The Netherlands) at a significant difference level of 0.05.

Fig. 2
figure 2

Illustration of the binning and log-normal fitting of the number-based particle size distributions of ERM-FD100 (a) and ERM-FD304 (b). A subdataset of 146 measurements is randomly selected at a magnification of 18,500 times for each CRM

Working range

The useful working range of the TEM is defined by the lower and upper size quantification limits. The lower size quantification limit is calculated based on the work of Merkus (2009), who showed that large systematic deviations in size measurements can be avoided if the particle area consists of at least hundred pixels. The upper size quantification limit is restricted by the field of view and is set to one tenth of the image size as proposed in ISO 13322-1 (2004). The calculated working range and the corresponding magnification, pixel size, and size of the field of view are summarized in Appendix 2 for the magnifications of 18,500 and 68,000 times.

A preliminary experiment demonstrated that ERM-FD100 and ERM-FD304 are monomodal. No particles with an area smaller than 52 nm2 are observed in the samples. Such an area corresponds to particles that contain 142 pixels at a magnification of 18,500 times and 2,000 pixels at a magnification of 68,000 times. For this reason, particles with an area smaller than 52 nm2 (corresponding with an ECD of about 8 nm) are considered as background signal and are omitted from analysis.

Analysis and selection of single primary particles

Single primary particles can be automatically selected in the datasets of ERM-FD100 and ERM-FD304 based on their morphology using the following approach. The morphology of the single primary particles is characterized in a preliminary experiment. In 10 micrographs all detected and measured particles are manually classified, either as single primary particles or as agglomerates. In the generated subdataset, which contains only the classified single primary particles, a correlation matrix of all 23 measurands is set up. Measurands that describe the morphology of the single primary particles and which have a low correlation (<0.5) with the ECD are selected (Appendix 3). The measurand showing the best separation between single and agglomerated primary particles is determined by performing linear discriminant analysis (LDA) on the full dataset.

The measurand with the lowest Type I and Type II classification errors (Appendix 4) is then considered for an automated classification in the other micrographs, resulting in a large dataset consisting of separated populations of single primary particles and agglomerates. The LDA calculations are programmed in the computational language “R” (R version 2.15.2, 2012) in RStudio (RStudio v 0.97, RStudio, Boston, MA, USA) which is available as freeware.

Robustness

The proposed method is inherently robust against “variation” introduced by the TEM operator because of a strong emphasis on the automation of the particle selection and image analysis aspects.

The robustness of the TEM method is assessed by varying the magnification and the number of analyzed particles which may influence the performance of the method. This magnification is varied between 18,500 and 68,000 times. The number of particles used to construct the number-based size distribution is varied by taking subdatasets. At a magnification of 68,000 times, the subdatasets contain between 3 and 335 particles and between 3 and 225 particles for ERM-FD100 and ERM-FD304, respectively. At a magnification of 18,500 times, the subdatasets contain between 3 and 4,050 particles and between 3 and 2,900 particles for ERM-FD100 and ERM-FD304, respectively. The effect of sample preparation is not included in the robustness part of the present validation study. Care should, therefore, be taken not to extrapolate the conclusions of this study to TEM methods following significantly different sample preparation approaches.

Precision

The intra-laboratory precision (composed of repeatability and intermediate precision) of the quantitative TEM method is assessed by measurements performed on ERM-FD100 and ERM-FD304 similar to the test design described in Braun et al. (2011a). Repeatability indicates the closeness between results of measurements, performed over a short period, using the same instrument and performed by the same operator. The relative repeatability uncertainty is calculated from the dataset using Eq. 1. The mean sum of squares is calculated using one-way analysis of variance (ANOVA), with SoftCRM, an undated version of the software described in Bonas et al. (2003).

$$u\left( r \right) = \frac{{\sqrt {{\text{MS}}_{\text{within}} } }}{{C_{\text{m}} }}$$
(1)

With u(r) being the relative repeatability uncertainty, MSwithin the mean of squares within the measurement days, and C m the mean measured value. The relative intermediate precision uncertainty (day-to-day variability) is determined with Eq. 2:

$$u\left( {\text{ip}} \right) = \frac{{\sqrt {\frac{{{\text{MS}}_{\text{between}} \text{ - }{\text{MS}}_{\text{within}} }}{{n_{\text{r}} }}} }}{{C_{\text{m}} }}$$
(2)

With u(ip) being the relative intermediate precision uncertainty, MSbetween the mean sum of squares between different days, and n r the number of measurement replicates per day.

The relative intra-laboratory precision uncertainty is then determined by combining the relative repeatability uncertainties and the relative intermediate precision (Eq. 3).

$$u\left( {\text{lab}} \right){ = }\sqrt {u^{2} \left( r \right){ + }u^{2} \left( {\text{ip}} \right)}^{ }$$
(3)

The intra-laboratory precision uncertainty summarizes the uncertainties related to the non-systematic variability in sample preparation, image acquisition, image analysis, and data analysis.

Uncertainty of the estimation of the mode

The uncertainty related to the estimation of the mode from the raw dataset, u(mode), is concealed in the repeatability and intermediate precision. Amonst all contributions to repeatability and intermediate precision, u(mode) is the one that depends most directly on the number of measured particles. It is, therefore, proposed to evaluate the u(mode) by separating the intra-laboratory precision uncertainty into a part which is dependent on the number of measured particles, u(mode), and a part which is independent on the number of measured particles, u’(r) and u’(ip) (Eq. 4).

$$u\left( {\text{lab}} \right) = \sqrt {u^{2} \left( {\text{mode}} \right) + u'^{2} \left( r \right) + u'^{2} \left( {\text{ip}} \right)}$$
(4.)

With u’(r) and u’(ip) being the uncertainties which do not depend on the number of measured particles. Subdatasets on a logarithmic scale (Fig. 3) are taken each day and for each repetition. For each subdataset, the laboratory uncertainty is determined. Linear regression is performed on the logarithmic plot of the laboratory uncertainty as a function of the sample size.

Fig. 3
figure 3

Correlation between the relative laboratory uncertainty and the number of measured particles per repetition. The linear regression between the log transformation of the relative laboratory uncertainty and the log transformation of the number of measured particles per repetition for ERM-FD100 and ERM-FD304 at magnifications of 18,500 (filled circle) and 68,000 times (unfilled circle) are given in (a) and (b), respectively

Calibration

The calibration of the microscope magnification using the cross-grating method and the image shift method is performed following ASTM E766 guidelines. The uncertainty associated with calibration, u(cal), is considered to be about 0.1 and 1.3 % for magnifications of 68,000 and 18,500 times, respectively.

Trueness

The results of a method are “true” if the method is free of systematic and significant bias. Whether a method produces significantly biased results can be assessed by comparing the results with reference values, for example by measuring one or more suitable CRMs as described in ERM application note 1 (Linsinger 2010). When the combined uncertainty of the measurement results and the certified value are larger than the absolute difference between the certified and the measured value (∆m), then it can be concluded that the measured value is not significantly different from the certified value. If the opposite is the case, then the method results are significantly biased and a correction of the results is preferred. Instead one can also choose to include the measured bias in the measurement uncertainty, especially when the bias value is not very well known.

This trueness assessment is not free of uncertainty itself, so even if the assessment indicates that the results are without significant bias, an uncertainty associated with the assessment of the trueness of the method must be taken into account. The trueness uncertainty, u(t), can be calculated by combining the uncertainty of the measurements on the CRMs, u(m), with the uncertainties of the certified values of the CRMs, u(CRM) (Eq. 5)

$$u (t ) { = }\sqrt {u^{ 2} (m )\;{ + }\;\frac{{\sum {u^{2} ({\text{CRM}})} }}{{n_{\text{CRM}}^{ 2} }}}$$
(5)

With \(\sum {u^{2} \left( {\text{CRM}} \right)}\)being the sum of the squares of the relative uncertainties of the certified values of the CRMs and n CRM the number of CRMs.

In validation studies, such as the study presented in this paper, the uncertainty u(m) of the results obtained on the CRMs is usually not a full measurement uncertainty, as it does not yet contain the u(t) contribution. Instead, u(m) contains repeatability and intermediate precision uncertainty contributions and can be estimated from Eq. 6

$$u(m){ = }\sqrt {\frac{{u^{ 2} \left( r \right)}}{{n_{\text{t}} }}{ + }\frac{{ u^{ 2} \left( {\text{ip}} \right)}}{{n_{\text{d}} }}}$$
(6)

With u(ip) being the relative intermediate precision uncertainty, u(r) the relative repeatability uncertainty, n d the number of test days, and n t the total number of measurement replicates. Note that the precision contributions to u(m) are different from those to u(lab) because the number of replicates and measuring days in the validation study is higher than during routine use of the method. Also, since two CRMs are tested, the measurement uncertainty of the technique is calculated from the average of the relative repeatability and relative intermediate precision uncertainties of measurements of the two CRMs.

Only for ECD the trueness uncertainties of TEM analyses can be estimated using the certified uncertainty of ERM-FD100 and the indicative uncertainty of ERM-FD304.

Combined and expanded measurement uncertainty

The uncertainty contributions explained above are to be combined in the method’s full uncertainty budget. The intra-laboratory precision uncertainty u(lab) is a type A uncertainty: it is derived from repeated testing and covers all sources of variation between analyses and the typical between-day variation. A type B uncertainty component (values taken from certificates, expert judgement, etc.) is the uncertainty of the certified values of the used CRMs, u(CRM), and the calibration uncertainty, u(cal). The trueness uncertainty u(t) is a mix of A and B type uncertainties (ISO/IEC GUIDE 98-3 2008).

If one assumes that all the uncertainty contributions of the quantitative TEM method are covered by the intra-laboratory precision uncertainty and the uncertainties for trueness and calibration, then the combined measurement uncertainty can be estimated from (Eq. 7):

$$u_{\text{c}} \left( x \right) = \sqrt {u^{ 2} \left( {\text{lab}} \right) + u^{ 2} \left( t \right) + u^{ 2} ( {\text{cal)}}^{ } }$$
(7)

The uncertainties are combined using the normal root-sum-square manner, resulting in the combined measurement uncertainty u c(x). By assuming that the combined uncertainty is normally distributed and a confidence level of approximately 95 % is required, and when the degrees of freedom of the individual uncertainty contributions permit, the combined uncertainty can be multiplied by a coverage factor (k) of 2 to obtain the expanded measurement uncertainty U(x) (ISO/IEC GUIDE 98-3 2008).

Results

Selection of single primary particles

The size, shape, and surface characteristics of single particles and agglomerates of ERM-FD100 and ERM-FD304 are bimodally distributed. This is illustrated in Appendix 5 for the sphericity measurand. The largest peak of the number-based distribution corresponds to the single primary particles and the second peak corresponds to the agglomerated particles.

In this validation study, only single primary particles are considered. As illustrated in Appendix 3, the aspect ratio, convexity, elongation, shape factor, and sphericity measurands have a low correlation (less than 0.5) with ECD. From these measurands, the sphericity allows to separate single primary particles from the agglomerates with the lowest type I and type II uncertainties as illustrated in Appendix 4. With this separation about 1 % of the single primary particles are removed from the datasets and about 1 and 5 % of the agglomerates are classified as single primary particles for ERM-FD100 and ERM-FD304, respectively.

The measured modal sphericities of the single primary particles of both ERM-FD100 and ERM-FD304 are about 0.8 and confirm that the particles of these materials are approximately spherical. Particles that have a sphericity lower than 0.4 for ERM-FD100 and lower than 0.5 for ERM-FD304 are classified as non-spherical particles (Appendix 5). Since the primary particles of ERM-FD304 are more homogeneous, a stricter threshold can be used for ERM-FD304 than for ERM-FD100. For ERM-FD100 the percentage of agglomerated particles is lower than that for ERM-FD304 (Table 1). The number-based size distributions of Feret max, Feret mean, Feret min, ECD, diameter max, diameter mean, and diameter min of the single primary particles are shown in Fig. 4 and Appendix 6 and the corresponding modes and FWHMs in Table 2 and Appendix 7 at magnifications of 18,500 and 68,000 times, respectively.

Table 1 Overview of the number of analyzed single and agglomerated particles in 150 micrographs for ERM-FD100 and ERM-FD304 at magnifications of 18,500 and 68,000 times
Fig. 4
figure 4

Number-based size distributions of ERM-FD100 (a) and ERM-FD304 (b) at a magnification of 18,500 times. The reported fittings are based on the mean values of the mode and FWHM in Table 2

Table 2 Primary particle mean mode and full width at half maximum (FWHM) of the number-based size distributions and the corresponding intra-laboratory uncertainty u(lab) for ERM-FD100 and ERM-FD304 at a magnification of 18,500 times

Precision

The number of particles per micrograph decreases with increased (squared) magnification: in 150 micrographs about 12–14 times more particles are measured at a magnification of 18,500 times than at a magnification of 68,000 times. For ERM-FD100 a total of 102,276 and 7,403 single particles and for ERM-FD304 a total of 60,772 and 5,141 single particles are analyzed at magnifications of 18,500 and 68,000 times, respectively (Table 1).

Figure 5 shows the individual components of the uncertainty of TEM measurement of the mode of ECD of ERM-FD100 and ERM-FD304. Table 2 and Appendix 7 indicate that for all seven size measurands and for both ERM-FD100 and ERM-FD304, the intra-laboratory precision (the combined repeatability and intermediate precision) is better at a magnification of 18,500 times than at a magnification of 68,000 times.

Fig. 5
figure 5

Uncertainties of the TEM measurement of the mode of the ECD of ERM-FD100 (a) and of ERM-FD304 (b) at magnification of 18,500 (black) and 68,000 times (white)

As illustrated in Fig. 3, there is a linear relationship between the log-transformed relative laboratory uncertainty of the mean mode, and the log-transformed of the number of measured particles. This linear relationship is shown in Eq. 8.

$${\text{Log(}}N ) \, = \, -\alpha { \log }\left( {u\left( {\text{lab}} \right)} \right) \,+ \,{\text{ log(}}\beta )$$
(8)

The experimentally determined values for α and β and the minimal sample size N that is required to have a relative laboratory uncertainty lower than 0.05 are presented in Table 3.

Table 3 Characteristics of the linear regression between the log transform of the number of measured particles and the log transform of the relative laboratory uncertainty

The EC definition is based on the true median size value of the particle size distribution (EC 2011). Therefore, in addition to the calculation of the intra-laboratory precision for the modes, the intra-laboratory precision is also calculated for the median values (Appendix 8 and 9). The intra-laboratory precision uncertainty is combined with the calibration uncertainty to calculate the combined uncertainty. Since, for two-dimensional size, shape, and surface topology measurands, no reference values are available yet, the mean median value with its standard deviation and its combined intra-laboratory precision can be calculated for these measurands, but trueness cannot be assessed. As illustrated in Appendix 10, the combined intra-laboratory precision uncertainties of the measured median values obtained for all 23 measurands showed to be better at a magnification of 18,500 times than at a magnification of 68,000.

Trueness and combined measurement uncertainty

Table 4 shows the uncertainty components of the modes of the ECD size distributions measured by TEM. The trueness uncertainty is the largest uncertainty factor of the TEM measurement of the mean modal ECD; it is larger than the combined intra-laboratory uncertainty obtained when measuring ERM-FD100 and ERM-FD304 (Table 3). The relative combined uncertainties range from 3.5 to 4.5 % for ERM-FD100 and from 2.5 to 4.0 % for ERM-FD304. A better estimation of the trueness uncertainty can be calculated by combining the relative measurement uncertainty with the relative uncertainties from the two CRMs. The relative measurement uncertainty is then calculated by taking the average relative repeatability uncertainty and average relative intermediate precision uncertainty. As illustrated in Table 5 the combined uncertainty of the TEM measurement of the mode of the ECD is then about 3 %.

Table 4 The absolute standard and expanded measurement uncertainties (nm) of modal ECD measurands obtained for ERM-FD100 and ERM-FD304 at magnifications of 18,500 and 68,000 times
Table 5 Relative uncertainties (in %) of the TEM measurement and trueness estimation of the mode of the ECD at magnifications of 18,500 and 68,000 times

The absolute difference between the mean modal ECD-certified values and the measured modal ECD result is smaller than the expanded uncertainty for ERM-FD100 and ERM-FD304 for both 18,500 and 68,000 times magnifications (Table 4). The modes of the diameter min and Feret min are not significantly different from the certified ECD value of ERM-FD100 and diameter mean and Feret mean are not significantly different from the certified ECD value of ERM-FD304 (data not shown). Pair-wise comparison within our experiments shows that the modes of Feret min, ECD, Feret mean, and Feret max distributions are increasing and significantly different. The modes of diameter min, ECD, diameter mean, and diameter max distributions are equally increasing and significantly different. The respective minimum, median, and maximum values are similar for all Feret and diameter measurands. The widths of the Feret max and diameter max distributions are significantly larger than the widths of the distributions of the other size measurands.

Discussion

Transmission electron microscopy is considered a key method for NM characterization because of its high resolution (Linsinger et al. 2012). In this paper, TEM analysis of colloidal silica CRMs is used to quantify the accuracy and precision of TEM size and shape measurements of near-monodisperse, near-spherical silica nanoparticles. An intra-laboratory validation study is reported for the TEM measurement of the modes of the ECD, diameter, and Feret single primary particle sizes and for the median values of 23 primary particle measurands, describing the size, shape, and surface properties of the colloidal silica materials.

The trueness uncertainty is the largest factor in the uncertainty budget of the measurement of the modal ECD of the silica materials. This trueness uncertainty consists mainly of the reported standard uncertainty of the certified value, which is 6–8 times larger than the measured within-lab standard uncertainty of the mean values of the modes. Consequently, when TEM results need to be compared between different laboratories, e.g. in the context of mutual control of material suppliers, customers, and regulators, or in the context of characterizing reference materials, insisting on the standardization of the TEM methodology of the contributing laboratories (Roebben et al. 2013) is more effective than increasing the number of analyzed particles and repetitions in each laboratory.

The sample preparation, sampling of the TEM grid surface, image analysis, and the separation of single and agglomerated primary particles are important elements of standardized, quantitative TEM analyses. The ISO standards for TEM analysis of nanoparticles (ISO 13322-1 2004; ISO 9276-6 2008; ISO 9276-2 2001) do not always cover these elements in detail, mainly because they are very material-dependent. In this work, the uncertainties associated with these elements are examined.

Sample preparation

The silica particles of the investigated CRMs are negatively charged at physiological pH (Braun et al. 2011b; Franks et al. 2012). Hence, attachment and an even distribution on the EM grids could be assured by introducing positive charges on the grid surface using Alcian blue pretreatment. The systematic selection procedure for unbiased random image collection is applied to take into account the variation due to selection of the micrographs. This variation is included in the repeatability and intermediate precision (Russ 2011).

Automation of image analysis

Automation of the image analysis process allows measuring multiple and arithmetically complex measurands, described in De Temmerman et al. (2012), on ten thousands of detected particles. This automation reduces the operator-induced bias, avoids the tedious repeated task of manual measurement, and assures a statistically relevant number of measurements (Pyrz and Buttrey 2008).

The separation of isolated and agglomerated primary particles strongly influences the precision of the measurement of all measurands. For agglomerated nanoparticles, (Russ 2011) and (Pons et al. 1999) suggest a separator filter-based approach. This approach allows including primary particles in agglomerates but it can also introduce important distortions in the shape and size of the detected particles. Since already more than 70 % of the measured particles of the materials tested in this study are isolated primary particles, this population is supposed to be representative for the large majority of primary particles in the sample. The applied linear discriminant analysis approach proves to be effective in objectively selecting the best measurand for omitting the agglomerated particles. Only one to five percent of the agglomerates are defined as primary particles. From the non-size-related measurands, the sphericity measurand is shown to be the most efficient measurand separating agglomerated primary particles from isolated primary particles.

Since the CRM samples contain much more elongated than circular agglomerates, this sphericity-based separation is prefered over the shape factor-based separation, suggested by NIST (Kaiser and Watters 2007). The latter efficiently removes circular agglomerates but it tends to fail in removing small elongated agglomerates. These two approaches can be combined and the biparametric separation can be better than the uniparametric separation (data not shown). In this work, it is not applied because it requires complex iterative data processing.

Choice of magnification

The selection of an appropriate magnification and the limitation of image processing steps by applying an N × N filter minimizes the repeatability and intermediate precision uncertainties (Pyrz and Buttrey 2008). At the lower magnification (18,500 times), the intra-laboratory uncertainty is lowest even though the individual particles are less accurately measured (Pyrz and Buttrey 2008). Indeed, more particles are detected per micrograph and the selection bias is lower than at the higher magnification of 68,000 times. At the magnification of 18,500 times, the lower intra-laboratory uncertainty is, however, compensated by the higher calibration uncertainty resulting in similar combined intra-laboratory uncertainties of the analyses at magnifications of 18,500 and 68,000 times. At the lower magnification small surface features could be underestimated. This should be taken into account for complex nanomaterials. However, in the examined silica CRM qualitative TEM analyses at high magnifications did not suggest the presence of such surface features.

The cross-grating calibration method used for magnifications up to 18,500 times is less effective than the image shift calibration method used for higher magnifications. Application of the latter method on the magnification of 18,500 times is expected to lower the calibration uncertainty systematically with 1.1 %, but requires adaptation of the applied calibration software (FEI 2012). Since the uncertainty associated with the calibration procedure is added as a Type B uncertainty and is not covered by the intra-laboratory uncertainty, such improvements in the calibration procedure can be included in the method validation dossier without repeating the validation experiment (ISO/IEC GUIDE 98-3 2008).

Number of particles to be counted

Our experimental data show an approximately linear relationship between the log-transformed intra-laboratory precision uncertainty and the log-transformed number of measured particles, for small numbers of particles (<200). This is in agreement with ISO (ISO 13322-1 2004) and the simulations of (Masuda and Gotoh 1999) who postulate that there is a linear relationship between the log-transformed number of measured particles and the log-transformed uncertainty. For larger particle numbers, this intra-laboratory precision uncertainty becomes independent of the number of measured particles and is mainly determined by a particle number-independent part of the repeatability and intermediate precision uncertainties.

To achieve a relative laboratory uncertainty of 5 % at a confidence level of 95 % on the estimation of the mode, between 50 and 200 particles have to be measured. These numbers are in the same order of magnitude as the numbers calculated following the method of (Masuda and Gotoh 1999) proposed by ISO 13322-1 (2004) when assuming that the size is log-normally distributed.

Data binning and fitting of the resulting distributions

The data binning procedure (weighing the number of non-empty bins to the number of measurements in the largest bin), followed by log-normal fitting allows precise estimation of the mode. This way, the uncertainty of the measurement of the mode (bin width) and the number of particles supporting the measurement of the mode (bin height) are better balanced, certainly for non-normal distributions, than with the Freedman–Diaconis rule, the Scott’s rule, and the Sturges rule, designed for normal distributed data (Freedman and Diaconis 1981; Scott 2009, 2010).

Log-normal fitting to determine the mode decreases the repeatability and intermediate precision uncertainties of the mean modal ECD more than the estimation of the modal ECD based on the selection of the bin containing most particles (data not shown).

Comparison of different measurands characterizing the particle distributions

For the measured silica materials (near-monodisperse, near-spherical, and non-agglomerated) the mean modal sizes and mean median sizes and their corresponding uncertainties are comparable and can be used in function of the measurement purpose. In the scope of the EC nanomaterial definition the most relevant measure is the Feret min, an estimate of the minimal size in one dimension. A material qualifies as a NM when the median Feret min dimension is smaller than 100 nm (Linsinger et al. 2012; EC 2011). For comparison with literature data (Braun et al. 2011b; Franks et al. 2012) and for validation purposes, the mean modal values are applied as more robust estimators for size, shape, and surface topology measures of materials such as the colloidal silica materials studied here. The modal values are less sensitive to outliers in the dataset than the median values (Braun et al. 2011b; Franks et al. 2012).

The semi-automatic analysis allows accessing multiple measurands and selecting the optimal measurand in function of a specific material or measurement purpose. The shape measurements confirmed that the CRMs are colloids of near-spherical nanoparticles. The measured modal ECD value is similar to the certified ECD values. The Feret and diameter sizes are significantly different from the ECD values of the CRMs such that a specific certification of these measurands appears necessary (Linsinger et al. 2012). The effects of TEM sample preparation, by the combined action of drying and high vacuum, can supposed to be low (Linsinger et al. 2012) for the examined materials since a good agreement between the ECD size measured by TEM and the equivalent spherical diameter size measured by other techniques, such as centrifugal liquid sedimentation and even dynamic light scattering, is observed (Braun et al. 2011b; Franks et al. 2012).

Validity range

The measurement uncertainties are determined for two silica NM and, since the particle detection is based on mass and thickness contrast, similar or better results are to be expected for similar particles, composed of heavier elements that can be more easily distinguished from the background. Earlier, this method allowed detecting and measuring colloidal silver (Klein et al. 2011) and gold NM (Van Doren et al. 2011).

Sensu stricto, the method is validated on near-monodisperse nanoparticles with a modal size in the range of 18–30 nm. In practice, the quantification limit of the applied microscope and camera configuration allows to detect and accurately measure particles with a diameter ranging from 10 to 400 pixels.

The determined measurement uncertainties are valid for near-spherical nanoparticles only. For nanoplates and nanofibres, like nanotubes, nanorods, and nanowires (ISO/TS 27687 2008), the size measurements depend on the orientation of the particles on the grid and, therefore, a separate validation is necessary.

Conclusions

This paper describes the validation of an in-house TEM method for the determination of the modal and median particle size, shape, and surface topology of near-monodisperse distributions of near-spherical colloidal silica primary particles. Uncertainty budgets for the TEM method are set up using data obtained on two CRMs. A high level of automation of the calibration, image acquisition, image analysis, and data analysis is obtained. This method proves to give precise and non-biased results for the modal particle ECD measurements of near-monodisperse populations of near-spherical nanoparticles in the size range between 18 and 30 nm. The expanded uncertainties on the modal ECDs are about 3 % and are mainly determined by the inter-laboratory component of the combined uncertainty of the reference materials. The expanded uncertainty can be applied to other colloidal near-spherical NM with another composition on the condition that the primary particles can be distinguished from the background, and have a near-monodisperse size distribution. The presented sample preparation, calibration, image acquisition, image analysis, and data analysis can serve as a guideline to set up further validation experiments for TEM analyses of these and other NM.