Introduction

Clay-containing materials are widely used for different purposes — from daily utensils, building materials and artistic applications to the production of cosmetics and pharmaceuticals. Knowledge of the quantitative chemical composition of clay-based materials is valuable in many fields, such as cultural heritage and archaeology [1,2,3,4,5,6,7], geology [8,9,10,11,12,13], engineering [14], pharmaceutics [15] and cosmetics [14].

This study focuses on the quantitative investigation of clay-containing materials from archaeological, cultural heritage and art objects (such as pottery, technical ceramics, sculptures and construction materials). Analysis of these clay-based objects can provide valuable information about preparation technologies from the past, the geographical, including the geological, origin of raw materials, and quality of the clay, with broader applications for authenticity studies. In industrial settings, quantitative data can be used for estimating the quality and purity of objects.

Clays and clay minerals are widely distributed and can be found in soils, natural sediments and rocks [8, 16, 17]. Natural clay contains fine-grained (i.e. < 2.0 μm) clay minerals (kaolinite, illites, smectite, chlorite, etc.), including other finely divided non-clay compounds like quartz, feldspars and also metal oxides/hydroxides [8, 18].

Clay is one of the earliest raw materials used in manufacturing ceramics, pottery and building materials [19]. Ceramics are made by mixing clay with water and various temper additives, shaped to the desired form, dried and lastly fired at high temperatures (i.e. 900 to 1600 °C). The same general process is also utilized for making pottery and bricks. The main difference is the type of clay material used and tempers added, which largely depend on manufacturing traditions and availability of different materials in the surrounding sediments, and the temperature reached during firing (usually below 1000 °C in historical samples). To improve the quality of ceramic vessels, tempers such as crushed rocks, stones, shells, different organic matter and grog (ground potsherds) are also added [4, 20,21,22]. Sand is combined with clay in some old brick preparations to reduce shrinkage. During firing, the clay materials go through significant mineralogical changes. The main mineral assemblages formed after firing depend on the original clay mineral type, the presence of other minerals, firing temperature and environment (i.e. reducing or oxidizing atmosphere) (see in the Supplementary information (SI) Fig. S1) [23,24,25,26,27,28,29].

The clay-containing materials or sediments are very complex mixtures, and investigation of their composition is challenging. For determining clay’s quantitative mineralogical composition, X-ray diffraction (XRD) is the de facto standard method due to the minerals’ regular and repeating crystal structure [30, 31]. Even though XRD is widely applied, it requires a rather large sample amount. Depending on sample nature, the minimum amounts for quantitative analysis, based on our experience, start from 0.5 to 1 g. It is possible to measure smaller sample quantities, but in both cases, the homogeneity and representativity of the sample must be ensured. This is problematic for archaeological and cultural heritage materials where only a minimal sample amount is usually available. Also, XRD has time-consuming sample preparation followed by interpretation of complex diffractograms with specific software that can be handled only by skilled specialists [32]. Furthermore, XRD cannot identify the amorphous phase, which is often present in natural samples as organic material or in ceramics as glass phase due to vitrification.

Because of these limitations in the XRD, other approaches for quantitative mineralogical analysis are sought. Fourier transform infrared (FT-IR) spectroscopy with attenuated total reflection (ATR) is a widely used technique for (mostly qualitative, less quantitative) analysis of cultural heritage and archaeological materials [33,34,35,36,37,38,39,40]. ATR-FT-IR spectroscopy (ATR-FT-IR) is a contact method that enables analysis of different types and states of samples (e.g. liquids, pastes, polymer films, fibres, solids and also powders) [41,42,43,44]. ATR-FT-IR has many advantages for clay analysis: it is a rapid, simple and relatively inexpensive technique that enables analysis with little or no sample preparation. It can work with very small samples (down to a few mg) and also non-destructively directly on the object. Furthermore, FT-IR offers highly characteristic information about clay sample composition and its mineral structure [45, 46]. This way it enables assigning samples to mineral families. More specifically, it is possible to determine isomorphic substitution and to discriminate between different forms of structural water and/or constitutional hydroxyl groups [45]. Compared to XRD, ATR-FT-IR can work with amorphous (non-crystalline), semi-crystalline and crystalline components.

Clays are complex mixtures, and in the ATR-FT-IR spectrum, the bands of some of their silicate minerals usually overlap strongly, thus making quantitative analysis challenging [47]. In this case, chemometric tools can be used. Partial least squares (PLS) is able to quantify a large number of components, whose bands can overlap severely in IR spectra, and spectral baselines can be variable, as long as there are at least minor differences in the ATR-FT-IR spectra of the components [48]. PLS is also good for being tolerant to noise in IR spectra, zero concentrations of components in some samples and some missing data in the model [49, 50]. So, coupled with PLS or similar chemometric methods, ATR-FT-IR is a useful technique for quantifying components from mixtures if a sufficient number of suitable calibration samples is available. Some quantitative studies of minerals from rocks (including shales) and soils have been published using the PLS regression method with ATR-FT-IR [42, 51,52,53,54]. However, mostly these reports are geology-related. Often, PLS combined with diffuse reflectance infrared Fourier transform spectroscopy (DRIFTS) has been used. For quantitative analysis, DRIFTS, compared to ATR-FT-IR, usually requires a somewhat larger sample amount (approx. 5–20 mg) and sample pre-treatment (e.g. preparation of KBr pellets) for more accurate results [32, 55,56,57]. Furthermore, for these investigations, the mid-IR region (4000–400 cm-1) has commonly been used. At the same time, clay components have characteristic absorption bands also in the far-IR region (below 400 cm-1), which are beneficial for the quantitative analysis [36, 41, 42]. To the best of our knowledge, there are no previous ATR-FT-IR-PLS–based complex quantitative investigations of main clay components in archaeological and cultural heritage samples without specific sample preparation and using an expanded wavenumber range (4000–225 cm-1).

In this study, a quantitative method for the determination of the mineralogical composition of clays in different archaeological and cultural heritage samples using ATR-FT-IR (in the mid-IR and far-IR regions) with PLS quantification is presented. The creation of the ATR-FT-IR-PLS model is described step by step. Altogether, 222 calibration and validation standards covering natural clay sources and various samples (including fired samples in different temperatures) from archaeological/cultural heritage artefacts are used (compositions determined with XRD). This is the largest calibration set used for creating an ATR-FT-IR-PLS method for clay minerals. The developed ATR-FT-IR-PLS method was applied to the 11 case-study samples to evaluate the efficiency and accuracy of this approach.

Materials and methods

Reference standards for classification and quantitative analysis

Clays are complex mixtures that can consist of a number of different components. For the quantitative analysis, it is impossible to find commercial reference standards (reference samples) with varying concentrations, where all the clay components have been characterised on a quantitative basis. In this work, we used real-life archaeological, cultural heritage, geological and other clay materials as calibration and validation standards for building the PLS model.

For the classification method, 28 individual monomineralic samples (standards) were used. Minerals for the classification were obtained from the collection of the University of Tartu Department of Geology.

For creating the quantitative PLS model, initially, 236 reference samples covering natural clay sources and various archaeological/cultural heritage objects were considered as reference standards. For the quantitative analysis, reference standards of different types, ages and origin were selected. The initial set contains 127 geological samples (natural sediments, soils and moraines) [9,10,11]; 83 archaeological and cultural heritage samples (pottery and ceramic fragments, bricks); 11 commercial plasters, pigments, clays, including 15 clay samples from a private collection. SI Table S1 lists all the different reference standards and their origin. In the process of creating the PLS model for quantification of clay components, out of the 236 initial reference standards, 14 were excluded (see section “Creating a PLS model for quantitative analysis of clay components” for the reasons). The remaining 222 were used as calibration and validation standards for creating the final PLS model (more described in section “Creating a PLS model for quantitative analysis of clay components”).

To obtain homogenous powders for XRD and ATR-FT-IR analysis, all the reference standards (amount ranged from 0.5 to more than 1g) were ground using ball mill (standards of geological origin) or agate mortar and pestle (standards of archaeological and cultural heritage origin).

Case-study samples

For the practical application, different archaeological and cultural heritage objects with variable clay components were analysed. Table 1 presents information about the analysed objects and the sample pieces taken from them. The photos of the artefacts and their samples are presented in the SI in Tables S2S5 and Figs. S2 and S3.

Table. 1 Information on the analysed case-study samples

For the ATR-FT-IR analysis from the archaeological objects (Egyptian clay vessel and potsherds from Narva Joaorg), it was possible to use only very small sample pieces (a few mg), and grinding was not carried out. Only samples from Tartu Cathedral and Tartu St John’s church were large enough (about 1 to 2.8 g) to be ground using agate mortar and pestle for obtaining a homogeneous mixture. For the comparison for Tartu Cathedral and Tartu St John’s Church samples (samples 7 to 11), XRD analysis was also carried out.

Measurements and processing of ATR-FT-IR spectra

All the ATR-FT-IR spectra were recorded using Thermo Scientific Nicolet 6700 FT-IR spectrometer with Smart Orbit diamond micro-ATR accessory. The FT-IR spectrometer has DLaTGS detector, CsI beamsplitter and Vectra Aluminium interferometer. The spectrometer is protected from atmospheric moisture by the constant flow of dry air.

For recording of an ATR-FT-IR spectrum, the powdered reference standard or piece of case-study sample was placed on the ATR crystal and pressed against the ATR crystal by a pressure applicator. The active sampling area of the micro-ATR crystal has a diameter of 1.5 mm. This small area is suitable for measuring powders as it enables applying high pressure for obtaining sufficient contact between the sample and the crystal, which is of key importance obtaining qualitative ATR-FT-IR spectra of powdered samples [44].

The FT-IR operating parameters were similar to the ones used earlier [36]: the resolution of 4 cm-1, number of scans were 128 and 256, wavenumber range at 4000–225 cm-1, zero filling factor was 0 and apodization window was Happ-Genzel. Thermo Electron’s OMNIC 9 software was used to record and process the IR spectra.

To obtain homogeneous results, for every individual monomineralic samples 2 to 4, clay-related reference samples 2 to 4 and case study samples 1 to 3 ATR-FT-IR spectra were recorded. After that, IR spectra were averaged, and from the monomineralic samples (28), clay-related reference samples (236) and case-study samples (11), statistical average ATR-FT-IR spectra were obtained.

For all the recorded ATR-FT-IR spectra, atmospheric suppression correction (removes CO2 and humidity absorptions from spectra) was applied (for that Thermo Fisher Scientific Inc. OMNIC Specta 2.0 was used). For some noisier IR spectra, the smoothing function of the OMNIC software (with the Savitzky-Golay algorithm) was used.

Analysis with X-ray diffraction (XRD)

XRD was used as the reference method for assigning the compositions of the reference standards, as it is the most reliable tool available for the quantitative determination of the mineralogical composition of materials. The XRD results of the mineralogical compositions of the reference standards used for calibration of the quantitative PLS model are shown in the SI in Table S6.

All the reference standards were measured in a similar way. Randomly oriented powdered standards (mostly over 1g) were measured on a Bruker D8 diffractometer using CuKα radiation and LynxEye positive sensitive detector in 3–70 2θ region. The quantitative mineral composition of the standards was interpreted and modelled using a Rietveld algorithm-based code Siroquant – 3 [59].

In this investigation, the XRD data of different geological samples that were previously published were also used to create the PLS model. XRD data of Estonian geological samples from Padu-Aru, Varja, Nudi and Lohu were obtained, together with XRD information on the crystallographic composition, from Liivamägi et al. (2014) [11] and Liivamägi et al. (2015) [10]. Some samples of African origin were also included, and their XRD data were obtained from Somelar et al. (2018) [9]. The rest of the reference standards were measured as part of this work. SI Table S1 lists all the reference standards.

Classification and quantitative analysis of clay components

Quantitative analysis of clay components was carried out using the PLS method. Classification was performed using discriminant analysis (DA) that is based on principal component analysis (PCA). For these chemometric techniques, Thermo Scientific TQ Analyst™ Professional Edition 9.0 software was used.

For creating DA classification and PLS models, ATR-FT-IR spectra of different reference standards were used. Even though all the reference standards were ground to homogenous powders and the measurements of the ATR-FT-IR spectra were made in a similar way, it is still difficult or impossible to ensure that the entry depth of beam is the same for every standard. In ATR-FT-IR, the depth of penetration of the IR radiation, and therefore the effective path length, depends on refractive index of standard sample, contact between crystal and sample, wavelength and also temperature [60]. So, in these spectra, effective path length in the samples can be different, but it is unknown how much it varies. To compensate for these differences, standard normal variate (SNV) correction was used for PLS and discriminant analysis. This correction algorithm improves the results if there is an interference that affects the signal proportionally to its magnitude (multiplicative influence). This can be due to the scattering of radiation or some other effect that is proportional to path length [48]. The SNV compensates for variations in sample thickness that can be due to particle size and scattering and also normalizes IR spectra to minimize the effects of scaling or offsets. The SNV correction eliminates the effects of scattering by normalizing the IR spectra of the reference standards individually so that chemical differences can be detected [48]. SNV is more often used for near-infrared (NIR) spectral data pre-processing before chemometric analysis [61]. However, our previous work [62] demonstrates that SNV can also be successfully used in the mid-IR region, and this work expands its usability in the far-IR region.

Creating DA classification model for individual minerals

DA classification enables estimating the compound classes based on similarities of minerals’ spectral features. This classification method compares the IR spectrum of an analysed unknown sample with the spectra of classes of known standard materials.

For the DA classification, 28 averaged ATR-FT-IR spectra of individual monomineralic samples were inserted into the TQ Analyst software, and wavenumber ranges of 3720–2900 cm-1 and 1810–230 cm-1 were selected.

Creating a PLS model for quantification of clay components

Initially, 236 averaged ATR-FT-IR spectra of clay-related reference standards with known compositions were used for creating the PLS model. The contents of the mineral components (g/100g) of the reference standards were found using XRD analysis, and these were inserted into the TQ Analyst software alongside recorded ATR-FT-IR spectra.

Based on the appearance of the absorption bands of the clay components (see Fig. 1, and in the SI Fig. S4) for the PLS analysis, the following wavenumber ranges were used:

  • 3720–3000 cm-1 (kaolinite, halloysite, chlorite, illite-smectite-mica and hydroxides have characteristic absorption bands in that range).

  • 1710–275 cm-1 (all the clay components, including carbonates and oxides have characteristic bands in this range).

Fig. 1
figure 1

Representative ATR-FT-IR spectra of individual minerals from each class

The software was allowed to choose the required number of factors, ranging from 8 to 10 for different minerals.

The PLS method was evaluated using the following parameters: root mean square error of calibration (RMSEC), root mean square error of prediction (RMSEP), squared correlation coefficient (R2), root mean square error of cross-validation (RMSECV) and modified Willmott performance index [63].

RMSEC, RMSEP and RMSECV describe the accuracy of the method, which means how similar the concentration found using the developed method is to the concentration found using the referents method and is found based on the following equation (1):

$$ RMSE=\sqrt{\left(\frac{1}{n}{\sum}_{i=1}^n{\left({\hat{y}}_i-{y}_i\right)}^2\right)\ } $$
(1)

In this equation, n stands for the number of either calibration standards (by RMSEC), validation standards (RMSEP) or an overall number of standards (RMSECV) of the method. The i represents the number of a reference standard, and \( {\hat{y}}_i \) the best estimate of the unknown value, which in this case is the concentration of the respective component in standard i found by XRD analysis. The yi is the concentration of the reference standard i found using the PLS method. In the case of RMSECV, the yi of every reference standard is found using all remaining standards as calibration standards. R2 describes the correlation between measured signal and component concentrations (found using XRD). TQ Analyst calculates R2 for calibration as well as validation standards.

The performance index shows the PLS method’s ability to predict the component concentrations in the validation standards. Performance index enables easy comparison between different methods [63].

For monitoring the correlation between RMSEC, RMSEP and the number of factors, the PRESS function was used. Also, TQ Analyst Spectrum Outlier function for spotting the deviating spectra was used. The function finds the most different data point and evaluates the significance of deviations using the Chauvenet’s test.

Results and discussion

Classification of the individual monomineralic samples

The first step in the quantitative method development was to evaluate ATR-FT-IR spectra of individual minerals commonly present in clays and observe how they can be differentiated and classified based on their spectral features. For that, DA classification was used. Based on the similarities of structures and observation of ATR-FT-IR spectra, initially, 28 individual monomineralic samples were grouped into seven classes (see Table 2). These classes were thereafter used for creating the DA classification model. ATR-FT-IR spectra of minerals from each class are presented in Fig. 1, and their complete interpretation can be found in the SI in Table S7.

Table. 2 The mineral classes used for the discriminant analysis classification

The 3D PCA graph with the classification results of the individual minerals is presented in Fig. 2. Altogether, 10 principal components (PCs) were obtained for discriminant analysis, and the variances (or variabilities) described by the first three principal components were PC1 36.1%, PC2 18.6% and PC3 11.9%, leading to the cumulative variance of only 66.6%. However, the cumulative variability described by all 10 PCs is 95.3 %, which is sufficient for our purposes.

Fig. 2
figure 2

Classification results of the individual minerals presented by the three principal components

The PCA graph in Fig. 2 shows that silica varieties, feldspars, carbonates and iron-oxyhydroxides are distributed into distinctly separate groups. However, the clay minerals, micas and amphiboles are grouped as one group. The structure of amphiboles compared to micas and clay minerals is slightly different. Amphiboles are inosilicates with a double-chain structure, and micas and clay minerals are phyllosilicates, but both groups contain a hydroxyl (OH) group.

Ten PCs were involved in the DA classification method. The results indicate that the minerals under question are different enough, and there are characteristic vibrations in the ATR-FT-IR spectra that the PLS algorithm can use for quantification. Also, these results suggest that for creating a reliable quantitative method, minerals grouped together (feldspars, clay minerals and micas, carbonates, etc.) should be handled as a group also in the PLS model.

Creating a PLS model for quantitative analysis of clay components

For creating an accurate ATR-FT-IR-PLS method, numerous calibration standards are needed. These must contain all the main components that are expected in the unknown sample, and their concentrations in the reference standards should vary independently. In this work, ATR-FT-IR spectra of 236 clay-related reference standards, with known contents of individual crystalline phases (components) distinguished with XRD analysis, were used. The XRD analysis detected altogether 36 crystalline phases from these standard samples. From those 36, the following 29 minerals (or mixtures of minerals) were explicitly used for creating the PLS model: quartz, cristobalite, K-feldspar, plagioclase, kaolinite, halloysite, illite, smectite, illite-smectite, mica, vermiculite, chlorite, chlorite-smectite, calcite, aragonite, dolomite, siderite, ankerite, hematite, goethite, amphibole, hydroxides (gibbsite, portlandite, brucite), apatite, Ti-oxides (anatase/rutile), pyroxene, Al-silicates (mullite/sillimanite), spinel. These mineral phases were chosen on the basis of two criteria: (1) the phase was detected in the composition of more than three reference standards, and (2) its content in at least one standard was above 1.5%. As a result, garnet, titanite, akermanite, pyrite and weddellite were left out of the model.

Examining the XRD results revealed that even the XRD analysis, which is commonly used for the quantification of a clay composition, is not always able to distinguish all the different components. This happens, for example, when the mineral phase with low content is overlapping with dominant mineral phases or if overlapping minerals are with similar structure and composition like micas, illite and illite-smectite.

Before creating the PLS model, the contents of the above-mentioned 29 individual crystalline phases of the reference standards were grouped based on the XRD results: minerals with similar structures and identical or similar chemical compositions were grouped into one group. For the grouping of the minerals, the results obtained by comparison of the ATR-FT-IR spectra of individual minerals, including their classification (see section “Classification of the individual monomineralic samples”) and also information obtained from the literature [18, 64], were taken into consideration. The minerals in the reference standards were divided into the following groups:

  1. 1.

    Silica varieties: quartz and cristobalite (SiO2 polymorphs). They have identical chemical compositions but different crystal structures.

  2. 2.

    Feldspars: plagioclase (Ca, NaAlSi2O8) and K-feldspar (KAlSi3O8). Feldspars are a group of tectosilicates (KAlSi3O8 – NaAlSi3O8 – CaAl2Si2O8) with very similar structure, chemical composition and physical properties. Feldspars are the most abundant minerals in Earth crust and are common in igneous, metamorphic and sedimentary rocks.

  3. 3.

    Clay minerals, group 1: kaolinite and halloysite (both [Al2Si2O5(OH)4]). These minerals have a similar chemical composition and crystal structure. The difference is in water content; in the halloysite, the unit layers are separated by a monolayer of water molecules.

  4. 4.

    Clay minerals, group 2: micas, illite, illite-smectite, smectite and vermiculite. The discrimination of those minerals using XRD is difficult due to overlapping peaks in diffractograms. The overlapping is caused by very similar chemical compositions and structures. To distinguish those minerals, the separation and analyses of clay minerals/clay fractions are needed.

  5. 5.

    Clay minerals, group 3: chlorite and chlorite-smectite. Unlike other clay minerals, the interlayer space of chlorite group minerals is filled by the brucite-like layer.

  6. 6.

    Carbonates: calcite (CaCO3), aragonite (CaCO3), dolomite (Ca,Mg(CO)2), siderite (FeCO3) and ankerite (Ca(Fe,Mg,Mn)(CO3)2). These minerals differ in structure, and some of them also in chemical composition.

  7. 7.

    Iron-oxyhydroxides: hematite (Fe2O3) and goethite [FeO(OH)].

The contents of the grouped components in the clays-related reference standards and their recorded ATR-FT-IR spectra were inserted into the TQ Analyst software as calibration or validation standards. The contents of the components not belonging to any of the above groups, such as amphibole, pyroxene, Al-silicates, spinel, apatite, hydroxides and Ti-oxides, were also added into the TQ Analyst software. These minerals were used in the PLS model as support components in order to improve its predictive capability for the main seven groups. With the XRD analysis, these components were not detected in all the standard samples, and the variability of their concentrations in samples was not high enough. For these reasons, it was not expected that the PLS model would reliably evaluate their contents in unknown samples. However, including them in the PLS model will enable quantifying the main 7 component classes in different reference standards more accurately (see Table 3).

Table. 3 Results of the ATR-FT-IR-PLS analysis of the main clay components

The PLS method was developed starting with the above-explained groups of reference standards. These standards occasionally contained also amorphous components that XRD analysis is not able to identify or other components that are not commonly found in natural clays. It was therefore expected that the set of reference standards contained samples whose compositions are not entirely correctly found using XRD or are not typical to most natural clays. To discover such samples, all the 236 reference standards were critically evaluated. At first, the TQ Analyst Spectrum Outlier diagnostic was used, which indicated that six reference standards failed the Chauvenet’s test and were considered outliers. Secondly, it was observed how well the standard sample data points were distributed in the calculated vs actual plots and the % difference plots (see in the SI Fig. S5). Calculated vs actual plot compares the concentration value calculated with the PLS model to the reference concentration value (found using XRD) for each component in the reference standards. The % difference plot shows the differences between the calculated and the actual concentration values relative to the actual values. Using these plots, we were able to detect eight outliers that were farther from the others in the calibration plots.

Thus, altogether 14 outliers were detected. All of them were somewhat different in their composition from the other reference standards (see in the SI Table S6). Observing the XRD results of these standards, it is not surprising that these were ruled out. From the 14 outlier standard samples, 8 (3, 12, 13, 14, 98, 183, 218, 230) contain essentially just one main component with very high concentration (between 65 and 90 g/100g), and the other 1 to 3 components have low concentrations. Three reference standards (136, 142, 144) contain a higher concentration of Al-silicates (25 to 45 g/100g) that is however in the model as a support component and is not present in most reference standards, also a quite high content of quartz (40 to 59 g/100g) and 1 or 2 components with lower concentrations. The remaining three reference standards (128, 160, 192) were left out because, under closer inspection, doubts arose about their homogeneity and, consequently, whether their IR spectra and XRD results refer to the same compositions. Based on these considerations, the ATR-FT-IR spectra of these 14 reference standards were excluded from the PLS model.

All in all, out of the initially considered 236 reference standards, 14 were excluded, as described above. From the remaining 222 reference standards, 202 were defined as calibration standards (training set) (see in the SI Table S6) and 20 were defined as validation standards (validation set). The main criteria for choosing validation standards were the presence and content ranges of the different components in the clay reference standards. Whenever possible, reference standards with extreme concentrations (i.e. standards in which the concentration of one of the components was higher or lower than in all remaining reference standards) were avoided. Also, each of the seven mineral groups of reference standards were observed separately to make sure that enough calibration and validation standards would be available for the quantification of the respective group of minerals. By the end of the process, 20 were defined as validation standards (see Table 4).

Table. 4 Comparison of actual (XRD values) and calculated (ATR-FT-IR with PLS) validation results using developed PLS method

The resulting PLS model can quantify the main seven classes of minerals in clay-related samples. Table 3 presents R2 for calibration and validation standards and RMSEC and RMSEP values, including performance index and number of factors results for each main clay components. The overall performance index of the PLS model (7 classes of main minerals including support components) is 82.6%. The average performance index of 7 main classes of minerals in clays is 84.8%.

For the seven groups of main components, the calibration R2 varied between 0.90 and 0.99 and validation R2 between 0.94 and 0.99. Table 3 indicates that RMSEC values of main components varied between 1.7 and 6.1 g/100g and RMSEP values between 0.9 and 5.1 g/100g. One would expect that RMSEP values are higher than RMSEC values. The reason for the opposite situation is that (1) the selection criteria for the validation standards in terms of composition were much more stringent, while for the calibration sets, much more relaxed selection criteria were applied (see above) and (2) the fact that in almost all calibration standards the content of at least some components was zero.

The fact that some found component concentrations are not within limits defined by RMSEP is not unexpected, since the probability of that is about 68% (assuming approximately normal distribution of deviations), and in addition to the uncertainty of ATR-FT-IR-PLS results, the uncertainties of XRD reference values also have an impact. The relative standard deviation of XRD Rietveld-based phase analysis of diffractograms using Siroquant-3 has been estimated 5 (for higher concentrations) to 10% (for lower concentrations), based on the investigation of mixtures of pure phases.

The performance and applicability of the developed PLS model can be evaluated by the results of validation standards. Table 4 shows how well a calibrated quantitative PLS method performs when quantifying the main clay components of the 20 validation standards. The performance index indicates how accurately the PLS method can quantify validation standards (see Table 3). In Table 4, the results of the validation standards show that carbonates have the most accurate results where differences between actual and calculated values vary between −1.7 and 0.7 g/100g. Also, the performance index of carbonates is the highest (96.4%) compared with the values of other components (see Table 3). This result is expected because the IR spectra of carbonates (see IR spectra in Fig. 1 and their complete interpretation in the SI in Table S7) are very different from other clay components. Comparing the results of three groups of clay minerals, group 1 (contains kaolinite and halloysite) has a better performance index (86.8%) than group 2 (contains summarized values of smectite, illite, illite-smectite, micas, vermiculite) and group 3 (chlorite and chlorite-smectite) which have performance indices 79.7% and 83.5%, respectively. The results presented in Table 4 indicate that the PLS method has quantified group 1 clay minerals better than group 2 clay minerals. As Fig. 1 shows, the ATR-FT-IR spectrum of kaolinite (a mineral in group 1) is different from that of illite and muscovite (clay minerals in group 2). Kaolinite has characteristic bands in the mid-IR ranges of 3686–3619 cm-1 and 1115–641 cm-1, as well as in the lower wavenumber region at 526–271 cm-1. IR spectra of illite and muscovite have broader and less characteristic bands. Clay minerals in group 2 contain components with very similar structure and chemical composition, and even XRD had difficulties in discriminating between them. Due to the similarities in the IR spectra, their absorption bands overlap extensively, and the PLS model has difficulties differentiating them. However, some uncertainty comes also from the XRD analysis, i.e. from the “actual” compositions. Figure 1 indicates that the IR spectra of chlorite (group 3 clay mineral) are somewhat different (there are characteristic bands in the region of 3700 to 3300 cm-1 and 1000–350 cm-1) from other clay minerals (groups 1 and 2) and the PLS model was able to quantify these components.

Table 4 reveals that some of the quantified components are absent in some of the validation standards. For these standards, ATR-FT-IR-PLS typically finds concentrations close to zero, which are quite reasonable. Importantly, when trying to detect components at zero concentration level and if the results are approximately symmetrically distributed (which is a reasonable assumption), then roughly half of the results should be positive, and half should be negative. Replacing negative results with zeroes (which is sometimes done) would introduce bias to the results. Right now, the average result of the 48 results where a component missing in the sample was quantified as −0.4 g/100g, indicating that positive and negative results essentially cancel and thus the bias is negligible. For this reason also, negative values are listed in the table.

Clays also contain some amount of non-clay minerals, such as quartz, feldspars (K-feldspar, plagioclase) and iron-oxyhydroxides (hematite and goethite). Figure 1 reveals that quartz, microcline (K-feldspar) and hematite all have absorption bands in the range of 1170 to 250 cm-1. This is the same range where all the clay components have absorption bands, causing considerable overlapping. PLS can quantify overlapping minerals as long as there are at least small differences in the spectra [36]. Looking closer, it can be seen that quartz has characteristic bands in the range of 796 to 261cm-1, microcline in the range of 769 to 278 cm-1 and for hematite could be useful bands at 428 and 382 cm-1. Based on their spectral features, PLS was able to adequately quantify these components (see Table 3 and Table 4).

In addition to the validation with the pre-defined validation standards, the leave-one-out cross-validation diagnostic in the TQ Analyst software was used to monitor how well the PLS method performs by quantifying each calibration standard out of 222 as if it were a validation standard (and leaving it out of the calibration set). For the seven groups of main components, the R2 varied between 0.86 and 0.99 and root mean square error of cross-validation (RMSECV) values varied between 2.4 and 7.4 g/100g (see Table 3).

Considering that for the calibration and validation standards real-life archaeological and geological reference standard samples were used, the obtained results are satisfactory to good. Quantitative determination of some components is possible while others are determined at a semiquantitative level.

Application of the developed method for analysis of case-study samples

The developed ATR-FT-IR-PLS method was applied for quantitative analysis of a selection of archaeological and cultural heritage samples (see Table 1) to evaluate its capabilities. All the ATR-FT-IR spectra of case-study samples were inserted into TQ Analyst software, and the PLS model calculated the quantitative contents of these samples. Table 5 presents all the results of the quantitative PLS analysis of 11 real-life samples.

Table. 5 Quantitative contents of clay components in the samples using the ATR-FT-IR-PLS methoda

The results presented in Table 5 demonstrate that the developed ATR-FT-IR-PLS method can quantify a group of minerals in the different kinds of clay samples, while in some cases the results are to be considered semiquantitative. It is instructive to look closer at the total quantified amounts listed in the table. These have been obtained as sums of the contents of the individual components. If all main components have been quantified, then, ideally, the sum should be exactly 100 g/100g. However, all individual amounts have uncertainties. They are quantified here as the RMSEP values, used as estimates of combined standard uncertainties (see the course in ref [65] for full details of uncertainty-related terminology). Consequently, the total quantified amount also has an uncertainty, found (again, at standard uncertainty level) as the square root of sums of squares of individual component RMSEP values (see Table 5). Looking at the values, it is remarkable how close most values are to 100%, demonstrating that in many cases, the approach performs better than the uncertainties imply. The differences of the total quantified amounts from 100 g/100g should be judged keeping in mind that standard uncertainty presents uncertainty at 68% coverage probability level and two times the standard uncertainty has coverage probability of 95%. One can see that all total quantified amounts are within the two times standard uncertainty region. If values are strongly below 100%, then the cause might also be that there is a non-negligible amount of unknown material that was not explicitly modelled and consequently was not quantified.

Because of the nature of the mathematical procedure, the absolute uncertainty estimates of the components’ contents (estimated via the RMSEP values) are constant over the whole content range. This means that the relative uncertainties increase sharply when the content of the respective component decreases. This makes sense— the lower the content of the component, the more disturbing are the spectral imperfections, components not accounted for and the measurement uncertainties of XRD analysis. For this reason, caution should be exercised with low content values. Specifically, if the content of a component is not higher than two times the standard uncertainty then it is not possible to state with confidence that the component is present in the sample. For example, in the majority of samples, it is not possible to state the presence of the first group of clay minerals (i.e. kaolinite).

The following shortly describes the results of each sample from archaeological and cultural heritage objects and the information obtained with the ATR-FT-IR-PLS method (see Table 5).

Quantitative analysis of Egyptian clay vessel with bird mummy

The photos of clay vessel with bird mummy and small analysed sample piece are presented in the SI in Table S2, with ATR-FT-IR spectrum provided in the SI Fig. S6.

The ATR-FT-IR-PLS method enabled quantifying most of the main components of the sample as a total of 82.6±8.3 g/100g. The rest of the content belong to components not explicitly modelled. PLS model quantified the contents of silica varieties (quartz and/or cristobalite), feldspars and group 2 clay minerals (smectite, illite, micas, illite-smectite, vermiculite) all over 20 g/100g. The red colour of the vessel was caused by hematite which was detected with the concentration of (8.4±1.9) g/100g. Based on these results, it is impossible to confirm the presence of kaolinite (clay minerals group 1) in the sample.

The composition of the bird mummy-urn is somewhat different from the rest of the case-study examples, all from Estonian archaeological sites (somewhat lower content of quartz, feldspars and absence of kaolinite) [66]. The results of vessel’s clay quantitative composition seem to correlate nicely with its function — an urn in an Egyptian cemetery context. It was meant as a burial container rather than a vessel for eating and drinking, and hence the firing and making the clay vessel strong and heat- and water-resistant were not a priority.

Quantitative analysis of archaeological samples from Narva Joaorg, Estonia

Using the developed ATR-FT-IR-PLS method, five archaeological pottery fragments from the different occupation phases covering a total of 6 millennia from Narva Joaorg (NE-Estonia) were analysed and compared. See photos of the fragments of archaeological pottery and analysed small samples in the SI in Table S3 and ATR-FT-IR spectra in Fig. S7.

These results are rather significant, especially considering that in the Narva Joaorg area the local clay sources are expected to have been used for pottery manufacture [67]. The compositions (see Table 5) of samples 3 (raw material clay coil found in Mesolithic context), 5 (fragment of textile ceramics) and 6 (Iron Age wheel-thrown specimen) are quite similar. The only significant differences can be observed in the content of quartz (about 36 g/100g) for sample 6 and the higher content of Fe-oxyhydroxides (7.5±1.9 g/100 g) for sample 3. Based on these quantitative results, samples 2 (fragment of the Narva Ware pottery) and 4 (fragment of Corded Ware pottery) differ more from the others. In sample 2, the contents of feldspars are about 52 g/100g, clay minerals 2 is significantly lower (11.2±5.1 g/100g) and Fe-oxyhydroxides (hematite) is quite high (about 10 g/100g). However, sample 4 has considerably higher group 2 clay minerals (smectite, illite, micas, illite-smectite), lower feldspar concentration (about 30.2 g/100g and 31.5 g/100g, respectively) and significantly high Fe-oxyhydroxides content (about 9 g/100g). Both samples 2 and 4 have higher contents of clay minerals 1 (kaolinite), around 7 g/100g, compared to other samples. Based on that, it can be supposed that samples 2 and 4 were fired at a lower temperature, not higher than 550 °C.

The different clay content of Narva Ware sherd (sample 2) is somewhat unexpected as it ought to resemble the raw material clay coil (sample 3). The difference may be due to the inhomogeneity of the analysed samples. However, the difference might also relate to the exploitation of different natural clay sources in the early pottery production phases compared to later periods (samples 3, 5, 6), potentially also different tempers added to the initial clay mixture or indeed potential temporal difference of these two artefacts. The considerably deviating composition of Corded Ware sherd (sample 4) in this context might, however, indicate that this vessel was not locally produced but rather imported. Indeed, the Corded Ware phenomenon as such in Europe has been widely related to human migrations (supported by aDNA studies [68]) and dispersal of several novel cultural traits from subsistence (arrival of domesticated species) to burial customs and artefacts (inc. pottery) [69].

Quantitative analysis of samples from the Tartu Cathedral, Estonia

Table S4 in the SI presents the photos of the analysed brick samples from Tartu Cathedral, and Fig. S8 shows their ATR-FT-IR spectra. For the comparison, XRD analyses for the brick samples were also carried out, and these data are presented in Table S8 in the SI.

The ATR-FTIR-PLS method enabled quantifying some of the main components of the two samples as a total of about (100±8) g/100g. Results of the quantitative analysis indicate that the composition of the bricks originating from the pillar and floor (samples 7 and 8) is expectedly quite similar. For both samples, the PLS model quantified the contents of silica varieties between (44.6±3.6) and (48.2±3.6) g/100g and feldspars between (30.5±4.0) and (35.7±4.1) g/100g. These values (especially quartz) are quite similar also with the XRD quantitative results. However, the ATR-FT-IR-PLS method could not adequately quantify clay minerals of groups 1 and 2 in these samples. When visually observing the powdered samples of the bricks (see in the ESI, Table S8), then sample 8 is more reddish than sample 7. The quantitative results support that PLS model quantified contents of iron-oxyhydroxides (in this case hematite that gives red colour) as (12.3±1.9) g/100g and (9.1±1.9) g/100g, respectively. XRD results also support these results. The results show that despite the overlapping of the bands and some noisiness in the IR spectra in the region of 600–225 cm-1, our ATR-FT-IR-PLS method can also quantify iron-oxyhydroxides (e.g. hematite) with reasonable accuracy.

Based on the results with ATR-FT-IR-PLS and XRD methods, it is possible to conclude that the firing temperatures were below 900–950°C (micas are still present) and the initial source of the studied Tartu Cathedral bricks is probably the same. Since brick kilns already existed during the Medieval Period in Tartu, it is possible that the clay used in manufacturing of the bricks was of local origin [70].

Quantitative analysis of samples from the St. John’s Church in Tartu, Estonia

Table S5 in the SI presents the photos of the analysed samples, and Fig. S9 their ATR-FT-IR spectra. Also, for these samples, XRD analyses were carried out, and obtained data are presented in Table S8 in the SI.

The developed ATR-FT-IR-PLS method enabled quantifying some of the main components of the three samples as a total of between (95.1±8.3) and (101.2±8.3) g/100g. PLS results show that the compositions of samples 9 (terracotta head sculpture) and 11 (brick) are more similar to each other compared to sample 10 (profile stone). Both samples 9 and 11 contain over 50 g/100g silica varieties (quartz), over 32 g/100g feldspars and under 6 g/100g of clay minerals and/or micas. There is a small difference in the composition of Fe-oxyhydroxides (reddish colour indicates the presence of hematite) — sample 9 has the content of (8.1±1.9) g/100g, and sample 11 has (5.6±1.9) g/100g. XRD results also support some of these results, indicating the same material source for those objects. XRD results also show that the ATR-FT-IR-PLS model quantified clay minerals of groups 1 and 2 incorrectly, and in this case, XRD quantified them correctly. However, in sample 10, the ATR-FT-IR-PLS quantified mineral content differs dramatically from XRD results — the contents of silica varieties and also feldspars both are around 36 to 38 g/100g, groups 1 and 2 clay minerals between 3 and 12 g/100g and Fe-oxyhydroxides (hematite) 13.7±1.9 g/100g. The dramatic difference is caused by the presence of an amorphous phase (glass) in sample 10. It is impossible to quantify the amorphous glass directly using the XRD. In this case, the XRD overestimates other mineral phases at the expense of the amorphous phase. The presence of the amorphous phase, as well as the detection of spinel with XRD, indicates that the firing temperature for the profile stone (sample 10) may have reached 900 to 950 °C.

In general, comparing the quantitative results of the samples from St. John’s Church and Tartu Cathedral, we can conclude that these are quite similar, and supposedly the clay and other raw materials could be from the same location and of the same origin.

Conclusions

In this study, a quantitative method for determining different minerals in clay samples using ATR-FTIR spectroscopy in the range of 3720 to 275 cm-1 with PLS data analysis is presented.

For creating the ATR-FT-IR-PLS method, 222 real-life archaeological, geological and commercial clay-related reference standard samples (the concentrations of components were found by XRD analysis) were selected after a careful evaluation for calibration and validation. These reference standards are unique, and this is the first time that these kinds of samples have been used for building the chemometric PLS method for the analysis of clays using FT-IR. As a result, the developed ATR-FT-IR-PLS method can quantify seven main classes of minerals (silica varieties, feldspars, three groups of clay minerals, carbonates, iron-oxyhydroxides) in different sizes and amounts of clay samples. The performance of the developed quantitative method is characterised as follows: the average performance index of 7 main classes of minerals is 84.8 %, R2 of the calibration and validation standards is ˃0.90, RMSEC values of main components varied between 1.7 and 6.1 g/100g and RMSEP values between 0.9 and 5.1 g/100g. We rate these figures of merit as satisfactory to good.

The ATR-FT-IR-PLS method was applied to the 11 (mostly very small) case-study samples from archaeological and cultural heritage objects, covering a large timespan and different geographical origins. The results indicate that the developed method can quantify tiny samples even if only one IR spectrum is available (however, higher uncertainty is expected, often leading to semiquantitative results). In general, the results of all the case-study samples show that the ATR-FT-IR-PLS method quantified quite well the higher contents of minerals like silica varieties (e.g. quartz), feldspars, for some samples also clay minerals 2 (illite, illite-smectite, micas) and surprisingly well also iron-oxyhydroxides. However, the quantified lower content values from the sample should be evaluated critically. Specifically, if the content of a mineral is not higher than two times the standard uncertainty, then it is not possible confidently to say that the component is present in the sample. In order to get adequate results, sample homogeneity is a key aspect. For the ATR-FT-IR-PLS model, the average spectrum from at least three IR spectra of the sample (if the sample size is large enough) is recommended.

Our results show that ATR-FT-IR with the PLS quantification method is a suitable alternative to the XRD if high accuracy is not needed (in some cases results are to be considered semiquantitative) and especially when the available sample amounts are very low. The developed method is quick and simple and does not have time-consuming sample preparation. However, the accuracy and ability to distinguish individual minerals of similar composition and/or structure are clearly lower in the case of ATR-FT-IR-PLS than in the case of XRD.

The developed quantitative ATR-FT-IR-PLS method can improve by adding new calibration standards into the model. Also, this serves as a baseline model for future research on clay components analysis using ATR-FT-IR spectra, particularly relevant for the analysis of heritage objects where small sample quantities are an essential aspect of the analysis.