Introduction

Wood is a complex biological material composed mainly of cellulose microfibrils within a hemicellulose–lignin matrix. Different wood species can have different physical and chemical properties, which affect their cost, application, and production. In addition, many tree species are on the official list of endangered plants, making it necessary to improve control over wood exploitation. Moreover, regardless of the species, woods can have different geographical origins or be submitted to different physical or chemical treatments. However, the visual identification of different woods is a very costly and difficult task, requiring highly qualified personnel. Therefore, the development of instrumental methods for the discrimination of woods is very important for wood trading and for environmental conservation (Carballo-Meilan et al. 2014; Francisco-Fernández et al. 2015; Nisgoski et al. 2017; Cui et al. 2019).

For the discrimination of wood species, several analytical techniques have been employed that use chemometrics for multivariate data analysis, such as mass spectrometry (Zhang et al. 2019), chemiresistor gas sensor array (Kalaw and Sevilla III 2018), laser-induced breakdown spectroscopy (LIBS) (Cui et al. 2019), thermogravimetric analysis (Francisco-Fernández et al. 2012, 2015), molecular fluorescence (Piuri and Scotti 2010; Oliveira et al. 2015), mid-infrared spectroscopy (Carballo-Meilan et al. 2014), and mainly near-infrared spectroscopy (Shou et al. 2014; Park et al. 2017; Kurata 2017; Nisgoski et al. 2017; Ramalho et al. 2018). Other works have also reported wood discrimination based on properties other than the species: based on the wood type and using near-infrared spectroscopy (Hayashi et al. 2003), Raman spectroscopy (Lavine et al. 2001), or thermogravimetric analysis (Francisco-Fernández et al. 2017); based on the geographical origin and using mass spectrometry (Horacek et al. 2009); based on the thermal treatment and using near-infrared spectroscopy (Bächle et al. 2012); and based on the wood preservative and using near-infrared spectroscopy (Bouslamti et al. 2013) or LIBS and X-ray fluorescence spectroscopy (Solo-Gabriele et al. 2004).

Near-infrared reflectance spectroscopy (NIRS) has been widely used for wood discrimination because it is a fast, noninvasive, and nondestructive instrumental technique and requires very few or no sample preparation. For the same reasons, the application of LIBS to wood discrimination has increased in recent years, although very few works have been published yet. Both instrumental techniques typically provide the different but complementary chemical information. While in NIRS, the sample is illuminated by an appropriate polychromatic light source, and the diffusely reflected light is collected and analyzed, containing information on molecular absorbance due to vibrational energy transitions, in LIBS, the sample is hit by a high-power pulsed and focused laser beam, and the resulting plasma radiation is collected and analyzed, containing information on atomic emissions due to electronic energy transitions (Gonzaga et al. 2017; Zhu et al. 2019). Recently, a work has reported on a successful combination of LIBS and NIRS data in the analysis of forage plants (Oliveira et al. 2019).

This work presents a new method for the discrimination of wood species by combining LIBS and NIRS and using chemometrics for data analysis. The method was evaluated for the discrimination of woods from Amburana cearensis, Copaifera lucens, Phyllocarpus riedelii, Cariniana legalis, Bowdichia virgilioides, and Aspidosperma pyricollum. The generated spectroscopic data were analyzed by principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA).

Materials and methods

Instrumentation

For the acquisition of the LIBS spectra, a home-made, compact, and low-cost instrument was employed. It has, among other optical components, a Standa STA-01-8 microchip laser (1053 nm wavelength, 600 µJ pulse energy, 470-ps pulse duration, and 100 Hz pulse repetition rate), a sample holder coupled to a Standa 8MT30-50 translation stage, and a B&W Tek Exemplar LS mini-spectrometer with a classical Czerny–Turner polychromator and a 2048 pixel non-gated, non-intensified, and non-cooled CCD sensor array (200–850 nm spectral range, 1.2 nm resolution). More details about the instrument were given in a previous work (Gonzaga et al. 2015).

For the acquisition of the NIRS spectra, a PerkinElmer Spectrum GX Fourier transform instrument was used, coupled to a Pike FlexIR module for operation in diffuse reflectance mode.

Samples

Forty-two wood samples (different pieces of wood) from six different species were analyzed using both instruments. The samples were randomly taken from different transversal regions of wood trunks, from the core up to near to the shell. A description of the samples is given in Table 1.

Table 1 Description of the wood samples analyzed

Experimental procedures

Prior to the analysis on each instrument, the samples were pretreated by polishing to remove any possible surface contaminations.

For the LIBS analysis, one spectrum was obtained for each sample, using a 200–750 nm spectral range, about 0.38 nm nominal resolution, 100 Hz laser pulse repetition rate, an average of four scans (each scan from a different micro-region of the sample surface, covering a circular region with about 1 cm diameter), and 4 s of integration time per scan (continuous integration of the radiation emitted by 400 plasmas). The translation stage was continuously displaced at 1.0 mm s−1 during the spectra acquisition to avoid plasma extinction (Lopez-Moreno et al. 2005).

For the NIRS analysis, one spectrum was obtained for each sample, using a 1100–2500 nm spectral range, about 1.30 nm nominal resolution, and an average of 16 scans (all scans from the same circular region, with about 1 cm diameter, of the sample surface).

Data pretreatment and chemometrics

First, the data from LIBS (42 samples and 1466 variables) and NIRS (42 samples and 1273 variables) were merged, by placing each NIRS spectrum right after the end of the respective LIBS spectrum for each sample, resulting in a data matrix with 42 rows (samples) and 2739 variables (wavelengths). Thereafter, the data were pretreated by standard normal variate (SNV) and Savitzky–Golay first derivative with smoothing (11 points data window and polynomial order 2) using CAMO The Unscrambler 9.7 software (Savitzky and Golay 1964). Although the pretreatment was carried out after merging the NIRS and LIBS data, the pretreatment procedures were applied to the NIRS and LIBS data separately. Additionally, the resulting NIRS data were also multiplied by 4.3. Then, an exploratory data analysis was carried out by PCA using CAMO The Unscrambler 9.7 software, and wood discrimination was evaluated by PLS-DA using Chemoface 1.61 software. In both cases, the data were additionally pretreated by mean centering. In PLS-DA, nine samples were randomly selected for the external validation, whereas the remaining 33 samples were used for calibration with full cross-validation.

Results and discussion

LIBS and NIRS spectra

Figure 1 shows the raw LIBS and NIRS spectra acquired for all wood samples. As can be seen, the LIBS spectra have thin peaks, which are typical for atomic spectroscopy, whereas the NIRS spectra have wide bands, which are typical for vibrational spectroscopy. As can also be observed, the spectra contain some noise (mainly LIBS), baseline variations, and effects from light scattering (common in the analysis of solid samples). Therefore, several pretreatment methods were tried to correct/minimize these problems in the spectra. The best results (best sample grouping and separation according to the wood species) were obtained using SNV (correction of effects from light scattering) and Savitzky–Golay first derivative with smoothing (correction of baseline variation and minimization of noise) for all spectra and additional multiplication of NIRS data by 4.3 (so that LIBS and NIRS data had similar final maximum intensities). Figure 2 shows all the spectra after the cited pretreatment. Zooming in the spectral regions of 240–300 nm (LIBS spectra) and 1900–2100 nm (NIRS spectra) is shown in Fig. 2b, which presents large intensity variations. Although it was possible to see the grouping of some spectra according to the wood species in Fig. 2b, clearly there was also the overlapping of some spectra from different species, making necessary the use of multivariate data analysis for improving the sample discrimination. The spectra were also additionally pretreated by mean centering, right before the PCA or PLS-DA, which also improved sample grouping and separation.

Fig. 1
figure 1

Raw LIBS (a) and NIRS (b) spectra for all wood samples

Fig. 2
figure 2

Pretreated LIBS (left) and NIRS (right) spectra for all wood samples: a all spectral range (numbers inside the plotting area according to Table 2), b zooming in 240–300 nm (LIBS) and 1900–2100 (NIRS)

Exploratory data analysis

As cited before, the data from LIBS and NIRS were placed together in a single 42 × 2739 matrix, even before the data pretreatment. PCA was applied to the data before and after the cited pretreatment to check for some sample grouping/separation according to the wood species and to evaluate the improvement in data quality resulting from the data pretreatment. In this case, for the PCA with the raw data, the NIRS data were previously multiplied by 136,294, so that LIBS and NIRS data had similar final maximum intensities. Therefore, Fig. 3 shows the PCA score plots in the first two principal components. Some sample groupings related to the wood species can be observed in the score plot resulting from the raw data (Fig. 3a), with no sample groups from a single species fully separated from the other samples. However, the score plot resulting from the pretreated data (Fig. 3b) shows a clear formation of sample groups according to the wood species, separated from each other, although with some minor group overlapping between some group boundaries (for CE-BV, CE-AC, and BV-AP species) and with a wide sample dispersion within some groups (for CL and PR species).

Fig. 3
figure 3

PCA score plots from all raw (a) and pretreated (b) data (explained variances of 89% and 85%, respectively)

For comparison purposes, PCA was also separately applied to the pretreated data from LIBS and NIRS to determine which analytical technique is responsible for the grouping and separation of which wood species. Thus, the PCA score plots in the first two principal components are shown in Fig. 4. As can be seen, the score plot from the LIBS data (Fig. 4a) shows a better sample grouping and separation, according to the wood species (compared to the score plot from the NIRS data in Fig. 4b), but with some group overlapping (mainly for CL-BV and CL-CE). The score plot from the NIRS data, although it has no sample group from a single species fully separated from the other samples, shows a full separation exactly between the sample groups where some group overlapping was observed at the score plot from the LIBS data. In other words, Fig. 4 shows that only a combination between the data from LIBS and NIRS can promote the best sample grouping and separation, according to the wood species, as shown in Fig. 3b.

Fig. 4
figure 4

PCA score plots from LIBS (a) and NIRS (b) pretreated data (explained variances of 84% and 96%, respectively)

The PCA loading plots from all the pretreated data are given in Fig. 5 for the first and second components. From the highest intensities observed in these plots, the more important variables could be selected related to the best sample grouping and separation shown in Fig. 3b. Therefore, ten variables were selected, related to five LIBS emission peaks and five NIRS absorption bands; the wavelengths were determined and are given in Table 2. Table 2 also shows an attempt for chemical assignment of these peaks/bands according to the literature. Mainly for the LIBS emission peaks, it was difficult to determine exactly which chemical element was related to each emission peak because of the relatively low spectral resolution of the LIBS instrument used in this work for atomic spectroscopy. However, in general, we can tell that different concentrations of mainly iron, copper, zinc, and/or sodium, and lignin, water, cellulose and/or hemicellulose among the samples were responsible for the sample grouping and separation according to the wood species.

Fig. 5
figure 5

PCA loading plots from all pretreated data for the first (a) and second (b) components

Table 2 Assignments of the spectral peaks (LIBS) and bands (NIRS) related to the species separation at PCA

Wood discrimination

After the exploratory data analysis by PCA, the pretreated data from LIBS and NIRS were employed to evaluate the possibility of sample discrimination according to the wood species using PLS-DA. Only nine samples (about 21%) were selected for the external validation because of the relatively low number of samples so that a representative number of samples was available in the calibration step for a good modeling of all wood species. Figure 6 shows the variation in the successful predictions of the wood species as a function of the number of latent variables (LV) employed in the calibration with full cross-validation. From this figure, and taking into account the compromise between the highest successful prediction and the lowest LV number, the authors decided to use 8 LV in the PLS-DA. Therefore, Table 3 shows the results of the PLS-DA for wood discrimination, obtained from the calibration with full cross-validation and from the external validation. As can be observed, three out of 33 samples were misclassified in the calibration (about 9%), whereas one out of nine samples was misclassified in the external validation (about 11%). All the misclassifications involved exactly the sample groups with some minor group overlapping between their boundaries (CE-AC and BV-AP) or the sample groups with a wide sample dispersion (PR and CL).

Fig. 6
figure 6

Effect of the number of latent variables employed in the calibration on the successful prediction of the wood species using full cross-validation

Table 3 Results of the calibration with full cross-validation and external validation using PLS-DA for the discrimination of wood species (eight latent variables)

Conclusion

A new method was proposed for the discrimination of wood species by using a home-made, compact, and low-cost LIBS instrument and a commercial Fourier transform NIRS instrument together with chemometrics. The method was evaluated in the analysis of 42 samples from six different wood species. The spectra from LIBS and NIRS were combined in a single matrix and pretreated to remove signal noise, baseline variations, and effects from light scattering and to equalize the maximum intensities between the spectra from both analytical techniques. The exploratory data analysis by PCA showed a clear formation of sample groups according to the wood species only when the data from both analytical techniques and the data pretreatment were used. The sample grouping was probably related to different concentrations of iron, copper, zinc, and/or sodium, in the case of LIBS data, and lignin, water, cellulose, and/or hemicellulose, in the case of NIRS data. The sample discrimination according to the wood species was proved possible by using PLS-DA, but with some minor misclassifications. The misclassifications were related to some minor group overlapping or to sample groups with a wide sample dispersion, as previously observed by PCA.