Introduction

Interest in the development of new and higher quality coffee beverages has led to the marketing of single origin and/or blended coffees that possess distinct organoleptic characteristics. An additional focus concerns coffees rich in bioactive compounds, especially in relation to the high intrinsic antioxidants content of this beverage (Carlsen et al. 2010). Antioxidant compounds have been extensively studied due to their potential beneficial health effects, since they help to protect against harmful reactive species (Lee et al. 2004). The most important coffee compounds, in terms of antioxidant properties, are phenolics, melanoidins, and caffeine. The first group includes hydroxycinnamic acids such as caffeic and ferulic acids (Gallardo et al. 2006), their esters, and quinic and chlorogenic acids (Moreira et al. 2005). As a result of their similar bioactivity and chemical properties, chlorogenic acids and hydroxycinnamic acids have been grouped together as CHAs (Budryn et al. 2015). Isoflavones are also included within the phenolics group but occur at lower concentrations in coffee beans, contributing less than 0.1% on a dry weight basis (dwb) (Alves et al. 2010). Overall, phenolic compounds comprise ca. 10% dwb of green coffee and decrease during the roasting process. Caffeine (Devasagayam et al. 1996), which shows no significant concentration change during roasting, accounts for ca. 1–2% dwb. Lastly, an additional antioxidant source has been attributed to melanoidins (Borrelli et al. 2002; Delgado-Andrade et al. 2005), which are formed in Maillard reactions during the roasting process and may be present at up to 25% dwb in roasted coffee.

Quantification of the total antioxidant capacity (TAC) in coffee has been performed using assays with 2,2′-azinobis (3-ethylbenzothiazoline-6-sulfonic acid) (ABTS) (Pérez-Martínez et al. 2010; Sánchez-González et al. 2005) and 2, 2-diphenyl-1-picrylhydrazyl (DPPH) (Delgado-Andrade et al. 2005). The determination of total phenolic compounds (TPCs) is typically performed by the Folin-Ciocalteu procedure (Pérez-Martínez et al. 2010). These methodologies require an extraction step prior to the analysis. Therefore, determination of the antioxidant profile (TAC and TPC) during the roasting process can only be performed off-line, based on discrete sampling. Antioxidants have been analyzed in studies of coffee brewing procedures (Pérez-Martínez et al. 2010; Sánchez-González et al. 2005; Vignoli et al. 2011), coffee varieties (Vignoli et al. 2011), degree of roast (Delgado-Andrade et al. 2005; Morais et al. 2009; Vignoli et al. 2011), and bioavailability of the compounds (Budryn et al. 2015).

The main characteristics of near-infrared spectroscopy (NIRS), namely fast response, no sample pre-treatment, and ease of handling, make this technique especially appealing for use in the coffee industry. NIRS has been used successfully for quantification of chemical compounds in coffee samples (Pizarro et al. 2004, 2007), discrimination between Arabica and Robusta species (Esteban-Díez et al. 2007), and prediction of sensory properties and degree of roast (Esteban-Díez et al. 2004; Ribeiro et al. 2011). Recently, in a pioneering approach, NIRS was employed for in situ prediction of titratable acidity (Santos et al. 2016a) and sucrose concentration (Santos et al. 2016b) during coffee roasting processes. The potential to accompany chemical reactions during coffee roasting enables more efficient implementation of corrective procedures, better understanding of the roasting process, and, most importantly, improvement of the general quality of the coffee product.

This work describes the use of NIRS with a diffuse reflectance probe for the real-time in situ monitoring of TAC and TPC during the roasting process. Prediction models for real-time quantification of antioxidant compounds are presented, and the influence of different roasting procedures and coffee varieties is discussed.

Experimental

Coffee Samples and Roasting Procedure

Four coffees from different origins were purchased locally for use in the experimental assays: two Arabica coffees originating from Brazil and Timor and two Robusta coffees originating from India and Uganda.

The coffee roasting was performed with a bench coffee roaster (Model KN-8828B-2K, Hottop USA) that featured a frontal glass observation window (through which the in-line NIR spectra were collected), a rotatory roasting drum for homogeneous roasting of the coffee batch, and a sampling door through which coffee sampling could be performed during the roasting process. The bench coffee roaster performs automatically the roasting process after previous setting of the heating power (that defines the temperature-increase/time ratio) and fan speed (which aids normalizing the roasting process). Two different roasting procedures were employed, designated as “fast roast” (FR) and “slow roast” (SR), based on a previous work (Santos et al. 2016a). In FR, coffee samples were heated continuously up to 225 °C during approximately 18 min; in SR, coffee samples were heated continuously up to 195 °C for approximately 25 min. For FR and SR modes, the heating power was set to 7 and 5, respectively, and the fan’s speed was set to 25% according to the roaster’s parameter scales. Batches of approximately 300 g of green coffee from the different origins were weighed out and roasted following the FR and SR procedures. Samples weighing approximately 5 g were collected every 3 min in order to comprise different roasting intensities during each roasting process. Sampling was performed by manually pulling the knob that opened the sampling door. The door was kept open for ca. 1 s during each sampling, avoiding significant perturbation of the roasting process. The collected samples were placed in borosilicate flasks. A total of 56 samples were collected (4 origins × (6 FR samples + 8 SR samples)). The experimental conditions selected focus on different raw materials and roasting policies, aiming at generating a fair number of distinct coffee samples for the predictive model construction.

Determination of Total Antioxidant Capacity and Total Phenolic Content

Reagents and Solutions

All reagents were analytical grade and were used without further purification. Ethanol (Panreac, 121086.1212, ≥99.8%) and water with resistivity >18 MΩ cm, obtained from a Milli-Q deionizer, were used to prepare the solutions.

For the TAC assays, ABTS (Fluka, 11557, ≥99%) and potassium persulfate (K2S2O8, Sigma-Aldrich, 21622-4, ≥99%) were prepared at concentrations of 14 and 4.9 mmol L−1, respectively. Equal volumes of these reagents were mixed for formation of the ABTS·+ radical. This solution was left overnight (>12 h), followed by preparation of standard solutions in ethanol/water medium (1:1, v/v) in order to achieve an absorbance value of 0.900 ± 0.020 (at 734 nm). Trolox (Sigma-Aldrich, 238813, ≥97%) was used as the standard in the determination of total antioxidant capacity. A stock solution was prepared daily at a concentration of 780 μmol L−1 in ethanol/water solution (1:1, v/v). Working standard solutions of Trolox were prepared by appropriate dilutions of the stock solution.

For the determination of TPC, Folin-Ciocalteu reagent (Sigma-Aldrich, F9252) and sodium carbonate (Fluka, 713510, ≥99%) were prepared in aqueous medium at concentrations of 3:10 (v/v) and 9% (w/v), respectively. Gallic acid (Fluka, 48630, ≥98%) was used as standard. An aqueous stock solution was prepared daily at a concentration of ca. 200 mg L−1. Working standard solutions of gallic acid were prepared by appropriate dilutions of the stock solution.

Antioxidant Compounds Extraction Procedure

Prior to extraction, the moisture contents of all the collected samples were determined according to the standard methodology (ISO 11294:1994). The extraction method used was as described previously (Páscoa et al. 2013): an accurately weighed out portion (ca. 50 mg) of each previously milled sample was extracted with 20.0 mL of ethanol/water solution (1:1, v/v) for 5 h, using an orbital shaker (Model 3005, GFL) at 300 rpm. The selected extraction solution composition ensured the solubilization of both lipophilic and hydrophilic compounds, as well as the interaction of antioxidants with radical species (Gökmen et al. 2009; Mussatto et al. 2011; Pérez-Jiménez and Torres 2011). This procedure was performed in duplicate for all the collected samples. The solutions were left to stand until a clear supernatant was obtained and were then decanted. Finally, appropriate dilutions of the clear ethanolic solutions were made prior to the determination of TAC and TPC.

Total Antioxidant Capacity of Roasted Coffee

The total antioxidant capacity of the collected coffee samples was determined according to the procedure described by Páscoa et al. (2013). Aliquots (150 μL) of coffee extract (previously diluted 40-fold) were mixed with 150 μL of ABTS·+ solution in 96-well microplates, using a multi-channel micropipette. The absorbance decrease was measured at 734 nm, every 30 min for 8 h, using an UV-Vis spectrophotometer (Synergy HT, Bio-Tek Instruments). The total antioxidant capacity was expressed as Trolox equivalent antioxidant capacity (in μmol of Trolox/kg of coffee sample dwb). A typical calibration curve obtained under the experimental conditions used could be described by the equation: ∆Abs734 nm = −0.015 + 0.960 CTrolox (R 2 = 0.998), obtained with Trolox in the concentration range from 5 to 50 μmol L−1. The intrinsic absorption of the coffee extracts was measured by adding extraction solution instead of ABTS·+ solution and was considered not significant because it was equivalent to less than 0.005 absorbance units. The measurement of antioxidant activity was performed in quadruplicate for each extract.

Total Phenolic Content of Roasted Coffee

The procedure used for the quantification of total phenolic compounds in the roasted coffee samples was performed as follows: 150 μL aliquots of previously 20-fold diluted coffee extract were treated with 50 μL of Folin-Ciocalteu solution (3:10, v/v), followed by addition of 100 μL of sodium carbonate solution (9%, w/v) in a 96-well microplate. The absorbance during the reaction was monitored at 760 nm at intervals of 10 min for 2 h (Páscoa et al. 2013). The intrinsic absorbance of the sample extracts was determined by mixing the extracts with 50 μL of HCl (0.6 M) and was found to be not significant. Similarly, the intrinsic absorbance of the reagent was measured by using the same volume of water, instead of the sample extract. The values obtained were considered in the subsequent calculations. The total phenolic content was expressed as gallic acid equivalent capacity (in g of gallic acid/kg of coffee sample dwb), according to the calibration curve described by the equation: A 760 nm = 0.072 + 0.065 C gallic acid (R 2 = 0.999), obtained with gallic acid at concentrations in the range 1–20 mg L−1. The measurement of the total phenolic content was performed in quadruplicate for each extract.

In-line NIR Spectra Acquisition

Near-infrared spectra were acquired in diffuse reflectance mode, using a Fourier transform NIR spectrometer (FTLA 2000, ABB, Quebec, Canada) controlled with Bomen-Grams software (version 7, ABB, QC, Canada). The spectrometer was connected to a diffuse reflectance probe and an indium-gallium-arsenide (InGaAs) detector. The probe (SabIR, ThermoNicolet, Madison, WI), featuring a sapphire window with a 0.20-cm2 illumination area, was placed outside the coffee roaster, facing the glass observation window at an approximate distance of 2 mm. In this way, the probe is as close as possible from the coffee sample under roasting without being significantly heated (Fig. 1).

Fig. 1
figure 1

Laboratory experimental setup for the real-time coffee roasting monitoring with near infrared spectroscopy: 1, diffuse reflectance probe; 2, frontal roaster glass window; 3, roaster’s drum; 4, sampling door; 5, samples/coffee beans; 6, roaster’s display; 7, sampling door knob

Each NIR spectrum was acquired using an average of 64 scans, resolution of 8 cm−1, and wavenumber interval from 10.000 to 4.000 cm−1. Under these conditions, approximately 30 s was required to record each spectrum. For each saved spectrum, a coffee sample was immediately collected to match that particular spectral analysis. A total of 56 spectra were used to develop chemometric models for antioxidants estimation. Before each roasting process, a background spectrum was obtained using a disk of PTFE (Teflon) reference material.

Multivariate Data Analysis

Exploratory Data Analysis

All chemometric calculations were performed using Matlab version 8.3 (MathWorks, Natick, MA, USA) and partial least squares (PLS) Toolbox version 7.5 (Eigenvector Research Inc., Wenatchee, WA, USA).

Before applying any chemometric model, all data were subjected to mean centering.

After collection all 56 NIR spectra, a principal component analysis (PCA) (Naes et al. 2004) model was constructed to identify patterns over the roasting process and/or any outliers. The squared prediction error and Hotelling’s T 2 statistics were evaluated to discard the presence of outliers. The criterion for detecting outliers was based on the comparison of vector lengths obtained for the normalized statistics divided by the 95% confidence limits (Naes et al. 2004).

PLS Modeling

For the PLS model optimization, the spectra were pre-processed exploiting Savitzky-Golay (SG) method (different parameters including first and second derivative were evaluated), standard normal variate (SNV) transformation, and multiplicative scatter correction, individually and combined. These pre-processing methods have been extensively used in the past for similar samples measured with NIR and modeling purposes (Santos et al. 2016b; Páscoa et al. 2013; Pizarro et al. 2004). Essentially, the selected methods remove baseline variations and correct for scale differences.

The PLS method with the PLS1 algorithm (Geladi and Kowalski 1986) was used to generate the predictive models for the parameters TAC and TPC, and each parameter was modeled individually. A set of 56 spectra was used to develop the PLS models for TPC and TAC determination. This set was divided randomly into two groups (ensuring an even distribution in terms of the values obtained by the reference procedures): one group (39 samples) was used for calibration and the other group, with the remaining 17 samples, was used for independent testing. The number of latent variables (LV) was defined with the calibration set, using the leave-one-out cross-validation method (Sarraguça and Lopes 2009). The optimization of the PLS models involved the combination of different spectral regions of the acquired NIR spectra (details in the “NIR spectral analysis” section). The accuracy of the models was evaluated in terms of the root mean square errors for calibration (RMSEC), cross-validation (RMSECV), and prediction (RMSEP) (Eq. 1).

$$ \mathrm{RMSE}\left(\mathrm{C},\mathrm{V},\mathrm{P}\right)=\sqrt{\frac{\sum_{i=1}^N{\left({\overbrace{y}}_i-{y}_i\right)}^2}{N}} $$
(1)

In Eq. 1, N is the number of samples, and yi and \( {\widehat{y}}_i \) are the experimental and NIR prediction values for sample i, obtained for the calibration (RMSEC), cross-validation (RMSECV), and prediction sets (RMSEP). Additional parameters used to evaluate the accuracy of the PLS models were the coefficient of determination (R 2 P) and the range error ratio (RER), calculated according to Eq. 2, both relative to the prediction set.

$$ RER=\left(\varDelta y\right)/\mathrm{RMSEP} $$
(2)

In Eq. 2, Δy is the difference between the maximum and minimum experimental values of TAC or TFC in the independent test set. RER values above 10 indicate good predictive models (De Girolamo et al. 2009). Using the concept of net analyte signal (NAS), the figures of merit for the multivariate PLS models (limit of detection (LOD), sensitivity (SEN), and selectivity (SEL)) were determined for each modeled parameter, employing a strategy similar to the univariate approach (Sarraguça and Lopes 2009). SEN and SEL were determined using the following equations:

$$ SEL=\left\Vert r\hat{\mkern6mu} *\right\Vert /\left\Vert r\right\Vert $$
(3)
$$ SEN=\left\Vert s\hat{\mkern6mu} *\right\Vert $$
(4)

In the previous equations, r is a sample spectra, r* is the NAS vector, and s* is the ratio between the NAS of any sample and the corresponding PLS prediction. Selectivity values were calculated by taking the average over all samples (SEL is multiplied by 100 and displayed as %).

Results and Discussion

Antioxidant Profile during Coffee Roasting

The ABTS colorimetric assay was employed as this methodology has been shown to be selective towards the main coffee compounds tentatively related to antioxidant activity, namely melanoidins (Delgado-Andrade et al. 2005; Pérez-Martínez et al. 2010), phenolic compounds (Andrade et al. 2012) and, particularly, within this last group, chlorogenic acids (Charurin et al. 2002; Pérez-Martínez et al. 2010). The results obtained are presented in Fig. 2.

Fig. 2
figure 2

TAC profiles for different coffee samples (Robusta India; Robusta Uganda; Arabica Brazil; Arabica Timor) and roasting modes (a slow roast; b fast roast)

The higher initial TAC values for the Robusta cultivar, compared to the Arabica species, may be due to the higher intrinsic content of chlorogenic acids in Robusta (ca. 11%, w/w) than in Arabica (8%, w/w) (Farah 2012). An additional justification is suggested by Vignoli et al. (2011) based on the higher content of polyphenols and caffeine in Robusta species. However, to the best of our knowledge, no previous modeling or experimental studies have investigated the reaction between ABTS and caffeine.

With respect to the changes in antioxidant levels during roasting, in both SR and FR assays, the Robusta samples showed constant and higher TAC values, compared to the increasing profile observed for the Arabica samples. Similar studies of antioxidant activity profiles based on ABTS assays have been performed, generally based on three measurements in order to consider light, medium, and dark roast samples. Vignoli et al. (2011) found that the antioxidant activity was not affected during roasting of coffee beans for 7–10 min at 214–225 °C. Delgado-Andrade et al. (2005) reported an increase in TAC from light roast to medium roast and leveling off from medium roast to dark roast. Sacchetti et al. (2009) roasted coffee samples for 10 min at 180 °C and found an increase up to medium roast, followed by a subsequent decrease. Nicoli et al. (1997) used 2,2′-azobis (2-amidinopropane) dihydrochloride as the peroxyl radical generator, instead of ABTS, in a roasting experiment with four samples obtained between 8 and 20 min (using a roasting temperature of ca. 200 °C), and found higher antioxidant concentrations in medium and dark roast Arabica samples.

There are several factors that make it difficult to directly compare the TAC profile results described in the literature. These include the intrinsic heterogeneity of coffee samples, the roasting procedure (which may be performed within 3 min (Del Castillo et al. 2002) or for up to ca. 20 min (Nicoli et al. 1997), at fixed or increasing temperature), and the analytical procedure employed for antioxidant activity measurement. In the last case, differences in extraction conditions, such as the medium used, temperature, and time, can lead to different extraction yields.

Possible explanations for the present findings include the existence of antioxidant compound precursors in Arabica beans and/or the different antioxidant activities of different compounds. The antioxidant activity profile during the roasting process reflects the sum of the contributions of all the antioxidant compounds present at a given moment during the roasting process. It is widely accepted that the concentration of chlorogenic acids decreases during the roasting process, while the concentration of melanoidins increases. The changes in the concentrations of these compounds during roasting could be largely responsible for the antioxidant profile.

The overall effect of using different roasting experimental conditions did not appear to be significant, since similar profiles were obtained for the Robusta and Arabica species in the FR and SR assays. In similar roasting assays used for in-line monitoring of sucrose (Santos et al. 2016b) and titratable acidity (Santos et al. 2016a), the influence of different roasting conditions was much more noticeable. In the present case, the evidence suggested that there were similar changes in the kinetics of the antioxidant compound degradation and formation reactions under the different roasting conditions used.

For both FR and SR procedures, the final TAC values for the Robusta and Arabica coffees were mostly similar (with the exception of the Timor Arabica sample in the SR assay), which could have been due to the typically similar melanoiidin concentrations in both species after roasting (Farah 2012). The TAC values obtained for the roasted coffees were within the range reported previously (Delgado-Andrade et al. 2005; Vignoli et al. 2011).

Folin-Ciocalteu Profile during Coffee Roasting

The widely used Folin-Ciocalteu assay for the determination of phenolic compounds was adopted. The profiles obtained in this procedure were similar to those previously observed in the ABTS radical scavenging assays (Fig. 3). This was expected, because the Folin-Ciocalteu reagent is selective to both chlorogenic acids and melanoidins (Pérez-Martínez et al. 2010), and the reaction involves similar chemical processes (based on electron transfer) as the ABTS radical scavenging technique (Huang et al. 2005).

Fig. 3
figure 3

TPC profiles for different coffee samples (Robusta India; Robusta Uganda; Arabica Brazil; Arabica Timor) and roasting modes (a slow roast; b fast roast)

For the Robusta samples, the TPC values remained approximately constant and were within the same range for the SR and FR roasting processes. For the Arabica samples, similar increases in TPC values were observed during the two roasting processes. However, these increases were lower than those observed in the TAC assays; this was the most notable difference between the ABTS and Folin-Ciocalteu results. The associations previously identified, concerning the initial and final TAC values and the typical intrinsic composition of green and roasted coffee, could also be applied to the results obtained using the Folin-Ciocalteu technique. The profiles observed for the Arabica samples showed that although the phenolic fraction was consumed during the roasting procedure, there was a smaller increase of the TPC values, compared to TAC, which could be explained by the lower selectivity of the Folin-Ciocalteu technique towards melanoidins (Pérez-Martínez et al. 2010). In addition, the profile differences could be related to the reaction yields for phenolics and melanoidins achieved with the Folin-Ciocalteu and ABTS methods (no specific data are available to support this possibility). Pérez-Martínez et al. (2010) reported that the Folin-Ciocalteu procedure may not be the most appropriate methodology for the measurement of phenolics in coffee samples and instead suggested the use of electron spin resonance (ESR) spectroscopy. The values obtained here for TPC in the roasted coffee were of the same magnitude as those found in other studies (Morais et al. 2009; Stelmach et al. 2015; Pérez-Martínez et al. 2010). However, different profiles have been described in the literature. Morais et al. (2009) roasted Robusta samples at 190 ± 10 °C for three different periods (6, 8, and 10 min) and observed decreases during the roasting processes. Vignoli et al. (2011), using both Arabica and Robusta samples, concluded that no clear trends could be found for TPC values during the roasting process.

As observed in the ABTS assays, there were no significant differences between FR and SR in terms of the phenolics profile. The same conclusions reached for the ABTS results could also be applied to the Folin-Ciocalteu assays.

NIR Spectral Analysis

The NIR spectra acquired during the roasting assays were similar to the spectra obtained in previous studies (Santos et al. 2016a; Santos et al. 2016b). The main changes in the spectra of the roasting process were an absorbance decrease in the water band region (5200–5000 cm−1) due to moisture loss and an absorbance increase in the combination band region (5000–4000 cm−1) (Fig. 4). A more in-depth discussion concerning the main wavelength intervals and their correlations with chemical and physical changes in the coffee during the roasting process has been provided elsewhere (Santos et al. 2016a).

Fig. 4
figure 4

Near-infrared spectra collected during a slow roast procedure of Robusta India coffee beans, indicating the selected five spectral regions: R1, 10,000–7300 cm−1; R2, 7299–6700 cm−1; R3, 6699–5350 cm−1; R4, 5349–5000 cm−1; and R5, 4999–4000 cm−1

The NIR spectral range was divided into five regions in order to identify those that showed the greatest association with the antioxidant properties. Region 1 (10,000–7300 cm−1) is mainly related to physical properties (bean size, shape and density); region 2 (7299–6700 cm−1) is mostly affected by the water content; region 3 (6699–5400 cm−1) essentially captures chemical information concerning C-H and S-H bonds; region 4 (5399–5000 cm−1) includes the first overtone and combination band regions and is most affected by RCO2N, CONH2, and H2O vibrations; and region 5 (4999–4000 cm−1) essentially contains chemical information related to C-H, N-H, and C-O vibrations.

The PCA developed model revealed that no outliers were present in the set of 56 spectra. Additionally, the model scores suggest the existence of four clusters each including a single coffee variety and origin, irrespective of the degree of roast (Supplementary Data, Fig. SD1). These results were similar to previous findings (Santos et al. 2016a).

PLS Model Testing

In order to optimize the PLS models, in terms of spectral regions, different combinations of the identified wavelength intervals were tested, and to each one, the referred pre-processing strategies were applied. Models for TAC and TPC were optimized based on a compromise between the numbers of LVs and RMSECV (Table 1). The best spectral window for both models was obtained using a combination of the R3 (6699–5400 cm−1) and R5 (4999–4000 cm−1) regions, pre-processed with SG using a 15-point filter width, a second-order polynomial, first derivative, and SNV transformation. During the roasting process, there was a general absorbance decrease in region R3 (especially within the interval 6300–6000 cm−1). In region R5, the absorbance was roughly constant within the interval 5000–4500 cm−1, while a significant absorbance increase occurred at 4300–4100 cm−1.

Table 1 Figures of merit for the PLS calibration models for the different chemical methods using the optimized spectral windows (6699–5400 and 4999–4000 cm−1)

Further evaluation of the PLS models was performed using the independent set. The parameters RMSEP, RER, and R P 2 were determined for the two PLS models (Table 1 and Fig. 5).

Fig. 5
figure 5

Comparison between wet antioxidant assay results and PLS cross-validation predictions for a TAC and b TPC: calibration set (filled circles); prediction set (open square)

Selectivity values around 12–20%, or the spectra fraction captured by the NAS vector unique to the analyte of interest, showed that a significant part of the signal was employed in model construction. The sensitivity can be understood as the slope of the calibration curve (Páscoa et al. 2013). The sensitivity values can be compared when the same analyte is being determined by different models. In this work, two different properties are being estimated and, therefore, the comparison between sensitivities does not provide a useful interpretation. The LOD values demonstrated that the developed technique was suitable for the real-time control of antioxidant properties.

The analysis to the regression coefficient vectors obtained for both PLS methods showed relevance of the identified regions but did not identify specific wave numbers (or more restricted wave number intervals) (Supplementary Data, Fig. SD2). In light of the present data and findings reported in the literature, an attempt was made to correlate the spectral changes with potential antioxidant compounds. The absorbance decrease in the R3 region (assigned to O-H and N-H (first overtone), C=O (second overtone), C-H, Ar-CH, and N-H bonds) could be attributed to a decrease in the concentration of aryl substituent compounds, notably phenolic compounds and chlorogenic acids (Ribeiro et al. 2011). The absorbance increase in the R5 region at 4300–4100 cm−1 (assigned to O-H and N-H combination bands, C-H and C-H combination bands, and C-H and C-C combination bands) could be attributed to the formation of melanoidins (a complex mixture consisting mainly of carbohydrates and amino acids formed during the roasting procedure, together with low levels of phenolic compounds and chlorogenic acids) (Preedy 2015). These correlations are in accordance with the main chemical drivers previously associated with the antioxidant profile.

Conclusions

The present work demonstrates the capacity of NIR spectroscopy for real-time evaluation of the antioxidant potential of coffee during the coffee bean roasting process. Different coffee species and origins were tested, considering the influence of the roasting process on the total phenolic content and the total antioxidant capacity.

PLS regression models for TAC and TPC yielded RER values above 10, indicative of acceptable predictive ability in practical applications. Prediction errors were of the same order of magnitude as for wet chemistry reference methodologies used to determine TAC and TPC. A significant part of the spectrum was employed for construction of the models (6699–5400 and 4999–4000 cm−1), resulting in selectivity values between 12 and 20%.

The values found for TAC and TPC in green coffee beans were in good agreement with previous reports. The profiles obtained here contribute to the current discussion, where it has been difficult to reach a consensus. The methodology developed is much faster and simpler to perform, compared to the usual batch methodologies, and offers a valuable analytical tool for the prediction of antioxidant activity in coffee samples or during the roasting procedure. It also assists in understanding the main chemical processes that govern the antioxidant activity. Additionally, there were no obvious effects of roasting speed on the antioxidant profiles of the coffees in the assays performed here.

The presented strategy is especially attractive for use in the coffee industry due to the easy setup (the roasting process can be monitored through the glass window of a roaster) and the intrinsic analytical advantages of NIRS, which is a non-destructive and non-invasive technique that is simple to operate.