Introduction

Coffee beans are the seeds of an evergreen shrub belonging to the Rubiaceae family and the genus Coffea. There are more than 100 species of coffee worldwide, but two species are of importance commercially, Coffea Arabica Linn. var. arabica, known in the trade as Arabica and Coffea canephora Pierre ex Froehner var. robusta, commercially called Robusta (Clarke and Macrae, 1985). Arabica is the most appreciated species worldwide, due to its sensory superiority, and represents more than 60% of global production. Its cultivation demands a relatively mild climatic conditions and a well-defined dry season (Toledo et al., 2016).

Robusta coffee grows at low altitudes, tolerates higher temperature and heavier rainfall, demands a higher soil humus content than Arabica and generally it is also much more resistant to diseases allowing mechanization of the cultivation technique. For these reasons the production of Robusta coffee is more economical (Clarke and Macrae, 1985; Colzi et al., 2017). Coffee beans contain several groups of bioactive compounds as alkaloids, phenolics and terpenoids. These compounds exhibit an antioxidant activity and coffee brew may be a good source of health-beneficial phytochemicals in diet (Lim et al., 2012). The condition of roasting process, temperature and time, are the most crucial for the content of bioactive compounds in coffee brew (Chung et al., 2013). In particular, high temperature and low water activity, facilitate molecular degradation in the roasting process (Herawati et al., 2019).

The volatile fraction of roasted coffee is one of the most important factors for determining coffee quality and it can be decisive at the time of purchase. The aroma of coffee is intrinsically related to the chemical composition of the bean, genetic strain, geographical location, climate, annual rainfall, agricultural practice and processing method applied (Toledo et al., 2016; Buffo and Cardelli-Freire, 2004). In order to characterize the species and the influence of geographical origin, some studies are focused on the identification of volatiles in green coffee (Flament, 2001), fingerprinting coffee flavor (Huang et al., 2007), discrimination of volatiles in defective coffee (Toci and Farah, 2008; Walker et al., 2019) and the effect of climatic conditions on the volatile compounds of green coffee (Bertrand et al., 2012).

Aroma volatiles produced during coffee bean roasting, have been previously reviewed (Grosch 2001; Toledo et al., 2016). Coffee volatiles are derived from numerous precursors found in the bean by chemical reactions, including Maillard reaction, Strecker degradation, degradation of trigonelline, chlorogenic acids, pigments, lipids, phenolic acids together with the breakdown of amino acids (Holscher and Steinhart, 1992; Ribeiro et al., 2009). Many studies have distinguished samples of roasted Arabica and Robusta coffee from different geographical origin (Colzi et al., 2017; Yener et al., 2014; Mathieu et al., 1996; Sanz et al., 2002; Rocha et al., 2004; Korhonovà et al., 2005). In most cases, emphasis was placed on geographical differentiation, but little can be concluded regarding the cause of differentiation. Some studies (Mondello et al., 2005; Cheong et al., 2013) indicated that it was very difficult to correlate the aromatic fraction present and the climatic conditions.

In order to satisfactorily analyze the aroma compounds of food, the analytical extraction methods applied are oriented to the sample manipulation and use of solvents reduction (Lee et al., 2013). Headspace techniques are the most suitable ones for the study of aroma compounds and dynamic sampling has been shown to be the most suitable as it does not introduce discrimination in the volatile components analyzed (Barcarolo et al., 1992; Clarke and Macrae, 1985). These methods have several advantages, such as being quicker, relatively simpler and highly reproducible. A particular dynamic headspace gas chromatography (DHS-GC) solvent-free device incorporated a reverse-flow step in order to avoid any contamination of the analytical gas chromatography column with gases or other substances that could not be correctly condensed (Barcarolo and Casson, 1997).

In some previous papers, the characterization of the volatile fraction of different matrices, such as olive oils (Procida et al., 2005, Procida et al., 2016), dairy products (Stefanon and Procida, 2004) and saffron (Procida et al., 2009) by headspace analyses of a number of samples, was reported in order to determine a relationship between compounds in the headspace and sensory evaluation.

In this paper, we have applied headspace analysis (Barcarolo and Casson, 1997) to a number of green and roasted Arabica and Robusta coffees samples. The aim of this preliminary study was to obtain flavor profile of the samples which vaporize at room temperature and could be correlated to natural olfactory perception. Qualitative and quantitative differences of the volatile fraction of the two coffee species, with the aim to identify possible markers that could discriminate the species of coffee analysed and highlight a correlation between the aromatic compounds of volatile pattern of single species and geographical origin, were investigated. For these purposes, the chemical composition of the volatile fraction of the analysed samples was explored to obtain a differentiation on the basis of chemometric tools.

Materials and methods

Sampling

Thirty-three green coffee samples (19 Arabica and 14 Robusta) from all the world were analysed. In particular as far as the Arabica is concerned, 5 samples came from Africa (Kenya, Cameroon, Uganda, Ethiopia and Burundi), 8 from Central America (Nicaragua, Cuba, Guatemala, Haiti, Costa Rica, Mexico, Santo Domingo and Puerto Rico) and 6 from South America (Venezuela, Peru, Ecuador, Colombia, Brazil Santos Florada and Fancy).

As far as Robusta are concerned, 8 samples came from Africa (Madagascar, Congo, Cameroon, Togo, Ivory Coast, Uganda, Angola and Central African Republic) and 6 from Asia (India Cherry and Parchment, Vietnam, Indonesia, Thailand and Java). All analysed samples of green coffee were subsequently roasted at 220 °C for about 15 min by a TT 7.5 roasting apparatus (Petroncini, Ferrara, Italy).

Headspace sampling

Three aliquots of 500 g each were ground for each sample considered. The aliquots to be analysed were taken from the whole mixture of the ground product (1.5 kg). 1 g of each sample, finely ground in a Retsch mill, was exactly weighed in a 10 ml vial then the vials were sealed with aluminium-rubber septum (Supelco, Bellefonte, PA, USA). 0.02 μl (17.8 μg) of the internal standard were added, either ethyl propionate (≥ 99.7%) to green coffee samples, or tetrahydrofuran (≥ 99.9%: Sigma St. Louis, USA) to roasted coffee; the internal standard was added to the sample for injection through the rubber septum by Hamilton 7001KH syringe (1 μl). Vials with samples were conditioned at 25 °C for 30 min before analysis, then stripping was carried out for 150 s on green samples and 90 s on roasted coffees. Stripping was realised with helium at a rate of 10 ml/min. During sampling volatile components are transferred into a capillary tube (0.53 mm I.D.) that was inside a cryogenic trap cooled by liquid nitrogen at − 110 °C: the trap is connected by an on-column mode to a capillary gas chromatograph (Carlo Erba GC 8000; Carlo Erba, Milan, Italy). The cryogenic trap, which was represented by a fused silica capillary tube, did not show activated adsorbent or porous polymers. This trap allows the acquisition of an aroma profile similar to natural olfactory perception without artefacts or problems related to competition between volatiles and incomplete or irreversible adsorption.

GC–MS analysis

At the end of the sampling time, desorption of volatile components took place by rapid heating of the cryogenic trap to 240 °C in 5 s and then by transferring volatiles to the analytical capillary column in 15 s. The analytical column used was a capillary fused-silica column 50 m x 0.32 mm I.D., coated with PS 264 (7% diphenyl/93% dimethylsiloxane), 3 μm film thickness (Mega, Milan, Italy). The GC system was coupled directly to a MD 800 mass spectrometer (Carlo Erba, Milan, Italy). Gas chromatographic conditions were the following: oven initial temperature 40 °C, held for 6 min, then programmed to 180 °C at a rate of 5 °C/min, then 5 min at 180 °C, then at 7 °C/min to 200 °C, held 2 min, and finally at 7 °C/min to 240 °C with 10 min of final isotherm. The transfer line temperature was kept at 250 °C.

The mass spectrometer scanned from 29 to m/z 300 at 0.5 s cycle time. The ion source was set at 180 °C and spectra were obtained by electron impact (70 eV).

The tentative identification of compounds was carried out through a study of the MS spectra and comparison with members of the NBS library.

Quantitative evaluation was carried out by using internal standard method: we set the response factor as unity for each substance so it was possible to give quantitative data as internal standard equivalent and to compare the content of each component in analysed samples.

Statistical analysis

The mean levels of each compound in Arabica and Robusta samples were compared using the Mann–Whitney test for the comparison of two groups. Bonferroni adjustment was applied to take into account multiplicity of the test. Only comparisons with adjusted p value < 0.05 were considered as statistically significant.

To identify the components of the volatile fraction that best discriminate between the two species the multivariate analysis technique named Classification Tree was applied. Classification tree is a method for supervised learning, i.e. to develop a classification rule based on data where true classification is known (in our case, the species) and information is available on potential predictors (the volatile compounds). With this technique, data are recursively partitioned into two groups based on the value of the single variable that, at that level, best discriminates among the considered classes. The result of the analysis is usually represented using a binary tree, where the leaves represent the classes and the branches represent the variables. In this way a classification rule is developed, that can then be applied to new samples where information about classification is not known. The power of the rule is measured in terms of misclassification error, i.e. the percentage of observations not correctly classified. Obviously, the classification ability of the technique increases with the complexity of the classification rule (i.e. the number of features used and/or levels of the tree), however results obtained with complex classification rules are hardly reproducible. For this reason, techniques based on the use of a validation set of data or on cross-validation (a leave one out method to evaluate the prediction error) have been developed to find a good compromise between the complexity of the rule and the misclassification error. Given the size of our sample, it was not possible to divide it in two parts, the training and the validation set, and then we made use of cross-validation to ‘prune’ a complex tree and find the optimal balance.

Classification tree methodology has been chosen because of its ability to handle a very large number of predictors and because it can detect complex interactions among them.

All the analyses were separately done for green and roasted coffee using the statistical package R ver. 3.4.2 (R. Core Team, 2015).

Results and discussion

In aroma research, headspace techniques are the most suitable for the determination of chemical components. The dynamic sampling was shown to be the most suitable since it does not introduce discrimination and enables nearly all of the volatile components to be analyzed. The particular head space configuration enables helium flow inversion through the Y press fit system during the sampling step and inhibits entry into the column of substances not cryofocused that exit through vapour exit device. This was performed with the aim of avoiding any contamination of the analytical column with incondensable gases, whose presence could produce a very broad peak and consequently lead to a poor chromatographic resolution (Barcarolo and Casson, 1997). Stripping was carried out at room temperature; it is thus possible to obtain an aroma profile as much as possible near to the natural olfactory perception (Hinshaw, 1988).

Analysis of the volatile fraction of green coffees allowed the identification of 68 compounds. These can be grouped into different classes, including alcohols, aldehydes, ketones, furans, sulfur compounds, esters, hydrocarbons, pyrroles, pyrazines and terpenes.

It is well known that in the volatile fraction of green samples several substances usually present in roasted product, such as furans that take origin from sucrose degradation, pyrazines and pyrroles from protein degradation phenomena, and pyridines, from trigonelline degradation, were identified. Degradation phenomena may occur for the main components of the bean during the steps of processing and storage of the product (Flament, 2001).

Our sampling technique allowed the detection of nine alcohols, namely methanol, ethanol, 2-propanol, isobutyalcohol, butanol, 3-methyl-1-butanol, 2-methyl-1-butanol, n-hexanol and 2-heptanol. As concerns the characterization of the species analyzed, the concentration of low molecular weight compounds, as methanol, ethanol, 2-propanol, isobuthylalcohol and butanol, seems homogeneous in the samples of Arabica and Robusta coffees. The two species significantly differ for the content of n-hexanol and 2-heptanol (Fig. 1). The content of hexanol varies from 4.69 and 47.61 μg/kg in Arabica coffee and in a range between 1.75 and 14.13 μg/kg in Robusta coffee samples analyzed: on the contrary, the concentration of 2-heptanol varies from 0.3 and 9.71 μg/kg in Arabica and ranged between 5.03 and 185.3 μg/kg in Robusta (Table 1). The higher lipoxygenase activity for linolenic acid than linoleic acid supports the biogenesis of more of C6 volatile compounds (Kalua et al., 2007).

Fig. 1
figure 1

Green coffee: boxplot of concentration of components with significant difference between the two species (p-values based on Mann–Whitney test)

Table 1 Significant volatile compounds (μg/kg) in the examined samples of the green Arabica and Robusta coffees

Seventeen aldehydes were identified in the green samples analyzed. Observing the data, the concentration of ethanal in Arabica ranged between 389.5 and 5723 μg/kg while hexanal, ranged between 350.7 and 118,516 μg/kg appears higher than in Robusta samples, ranged from 468.3 to 2646 μg/kg and from 460.3 to 3282 μg/kg, respectively (Table 1). The higher presence of ethanal in Arabica may be associated with the sensory quality of the samples, in particular with the fruity attribute, while hexanal may be indicator of body and bitterness (Bertrand et al., 2012). A significative difference was observed for isobutanal, whose concentration ranged between 178.9 and 2252 μg/kg in Arabica samples and between 344.9 and 2958 μg/kg in Robusta samples (Fig. 1).

Besides, our sampling technique allowed the detection of nine ketones, namely acetone, butan-2,3-dione, butan-2-one, 2,3-pentadione, 3-hydroxy-2-butanone, 3-methyl-2-pentanone, 2-heptanone, 6-methyl-5-hepten-2-one and 3-octen-2-one. From a quantitative point of view, only 2-heptanone shows a significant difference, with values that range between 1.28 and 12.11 μg/kg for Arabica and 4.6 μg/kg and 31.5 μg/kg for Robusta coffees (Fig. 1). This is probably correlated to a reduced seasonal temperature variation during seed grown and with high elevation and cool climate of regions of coffee produced (Bertrand et al., 2012).

Among furans, namely furan, 2-methyl-furan, 2-ethylfuran, 4-methyl-2,3-dihydrofuran, 2-methyl tetrahydrofuran-3-one, furfural and pentylfuran, only 4-methyl-2,3-dihydrofuran shows a significant difference between the two species, with values that range between 0.9 and 14.3 μg/kg for Arabica and 4.3 μg/kg and 77.3 μg/kg for Robusta coffees (Fig. 1).

The analysis of green coffee samples allowed the identification of four pyrazines, namely methylpyrazine, 2,6-dimethylpyrazine, ethylpyrazine and 2,3-dimethylpyrazine. The concentration of ethylpyrazine in Robusta varies from 4.33 to 29.36 μg/kg and is significantly greater respect to Arabica, ranged between 1.42 and 12.49 μg/kg (Fig. 1).

In the volatile fraction of coffee samples, four sulfur compounds were identified, namely 2-thiapropane, carbon disulphide, dimethylsulfone and thiophene. While the concentration of 2-thiopropane and carbon disulphide appear homogeneous in the coffee samples analyzed, dimethylsulfone is only present in Robusta species from 0.18 to 7.36 μg/kg. Besides, the concentration of thiophene in Robusta varies from 4.12 to 19.4 μg/kg, while in Arabica from 3.2 to 8.24 μg/kg (Fig. 1). The concentration of thiophene appears three-fold present in green Robusta coffee rather than in Arabica coffee samples. Sulfur-containing compounds, mainly obtained from methionine, cysteine and cystine via Strecker degradation, are extremely influential on the sensory profile of coffee: due to their extremely low odour thresholds they have a great olfactory impact. Their identification and quantification is crucial for the assessment of coffee sensory quality (Sunarharum et al., 2014; Dulsat-Serra et al., 2016).

Lastly, methylacetate and amylformate show a significant difference between the two species. Methylacetate ranges from 2.33 to 79.13 μg/kg in Arabica samples, while from 10.42 to 186.1 μg/kg in Robusta samples (Table 1); amylformate ranges between 15.48 and 151.8 μg/kg in Arabica samples, and between 6.37 and 54.51 μg/kg in Robusta samples (Fig. 1).

Data were analyzed using classification trees to evaluate the discrimination ability of the compounds that we considered. The classification tree gives a rule to classify the different samples as Arabica or Robusta based on their characteristics. In our case, we obtained the following rule: classify as robusta if the value of 2-heptanol is higher than 9.8 μg/kg or if the value of n-hexanol is lower than 5.2 μg/kg.

The classification rule based on the concentration of 2-heptanol and n-hexanol correctly classifies 18 out of the 19 Arabica samples (1 is indeed classified as Robusta), while all the Robusta samples are correctly classified. The classification rule that we have found is indeed very simple (only two features are involved) and powerful, with just 1 (3% of the total sample) misclassification error. Indeed, other rules with the same characteristics can be found:

  • classify as Robusta if ethylpyrazine is greater than or equal to 13 μg/kg or if dimethyl sulfone is greater than or equal to 1.4 μg/kg (only one Robusta sample classified as Arabica);

  • classify as Robusta if 2-heptanone is greater than or equal to 6.5 μg/kg or if n-hexanol is lower than 4.3 μg/kg (only one Arabica sample classified as Robusta);

  • classify as Arabica if n-hexanol is greater than or equal to 16 μg/kg or if furfural is greater than or equal to 28 μg/kg (only 1 Robusta sample classified as Arabica);

  • classify as Arabica if amylformate is greater than or equal to 30 μg/kg or if diacetyl is lower than 105 μg/kg (only 1 Robusta sample classified as Arabica);

The four classification rules so identified are equivalent because they share the same degree of complexity and have the same classification error. It can be noticed that most of the variables involved were already identified with the simple comparison of the two species: indeed, also the boxplots in Fig. 1 show the high degree of separation between the two species when considering specific compounds. However, some of the substances used for classification purposes (furfural, diacetyl and dimethyl sulphone) were not significantly different in the two groups.

We applied the same procedure also taking into account the geographical origin of the samples: in this way, we created six classes, four for the Arabica samples (origin: Africa, Caribbean, Central America, South America) and two for the Robusta samples (origin: Africa and Asia).

We then evaluated the discrimination ability of the considered substances by using a classification tree. The result has been obtained from a more complex tree that has been subsequently pruned to find the best balance between complexity and misclassification error.

The resulting classification rule is in this case more complex due to the more articulated nature of the classification. The origin of Robusta samples is identified by 4-methyl-2.3-dihydrofuran (Africa if it is greater than 14 μg/kg) or n-hexanol (Asia if it is lower than 4.3 μg/kg), the Arabica samples have 4-methyl-2.3-dihydrofuran lower than 14 μg/kg and n-hexanol greater than 4.3 μg/kg and can be further classified by origin according to limonene (greater than 9.6 μg/kg for Africa samples) or to n-nonanal (lower than 8 μg/kg for Caribbean, between 8 and 22 μg/kg for South America, greater than 22 μg/kg for Central America).

The classification rule misclassifies 5 samples (15% of the entire sample): an Arabica sample from Africa was classified as coming from the Caribbean, and Arabica sample coming from the Caribbean and another one coming from South America were classified as coming from Central America, a Robusta sample coming from Asia was classified as an Arabica coming from South America and last a Robusta sample coming from Asia was classified as coming from Africa. The classification performance, even if lower than in the case of the species, seems to be more than adequate.

As far as the volatile fraction of roasted coffees, the analytical technique employed allowed the identification of 84 compounds, grouped into different classes, including furans, pyrazines, pyrrole, sulfur compounds, aldehyde, ketones, alcohols and esters.

The comparison of the two species in terms of the detected compounds identified eight statistically significant differences (Fig. 2). Of these, 4 refer to ketones: acetone ranges from 11,734 to 28,873 μg/kg for Arabica samples, while in Robusta samples ranges from 8359 to 15,792 μg/kg; diacetyl ranges from 3235 to 8818 μg/kg for Arabica samples and from 1959 and 4316 μg/kg for Robusta samples; 2-butanone ranges from 4996 to 50,711 μg/kg for Arabica samples and from 512.2 to 6530 μg/kg for Robusta samples; 2,3-pentanedione ranges from 3087 to 8853 μg/kg for Arabica samples and from 341.4 to 4701 μg/kg for Robusta samples (Table 2). Two esters, namely methylformate and methylbutyrate show significant differences between the two species, as well as one aldehyde (isobutanal) and one alchool (2-methyl-3-buten-2-ol).

Fig. 2
figure 2

Roasted coffee: boxplot of concentration of components with significant difference between the two species (p-values based on Mann–Whitney test)

Table 2 Significant volatile compounds (μg/kg) in the examined samples of the roasted Arabica and Robusta coffees

When applying classification trees considering only the species, four equivalent solutions are found, that give a perfect classification of the samples (no misclassification error) with very simple classification rules:

  • Classify as Arabica if 2-methyl-3-buten-2-ol is greater than 179.09 μg/kg or if acetone is greater than 15,914 μg/kg, otherwise classify as Robusta;

  • Classify as Arabica if methylbutyrate is lower than 11.59 μg/kg or if pyridine is greater than or equal to 11,655 μg/kg, otherwise classify as Robusta;

  • Classify as Arabica if diacetyl is greater than or equal to 3777 μg/kg and 2,3-dimethylpyrazine is lower than 726.2 μg/kg or if diacetyl is lower than 3777 μg/kg and propylformate is greater than 38.21 μg/kg, otherwise classify as Robusta;

  • Classify as Arabica if acetone is greater than or equal to 11,656 μg/kg and 2,3-pentanedione is greater than or equal to 3083 μg/kg and 3-hexanone is lower 1066 μg/kg, otherwise classify as Robusta.

Again, some of the variables involved in the classification rules do not significantly differ in the two species but give an important contribution when considered in conjunction with other variables.

The classification tree derived when jointly considering species and geographical origin again gives a more complex classification rule, that involves several substances. The origin of Robusta samples is identified by 2-butanone (Africa if it is lower than 4913 μg/kg) or methylbutyrate (Asia if it is greater than 26 μg/kg), the Arabica samples have 2-butanone greater than 4913 μg/kg and methylbutyrate lower than 26 μg/kg and can be further classified by origin according to methanol (greater than 3498 μg/kg for South America samples) or to ethylformate (lower than 167 μg/kg for Africa and greater than 167 μg/kg for Central America).

The quality of the classification is lower than in the previous cases: indeed, three Arabica samples from the Caribbean and one from South America are classified as coming from Africa, one Arabica sample coming from Africa is classified as coming from South America, one Arabica sample from the Caribbean and one Robusta sample from Africa are classified as Robusta coming from Asia and two Robusta samples from Asia are classified as coming from Africa. In total 9 samples out of 33 (27% of the entire dataset) are misclassified.

The analytical technique used to characterize the volatile fraction of coffee samples, has several advantages, being quicker, simpler, highly reproducible and yielding ‘true’ aroma profile, very similar to natural olfactory perception. Furthermore, there is little chance of artefact formation, competition phenomena, selective, incomplete or irreversible adsorption and incomplete desorption.

Chemometric evaluation of the data, shows for green beans, as well as for roasted ones, that the two species are very well characterized by few substances. This is quite evident when considering each substance separately (in some cases the separation of the two species is very strong) and even more striking when considering jointly all the substances using classification trees. Indeed, very simple classification rules are able to classify correctly nearly all the observed samples. The same is true when also considering geographical origin, at least for green coffee, while the misclassification error is much higher when considering roasted coffee.

The results are then quite strong and really promising, a validation on a different set of samples would be advisable.