Introduction

Taste and aroma are key sensory attributes for making coffee one of the most consumed beverages in the world. These attributes are formed from chemical compounds (precursors) present in green coffee beans [1, 2]. These chemical compounds are influenced by several factors, such as genetic origin [3], local production [4, 5] and agricultural practices [6]. In addition, harvesting and post-harvest processes (PHP) have a crucial effect on beverage quality [7].

As with any other fruit, the best quality of the coffee fruit is reached when it is fully ripe [8, 9]. The flowering of the coffee plant occurs several times in the same crop, and it is common to find fruits in different stages of maturation at harvesting time. In general, harvesting occurs when more than 80% of the fruits have an intense red husk [10].

The harvested coffee is subjected to various post-harvested processes (PHP). Usually, dry and wet processes are used to process the harvested coffee manually or mechanically. These processes are distinguished primarily by presence or absence of husk and mucilage of the coffee beans during the drying. In the dry process, the harvested coffees are washed in water tanks, and without removing the husk and mucilage, they are spread directly on a patio or on suspended plastic screens for drying [1] Coffee produced in this way is known as natural coffee (CN).

In the wet process, the harvested coffee is washed and peeled in machines and can follow two paths before spreading for drying. These processes are differentiated by the presence of mucilage on the parchment. When the husked coffee is sprinkled on the patio without previous removal of the mucilage, the coffee obtained is called peeled or semi-dry coffee (CD). However, when the mucilage is completely removed in the fermentation tank before drying, the coffee is named pulped coffee (CP) [7, 11].

The choice of process depends primarily on available structures, but often, it is determined by the difficulty of drying the coffee in the climatic conditions of the producing region. Economic profits or incentives to improve beverage quality can also determine the choice of the PHP [12].

The PHP interferes in the composition of the coffee beans and strongly impacts the attributes of the coffee beverage. The involvement of the husk and mucilage in the metabolic events that occur during PHP has recently been studied. Germination inhibitors in the coffee pulp were identified, and germination started when the pulp was removed before drying the coffee [1, 2]. The removal of the husk and pulp from the coffee in the CD process enabled germination as long as there was water available in the coffee beans. However, the presence of inhibitors in the CN process blocked the germination, and metabolic activities developed slowly [2].

When the germination metabolism is active, the mobilization and transformation of the carbohydrate, lipid and protein polymers begin, and the resulting products from these reactions become the aroma precursors. Among the main coffee aroma precursors are amino acids and soluble sugars, which are released into the endosperm during germination. The presence of sugars [13, 14] and lipids [12] were identified in PHP and were associated with a pleasant aroma, and the taste of the roasted coffee was connected with the beverage quality.

The amount and chemical nature of the precursors are fundamental for the formation of the sensory attributes of the coffee beverage [5, 7, 8]. The characteristics of the roasted bean, such as volume expansion, surface color [15] and acidity of the beverage, are associated with coffee bean formation and maturation [16, 17] and are indicators of the quality of the harvested and processed coffee beans.

The primary sensory attributes of coffee beverage are quantified by descriptive techniques where the attributes are individually evaluated [18, 19] and the beverage receives an overall quality score [4].

The definition of the attributes of the coffee beverage is a difficult and complex task due to the interactions between aroma and flavor precursor compounds during the roasting [16]. The presence of different flavor precursors and flavors in the PHP processes determine the aromatic profile of the beverage [20]. The CN coffee beverage has a strong coffee aroma, moderate acidity, intense body and pleasant sweetness. On the other hand, in the CD coffee beverage are present a weak coffee aroma and body of medium intensity, and the acidity and sweetness are lower than in the CN coffee beverage [21].

Currently, electronic devices (electronic nose and tongue) have been constructed to mimic human activities in food quality assessments, such as control of coffee roasted coffee [22, 23]. However, the use of these devices for the sensorial evaluation of coffee is still limited to identifying the presence of coffee beverage defects [24]. But, when the purpose of the study is to describe the qualitative and quantitative aspects of the sensory attributes of a product it is recommended to apply a descriptive sensory method [25].

Chemometric techniques for data reduction are useful for classifying and identifying the quality and characteristics of a product [26]. Neural network and PLS-DA techniques were applied to classify the genetic and geographic origins of the coffee [5, 27] and support vector machines were used to identify geographical classification of green arabica coffee [28].

A successful description of PHP applied coffee is based on data sets that combine results from various analytical techniques [19]. In such cases, chemometric techniques are needed to determine the most important characteristics that need to be measured to indicate the influence of the PHP. Exploratory multivariate techniques, such as principal component analysis (PCA) and multiple factor analysis (MFA), have been commonly used to evaluate the performance of these groups of variables [29, 30]. PCA is used to reduce the dimensionality of a data set, and MFA enables the simultaneous analysis of multiple data sets, structured in groups, in which each group of variables is balanced. MFA has been used to describe coffees from different regions [19, 31], cultivars of tomatoes [32] and grapes varieties [33].

The objective of this study was to evaluate the influence of PHP in the physico-chemical characteristics of green and roasted coffee beans and the sensory attributes of natural (CN) and semi-dry (CD) coffees. An MFA was used to evaluate the influence of the groups of variables on the quality of the coffee produced in each process. The main innovation of this study was that the preparation of the coffee samples was undertaken by the producers who participated in the coffee quality competition.

Materials and methods

Coffee samples

A total of 169 coffee samples were collected in the coffee region of Paraná-Brazil from the 2004 crop (79 samples) and 2005 crop (90 samples). The samples were processed by the CN process (56 samples in 2004 and 64 samples in 2005) and by the CD process (23 samples in 2004 and 26 samples in 2005). The results were presented according to their process and the respective year of harvesting, e.g., CN04, CN05, CD04 and CD05.

All of the procedures of PHP were performed by the producers on their properties with the available infrastructures (Fig. 1). During these harvesting years, producers presented their samples in the coffee quality competition promoted by an official organization of the local coffee industry. The members of this committee were responsible for removal of the husk, parchment and defective beans. Coffee beans were stored in paper bags in a dry place (25 ± 2 °C) after passing through a # 16 coffee sieve.

Fig. 1
figure 1

Schematic plan of the post-harvest processes practiced by coffee producers

Chemical compounds

5-O-Caffeoylquinic acid (5-CQA), caffeine, glucose, trigonelline and gallic acid were purchased from Sigma-Aldrich. Potassium sulphate, copper sulphate, sodium hydroxide and petroleum ether were purchased from Quimex (São Paulo, Brazil). Sulfuric acid, zinc sulfate, sodium carbonate, anhydrous disodium phosphate, sodium and potassium tartrate, anhydrous sodium sulfate and sodium arsenate were purchased from Merck. HPLC grade acetonitrile (J.T. Baker, State of Mexico, Mexico), methanol (Nuclear, São Paulo, Brazil), acetic acid (Merck, Darmstadt, Germany) were also used.

Physico-chemical analysis of green coffee beans

For the physico-chemical analysis the green coffee beans were frozen with liquid nitrogen and ground (0.5 mm particles) in a disc mill (PERTEN 3600). The ground samples were stored at − 18 °C.

The respective moisture and proteins of the green coffee beans was determined, as described by the AOAC [34]. Phenolic compounds were determined using gallic acid by the Folin Ciocalteau reagent [34]. Lipids were extracted with petroleum ether for 16 h after the ground green coffee (2 g) was oven dried at 105 °C for 3 h [34]. Trigonelline, caffeine and 5-caffeoylquinic acid (5-CQA) were determined according Kitzberger et al. [35] and the methodology described in Scholz et al. [31] was used to determine sugar determinations (reducing and total sugars) and the results were expressed in g 100 g−1 of coffee. Total chlorogenic acids were evaluated according to Clifford and Wight [36]. Ground coffee was stirred with 70% isopropyl alcohol and the absorbance was read at 406 nm. A solution of 5-CQA (20–100 µg mL−1) was used as standard. The apparent density and titratable acidity the green coffee were determined according to Franca et al. [9] and Ramalakshmi et al. [17] respectively.

The physico-chemical determinations were performed in two replicates, and the results were expressed on a dry basis. This group of physico-chemical variables was called FGV.

Physico-chemical analysis of roasted beans

The roasted beans’ total weight loss after roasting [15], roasted coffee density and volume expansion [9] were determined. Titratable acidity, pH, and solids totals were measured for the beverage [3]. The color of roasted and ground coffee was measured with a colorimeter (Byk Gardner, model PCB-6805) previously calibrated with white and black standards and configured for D65 and a 10° angle illuminant. This group of roasted coffee variables was named FGT.

Beverage preparation and sensory analysis

Green coffee was subjected to a medium roasting process (Roaster Rod-Bel) until the desired weight was achieved [8]. Samples were tested no later than two days after roasting. The beverage for descriptive sensory analysis was prepared with roasted and ground coffee beans (70 g L−1 of boiling water). After a 1-min extraction, the beverage was filtered (Mellita filter paper) directly into a thermo flask. The beverage was served in disposable cups (30 mL at 50–60 °C) to 8 trained tasters in individual cabins with natural light no more than 20 min after the preparation.

Four coffee beverages were presented in a randomized complete block design in each session. The intensities of sensory attributes, including turbidity, coffee aroma, green aroma, sweet aroma, body, sweet taste, acid taste, grassy taste, bitter taste and astringency, were evaluated using a 10 cm scale, and the minimum and maximum attribute intensities are anchored at the ends of the scale [16]. This group of sensory attributes variables was identified as SENS.

Statistical analyses

Analysis of variance was applied to the physico-chemical of green and roasted coffees and sensory attributes variables. A Tukey test was applied for mean comparisons.

The data groups of physico-chemical characteristics of the green and roasted coffee beans and sensory attributes variables were subjected to an MFA. This technique consists of two steps. In the first step, the multiple correspondence analysis (MCA) were applied to the tables of the groups of qualitative variables. In the second step principal component analysis was applied (PPCA) to the groups of quantitative variables. PPAC results were used for global analysis that simultaneously enables the groups of variables and the samples to be associated. RV coefficient is used to evaluate the link between groups of variables. This coefficient is obtained from the linear correlation coefficients between two groups of variables. Values above 0.8 indicate that the global factor represents the trend in the individual analysis of each group of variables. Further information on multiple factor analysis can be found in Pagès [30] and Kennedy and Heymann [37].

The statistical analysis were performed using a Statistical XLSTAT- Statistical Data Analysis for Excel software.

Results and discussion

Physico-chemical characteristics and sensory attributes of natural (CN) and semi-dry (CD) coffees

The coffee samples prepared by the producers were evaluated by innumerable variables that represent the quality of PHP. The proteins, lipids, total sugars, sucrose, chlorogenic acids, phenolic compounds, trigonelline contents, green coffee density and titratable acidity of the CN and CD coffees produced during the 2 years presented significant differences. Caffeine and 5-CQA contents did not show significant differences between the processes over the 2 years of production (Table 1).

Table 1 Mean values of green and roasted coffee physico-chemical characteristics (g 100 g−1) and sensory attributes of natural coffee (CN) and semi-dry coffee (CD) processes

The roasting of the green coffee was controlled visually by brown color development. CN and CD coffees showed significantly different total weight loss to achieve a similar color. The instrumental color of the roasted coffee showed no significant differences in the luminosity (L *) between the coffee produced by different processes and in 2 consecutive years.

Comparing the total weight loss and the luminosity of the coffees from different processes, the CN05 coffees presented greater weight loss to reach the same luminosity as other coffees (Table 1). The pH of the beverage was similar in the coffees of the CN and CD processes and between the 2 years. CD05 coffees presented the lowest titratable acidity and a higher solids value than the other coffees (Table 1).

In the sensory evaluation, coffee and sweet aromas and sweet and acid tastes presented significant differences between the years and processes. The other sensory attributes were considered to be similar by the tasters (Table 1).

Relationship between groups of variables in a MFA

When the groups of variables are evaluated independently, the resulting information regarding the processes is partial, due to the predominance of one group of variables over the others [30]. MFA is a technique that enables the simultaneous analysis of the groups of variables acquired on the same set of samples without the predominance of a group of variables in the description of the samples because the influence is balanced.

In MFA applied to PHP, the first eigenvalue of FGV (7.06) of the PPCA is greater than the eigenvalue of FGT (5.72) and SENS (3.37). The balance of the variables corrected the dominant effect of the FGV group in the construction of the first axis of MFA. The successive eigenvalues of each group of variables decreased rapidly. The first two dimensions of the MFA comprise 87.51% of variance (Table 2).

Table 2 Eigenvalues (AV) and cumulative variance (VA %) of the partial PPCA in an MFA

MFA is the transposition of the axes of PPCA into a global plan. The quality of this transposition can be evaluated by the contribution to the formation of factors and the RV coefficient. The F1 was formed with the contribution of the FGV (33.46%) and SENS (31%) groups and F2 was formed with the contribution of the FGT group (37.73%). These contributions indicate that the information contained in each group of variables was transposed into MFA. The RV coefficient between MFA and FGV (RV = 0.96), FGT (RV = 0.98) and SENS (RV = 0.94) were above 0.80. These values indicate that the data contained in the individual PCA were transposed to MFA without significant loss of information, so that the MFA translates the trend in the tables of each group of variables [37]. The difference between two groups of variables was sufficiently important to justify the use of MFA to highlight the common and specific characteristics of each group of variables [30]. The FGV was correlated with FGT (RV = 0.95), while the SENS showed correlation with FGT (RV = 0.89). These values suggest that roasted coffee characteristics depend on green coffee physico-chemical characteristics that ultimately determine the sensory attributes of the beverage [3, 5].

Description of natural (CN) and semi-dry coffee (CD) processes

The qualitative variables (CN and CD processes) were analyzed by correspondence analysis (MCA). The effectiveness of the MCA is evaluated by the contribution of the group of variables in the formation of the factors. In this case, CN04, CD04 and CD05 processes were associated with F1 with contributions of 28.96%, 20.75% and 31.95%, respectively, and CN04 process showed contribution of 43.74% to F2.

The relationship between the PCA projections of the individual groups and those projections resulting from MFA (centroid) for the evaluated processes and years is shown in Fig. 2. The distance between the centroid and PPAC points indicates the influence that individual groups of variables (FQV, FQT and SENS) would exert on the descriptions of the CD and CN coffees if they were used separately.

Fig. 2
figure 2

Projections of the natural (CN) and semi-dry (CD) processes on the space formed by F1 and F2 of PPCA and MFA

The largest divergences between the descriptions with the individual group of variables occur only in coffees from the CD process, suggesting that descriptions with the SENS variables would diverge from the descriptions with the FGV and FGT variables. These divergences were corrected when the MFA was applied and thus enabled the description of coffees with all variables simultaneously.

MFA showed that coffee CD04, CD05 and CN04 were associated with the of FGV and SENS groups. Conversely, CN05 coffee is related to the FGT group due to the higher contributions of this group of variables to the F1 factor formation (Table 2).

The projection of the variables (Fig. 3a) and the CN and CD processes (Fig. 3b) in the space formed by F1 and F2 of the MFA allows for a description of the relation between the variables and the factors, and an association between the variables and the processes.

Fig. 3
figure 3

Representation of MFA of variables (a) and processes (b) projected on the space of F1 and F2

The variables of the individual group and their relationships to F1 and F2 of MFA were shown in Table 3. The variables forming the MFA factors were sufficiently different to promote separations between the coffees produced in the 2 years and those submitted to one of the PHP.

Table 3 Variables forming F1 and F2 factors of MFA

Coffees produced in the 2004 crop and in the 2005 crop were separated by F1; coffee from the 2004 crop was associated with F1 negative, while the coffees from the 2005 crop were associated with F positive, regardless of the process to which they were submitted. Conversely, the CN and CD processes were associated with F2 negative and F2 positive, respectively.

Annual coffee harvests were submitted to the CN or CD process. In the MFA, the separation of the annual crop occurred due to the F1 forming variables, demonstrating the influence of environmental conditions on the content of the compounds and the intensity of the sensory attributes associated with F1.

Coffees from the 2005 crop presented higher sugar (TS, Sac and RS) and proteins contents than those harvested in the 2004 harvest (Fig. 3a, b), regardless of the process used. In addition, the coffees of the 2005 harvest presented a lower 5-CQA and trigonelline contents. These results suggest that these compounds were strongly influenced by environmental conditions. Chemical composition of the coffee beans, especially sugars, are related to fruit development [38, 39] and depends on the local environmental conditions of the year of production [40].

Immature coffees have incomplete cell wall structures, and the gas formed during the roasting expands the cell due to a lack of cell wall resistance, reducing the density of the roasted coffee beans [15]. Additionally, immature green coffee beans have a high content of 5-CQA [9].

Coffee harvested from the 2005 crop reached a better maturation than in the 2004 crop because it presented lower a content of 5-CQA and roasted coffee beans presented a darker coloration (low L* value), higher acidity (low pH) and lower volume expansion, suggesting complete coffee bean maturation [8, 9].

The beverages of the coffee from the 2005 crop were associated with intense turbidity, high acid taste and astringency, and full-bodied texture. These attributes were accompanied by a low intensity of bitterness and sweet taste and aroma (Table 3). Conversely, coffee beverages from the 2004 crop had a high intensity of sweet aroma and sweet and bitter tastes.

Therefore, the results of coffee harvested in the 2 years point to the strong effect of environmental conditions on these characteristics and the evaluated sensory attributes.

Coffee from the CN and CD processes and each year of production were separated. CN processes were associated with F2 positive and CD processes  with F2 negative, regardless of the crop year (Fig. 3b). Considering the variables forming the factor F2 positive, the green coffee beans of the CN processes (CN04 and CN05) presented high lipids, caffeine, phenolic compounds, chlorogenic acids contents and titratable acidity compared to the CD processes. The roasted coffee beans of the CN processes showed a greater weight loss to reach the desired color and showed a higher Brix value.

The green coffee beans of the CD processes (CD04 and CD05) presented a high density of green and roasted coffee beans and a high acidity in the beverage. The primary attributes of CD coffees were high coffee and grassy aromas, low body and grassy taste.

The characteristics of green and roasted coffee beans and sensory attributes found in coffees from PHP can be attributed to the germination metabolism and water stress that occur during these PHP [39,40,41,42]. In the CN, the presence of husk inhibits the germination and the macromolecules of the endosperm remain nearly intact in the end of process. During roasting, these compounds were less reactive, and, consequently the formation of volatile compounds responsible for aroma and taste were low.

In addition, larger molecules (polymers), such as phenolic compounds, have been observed to be responsible for the green aroma and taste found in the beverages [8, 9]. Astringency caused by phenolic compounds can be reduced when the polymers were degraded in the germination [43, 44]. In the CN coffee, these compounds have been altered and the astringency can still be perceived.

In the CD coffee, the removal of the husk and residues of the pulp activate the germination that makes the degradation of the polymers possible. Studies have shown that CD coffees have a lower amount of water-soluble galactomannan than the CN, which was associated with cell wall degradation [14].

Such compounds as lipids [45] and proteins were also hydrolyzed in germination [46]. All of these compounds result in small molecules that during the roasting can form a greater number of volatile compounds [47]. The precursors present in the green coffee beans in each PHP are different, and different aromatic compounds are formed during the roasting. The combination of these volatile compounds will result in the specific aromatic profile and sensory attributes of the CD and CN processes.

Conclusions

The innovative evaluation of the coffees processed by producers in 2 consecutive years showed the influence of the PHP on the physico-chemical characteristics of the green and roasted coffee beans and on the sensory attributes. The differences found between the processes were maintained in the 2 years indicating the efficiency of the evaluated variables and the use of MFA in this separation. The application of the MFA showed that the description of the characteristics of the processes depends on the set of variables and the process. The greatest divergence of descriptions was observed when CD coffee was described by sensory variables. Additionally, this study also showed that sucrose, proteins and 5-CQA contents were associated with the year of production, whereas lipids, phenolic compounds, caffeine and chlorogenic acids contents were associated with the processes. It was also noted that the same attributes occurred in the description of both processes, suggesting that they depend on the content of precursors in the green coffee beans. Interestingly, the coffee samples prepared by the producers presented similar results to the laboratory samples, indicating that the reactions were determined by the process conditions, regardless of the amount processed. Finally, the formation of the precursors was influenced by the factors linked to peeling and drying of green coffee. These factors are the primary critical points that must be controlled to assure the success of each PHP.