Selective efficiency of genome-wide selection in Coffea canephora breeding

Alkimim, Emilly Ruas; Caixeta, Eveline Teixeira; Sousa, Tiago Vieira; Resende, Marcos Deon Vilela; da Silva, Felipe Lopes; Sakiyama, Ney Sussumu; Zambolim, Laércio

doi:10.1007/s11295-020-01433-3

Selective efficiency of genome-wide selection in Coffea canephora breeding

Original Article
Published: 20 April 2020

Volume 16, article number 41, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Tree Genetics & Genomes Aims and scope Submit manuscript

Selective efficiency of genome-wide selection in Coffea canephora breeding

Download PDF

Emilly Ruas Alkimim¹,
Eveline Teixeira Caixeta²,
Tiago Vieira Sousa³,
Marcos Deon Vilela Resende⁴,
Felipe Lopes da Silva⁵,
Ney Sussumu Sakiyama⁵ &
…
Laércio Zambolim⁶

1018 Accesses
27 Citations
2 Altmetric
Explore all metrics

Abstract

Over the years, breeding programs have sought efficient strategies to select genotypes with superior performance. Genome-wide selection (GWS) emerged in 2001, aiming to increase efficiency and accelerate selection gain. This technique is considered essential in the breeding of perennial species, such as Coffea canephora, mainly due to the potential to increase the gain per unit of time. Thus, this study aimed to apply the GWS principle, evaluate the efficiency of this technique in C. canephora population using SNP molecular markers, and evaluate eight main phenotypic traits. A total of 165 genotypes were evaluated, being 51 of varietal group of Conilon, 32 of Robusta, and 82 intervarietal hybrids. Through the sequencing of the RAPiD Genomics company, 18,111 SNP markers were identified, of which 14,429 were used after quality analysis. All traits showed good predictive capacity, except for fruit maturation time, fruit size, and yield per plant. The lower values of genomic heritability found for these traits may justify the low values of predictive capacity obtained. The accuracy values estimated were considered as moderate to high, ranging from 67 to 82%. By shortening the cycle time from 6 to 3 years, GWS provided selective efficiency ranging from 22 to 146%. Results revealed that GWS provides higher gains per unit of time. Therefore, GWS proved to be a useful and promising tool for the breeding of C. canephora for accurately predicting the individuals’ genotypes, shortening the time required to complete the selection cycle and providing gains in selective efficiency per unit of time.

Markers, Maps, and Marker-Assisted Selection

Efficiency of genomic selection for breeding population design and phenotype prediction in tomato

Article 14 September 2016

Genotyping-by-sequencing and multilocation evaluation of two interspecific backcross populations identify QTLs for yield-related traits in pigeonpea

Article 16 December 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

For a long time, genetic breeding programs selected the best individuals using morphological markers. These markers are influenced by the environment and have low selection gain (Toppa and Jadoski 2013). Technology has improved the use of molecular markers, allowing significant progress in the selection of superior individuals by detecting DNA polymorphism (Sousa et al. 2017; Alkimim et al. 2017). In the early 1990s, marker-assisted selection (MAS) was proposed, based on the existence of linkage disequilibrium between DNA markers and genes of interest (Lande and Thompson 1990). When comparing MAS for oligogenic traits with phenotypic selection, the selective efficiency increases and the time required to carry out selection shortens, among other benefits (Noir et al. 2003; Lopez et al. 2013; Romero et al. 2014; Alkimim et al. 2017). Also, this technique enables preventive breeding by allowing the selection of genotypes carrying genes of interest in regions where the pathogen is absent (Alkimim et al. 2017). However, the MAS technique has proven to be more efficient for monogenic or oligogenic traits with high heritability (Asins et al. 2012; Kemper and Goddard 2012). Agronomic traits, in general, are governed by several genes, compromising the efficiency of MAS. Therefore, a new selection method, known as genome-wide selection (GWS), was developed (Meuwissen et al. 2001). GWS emphasizes the simultaneous prediction of the genetic effects of hundreds or thousands of markers that densely cover the genome. Thus, all quantitative trait loci (QTL) of a quantitative trait are expected to be in linkage disequilibrium with at least some of the markers (Grattapaglia and Resende 2011; Valente et al. 2016). GWS stands out for promoting high selective accuracy and for not requiring the knowledge of the prior location (maps) of the QTL in the chromosomes (Meuwissen 2007; Jannink et al. 2010; de Almeida et al. 2016).

In the GWS approach, genomic estimated breeding value (GEBV) can be predicted by different statistical methodologies, including GBLUP (genomic best linear unbiased prediction) (VanRaden 2008). In this method, the GEBV values are predicted using the kinship matrix, estimated from the information of molecular markers, known as the genomic kinship matrix (G). The GBLUP prediction uses much more information on parentage than phenotypic selection, which is based on pedigree (through the parentage matrix A). Then genomic heritability and accuracy of genomic selection can sometimes be higher than those parameters from phenotypic selection. And this can be explained by the many more genetic relationship in the G (the genomic relationship matrix) than in A (the genetic relationship matrix based on genealogy). This increase in the amount of information by using the genomic matrix G can, sometimes, lead to better and more precise estimations and predictions. For two populations and its hybrid population, the genetic variance and heritability are defined at the interpopulation level (Bernardo, 2010; Resende, 2015).

The GBLUP is advantageous for its simplicity and for the shorter computational time required (Heslot et al. 2015). This method is mostly recommended for polygenic traits, which are governed by several genes of minor effect (VanRaden 2008). GBLUP is suitable for the analysis of continuous traits or outcomes. For non-normally distributed traits such as those evaluated by a score scale, GBLUP can be used with the technique called generalized linear model. The results may not differ so much from those got by using the standard procedure of linear mixed model. This is in line with theory, which preconizes that the higher the number of score scale classes, the smaller the benefit from using the generalized linear model technique (Sousa et al. 2019).

The molecular markers SNPs (single nucleotide polymorphism), used in GWS studies, stand out for being the most common type of polymorphism in genomes, for the possibility of automation, and for being codominant and biallelic (Resende et al. 2008; Liao and Lee 2010). Recently, the development of next-generation sequencing (NGS) platforms has facilitated the discovery of SNPs, decreasing data point costs. With the identification of SNP markers widely distributed in the species genome, GWS has become a reality, allowing significant gains for several breeding programs (Goddard and Hayes 2007; Meuwissen 2007; Carvalho and Silva 2010; Resende et al. 2012; Fritsche-Neto et al. 2012; Van Eenennaam et al. 2014; Zhao et al. 2015; Sousa et al. 2019).

Although significant, the number of reports regarding GWS in the genus Coffea, even for species of commercial importance, such as Coffea canephora and Coffea arabica, is still low (Ferrão et al. 2017, 2019; Sousa et al. 2019). In a recent study with two populations of a recurrent selection of C. canephora, genotyping by sequencing (GBS) showed good potential to be used in coffee breeding programs (Ferrão et al. 2017). C. canephora is characterized as allogamous, diploid (2n=2x=22), with gametophytic self-incompatibility (Leroy et al. 2005). The species stands out for its rusticity, high yield potential, higher soluble solids content, and genetic resistance to coffee leaf rust, caused by the fungus Hemileia vastatrix (Zambolim 2016).

This study aimed to apply the GWS principle and evaluate its efficiency in the prediction of genomic-genetic value and in the shortening of the selective cycle in C. canephora population, through the RAPiD Genomics sequencing company, by building specific probes in coding and non-coding regions.

Material and methods

Genetic material

The population consisted of clones of the Conilon and Robusta varietal groups and intervarietal hybrids originated from crosses between these groups. The Conilon genetic material was obtained from the Instituto Capixaba de Pesquisa, Assistência Técnica e Extensão Rural (Incaper), and the Robusta material was obtained from the Centro Agronómico Tropical de Investigación y Enseñanza (CATIE). This population composes the breeding program of the Empresa de Pesquisa Agropecuária de Minas Gerais (Epamig), in partnership with the Universidade Federal de Viçosa (UFV) and the Empresa Brasileira de Pesquisa Agropecuária—Café (Embrapa Café), located in Oratórios/MG and Viçosa/MG.

The Conilon and Robusta varietal groups consisted of 51 and 32 genotypes (Table 1), respectively. Also, 82 intervarietal hybrids were obtained by artificial crosses between five genotypes of the Conilon group (male parents) and five genotypes of the Robusta group (female parents), evaluated in the interpopulational partial diallel (Table 2).

Table 1 Coffea canephora genotypes used in the genome-wide selection study

Full size table

Table 2 Intervarietal hybrids and description of their crosses, used in the genome-wide selection study

Full size table

For the non-crossed genotypes, information came only from the parents of Conilon and Robusta in the experiment and from their parentage with the crossed parents.

Phenotypic evaluations

The experiment was established in an incomplete block design with up to 35 replicates and single tree plots. It included hybrids and parents. Phenotypic evaluations were carried out for eight traits during three consecutive years (2014–2016). Five categorical traits and three continuous traits were evaluated. Evaluations were performed at the time of physiological maturity of the coffee fruits.

The categorical traits evaluated were as follows: vegetative vigor (Vig), field evaluation of rust incidence (Rus) and cercosporiosis incidence (Cer), fruit maturation time (Mat), and fruit size (FS). The vegetative vigor was evaluated by the general appearance of the plant, by observing plant leaf development, leaf color, nutritional status, and health of coffee plants. A score scale ranging from 1 to 10 was used, where 1 was attributed to totally depleted plants and 10 was assigned to highly vigorous plants. Rust incidence and cercosporiosis were evaluated by a score scale ranging from 1 to 5, where 1 was attributed to genotypes with no symptoms of the pathogen, and 5 was assigned to highly susceptible genotypes, cercosporiosis. Fruit maturation time was classified as early, intermediate, and late, with scores ranging from 1 to 3, respectively. Fruit size was classified as small, medium, and large, with scores from 1 to 3, respectively.

The continuous traits evaluated were as follows: plant height (PH), diameter of the canopy projection (DC), and yield in liters per plant (Y). Plant height (cm) was determined by measuring the most developed orthotropic branch, from the ground to the last apical point of the coffee plant, using a measurement tape fixed to a wooden rod. The diameter of the canopy projection was determined in centimeters (cm), using a ruler perpendicular to the planting row. The yield per coffee plant was evaluated by harvesting all the fruits in a genotype and measuring the total volume in liters of freshly harvested coffee.

Analysis of phenotypic data

The phenotypes were corrected for environmental effects of years and blocks using the Selegen REML/BLUP software (de Resende 2016). The model used was as follows: y=Xu+Za+Wc+Qs+Sb+e, where y is the data vector; u is the vector of year-mean effects (assumed as fixed) added to the overall mean; c is the vector of specific combining ability effects between the Conilon and Robusta parents (assumed as random and distributed as N~I$ {\sigma}_c^2 $); a is the vector of additive genetic effects of individuals (assumed as random and distributed as N~A$ {\sigma}_a^2 $); s is the vector of permanent effects of individuals (assumed as random and distributed as N~I$ {\sigma}_s^2 $); b is the vector of permanent environment effects of blocks (assumed as random and distributed as N~I$ {\sigma}_b^2 $); and e is the residual vector (assumed as random and distributed as N~I$ {\sigma}_e^2 $). All the effects were assumed as uncorrelated. Uppercase represent the incidence matrices for these effects. The corrected phenotypes were given by corrected phenotypes were given by y∗=y−Xu^−Sb^ and are called deregressed phenotypes, which enter in the genomic analyses (Garrick et al. 2009; de Andrade et al. 2019).

The selective accuracy was obtained by the equation r_yy = (1 − PEV/$ {\sigma}_a^2 $)^1/2, where $ {\sigma}_a^2 $ is the additive genetic variation between individuals under evaluation and PEV is the variance of the prediction error, given by PEV = $ {C}_i^{22}{\sigma}_e^2 $, where $ {C}_i^{22} $ is the ith element of the inverse diagonal of the matrix of the coefficients of the mixed model equations, and $ {\sigma}_e^2 $ is the residual variance.

According to the model y=Xu+Za+Wc+Qs+Sb+e, the individual heritability was estimated by the following: $ {h}^2={\sigma}_a^2/\left({\sigma}_a^2+{\sigma}_c^2+{\sigma}_s^2+{\sigma}_b^2+{\sigma}_e^2\right) $, where $ {\sigma}_j^2 $ is the variance component associated to the j effect.

Genomic DNA extraction, identification, and quality analysis of SNP markers

Young and fully expanded leaves of the 165 coffee trees under study were collected, and the genomic DNA was extracted using the methodology described by Diniz et al. (2005). The DNA concentration was verified in NanoDrop 2000, and its quality was evaluated in 1% agarose gel. The DNA concentration of the samples was standardized and sent to RAPiD Genomics, located in Florida, USA, for the construction of probes, sequencing, and identification of SNP molecular markers.

To identify SNP markers and coffee genotyping, 10,000 probes were selected from 40,000 polymorphic probes (Resende et al. 2016), and 18,111 SNP markers were identified. The probes were constructed from reference sequences. One of the databases was the Brazilian Genome Coffee Project, which contains over 200,000 ESTs (expressed sequence tags), corresponding to about 33,000 transcribed genes, known as Unigenes (Vieira et al. 2006). Another one was the reference genome of the C. canephora species, containing a total of 25,574 genes (Denoeud et al. 2014). Using these reference sequences, specific probes were obtained so that the whole genome was covered, considering both coding and non-coding regions. With these probes, the coffee genotypes were sequenced using the Illumina platform, and the SNP markers were identified using the methodology developed by the company RAPiD Genomics (Resende et al. 2016), developed for humans (Gnirke et al. 2009), and adapted to plants (Neves et al. 2013, 2014). This technology uses a method of genotyping-by-sequencing of specific regions of the genome. Details of the construction of the probes and identification of the SNP markers can be obtained in the study carried out by Alkimim et al. (2018). The SNPs set was subject to analysis of quality implemented in the Rbio software (Bhering 2017). Quality control of SNPs was carried out by the MAF (minor allele frequency—higher than or equal to 5%) and/or call rate (CR—higher than or equal to 90%). The critical level for the MAF parameter was obtained by the equation $ \mathrm{MAF}=\frac{1}{\sqrt{2N}} $, where N refers to the total number of genotypes evaluated (de Resende et al. 2017).

Prediction using the GBLUP model

Analyses were carried out using the GBLUP method via RKHS (Reproducing Kernel Hilbert Spaces) (Gianola 2006), with a Bayesian algorithm, via R environment, in the BGLR package (Resende 2008; Perez and De Los Campos 2014). RKHS accounts for the genetic effects using the Gaussian kernel matrix (K). K = exp. (− hD / median(D)), where h is the reduction coefficient to K values, h is equal to 1, and D is the Euclidean distance of codified markers matrix. A total of 100,000 Markov Chain Monte Carlo (MCMC) iterations were used, with a burn-in of the first 2000 MCMC iterations and a sampling interval (thinning) of 10.

The general mixed linear model (de Resende 2007, 2015; VanRaden 2008) was adjusted to estimate the additive genetic effects of the individuals: y* = Xm + Zg + e, where y* is the vector of corrected phenotypic observations, m is the vector of fixed effects (general mean), g is the vector of random effects of the additive genomic effects of the individuals (assumed distributed as N~G$ {\sigma}_g^2 $), and e refers to the vector of random residuals. Uppercase letters represent the incidence matrices for these effects. The genomic mixed model equations for the prediction of g using the GBLUP method are given by the following:

$$ \left[\begin{array}{cc}X^{\prime }X& X^{\prime }Z\\ {}Z^{\prime }X& {Z}^{\prime }Z+G\frac{\sigma_e^2}{\sigma_g^2}\end{array}\right]\left[\begin{array}{c}\hat{m}\\ {}\hat{g}\end{array}\right]=\left[\begin{array}{c}X{\prime}_{y\ast}\\ {}Z{\prime}_{y\ast}\end{array}\right] $$

The genomic relationship matrix G comes from a incidence matrix M which contains the values 0, 1, and 2 for the number of alleles of the marker (or the so-called QTL) in a diploid individual.

The component M_ij refers to the element i of the row j of the matrix M, referring to individual j. G is a function of MM′ (VanRaden 2008). The genomic heritability was computed as $ {h}_a^2=\frac{\sigma_g^2}{\left({\sigma}_g^2+{\sigma}_e^2\right)} $, where $ {\sigma}_g^2 $ is the additive genomic variance and $ {\sigma}_e^2 $is the residual variance.

Cross-validation

The cross-validation method K-fold was used, considering k=11 folds. The set of observations of 165 genotypes was, randomly, divided into groups. In the process of analysis, 150 genotypes were used as training population, and the group of 15 genotypes (remaining of original population of 165 individuals) was used as the validation population. This procedure was repeated 11 times (k=11) so that all groups of excluded genotypes were used in the validation.

Predictive capacity, prediction, and accuracy bias of GWS

The predictive capacity and the prediction bias are practical measures of the capacity of a method in predicting with accuracy and not with bias. The predictive capacity (r_gy) is determined by the correlation between the predicted genomic values and the observed phenotypic values, which are equivalent to the GWS predictive capacity to estimate the phenotypes. The prediction bias (b) is determined by the coefficient of regression of the predicted genomic values on the phenotypic values (de Resende et al. 2012; Pértile et al. 2016). The accuracy was determined by the estimator r_gg=$ {r}_{\mathrm{gy}}/\sqrt{h^2} $, where r_gy is the prediction ability of GWS, and h² is the individual heritability (Borém and Fritsche-Neto 2013).

Estimate of the number of QTL (n _QTL) and number of individuals (Ni) to obtain desired accuracy

The estimate of the number of QTLs that control each trait was calculated by the expression $ {n}_{\mathrm{QTL}}=\frac{\left(1-{r}_{\mathrm{gg}}^2\right)N{h}^2}{r_{\mathrm{gg}}^2} $, where r_gg is equivalent to the GWS accuracy, N refers to the number of individuals in the population, and h² is the individual heritability (de Resende et al. 2014).

The estimate of the number of individuals (Ni) that should be evaluated in order to obtain desired accuracy was calculated by the expression $ \mathrm{Ni}=\frac{r_{\mathrm{gg}}^2{n}_{\mathrm{QTL}}}{\left(1-{r}_{\mathrm{gg}}^2\right){h}^2} $, where r_gg is equivalent to the accuracy of GWS, n_QTL is the number of QTLs that control each trait, and h²is the individual heritability (de Resende et al. 2014).

Efficiency of GWS

The selective efficiency of GWS compared with the selection based only on 6-year phenotypes was calculated using the expression $ \mathrm{Ef}=\frac{r_{\mathrm{gy}}{L}_{\mathrm{f}}}{r_{\mathrm{yy}}{L}_{\mathrm{GWS}}} $, where r_gy is the predictive capacity of GWS, r_yy is the accuracy of the selection based on phenotypes, L_fis the mean time required for the selection cycle based on phenotypes, and L_GWS is the mean time required for the selection cycle based on GWS (de Resende et al. 2012).

Results

Analysis of phenotypic data

Phenotype data were corrected for environmental effects of years and blocks. The values of selective accuracy (r_yy) were estimated from the phenotypic evaluations (Table 3).

Table 3 Genome-wide selection (GWS) and estimates of selective accuracy based on phenotypic data obtained by mixed model analysis (REML/BLUP) for eight morphoagronomic traits in a breeding population of Coffea canephora

Full size table

No satisfactory predictive capacity was verified for the traits fruit maturation time, fruit size, and yield per plant. Therefore, the accuracy values of these traits were not estimated. In addition, for the trait cercosporiosis incidence, selective accuracy was not estimated since the value of broad-sense heritability was 0, based on the phenotypic data.

In general, the evaluated traits showed r_yy of high magnitude. Values ranged from 39% for rust incidence to 67% for the plant height.

Analysis of quality of SNP markers

With the sequencing of the 165 genotypes using 10,000 probes (previously selected and distributed throughout the genome), 18,111 SNPs were identified. After the quality analyses, carried out in the Rbio software (Bhering 2017), 14,429 SNP markers were obtained. The initial set of SNP markers reduced by 20.33% (Fig. 1). The number of SNPs per chromosome, after the quality analyses, ranged from 4 to 2163. The highest number of SNPs was observed on chromosomes 0 and 2 (Fig. 1). We made available a file with 14,429 SPNs used in the genetic analyzes and their respective positions in the genome (Online Resource 1).

Genomic heritability, predictive capacity, prediction, and accuracy bias of GWS

Estimates of genomic heritability values, predictive capacity of GWS, prediction bias, and accuracy based on the phenotype data are shown in Table 3.

The estimated genomic heritability values ($ {h}_a^2 $) ranged from 0.15 for the trait yield per plant (Y) to 0.53 for the trait diameter of the canopy projection (DC). Despite the considerable $ {h}_a^2 $values obtained for fruit maturation time (0.21), fruit size (0.21), and yield per plant (0.15), their predictive capacity was low.

Regarding predictive capacity of GWS (r_gy), the traits Vig (0.44), Rus (0.48), Cer (0.54), PH (0.41), and DC (0.58) stood out for their high estimate. This confirms that, in general, the r_gy values were higher for traits that had the highest $ {h}_a^2 $ values.

The prediction bias (b) resulted in values close to 1.0 for the traits vegetative vigor, rust incidence, cercosporiosis incidence, plant height, diameter of the canopy projection, fruit maturation time, and fruit size. The trait yield per plant showed no prediction bias close to 1.0. In addition, this trait had the lowest estimate value of genomic heritability.

The accuracy of GWS (r_gg) for the traits fruit maturation time, fruit size, and plant yield was not satisfactory, and therefore, their accuracy was not estimated.

The estimates of the accuracy values were obtained for the other traits, ranging from 67% (Vig) to 82% (Rus). der_gg values were moderate (68%) to high (79%), even for plant height and rust incidence, which had low $ {h}_a^2 $ values (0.36 and 0.37, respectively).

Estimate of the number of QTL (n _QTL) and number of individuals (Ni) to obtain desired accuracy

The estimated number of QTLs controlling each trait ranged from 35 (Cer) to 87 (Vig). In addition, the lowest values of accuracy of GWS, 67% (Vig) and 68% (PH), were obtained for the traits that had the highest number of QTLs (Table 4).

Table 4 Number of QTLs that control the trait (n_QTL) and number of individuals (Ni) required to achieve desired accuracy of GWS (r_ggd) in a breeding population of Coffea canephora for Vig, Rus, Cer, PH, and DC traits

Full size table

Table 4 shows the estimated number of individuals (Ni) that should be evaluated to achieve desired accuracy (r_ggd). The values of desired accuracy used were of 0.50, 0.60, 0.70, 0.80, and 0.90 to estimate the number of individuals. This calculation considered estimates of the genomic heritability values ($ {h}_a^2 $), shown in Table 3, and the estimated number of QTLs (n_QTL) controlling each trait, shown in Table 4. To obtain desired accuracy of 70%, which is considered of high magnitude (de Resende and Duarte 2007), 194 individuals need to be evaluated for the trait vegetative vigor, 96 for coffee rust, 78 for cercosporiosis incidence, 184 for plant height, and 89 for diameter of the canopy projection. For all traits evaluated, the higher the accuracy desired, the larger was the number of individuals to be analyzed.

Efficiency of GWS

Figure 2 shows the efficiency of the GWS with the decrease of the selective cycle in relation to the selection based only on 6 years of phenotypic data, for all traits that had good predictive capacity, except for cercosporiosis incidence (Cer). Thus, the GWS efficiency was estimated for the traits vegetative vigor, rust incidence, plant height, and diameter of the canopy projection.

Figure 2 shows an increase in selective efficiency by using GWS, even for vegetative vigor and plant height, which had high estimates of accuracy from selection based on phenotypic data (60 and 67%, respectively) (Table 2). Even the traits with low $ {h}_a^2 $ values, plant height (0.36), and rust incidence (0.37) showed efficiency gains with GWS. With the decrease of the selective cycle from 6 to 3 years, GWS was higher (ranging from 22 to 146%) for all traits.

Discussion

Analysis of phenotypic data

Selective accuracy (r_yy) was estimated by the REML/BLUP method (de Resende 2016). Selective accuracy reflects the quality of the information and procedures used to predict the genetic values of the individuals (Sousa et al. 2019).

In general, the evaluated traits had high magnitude r_yy, ranging from 39% (R) to 67% (PH). Therefore, the higher the value of the selective accuracy, the higher is the confidence in the evaluation and the predicted genetic value of an individual (Sousa et al. 2019).

Quality analysis of the SNP markers

Quality analyses revealed 14,429 SNP molecular markers. Quality evaluations allow identifying markers with ideal quality criteria (Sant’Ana et al. 2018). In addition, this evaluations are advantageous for they remove poor quality markers prior to the statistical analyses, consequently decreasing the occurrence of false-positive (type I error) and false-negative (type II error) (Anderson et al. 2010). High marker density is essential for capturing genes with lower and higher effect and, consequently, increasing the probability of explaining most of the genetic variation of the study trait (Resende et al. 2008; Resende et al. 2016). Valente et al. (2016) found that the use of higher marker densities is required to obtain prediction accuracy of high magnitude (>70%, according to de Resende and Duarte 2007).

A greater number of SNPs were identified on chromosomes 0 and 2. However, chromosome 0 is not a true chromosome, but a set of unsorted sequence scaffolds of C. canephora. This result, number of SNPs identified on chromosome 2, may be due to the length of chromosome 2 in the genome of C. canephora (Denoeud et al. 2014).

Genomic heritability, predictive capacity, prediction bias, and GWS accuracy

The results of estimated values of genomic heritability (0.15 for yield per plant at 0.53 for diameter of the canopy projection) indicate the inheritable capacity of each trait. Despite the considerable $ {h}_a^2 $ values, given the genetic complexity of the traits, obtained for the fruit maturation time (0.21), fruit size (0.21), and yield per plant (0.15), their predictive capacity was low. Estimates of predictive capacity are expected to be lower for traits with low heritability (Legarra et al. 2008). The fact that these three traits had the lowest $ {h}_a^2 $ values justifies their low predictive capacity values.

Good predictive capacity (r_gy) was recorded for the traits Vig (0.44), Rus (0.48), Cer (0.54), PH (0.41), and DC (0.58), indicating the capacity to anticipate phenotypes for these traits. All of them are lower than 58%. Although 58% is not a high (above 70%) magnitude, it can be possible do have genetic gain from genomic selection and it can be higher per unit of time than that from phenotypic selection. These data show that, in general, the r_gy values were higher for the traits that had the highest $ {h}_a^2 $values. A GWS study with cashew tree (Anacardium occidentale) also revealed a response of the predictive capacity in function of the heritability (Cavalcanti et al. 2012).

The prediction bias (b) was close to 1.0 for vegetative vigor, rust incidence, cercosporosis, plant height, diameter of the canopy projection, fruit maturation time, and fruit size. Predictive bias close to 1.0 indicates that the prediction was non-biased and, therefore, is effective in predicting the real magnitudes of the differences between the individuals evaluated (Resende et al. 2012). The yield per plan trait had a biased prediction. This trait had also the lowest estimate value of genomic heritability, which may justify the observed bias. In addition, traits governed by larger numbers of genes require populations with larger sample sizes.

The accuracy can be classified as very high (>90%); high (70–90%); moderate (50–70%), and low (<50%) (de Resende and Duarte 2007; Rabier et al. 2016). The traits fruit maturation time, fruit size, and yield per plant showed unsatisfactory predictive capacity. Therefore, their accuracy values were not estimated. The estimates of the accuracy values were obtained for the other traits, ranging from 67% (Vig) to 82% (R). The r_gg values were moderate (68%) to high (79%), even for plant height and rust incidence, which had low $ {\mathrm{h}}_{\mathrm{a}}^2 $ values (0.36 and 0.37, respectively). These results confirm the efficiency of GWS in the selection of traits with low heritability and agree with other studies (Legarra et al. 2008; Zhang et al. 2010).

Estimate of the number of QTLs (n _QTL) and number of individuals (Ni) to obtain desired accuracy

The estimated number of QTLs controlling each trait ranged from 35 (Cer) to 87 (Vig). These results show the quantitative nature of the traits evaluated in this study. In addition, traits with the highest number of QTLs were those with the lowest values of accuracy of GWS, 67% (Vig) and 68% (PH). A study carried out with oil palm (Elaeis guineensis Jacq.) revealed that the accuracy of GWS is inversely proportional to the number of QTLs that control the traits (Wong and Bernardo 2008). This is expected because traits governed by a larger number of QTLs are more complex. Also, in polygenic traits, in general, each gene has a little contribution to the manifestation of the trait. However, even in a scenario under low heritability and a higher number of QTLs, results of our GWS work were promising. This event probably occurred due to the set of SNPs selected by the quality criteria. Another fact is that the SNPs are widely distributed along the genome of this species. These results lead us to highlight the importance of implementing the GWS technique in the Brazilian coffee breeding program. This can be gradually incorporated, obeying the pace and conditions of each program, as an auxiliary tool in the practical conduction of improvement programs, in order to obtain quick and accurate gains.

The estimate of the number of individuals required to achieve desired accuracy was obtained considering values of desired accuracy of 0.50, 0.60, 0.70, 0.80, and 0.90. To obtain desired accuracy (r_ggd) of 70% (high magnitude) (de Resende and Duarte 2007), 194 individuals would have to be evaluated for vegetative vigor, 96 for coffee rust incidence, 78 for cercosporiosis incidence, 184 for plant height, and 89 for diameter of the canopy projection. Thus, this study evaluated more individuals than necessary to achieve accuracy of 70% for most of the traits (Rus, Cer, and DC). In addition, this study revealed that the higher the desired accuracy, the larger is the number of individuals to be analyzed.

Efficiency of GWS

The y-axis values shown in Fig. 2 mean the ratio of gain, per unit of time, between genomic selection, and phenotypic selection. When higher than 1, it indicates that genomic selection will provide superior gain. For example, for a value of 1.25, the gain is 25%. Genomic prediction is an additional information on the genetic value of the individual and as such can allow an earlier selection with precision and so can enable reduction of the number of harvests below four (it is well-known that coffee trials require, on average, four harvests for an efficient selection).

In perennial species, such as C. canephora, one of the advantages of genomic selection is the shortening of the selection cycle in order to practice early selection (Castro et al. 2016). In our work, the selective efficiency increased when using GWS, even for vegetative vigor and plant height, which showed high estimates of accuracy based only on phenotype data (60 and 67%, respectively). This increase in efficiency is due to the reduction in the time required to complete a selective cycle using GWS. Thus, reducing the cycle from 6 to 3 years increased the selective efficiency of the GWS for all the traits throughout the reduction of the cycle. Therefore, even when the accuracy of genomic selection has the same magnitude as that obtained with selection based on phenotypic data, GWS will provide higher genetic gains due to the shorter selection cycle (Gois et al. 2016).

With the decrease of the selective cycle from 6 to 3 years, GWS was more efficient (ranging from 22 to 146%) for all traits. Thus, the reduction in the time required to complete a selective cycle is significant when using GWS. Genomic prediction and selection can be performed at the seedling stage, and therefore, GWS has a higher efficiency per unit of time (Resende et al. 2012; Gois et al. 2016). Similar results were observed in other studies. In a study evaluating the selective efficiency of GWS with a 50% decrease in the selective cycle of citrus, GWS was superior for all the traits evaluated (ranging from 31 to 160%) (Gois et al. 2016). A study with oil palm (Elaeis guineensis Jacq.) revealed a reduction in the selection cycle from 19 to 6 years when using GWS (Wong and Bernardo 2008). In a study with maize (Zea mays L.), the use of GWS significantly increased the selective accuracy and the genetic gains per unit of time (Fritsche-Neto et al. 2012). In other studies, genomic selection also showed great potential in increasing breeding efficiency by simulated results (Resende et al. 2008; Valente et al. 2016).

GWS provided gains in efficiency even for traits with low $ {h}_a^2 $, plant height (0.36), and rust incidence (0.37). These data confirm the importance of GWS in the selection of low-heritability traits, a fact that is also observed in other studies (Resende et al. 2012; Gois et al. 2016).

Conclusion

Results reveal that genome-wide selection is useful for C. canephora breeding since it accurately predicts the phenotypes of individuals. This fact leads to a significant reduction in the time required to complete the selection cycle, providing gains in selective efficiency per unit of time. In addition, these results can be used as the basis for further studies on the genus Coffea and on perennial species with genetic similarity.

References

Alkimim ER, Caixeta ET, Sousa TV, Pereira AA, Oliveira ACB, Zambolim L, Sakiyama NS (2017) Marker-assisted selection provides arabica coffee with genes from other Coffea species targeting on multiple resistance to rust and coffee berry disease. Mol Breed 37:6–10. https://doi.org/10.1007/s11032-016-0609-1
Article CAS Google Scholar
Alkimim ER, Caixeta ET, Sousa TV, Silva FL, Sakiyama NS, Zambolim L (2018) High-throughput targeted genotyping using next-generation sequencing applied in Coffea canephora breeding. Euphytica 214:50–18. https://doi.org/10.1007/s10681-018-2126-2
Article CAS Google Scholar
Anderson CA, Pettersson FH, Clarke GM, Cardon LR, Morris AP, Zondervan KT (2010) Data quality control in genetic case-control association studies. Nat Protoc 5:1564–1573. https://doi.org/10.1038/nprot.2010.116
Article CAS PubMed PubMed Central Google Scholar
Asins MJ, Fernandez-Ribacoba J, Bernet GP et al (2012) The position of the major QTL for Citrus tristeza virus resistance is conserved among Citrus grandis, C. aurantium and Poncirus trifoliata. Mol Breed 29:575–587. https://doi.org/10.1007/s11032-011-9574-x
Article Google Scholar
Bhering LL (2017) Rbio: a tool for biometric and statistical analysis using the R platform. Crop Breed Appl Biotechnol 17:187–190. https://doi.org/10.1590/1984-70332017v17n2s29
Article Google Scholar
Bernardo R (2010) Breeding for quantitative traits in plants. Stemma Press, Woodbury
Borém A, Fritsche-Neto R (2013) Biotecnologia Aplicada ao Melhoramento de Plantas
Carvalho MC d CG d, Silva DCGD (2010) Sequenciamento de DNA de nova geração e suas aplicações na genômica de plantas. Ciência Rural 40:735–744. https://doi.org/10.1590/S0103-84782010000300040
Article Google Scholar
Castro CA d O, Resende RT, Bhering LL, Cruz CD (2016) Brief history of Eucalyptus breeding in Brazil under perspective of biometric advances. Ciência Rural 46:1585–1593. https://doi.org/10.1590/0103-8478cr20150645
Article CAS Google Scholar
Cavalcanti JJV, de Resende MDV, dos Santos FHC, Pinheiro CR (2012) Predição simultânea dos efeitos de marcadores moleculares e seleção genômica ampla em cajueiro. Rev Bras Frutic 34:840–846. https://doi.org/10.1590/S0100-29452012000300025
Article Google Scholar
de Almeida ÍF, Cruz CD, de Resende MDV (2016) Validação e correção de fenótipos na seleção genômica ampla. Pesqui Agropecuária Bras 51:1973–1982. https://doi.org/10.1590/s0100-204x2016001200008
Article Google Scholar
de Andrade LRB, e Sousa MB, Oliveira EJ et al (2019) Cassava yield traits predicted by genomic selection methods. PLoS One 14:e0224920. https://doi.org/10.1371/journal.pone.0224920
Article CAS PubMed PubMed Central Google Scholar
de Resende MDV (2007) Matemática e Estatística na Análise de Experimentos e no Melhoramento Genético, 1st edn, Colombo
de Resende MDV (2016) Software Selegen-REML/BLUP: a useful tool for plant breeding. Crop Breed Appl Biotechnol 16:330–339. https://doi.org/10.1590/1984-70332016v16n4a49
Article Google Scholar
de Resende MDV, Duarte JB (2007) Precisão e controle de qualidade em experimentos de avaliação de cultivares. Pesqui Agropecuária Trop 37:182–194
Google Scholar
de Resende MDV, e Silva FF, Lopes PS, Azevedo CF (2012) Seleção Genômica Ampla (GWS) via Modelos Mistos (REML/BLUP), Inferência Bayesiana (MCMC), Regressão Aleatória Multivariada e Estatística. Espacial, Viçosa
Google Scholar
de Resende MDV, e Silva FF, Azevedo CF (2014) Estatística Matemática, Biométrica e Computacional: Modelos Mistos, Multivariados, Categorias e Generalizados (REML/BLUP), Inferência Bayesiana, Regressão Aleatória, Seleção Genômica, QTI-GWAS, Estatística Espacial e Temporal, Competição, Sobrevivência, 1st edn. Suprema, Viçosa
Google Scholar
Resende MDV de, e Silva FF, Ferreira C, et al (2017) Atualidades da biometria no melhoramento de plantas perenes. In: Ludke WH, Andrade ACB, Volpato L, et al. (eds) Desafios Biométricos No Melhoramento Genético, 1^a edição. Viçosa, p 166
Denoeud F, Carretero-Paulet L, Dereeper A et al (2014) The coffee genome provides insight into the convergent evolution of caffeine biosynthesis. Science 345(80):1181–1184. https://doi.org/10.1126/science.1255274
Article CAS PubMed Google Scholar
Diniz LEC, Sakiyama NS, Lashermes P et al (2005) Analysis of AFLP markers associated to the Mex-1 resistance locus in Icatu progenies. Crop Breed Appl Biotechnol 5:387–393. https://doi.org/10.12702/1984-7033.v05n04a03
Article CAS Google Scholar
Ferrão LFV, Ferrão RG, Ferrão MAG et al (2017) A mixed model to multiple harvest-location trials applied to genomic prediction in Coffea canephora. Tree Genet Genomes 13:95–13. https://doi.org/10.1007/s11295-017-1171-7
Article Google Scholar
Ferrão LFV, Ferrão RG, Ferrão MAG, Fonseca A, Carbonetto P, Stephens M, Garcia AAF (2019) Accurate genomic prediction of Coffea canephora in multiple environments using whole-genome statistical models. Heredity (Edinb) 122:261–275. https://doi.org/10.1038/s41437-018-0105-y
Article CAS Google Scholar
Fritsche-Neto R, Resende MDV, Miranda GV, DoVale JC (2012) Seleção genômica ampla e novos métodos de melhoramento do milho. Rev Ceres 59:794–802. https://doi.org/10.1590/S0034-737X2012000600009
Article Google Scholar
Garrick DJ, Taylor JF, Fernando RL (2009) Deregressing estimated breeding values and weighting information for genomic regression analyses. Genet Sel Evol 41:55. https://doi.org/10.1186/1297-9686-41-55
Article PubMed PubMed Central Google Scholar
Gianola D (2006) Genomic-assisted prediction of genetic value with semiparametric procedures. Genetics 173:1761–1776. https://doi.org/10.1534/genetics.105.049510
Article CAS PubMed PubMed Central Google Scholar
Gnirke A, Melnikov A, Maguire J, Rogov P, LeProust E, Brockman W, Fennell T, Giannoukos G, Fisher S, Russ C, Gabriel S, Jaffe DB, Lander ES, Nusbaum C (2009) Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing. Nat Biotechnol 27:182–189. https://doi.org/10.1038/nbt.1523
Article CAS PubMed PubMed Central Google Scholar
Goddard ME, Hayes BJ (2007) Genomic selection. J Anim Breed Genet 124:323–330. https://doi.org/10.1111/j.1439-0388.2007.00702.x
Article CAS PubMed Google Scholar
Gois IB, Borém A, Cristofani-Yaly M et al (2016) Genome wide selection in Citrus breeding. Genet Mol Res 15:1–14. https://doi.org/10.4238/gmr15048863
Article Google Scholar
Grattapaglia D, Resende MDV (2011) Genomic selection in forest tree breeding. Tree Genet Genomes 7:241–255. https://doi.org/10.1007/s11295-010-0328-4
Article Google Scholar
Heslot N, Jannink J-L, Sorrells ME (2015) Perspectives for genomic selection applications and research in plants. Crop Sci 55:1–12. https://doi.org/10.2135/cropsci2014.03.0249
Article Google Scholar
Jannink J-L, Lorenz AJ, Iwata H (2010) Genomic selection in plant breeding: from theory to practice. Brief Funct Genomics 9:166–177. https://doi.org/10.1093/bfgp/elq001
Article CAS PubMed Google Scholar
Kemper KE, Goddard ME (2012) Understanding and predicting complex traits: knowledge from cattle. Hum Mol Genet 21:R45–R51. https://doi.org/10.1093/hmg/dds332
Article CAS PubMed Google Scholar
Lande R, Thompson R (1990) Efficiency of marker-assisted selection in the improvement of quantitative traits. Genetics 124:743–756
Article CAS Google Scholar
Legarra A, Robert-Granie C, Manfredi E, Elsen J-M (2008) Performance of genomic selection in mice. Genetics 180:611–618. https://doi.org/10.1534/genetics.108.088575
Article PubMed PubMed Central Google Scholar
Leroy T, Marraccini P, Dufour M, Montagnon C, Lashermes P, Sabau X, Ferreira LP, Jourdan I, Pot D, Andrade AC, Glaszmann JC, Vieira LG, Piffanelli P (2005) Construction and characterization of a Coffea canephora BAC library to study the organization of sucrose biosynthesis genes. Theor Appl Genet 111:1032–1041. https://doi.org/10.1007/s00122-005-0018-z
Article CAS PubMed Google Scholar
Liao P-Y, Lee KH (2010) From SNPs to functional polymorphism: the insight into biotechnology applications. Biochem Eng J 49:149–158. https://doi.org/10.1016/j.bej.2009.12.021
Article CAS Google Scholar
Lopez GAG, McCouch SR, Moncada MDP (2013) A genetic map of an interspecific diploid pseudo testcross population of coffee. Euphytica 192:305–323. https://doi.org/10.1007/s10681-013-0926-y
Article CAS Google Scholar
Meuwissen T (2007) Genomic selection: marker assisted selection on a genome wide scale. J Anim Breed Genet 124:321–322. https://doi.org/10.1111/j.1439-0388.2007.00708.x
Article PubMed Google Scholar
Meuwissen T, Hayes B, Goddard M (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157:1819–1829 11290733
Article CAS Google Scholar
Neves LG, Davis JM, Barbazuk WB, Kirst M (2013) Whole-exome targeted sequencing of the uncharacterized pine genome. Plant J 75:146–156. https://doi.org/10.1111/tpj.12193
Article CAS PubMed Google Scholar
Neves LG, Davis JM, Barbazuk WB, Kirst M (2014) A high-density gene map of loblolly pine (Pinus taeda L.) based on exome sequence capture genotyping. G3-Genes Genomes Genet 4:29–37. https://doi.org/10.1534/g3.113.008714
Noir S, Anthony F, Bertrand B et al (2003) Identification of a major gene (Mex-1) from Coffea canephora conferring resistance to Meloidogyne exigua in Coffea arabica. Plant Pathol 52:97–103. https://doi.org/10.1046/j.1365-3059.2003.00795.x
Article CAS Google Scholar
Perez P, De Los Campos G (2014) Genome-wide regression and prediction with the BGLR statistical package. Genetics 198:483–495. https://doi.org/10.1534/genetics.114.164442
Article PubMed PubMed Central Google Scholar
Pértile SFN, e Silva FF, Salvian M, Mourão GB (2016) Seleção e associação genômica ampla para o melhoramento genético animal com uso do método ssGBLUP. Pesqui Agropecuária Bras 51:1729–1736. https://doi.org/10.1590/s0100-204x2016001000004
Article Google Scholar
Rabier C-E, Barre P, Asp T, Charmet G, Mangin B (2016) On the accuracy of genomic selection. PLoS One 11:e0156086. https://doi.org/10.1371/journal.pone.0156086
Article CAS PubMed PubMed Central Google Scholar
Resende MDV (2008) Genômica Quantitativa e Seleção no Melhoramento de Plantas Perenes e Animais, Colombo: E
Resende MDV, Lopes PS, Silva RL, Pires IE (2008) Seleção genômica ampla (GWS) e maximização da eficiência do melhoramento genético. Pesqui Florest Bras 56:63–77
Google Scholar
Resende MDV, Resende MFR, Sansaloni CP et al (2012) Genomic selection for growth and wood quality in Eucalyptus: capturing the missing heritability and accelerating breeding for complex traits in forest trees. New Phytol 194:116–128. https://doi.org/10.1111/j.1469-8137.2011.04038.x
Article PubMed Google Scholar
Resende M, Caixeta E, Alkimim ER et al (2016) High-throughput targeted genotyping of Coffea arabica and Coffea canephora using next generation sequencing, San Diego, p 1
Resende MDV, Ramalho MAP, Guilherme SR, de F.B. Abreu Â (2015) Multigeneration Index in the Within-Progenies Bulk Method for Breeding of Self-pollinated Plants. Crop Sci 55:1202–1211. https://doi.org/10.2135/cropsci2014.08.0580.z
Romero G, Vásquez LM, Lashermes P, Herrera JC (2014) Identification of a major QTL for adult plant resistance to coffee leaf rust (Hemileia vastatrix) in the natural Timor hybrid (Coffea arabica x C. canephora). Plant Breed 133:121–129. https://doi.org/10.1111/pbr.12127
Article CAS Google Scholar
Sant’Ana GC, Pereira LFP, Pot D et al (2018) Genome-wide association study reveals candidate genes influencing lipids and diterpenes contents in Coffea arabica L. Sci Rep 8:465. https://doi.org/10.1038/s41598-017-18800-1
Article CAS PubMed PubMed Central Google Scholar
Sousa TV, Caixeta ET, Alkimim ER, Oliveira ACB, Pereira AA, Zambolim L, Sakiyama NS (2017) Molecular markers useful to discriminate Coffea arabica cultivars with high genetic similarity. Euphytica 213:1–15. https://doi.org/10.1007/s10681-017-1865-9
Article CAS Google Scholar
Sousa TV, Caixeta ET, Alkimim ER et al (2019) Early selection enabled by the implementation of genomic selection in Coffea arabica breeding. Front Plant Sci:9. https://doi.org/10.3389/fpls.2018.01934
Toppa EVB, Jadoski CJ (2013) O uso dos marcadores moleculares no melhoramento genético de plantas. Sci Agrar Parana 12:1–5. https://doi.org/10.18188/1983-1471
Article Google Scholar
Valente MSF, Viana JMS, de Resende MDV et al (2016) Seleção genômica para melhoramento vegetal com diferentes estruturas populacionais. Pesqui Agropecuária Bras 51:1857–1867. https://doi.org/10.1590/s0100-204x2016001100008
Article Google Scholar
Van Eenennaam AL, Weigel KA, Young AE et al (2014) Applied animal genomics: results from the field. Annu Rev Anim Biosci 2:105–139. https://doi.org/10.1146/annurev-animal-022513-114119
Article PubMed Google Scholar
VanRaden PM (2008) Efficient methods to compute genomic predictions. J Dairy Sci 91:4414–4423. https://doi.org/10.3168/jds.2007-0980
Article CAS PubMed Google Scholar
Vieira LGE, Andrade AC, Colombo CA et al (2006) Brazilian coffee genome project: an EST-based genomic resource. Brazilian J Plant Physiol 18:95–108. https://doi.org/10.1590/S1677-04202006000100008
Article CAS Google Scholar
Wong CK, Bernardo R (2008) Genomewide selection in oil palm: increasing selection gain per unit time and cost with small populations. Theor Appl Genet 116:815–824. https://doi.org/10.1007/s00122-008-0715-5
Article CAS PubMed Google Scholar
Zambolim L (2016) Current status and management of coffee leaf rust in Brazil. Trop Plant Pathol 41:1–8. https://doi.org/10.1007/s40858-016-0065-9
Article Google Scholar
Zhang Z, Liu J, Ding X et al (2010) Best linear unbiased prediction of genomic breeding values using a trait-specific marker-derived relationship matrix. PLoS One 5:e12648. https://doi.org/10.1371/journal.pone.0012648
Article CAS PubMed PubMed Central Google Scholar
Zhao Y, Mette MF, Reif JC (2015) Genomic selection in hybrid breeding. Plant Breed 134:1–10. https://doi.org/10.1111/pbr.12231
Article Google Scholar

Download references

Data archiving statement

The SNP data was included in the Supplementary Materials.

Funding

This work was financially supported by the Brazilian Coffee Research and Development Consortium (Consórcio Brasileiro de Pesquisa e Desenvolvimento do Café—CBP&D/Café), by the Foundation for Research Support of the state of Minas Gerais (FAPEMIG), by the National Council of Scientific and Technological Development (CNPq), by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior—Brasil (CAPES)—Finance Code 001, and by the National Institutes of Science and Technology of Coffee (INCT/Café).

Author information

Authors and Affiliations

Universidade Federal do Triângulo Mineiro, Iturama, MG, Brazil
Emilly Ruas Alkimim
Empresa Brasileira de Pesquisa Agropecuária - Embrapa Café, Bioagro, BioCafé, Universidade Federal de Viçosa, Vicosa, MG, 36570-000, Brazil
Eveline Teixeira Caixeta
Instituto Federal Goiano, Urutai, MG, Brazil
Tiago Vieira Sousa
Empresa Brasileira de Pesquisa Agropecuária - Embrapa Café, Vicosa, MG, Brazil
Marcos Deon Vilela Resende
Departamento de Fitotecnia, Universidade Federal de Viçosa, Vicosa, MG, Brazil
Felipe Lopes da Silva & Ney Sussumu Sakiyama
Bioagro, Universidade Federal de Viçosa, Vicosa, MG, Brazil
Laércio Zambolim

Authors

Emilly Ruas Alkimim
View author publications
You can also search for this author in PubMed Google Scholar
Eveline Teixeira Caixeta
View author publications
You can also search for this author in PubMed Google Scholar
Tiago Vieira Sousa
View author publications
You can also search for this author in PubMed Google Scholar
Marcos Deon Vilela Resende
View author publications
You can also search for this author in PubMed Google Scholar
Felipe Lopes da Silva
View author publications
You can also search for this author in PubMed Google Scholar
Ney Sussumu Sakiyama
View author publications
You can also search for this author in PubMed Google Scholar
Laércio Zambolim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Eveline Teixeira Caixeta.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Communicated by F. P. Guerra

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

ESM 1

(XLSX 7697 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Alkimim, E.R., Caixeta, E.T., Sousa, T.V. et al. Selective efficiency of genome-wide selection in Coffea canephora breeding. Tree Genetics & Genomes 16, 41 (2020). https://doi.org/10.1007/s11295-020-01433-3

Download citation

Received: 12 December 2019
Revised: 29 March 2020
Accepted: 31 March 2020
Published: 20 April 2020
DOI: https://doi.org/10.1007/s11295-020-01433-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Selective efficiency of genome-wide selection in Coffea canephora breeding

Abstract

Similar content being viewed by others

Markers, Maps, and Marker-Assisted Selection

Efficiency of genomic selection for breeding population design and phenotype prediction in tomato

Genotyping-by-sequencing and multilocation evaluation of two interspecific backcross populations identify QTLs for yield-related traits in pigeonpea

Introduction

Material and methods

Genetic material

Phenotypic evaluations

Analysis of phenotypic data

Genomic DNA extraction, identification, and quality analysis of SNP markers

Prediction using the GBLUP model

Cross-validation

Predictive capacity, prediction, and accuracy bias of GWS

Estimate of the number of QTL (n QTL) and number of individuals (Ni) to obtain desired accuracy

Efficiency of GWS

Results

Analysis of phenotypic data

Analysis of quality of SNP markers

Genomic heritability, predictive capacity, prediction, and accuracy bias of GWS

Estimate of the number of QTL (n QTL) and number of individuals (Ni) to obtain desired accuracy

Efficiency of GWS

Discussion

Analysis of phenotypic data

Quality analysis of the SNP markers

Genomic heritability, predictive capacity, prediction bias, and GWS accuracy

Estimate of the number of QTLs (n QTL) and number of individuals (Ni) to obtain desired accuracy

Efficiency of GWS

Conclusion

References

Data archiving statement

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Electronic supplementary material

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Estimate of the number of QTL (n _QTL) and number of individuals (Ni) to obtain desired accuracy

Estimate of the number of QTL (n _QTL) and number of individuals (Ni) to obtain desired accuracy

Estimate of the number of QTLs (n _QTL) and number of individuals (Ni) to obtain desired accuracy