Introduction

Rice (Oryza sativa L.) is a prominent cereal crop and major staple food all around the world. Among various agronomic traits of rice, grain shape is a key component, which not only determines grain weight and yield, but also affects the end-use quality of rice (Huang et al. 2013). In breeding programs, grain shape characters including grain length (GL), width (GW), thickness (GT) and length-to-width ratio (LWR) are importantly considered for the evaluation of grain weight and final yield. Therefore, elucidating the genetic basis of grain shape would be of great interest for breeding cultivars with high yield and quality.

During the past two decades, great efforts have been made to understand the genetic basis of grain shape. It is well-evident that the four components of grain shape are typical quantitative traits controlled by polygenes (Xing and Zhang 2010; Zuo and Li 2014). To date, a huge number of quantitative trait loci (QTLs) regulating GL, GW, GT, and LWR have been reported, which were distributed over all the 12 chromosomes. Among those, a few QTLs such as GS2/GL2 (Hu et al. 2015; Che et al. 2015; Duan et al. 2015), GS3 (Fan et al. 2006), GL3/GL3.1 (Qi et al. 2012; Zhang et al. 2012), GLW7 (Si et al. 2016), GW2 (Song et al. 2007), qSW5/GW5 (Shomura et al. 2008; Weng et al. 2008), GS5 (Li et al. 2011), GW6a (Song et al., 2014), TGW6 (Ishimaru et al. 2013), GW8 (Wang et al. 2012a), and GS7/GW7 (Wang et al. 2015a, b) have been cloned, which have greatly increased our knowledge regarding the genetic basis of grain shape. However, more attention should be paid to mining novel QTL/genes to further expand the understanding of the regulatory mechanisms of grain shape.

Rice cultivars with large grain shape are valuable germplasm resources, which contain novel alleles having great contribution to grain shape. For instance, GW2, a major QTL conferring grain width and weight, was isolated from WY3, a japonica variety with a KGW value of 41.9 ± 1.3 g (Song et al. 2007). GL3/GL3.1, a major QTL for grain length and weight, was identified in WY3, and N411, a japonica variety with a KGW value of 72.13 ± 2.32 g (Qi et al. 2012; Zhang et al. 2012). Likewise, GS2/GL2, a major QTL for GL, GW, GT and KGW, was isolated from three indica cultivars (BDL, RW11, Judali) with extra large grain shape (Hu et al. 2015; Che et al. 2015; Duan et al. 2015). Therefore, identifying and isolating novel alleles for grain shape from cultivars with large grain shape would not only help in elucidating the genetic basis and molecular mechanisms of grain shape, but also provide valuable gene resource for rice breeding programs.

In this study, a recombinant inbred line (RIL) population derived from two indica cultivars, Zhenshan 97B (ZS97) and Nanyangzhan (NYZ) was developed, and used to dissect the genetic basis of the extremely large grain shape in NYZ. QTLs for GL, GW, GT, LWR and KGW were mapped, and two QTLs conferring GT (qGT3.1) and GL (qGL3.4) were validated in the near-isogenic lines (NILs) background, respectively. These results would facilitate the future breeding of high-yielding and good-quality rice varieties.

Materials and methods

Population development and field experiments

A population of 190 RILs was developed from a cross between two indica varieties viz., Zhenshan 97B (ZS97) and Nanyangzhan (NYZ), followed by a series of self-crosses using the single-seed decent method. NYZ has extra large grains, while ZS97 possesses small grains. The RILs F7 and F8 were planted at the experimental farm of Huazhong Agricultural University in Lingshui (Hainan) during 2001 growing seasons, and Wuhan (Hubei) during 2002 growing seasons.

The NILs for qGT3.1 and qGL3.4 were constructed by backcrossing the selected F9 lines with the recurrent parent, ZS97, for four times, followed by a self-cross. Two pairs of NILs, NIL-qGT3.1 NYZ and NIL-qGT3.1 ZS, and NIL-qGL3.4 NYZ and NIL-qGL3.4 ZS, were produced. NIL-qGT3.1 NYZ and NIL-qGT3.1 ZS were the NILs carrying homozygous qGT3.1 regions from NYZ and ZS, respectively, while NIL-qGL3.4 NYZ and NIL-qGL3.4 ZS were the NILs carrying homozygous qGL3.4 regions from NYZ and ZS, respectively. The two pairs of NILs were planted at the experimental farm of Huazhong Agricultural University in Wuhan (Hubei) during 2011 growing seasons.

Trait evaluation

After harvesting, paddy rice was air-dried and stored at room temperature for at least 3 months before testing. Fully filled grains were used for measuring the GL, GW, GT and KGW. GL and GW were measured as described in Tan et al. (2000). LWR was calculated as the ratio of GL to GW. GT was determined for each grain individually using an electronic digital caliper (Guanglu Measuring Instrument Co. Ltd., China) with a precision of 0.1 mm, and thirty grain values were averaged and used for data analysis. KGW was calculated based on 400 grains and replicated for three times.

Molecular marker development and analysis

For genotyping of the RIL population, simple sequence repeat (SSR) markers were identified from the Gramene database (www.gramene.org).

DNA was extracted according to the CTAB method with minor modification (Murray and Thompson 1980). Marker analysis was performed according to Sun et al. (2013).

Construction of genetic linkage map

A total of 574 SSR markers were used to screen for the polymorphic markers between two parents (ZS97 and NYZ) and 185 markers were identified. Among those, 106 markers distributed evenly over all 12 rice chromosomes were employed to screen the RIL F7 lines. A linkage map was constructed using the program Mapmaker/Exp 3.0 (Lincoln et al., 1992) with the Kosambi mapping function.

Data analysis

Correlation analysis was performed using the data analysis module in Microsoft Office Excel 2007. QTL analysis was performed by composite interval mapping (CIM) using the software package QTLCartographer V2.5 (Wang et al. 2012b) with a logarithm of odds (LOD) threshold of 2.5.

Results

Phenotypic variation and correlation in the RIL population

A significant difference was observed between the two parents, ZS97 and NYZ, regarding all the five studied traits (Fig. 1; Table 1). Averaged across 2 years, the large-grain parent (NYZ) had a GL value of 12.1 mm, GW of 3.6 mm, LWR of 3.3 and KGW of 40.1 g. In contrast, the small-grain parent (ZS97) had an average value of 7.9 mm for GL, 3.3 mm for GW, 2.3 for LWR and 25.6 g for KGW. The GT values of NYZ and ZS97 were 2.54 mm and 2.09 mm, respectively, in 2002. All the five traits showed continuous variation, followed the normal distribution and displayed transgressive segregation in the RIL population during 2 years.

Fig. 1
figure 1

Frequency distribution of RIL population for GL, GW, LWR, KGW and GT during 2001 and 2002 growing seasons. y01, year 2001; y02, year 2002

Table 1 Descriptive statistics of GL, GW, LWR, KGW and GT in two parents (ZS97 and NYZ) and RIL population during 2001 and 2002 growing seasons

All the four grain shape factors, GL, GW, LWR and GT, were significantly and positively correlated with KGW in both years (Table 2). The highest correlation coefficient was observed between GL and KGW, with a value of 0.641 in 2001 and 0.695 in 2002. GT was also positively correlated with GL (r = 0.410) and GW (r = 0.328) in year 2002, while no correlation was observed between GL and GW.

Table 2 Correlations of GL, GW, LWR, KGW and GT in 2001 and 2002 growing seasons

QTLs identified in the RIL population

GL

A total of 17 QTLs for GL were detected in 2 years (Table 3; Fig. 2). The phenotypic variance explained by each QTL ranged from 2.53 to 33.04%. Two QTLs (qGL3.2 and qGL3.3) located between MRG2204 and RM411 on chromosome 3 accounted for 33.04% and 31.81% of GL variation in 2001 and in 2002, respectively. Three QTLs viz., qGL3.1, qGL3.4 and qGL10, were repeatedly detected, and explained 8.55%, 11.32% and 3.35% variation in 2001, and 11.24%, 11.50% and 8.58% variation in 2002, respectively. Two QTLs (qGL11.1 and qGL11.2) located at adjacent regions on chromosome 11 accounted for 4.00% of the GL variation in both years. The remaining 10 QTLs were identified only in a single year, and explained less than 10% variation.

Table 3 QTLs identified for GL, GW, LWR, KGW and GT in the RIL population during 2001 and 2002 growing seasons
Fig. 2
figure 2

Distribution of putative QTLs for GL, GW, LWR, KGW and GT identified in the RIL population during 2001 and 2002 on the linkage map. GL01, QTLs for GL in 2001; GL02, QTLs for GL in 2002; GW01, QTLs for GW in 2001; GW02, QTLs for GW in 2002; LWR01, QTLs for LWR in 2001; LWR02, QTLs for LWR in 2002; KGW01, QTLs for KGW in 2001; KGW02, QTLs for KGW in 2002; GT02, QTLs for GT in 2002

GW

In total, 12 QTLs for GW were detected in 2 years (Table 3; Fig. 2), and the variation explained by each QTL ranged from 4.36% to 22.05%. The QTL qGW2.4, located between RM183 and RM526 on chromosome 2, was repeatedly detected in both years and explained 22.05% and 11.6% of GW variation in 2001 and 2002, respectively. qGW2.2, the other repeatedly detected QTL on chromosome 2, explained 4.71% variation in 2001 and 6.79% in 2002. Two other QTLs (qGW2.3 and qGW2.6) located on chromosome 2 explained 11.7% and 10.6% variation in 2001 and 2002, respectively. qGW4, the QTL flanked by RM317 and RM255 on chromosome 4, and qGW6.1, the QTL flanked by RM585 and RM557 on chromosome 6, accounted for 14.17% and 13.57% variation in 2001, respectively. The remaining 6 QTLs were detected only in a single year, and accounted for less than 10% variation.

LWR

In total, eight QTLs were detected for LWR in 2 years (Table 3; Fig.2). Among those, qLWR3.2, located between MRG2204 and RM156 on chromosome 3, explained 27.85% of phenotypic variation in 2001 and 33.19% in 2002. The other three repeatedly detected QTLs, qLWR2.1, qLWR3.3 and qLWR5, explained 10.35%, 4.40% and 8.44% variation in 2001, and 7.68%, 7.29% and 8.11% variation in 2002, respectively. The QTL flanked by RM517 and RM473D on chromosome 3, qLWR3.1, accounted for 10.33% variation in 2001. The remaining 3 minor QTLs explained less than 10% variation, and were only detected in 2001.

KGW

Totally, 11 QTLs for KGW were detected in 2 years (Table 3; Fig. 2). Among those, qKGW3.1, located between MRG2204 and RM156 on chromosome 3, explained 21.43% of KGW variation in 2001 and 17.80% in 2002. The remaining QTLs were only identified in a single year, and the variance explained by each QTL ranged from 3.58% to 13.66%.

GT

Five QTLs for GT were detected in year 2002 (Table 3; Fig. 2), which were distributed on chromosomes 3, 4 and 9. The QTL qGT3.1, located between RM545 and RM517 on chromosome 3, explained 16.94% of phenotypic variation. The variation explained by each of the other four QTLs was less than 10%.

Validation of qGT3.1 and qGL3.4 in the NIL background

Among the five studied traits, GT received less attention in previous studies. Therefore, in order to validate the genetic effect of QTLs conferring GT, the NIL of qGT3.1 was constructed, which showed the largest effect in the RIL population. An F9 line carrying homozygous genotypes from NYZ at markers RM545 and RM517 was backcrossed with the recurrent parent ZS97 for four times, and a BC4F1 line was produced. The genetic background of the BC4F1 line was evaluated with 92 markers evenly covering the whole genome. The result revealed that the line carried two other heterozygous regions not conferring GT on the short arms of chromosomes 1 and 10, except for the target region (data not shown). Thus, the line was self-crossed to produce the two homozygous NILs, NIL-qGT3.1 NYZ and NIL-qGT3.1 ZS, to evaluate the genetic effect of qGT3.1. As shown in Fig. 3(a), significant difference in grain thickness was observed between the two NILs. Compared to NIL-qGT3.1 ZS, NIL-qGT3.1 NYZ displayed an increased value by 0.12 mm in grain thickness.

Fig. 3
figure 3

Grain thickness of NIL-qGT3.1 NYZ and NIL-qGT3.1 ZS (a) and grain length of NIL-qGL3.4 NYZ and NIL-qGL3.4 ZS (b). *** significant at P < 0.001

Of the four QTLs conferring grain length on chromosome 3, the QTL located on the terminal region of the long arm, qGL3.4, was selected to validate its effect in the NIL background. The development process of the two homozygous NILs, NIL-qGL3.4 NYZ and NIL-qGL3.4 ZS, was similar to that of NIL-qGT3.1 NYZ and NIL-qGT3.1 ZS, except that the genotype of qGL3.4 was determined by RM570 and RM442. As shown in Fig. 3(b), significant difference in grain length existed between the two NILs. Compared to NIL-qGL3.4 ZS, NIL-qGL3.4 NYZ displayed an increased value by 0.47 mm in grain length.

Discussion

QTLs for grain shape and KGW identified in the RIL population

In this study, a total of 53 QTLs for grain shape and KGW were identified from the RIL population derived from the two parents (NYZ and ZS97) differing substantially in their grain shape. It was clearly observed that QTLs for GW were mainly located on chromosome 2, while QTLs for GL were mainly detected on chromosome 3 (Table 3, Fig. 2). Among those, the region flanked by MRG2204 and RM411 on chromosome 3 was responsible for GL and KGW in both years and for GW and GT in 2002 only (Fig. 2). GS3, a major QTL for GL and KGW and a minor QTL for GW and GT, showed the same location (Fan et al. 2006). Therefore, GS3 was likely to be the candidate gene underlying the region. The region flanked by RM322 and RM521 on chromosome 2 was responsible for GL in 2001, GW in 2002 and LWR in both years (Fig. 2). Previously, a major QTL for GW, GW2, was observed in this region by Song et al. (2007). Thus, GW2 was likely to be the candidate gene for GW. In addition, the region flanked by RM183 and RM526 on chromosome 2, and the region flanked by RM574 and RM169 on chromosome 5, showed the same locations with the cloned GS2/GL2 and SW5/GW5, respectively (Shomura et al. 2008; Weng et al. 2008; Hu et al. 2015; Che et al. 2015; Duan et al. 2015). The remaining QTLs were detected in a single year and no cloned genes for grain shape were co-located.

Accumulation of QTLs showing positive effects contributing to the large grain shape

Rice cultivars with extra large grain shape are valuable resources, which are believed to carry a few novel and major genes for grain shape. Several studies have been conducted to dissect the genetic basis of large grain shape, in which two genes conferring GL, GS3 and GL3/GL3.1 (Kato et al. 2011; Zhang et al. 2011; Qi et al. 2012; Zhang et al. 2012; Zhang et al. 2013), two genes conferring GW, GW2 and qSW5/GW5 (Song et al. 2007; Zhang et al. 2011; Ying et al. 2012; Zhang et al. 2013; Gao et al. 2015b), and a pleiotropic gene conferring GL, GW and GT, GS2/GL2 (Hu et al. 2015; Che et al. 2015; Duan et al. 2015), were frequently detected. In this study, NYZ contributed to 45 of the total 54 QTLs showing positive effects identified in the RIL population (Table 3). Among those, qGL3.2, qGL3.3, qGW2.1 and qGW2.4 showed the same locations with GS3, GW2 and GS2/GL2, and all the four alleles from NYZ increased the value of GL and GW. In addition, a total of 17 QTLs were detected for GL, and 14 alleles from NYZ showed positive additive effects. Among those, the alleles from NYZ of all the four QTLs for GL on chromosome 3 showed positive effects, and accounted for more that 50% of the variation in 2 years (Table 3). Therefore, the large grain shape of NYZ might be the combined influence of numbers of QTLs showing positive effects.

Validation of qGT3.1and qGL3.4 in the background of ZS97

GT, an important determinant of grain shape, has received less attention in the past compared with the rest of the three determinants. A few studies have been performed to dissect the genetic basis of GT (Zhang et al. 2011, Ying et al. 2012, Liu et al. 2015). However, only one QTL (qGT8) was finely mapped to a region of 11.2 kb, which contained GW8, a gene conferring grain width and yield (Gao et al., 2015a). In addition, GS3 and GS2/GL2, two major genes conferring GL, also affect the GW and GT (Fan et al. 2006, Hu et al. 2015). In the present study, five QTLs were detected for GT (Table 3). Among those, qGT3.1, located between RM545 and RM517 on chromosome 3, accounted for the highest (16.94%) phenotypic variation (Table 3). In order to validate the effect of qGT3.1, two homozygous NILs, NIL-qGT3.1 NYZ and NIL-qGT3.1 ZS were developed with the background of ZS97. Results showed that NIL-qGT3.1 NYZ displayed an increased value by 0.12 mm in grain thickness than NIL-qGT3.1 ZS , demonstrating that the qGT3.1 region from NYZ could improve grain thickness (Fig. 3).

Until now, several genes conferring GL have been cloned, of which two major genes, GS3 and GL3/GL3.1 are on chromosome 3 (Fan et al. 2006, Qi et al. 2012, Zhang et al. 2012). Among the 17 QTLs for GL, all the 4 accounting for over 10% of the variation were distributed on chromosome 3 (Table 3, Fig. 2). qGL3.2 and qGL3.3 were located to the surrounding regions of GS3, while qGL3.1 was mapped to the adjacent region of qGL3.2. Only qGL3.4 was away from the two cloned genes and on the terminal region of chromosome 3; thus, it was selected to validate its effect under the background of ZS97. Results showed that the GL value of NIL-qGL3.4 NYZ was 0.47 mm longer than that of NIL-qGT3.1 ZS , demonstrating that the qGT3.1 region from NYZ did, indeed, have a positive effect on grain length (Fig. 3).

Breeding rice cultivars with desirable grain shape, good quality and high yield

The Asian cultivated rice consists of two major subspecies, indica and japonica. Typical indica cultivars have long and slender grains, while typical japonica cultivars have short and round grains, which are desired by different people in different regions or countries (Unnevehr 1992, Juliano and Villareal 1993). Many rice cultivars cannot meet a balance among grain shape, quality and yield. For example, NYZ, the parent with large grain in this study, showed high chalkiness and low yield (data not shown). Nevertheless, approved rice cultivars in China show moderate grain, better quality and higher yield. Therefore, breeding rice cultivars with a combination of desirable grain shape, good quality and high yield is of great challenge and significance Although the genetic basis and molecular mechanism underlying grain shape is not quite clear, a few genes have been cloned and many QTLs have been identified and even finely mapped. In this study, 5 QTLs for GL (qGL3.1, qGL3.2, qGL3.3, qGL3.4 and qGL10), 2 QTLs for GW (qGW2.2 and qGW2.4), 4 QTLs for LWR (qLWR2.1, qLWR3.2, qLWR3.3 and qLWR5) and 1 QTL for KGW (qKGW3.1) were stably detected in 2 years, or accounted for over 20% of the variation (Table 3). In addition, qGT3.1 and qGL3.4 were further validated in the NIL background. With so many cloned genes and identified QTLs for grain shape, selective combinations of some would produce cultivars with good combinations of desirable grain shape, good quality and high yield.