Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Resequencing and variation identification of whole genome of the japonica rice variety "Longdao24" with high yield

  • Shukun Jiang ,

    Roles Writing – original draft, Writing – review & editing

    sk.jiang@hotmail.com

    Affiliation Cultivation and Farming Research Institute, Heilongjiang Academy of Agricultural Sciences, Harbin, P.R.China

  • Shichen Sun,

    Roles Data curation, Formal analysis

    Affiliation Cultivation and Farming Research Institute, Heilongjiang Academy of Agricultural Sciences, Harbin, P.R.China

  • Liangming Bai,

    Roles Conceptualization, Methodology

    Affiliation Cultivation and Farming Research Institute, Heilongjiang Academy of Agricultural Sciences, Harbin, P.R.China

  • Guohua Ding,

    Roles Data curation, Resources

    Affiliation Cultivation and Farming Research Institute, Heilongjiang Academy of Agricultural Sciences, Harbin, P.R.China

  • Tongtong Wang,

    Roles Investigation, Validation

    Affiliation Cultivation and Farming Research Institute, Heilongjiang Academy of Agricultural Sciences, Harbin, P.R.China

  • Tianshu Xia,

    Roles Formal analysis, Visualization

    Affiliation Cultivation and Farming Research Institute, Heilongjiang Academy of Agricultural Sciences, Harbin, P.R.China

  • Hui Jiang,

    Roles Data curation, Formal analysis

    Affiliation Cultivation and Farming Research Institute, Heilongjiang Academy of Agricultural Sciences, Harbin, P.R.China

  • Xijuan Zhang,

    Roles Data curation, Formal analysis, Project administration, Visualization

    Affiliation Cultivation and Farming Research Institute, Heilongjiang Academy of Agricultural Sciences, Harbin, P.R.China

  • Fengming Zhang

    Roles Formal analysis, Visualization

    Affiliation Cultivation and Farming Research Institute, Heilongjiang Academy of Agricultural Sciences, Harbin, P.R.China

Abstract

Japonica rice mainly distributes in north of China, which accounts for more than half of the total japonica rice cultivated area of China. High yield, good grain quality and early heading date were the main breeding traits and commercial property in this region. We performed re-sequencing and genome wide variation analysis of one typical northern japonica rice variety Longdao24 and its parents (Longdao5 and Jigeng83) using the Illumina sequencing technology. 53.17 G clean bases were generated and more than 96.8% of the reads were mapped to the genomic reference sequence. An overall average effective depth of 43.67 × coverage was achieved. We identified 420,475 SNPs, 95,624 InDels, and 14,112 SVs in Longdao24 genome with the genomic sequence of the japonica cultivar Nipponbare as reference. We identified 361,117 SNPs and 81,488 InDels between Longdao24 genome and Longdao5 genome. We also detected 428,908 SNPs and 97,209 InDels between Longdao24 genome and Jigeng83 genome. Twenty-two yield related genes, twenty-two grain quality related genes and thirty-nine heading date genes were analyzed in Longdao24. The alleles of Gn1a, EP3, SCM2, Wx, ALK, OsLF and Hd17 came from the female parent Longdao5. The other alleles of qGW8, SSIVa, SBE3, SSIIIb, SSIIc, DTH2, Ehd3 and OsMADS56 came from the male parent Jigeng83. These results will help us to research the genetics basis of yield, grain quality and early heading date in northern rice of China.

Introduction

Given continuing population growth and increasing competition for arable land between food and energy crops, the next century may witness serious global food shortage problems. Thus, there is a need for an increase in rice yield, because rice is the world’s most important staple food crop. To substantially increase rice yield, Japan initiated a super high-yielding rice breeding program in 1981 that was targeted to raise rice yield 50% within 15 years. However, this target has not yet been realized [1, 2]. Next, the International Rice Research Institute (IRRI) launched a new plant-type breeding program to develop super rice in 1989, with the target of developing a super rice. However, this target was not reached either [13]. In order to meet the food demand required by the Chinese people in the 21st century, a program to breed super rice through combining morphological improvement and the utilization of inter-subspecific heterosis was set up by the Ministry of Agriculture and the Ministry of Science and Technology in 1996 and 1997, respectively [1, 2, 4]. Today, nearly 80 super rice varieties have been released and some of them show high grain yields of 12–21 t/hm2 in field experiments [4]. The core of super rice breeding is an effective use of germ plasma resources and favorable genes. For example, most of the northern japonica super rice varieties have DEP1 gene, which controlled large erect panicle [5, 6]. Although a large progress about the genetic basis for yield, grain quality and heading date of super rice had achieved [57], but the molecular mechanism is still poorly understood.

To meet the global food demands, achievements in molecular genetics of complex traits in rice will be important for increasing yield in the post-genomics era [8]. The draft genomic sequences of two rice subspecies, japonica (Nipponbare) and indica (93–11), were released [9, 10]; and later, the final genome sequence of Nipponbare was completed by International Rice Genome Sequencing Project [11]. These achievements provided a high quality reference genome for re-sequencing rice genomes using high-throughput sequencing technologies [12]. The widely used high-throughput sequencing technologies are also known as next-generation sequencing (NGS) technology. The NGS technology has been widely used in rice genomics and molecular breeding studies [13]. These characteristics have enabled researchers to perform accurate genetic polymorphism analysis and discovered important cloned genes of rice variety [14]. Several super rice or its parents including Liangyoupei 9 [15], Shanyou 63 [16] and Shennong 265 [17] has been re-sequenced. However the Heilongjiang super rice varieties such as Longdao 5, Songgeng 9, Longgeng 31 and so on still not be researched. In this study, we re-sequenced a new bred elite japonica rice variety, Longdao24, with high quality, super high yield and early heading date in Heilongjiang province. Our results provided several candidates account for the super high yield, good quality and early heading date of rice in Heilongjaing province. The results therefore lay the groundwork for long-term efforts to uncover important genes and for future variety improvement.

Materials and methods

Plant materials

Longdao24 (Fig 1) was selected from the cross between two super japonica rice Longdao5 and Jigeng83. Longdao5 is a leading cultivar widely cultivated in the south and east part of Heilongjiang province and has high resistance to disease and super high yield. Jigeng83 is a leading cultivar widely cultivated in the middle part of Jilin province and has super high yield.

DNA isolation and genome sequencing

Genomic DNA was prepared from bulk harvested leaves of a single young plant of Longdao24, Longdao5 and Jigeng83 using a modified CTAB method [18]. Whole genome re-sequencing of the grape lines was performed on an Illumina HiSeq4000™ by Biomarker Technologies (Beijing, China). The procedure was performed as the standard Illumina protocol, including sample preparation and sequencing as follows: quantified DNA was extracted and DNA fragments were obtained using the ultrasonic wave technique, which was purified using the QIA quick PCR kit. End repair was performed with poly-A on the 3´ ends, then the adaptors were ligated, clusters were generated, agarose gel electrophoresis was used to select fragments, and PCR amplification was performed. Sequencing was performed by establishing a library with Illumina HiSeq4000™. The short reads were aligned using the Burrows–Wheeler transformation [19]. To ensure high quality, Low-quality reads (<20), reads with adaptor sequence and duplicated reads were filtered, and the remaining high-quality data were used in the mapping.

Mapping of reads to the reference

The clean sequencing reads were aligned to the temperate japonica Nipponbare reference genome—the unified-build release Os-Nipponbare-Reference-IRGSP-1.0 [20], using the BWA software under the default parameters with a small modification. The alignment results were then merged and indexed as BAM files [21]. Average sequencing depth and coverage were calculated using the alignment results. The mapped reads were then used to detect SNPs, InDels and SVs polymorphisms.

Detection of SNPs, InDels and SVs polymorphisms

Firstly, we used GATK tools software [22] to detect SNPs and InDels. Secondly, SVs were detected using BreakdancerMax.pl software [23] with its default parameters. In our result, the types of SVs include insertion (INS), deletion (DEL), inter chromosomal translocation (CTX), deletion including insertion (IDE), intra chromosomal translocation (ITX) and inversion (INV). And in our analysis, InDels were defined as the insertion or deletion the length of which was from 1 to 10 bp.

Annotation of SNPs, InDels and SVs

The annotation of SNPs and Indels were performed by SnpEff software [24]. The localization of SNPs, InDels and SVs were based on the annotation of reference genome databases. The polymorphisms in the gene region and other genome regions were annotated as genic and intergenic. The SNPs, InDels and SVs were classified according to their localization. SNPs in the CDS were further separated into synonymous and non-synonymous. The functional annotation data were achieved by blast each gene including non-synonymous SNPs or SVs to NR [25], SwissProt [25], GO [26], COG [27] and KEGG [28] database.

Results

Genome sequencing of Longdao24 and its parents

Genome sequencing of Longdao24, Longdao5 and Jigeng83 yielded 64.19, 53.93 and 70.60 million reads, most of them being 300 bp paired end reads. After cleaning, 19.17 G (Longdao24), 16.07 G (Longdao5) and 21.11 G (Jigeng83) clean bases were generated and 96.80%, 99.27% and 98.99% of the reads were mapped to the genomic reference sequence. The overall effective depth coverage of Longdao24, Longdao5 and Jigeng83 was 47 ×, 32 × and 52 ×, respectively. The 1 ×, 5 × and 10 × coverage ratio of Longdao24 to the reference genome was 97.87%, 96.11% and 95.08%, respectively. The 1 ×, 5 × and 10 × coverage ratio of Longdao5 to the reference genome was 97.21%, 95.44% and 93.19%, respectively. The 1 ×, 5 × and 10 × coverage ratio of Jigeng83 to the reference genome was 98.40%, 97.32% and 96.63%, respectively. The state information of Longdao24 and its parent were showed in Table 1.

thumbnail
Table 1. The re-sequencing state information of Longdao24 and its parent.

https://doi.org/10.1371/journal.pone.0181037.t001

Frequency distributions of different variation

We then examined genome-wide variations. 420,475 SNPs, 95,624 InDels, and 14,112 SVs were yielded from the Longdao24 genome. The frequency distributions of SNPs and InDels on Longdao24 genome were showed in Fig 2 and Fig 3. We detected ten abundant SNPs high region on chr 2(33–35 Mb), chr 4(6.5–12 Mb), chr 5(19-22Mb), chr 6(10-22Mb, 27-31Mb), chr 8(21.5–23 Mb), chr 10(2–6 Mb, 15-16Mb, 17–21 Mb) and chr12 (10–15 Mb). Moreover, the most abundant InDels region were identified on chr 2(33–35 Mb), chr 5(19-22Mb), chr 6(10-22Mb, 27-31Mb), chr 8(21.5–23 Mb) and chr 10(2–6 Mb, 15-16Mb, 17–21 Mb). Several chromosomal loci in the SNP high region were also found covered by the InDel high regions (Figs 2 and 3), which might suggest that those regions were the most polymorphic. 361,117 SNPs and 81,488 InDels were detected between the Longdao24 genome and Longdao5 genome. 428,908 SNPs and 97,209 InDels were identified between the Longdao24 genome and Jigeng83 genome. Frequency distributions of all different variation of Longdao24 genome was showed on Fig 4 via circos software.

thumbnail
Fig 2. The frequencies of SNPs for Longdao24 at a 100 kb sliding window along each chromosome.

https://doi.org/10.1371/journal.pone.0181037.g002

thumbnail
Fig 3. The frequencies of Indels for Longdao24 at a 100 kb sliding window along each chromosome.

https://doi.org/10.1371/journal.pone.0181037.g003

thumbnail
Fig 4. Summary of different variation between Longdao24 and reference variety Nipponbare.

From the outside to the inside in turn is chromosome location, distribution of SNP density, distribution of InDel density, Copy number variation (CNV), Structure Variation (Insertion (INS), Deletion (DEL), Inversion (INV), Intra-chromosomal Translocation (ITX, red line) and Inter-chromosomal Translocation (CTX, green line)).

https://doi.org/10.1371/journal.pone.0181037.g004

Characteristics of SNPs, InDels and SVs

22,250, 65,858, 137,848 and 107,288 SNPs were found in intergenic region, intron region, upstream region and downstream region respectively. 9,054 SNPs were located in the 5’UTR and 3’UTR region. 1,958 SNPs were detected near the splice site. A total of 76,214 SNPs were located in the CDS regions, among which 31,308 were synonymous coding sequences (Syn CDS) and 42,115 were non-synonymous coding sequences (Nonsyn CDS) (Fig 5A).

thumbnail
Fig 5. Summary of SNPs and Small Indels detected in Longdao24.

https://doi.org/10.1371/journal.pone.0181037.g005

5,623, 17,609, 34,717 and 27,132 InDels were found in intergenic region, intron region, upstream region and downstream region respectively. 4,465 InDels were located in the 5’UTR and 3’UTR region. 426 InDels were detected near the splice site. A total of 5,602 InDels were located in the CDS regions, among which 3,401 caused frame shift and 2,092 caused codon change (Fig 5B).

A total of 14,112 Structure Variation (SVs) were detected in Longdao24 including 7,795 Insertion (INS), 3,273 Deletion (DEL), 217 Inversion (INV), 317 Intra-chromosomal Translocation (ITX) and 2396 Inter-chromosomal Translocation (CTX). A total of 11,285 SVs were located in the CDS regions, among which 2,969 were detected in exon, 1,697 were located in intron and 6,619 were located in intergenic region. And 2,677 Copy number variation (CNVs) were also detected in Longdao24.

Variation annotation and gene categories

16,831 variations including 10,615 non-synonymous SNP mutations, 3,727 frame shifts and 2,489 SVs caused by indels, transversions, and transitions in 12,772 genes might influence the expression of the relevant protein. All tested and functional annotated genes were classified into gene ontology (GO) categories (Fig 6), which mainly divided the genes into three categories: cellular components (20 groups), molecular function (17 groups), and biological process (17 groups). Classification of gene variations compared with COG database by blast showed that most variations (1,935) were identified in replication, recombination and repair progress. Compared with the reference genome, most variations was screened in genetic information processing and metabolism.

thumbnail
Fig 6. GO classify for differential UniGenes in Longdao24.

https://doi.org/10.1371/journal.pone.0181037.g006

Variation analysis on high yield genes

We investigated 22 cloned yield related genes [7] which might explain the high yield of Longdao24. A large number of SNPs and Indels (Table 2) were only detected in the CDS sequence of 4 genes related to high yield among Longdao24, Longdao5 and Jigeng83, such as Gn1a (related to spikelet number), EP3 (related to erect panicle and spikelet number), SCM2 (related to lodging resistance and spikelet number) and qGW8 (related to grain size and grain quality). Four non-synonymous SNPs and a codon insertion InDel were identified in Gn1a locus. Two non-synonymous SNPs and a codon change + deletion InDel were identified in SCM2 locus. Three non-synonymous SNPs and a codon deletion InDel were identified in qGW8 locus. One frame shift InDel were identified in EP3 locus. The allele of Gn1a, DEP3 and SCM2 were come from the female parent Longdao5. The allele of qGW8 were come from the male parent Jigeng83.

thumbnail
Table 2. Variations of 22 high yield related genes in Longdao 24.

https://doi.org/10.1371/journal.pone.0181037.t002

Variation analysis on starch quality genes

We investigated 22 cloned grain quality related genes [29] in Longdao24 (Table 3). A total of fourteen SNPs and one Indels were detected in the CDS sequence of seven grain quality genes among Longdao24, Longdao5 and Jigeng83. Eleven SNPs and one Indels were detected in GBSSII (granule-bound starch synthase II), SSIIIb (Soluble starch synthase IIIb) and SSIIc (Soluble starch synthase IIc) locus between Longdao24 and Nipponbare. The GBSSII in Longdao24 had only one SNP. The SSIIIb and SSIIc of Longdao24 had six SNPs plus one Indel and four SNPs, respectively. The other three SNPs were detected in SSIVa and SBE3 locus among Longdao24, Longdao5 and Jigeng83. The allele of GBSSII locus was same among Longdao24, Longdao5 and Jigeng83, but different from Nipponbare. The other four allele of SSIVa, SBE3, SSIIIb and SSIIc were come from the male parent Jigeng83.

thumbnail
Table 3. Variations of 22 grain quality related genes in Longdao 24.

https://doi.org/10.1371/journal.pone.0181037.t003

Variation analysis on heading date genes

Eight (DTH2, OsLF, Hd17, Hd3a, Hd1, Hd2, Ehd3 and OsMADS56) from 39 heading date genes had 24 variations (19 SNPs and 5 Indels) among Longdao24, Longdao5 and Jigeng83 (Table 4). The only one non-synonymous SNP in DTH2 allele from Longdao24 was same as Nipponbare and Jigeng83, but different from Longdao5. Hd1 allele in Longdao24 had 6 non-synonymous SNPs, one frame shift Indel and three long base pare insertion (36 bp, 69 bp and 51 bp). Hd2, Ehd3 and OsMADS56 in Longdao24 all be identified with 3 non-synonymous SNPs. OsLF in Longdao24 all had one non-synonymous SNP and one Indel. Only one non-synonymous SNP was detected in Hd3a and Hd17. The SNPs and Indels in Hd3a, Hd2 and Hd1 locus were detected between Nipponbare with the other three varieties. The allele of OsLF were come from the female parent Longdao5. The other fourth allele of DTH2, Hd17, Ehd3 and OsMADS56 were come from the male parent Jigeng83.

thumbnail
Table 4. Variations of 39 heading date related genes in Longdao 24.

https://doi.org/10.1371/journal.pone.0181037.t004

Development DNA markers for providing evidence of polymorphisms

Although the accuracy of deep sequencing has now been greatly improved, it still needs to be confirmed by other methods for the variations of interest. Therefore, we selected one InDel and one SNP polymorphisms to developed DNA markers (Table 5). We designed the InDel marker according the 9 bp deletion in SCM2 locus. A complete matching genotypes was observed in Longdao24 and its parents. Longdao24 and Longdao5 was found having this 9-bp deletion compared to Jigeng83 (Fig 7A). We also developed a CAPS marker to provide the SNP (2235191) in Hd17 locus. A complete matching genotypes was also observed in Longdao24 and its parents (Fig 7B).

thumbnail
Fig 7. Development DNA markers for providing evidence of polymorphisms.

https://doi.org/10.1371/journal.pone.0181037.g007

thumbnail
Table 5. The primers used for detection the polymorphisms in SCM2 and Hd17.

https://doi.org/10.1371/journal.pone.0181037.t005

Discussion

Breeding value of the re-sequencing data of super high yield rice variety

Marker-assisted selection (MAS) is expected to provide higher efficiency, reduced cost and shorter duration of the breeding scheme, compared with conventional methods. But most important complex traits in rice, including yield, quality and stress tolerance, are controlled by quantitative trait loci (QTLs). Isolating and characterizing genes involved in QTLs has been the key step for MAS. Whole genome re-sequencing technology has provided the possibility of genome wide variation analysis in rice breeding varieties. Longdao24 had been used widely as breeding parent, because of its high yield, good quality, early heading date and strong stress tolerance. Genome sequencing of Longdao24 with 47 × coverage effective depth and 95.08% 10 × coverage ratio was achieved. A total of 420,475 SNPs, 95,624 InDels, and 14,112 SVs were yielded from the Longdao24 genome. We identified 361,117 SNPs and 81,488 InDels between Longdao24 genome and Longdao5 genome. We also detected 428,908 SNPs and 97,209 InDels between Longdao24 genome and Jigeng83 genome. These different variations will be used for developing new molecular markers for genetics research and MAS.

The genetic basic of high yield in Longdao24

Developments of whole genome re-sequencing technology have provided new tools for discovering and tagging novel useful alleles for improving target traits and for manipulating those genes in rice breeding program through MAS. We identified four yield related genes (Gn1a, EP3, SCM2 and qGW8) having genetic variations in Longdao24. The first one is Gn1a, which controlled cytokinin accumulation in inflorescence meristems and increases the number of reproductive organs, resulting in enhanced grain yield [30]. Comparison of the DNA sequences between the cultivars revealed several nucleotide changes, including a 6-bp insertion in the first exon, and four nucleotide changes resulting in amino acid variation in the first and fourth exons of the Longdao24 and Longdao5 allele (Fig 8A). The Gn1a allele from Longdao24 and Longdao5 was similar to Habataki, which has been reported having the increasing Gn1a allele. The second one is EP3, which controlled panicle architecture and enhance the grain yield in rice [31]. Comparison of the DNA sequences between the cultivars revealed a 2-bp deletion in the first exon of the Longdao24 and Longdao5 allele (Fig 8B). The EP3 allele from Longdao24 and Longdao5 was a novel allele. The third one is SCM2, which showed enhanced culm strength and increased spikelet number [32]. Comparison of the DNA sequences between the cultivars revealed several nucleotide changes, including a 9-bp deletion in the second exon, and two nucleotide changes resulting in amino acid variation in the first exons of the Longdao24 and Longdao5 allele (Fig 8C). The SCM2 allele from Longdao24 and Longdao5 was same as Habataki, which has been reported having the increasing SCM2 allele. The last one is qGW8, which showed enhanced culm strength and increased spikelet number [33]. Comparison of the DNA sequences between the cultivars revealed several nucleotide changes, including a 10-bp insertion in the first exon, a 3-bp deletion in the first exon, and three nucleotide changes resulting in amino acid variation in the first and third exons of the Longdao24 and Jigeng83 allele (Fig 8D). The qGW8 allele from Longdao24 and Longdao5 was same as HJX74, which has been reported having the increasing yield haplotype. So, the super high yield formed by the large half erect panicle and lodging resistance of Longdao24 would be caused by these four increasing alleles of high yield genes including Gn1a, EP3, SCM2 and qGW8.

thumbnail
Fig 8. The yield related genes (Gn1a, EP3, SCM2 and qGW8) having genetic variations in Longdao24.

https://doi.org/10.1371/journal.pone.0181037.g008

The genetic basic of early heading in Longdao24

Heading date is a major determinant of rice adaptability to regional and environmental conditions. It has been a major selecting target in rice breeding programs. To elucidate the key heading date gene in Longdao24 for adaptation to the northern limit of rice cultivation Heilongjiang Province (43°26’-53°33’), we analyzed 39 cloned heading date related genes. DTH2, OsLF, Hd17, Hd3a, Hd1, Hd2, Ehd3 and OsMADS56 had variations among Nipponbare, Longdao24, Longdao5 and Jigeng83. The alleles of Hd3a, Hd1 and Hd2 in Longdao24, Longdao5 and Jigeng83 were different from that in Nipponbare. The might be the major reason for these three varieties adaptation to the northern limit of rice cultivation. Hd1 and Hd3a are the two key genes controlled heading date in rice [34]. Comparison of the DNA sequences of Hd3a between the cultivars revealed only one nucleotide changes resulting in amino acid variation in the fourth exon of Longdao24, Longdao5 and Jigeng83 allele (Fig 9A). This Hd3a allele was a novel allele, which was not be reported before. The only one SNP caused an amino acid change at the carboxyl end of the predicted protein: the Asn (AAG) in Nipponbare was changed to Lys (AAC). The Hd1 allele in Longdao24, Longdao5 and Jigeng83 was also a novel allele. Four InDels and six SNPs was identified by comparison of the DNA sequences of Hd1 between the cultivars. Three long insertion and four nucleotide changes resulting in amino acid variation were identified in the first exon; and a 2-bp insertion and two nucleotide changes resulting in amino acid variation were identified in the second exon (Fig 9B). The Hd2 allele in Longdao24, Longdao5 and Jigeng83 all had the three nucleotide changes resulting in amino acid variation in the first, fourth and eighth exons (Fig 9C). These variations were found in many European and Asian rice cultivars, which flower extremely early under natural long-day conditions [35]. The DTH2 allele in Longdao24 was same as Jigeng83 (Fig 9D). This SNP in the third exon was reported as a key functional differences in the rice landraces from northern limit of rice cultivation in the world [36]. We also identified the allele of Hd17, Ehd3 and OsMADS56 in Longdao24 was came from Jigeng83 (Table 4). But only allele of OsLF was detected from Longdao5 (Table 4). All these information showed that the Hd3a, Hd1 and Hd2 might be the major reason for the varieties adaptation to the northern rice cultivation region. The DTH2, OsLF, Hd17, Ehd3 and OsMADS56 might formed different varieties adaptation to different region in northern rice cultivation location.

thumbnail
Fig 9. The heading related genes (Hd3a, Hd1, Hd2 and DTH2) having genetic variations in Longdao24.

https://doi.org/10.1371/journal.pone.0181037.g009

The genetic basic of grain quality in Longdao24

Starch is the major component of rice grain, which is mainly composed of amylose and amylopectin. The starch synthesis pathway is an ideal system for examining the evolution of biochemical pathways. Over 20 genes involved in the rice starch synthesis pathway have been identified so far [37]. We investigated 22 cloned starch synthesis related genes in Longdao24 (Table 3). We only identified variations in SSIVa, SBE3, SSIIIb and SSIIc between the parents of Longdao24. All four allele of SSIVa, SBE3, SSIIIb and SSIIc were come from the male parent Jigeng83. These genes represented starch synthases (SSIVa, SSIIIb and SSIIc) and branching enzymes (SBE3). These enzymes are involved in amylopectin biosynthesis in rice endosperm. They had low to medium effects on variation in starch trait [38]. So we need do more works to get enough data, for example polymorphisms or expression levels of these genes, to explore the quality genetic basis of Longdao24.

References

  1. 1. Chen WF, Xu ZJ, Zhang LB. Theories and Methods of Breeding Japonica Rice for Super High Yield. Journal of Shenyang Agricultural University. 2005; 1: 3–8. (in Chinese)
  2. 2. Yuan LP. Development of Super Hybrid Rice for Food Security in China. Engineering. 2015; 1: 13–14.
  3. 3. IRRI. IRRI redesigns rice plant to yield more grain. IRRI reporter. 1994; 4: 1.
  4. 4. Cao LY, Zhan XD, Chen SG, Feng Y, Wu WM, Shen XH, et al. Breeding Methodology and Practice of Super Rice in China. Rice Science, 2010; 2: 87–93.
  5. 5. Huang X, Qian Q, Liu Z, Sun H, He S, Luo D. Natural variation at the DEP1 locus enhances grain yield in rice. Nature Genetics. 2009; 4:494–497.
  6. 6. Wang J, Nakazaki T, Chen S, Chen W, Saito H, Tsukiyama T. Identification and characterization of the erect-pose panicle gene EP conferring high grain yield in rice (Oryza sativa L.). Theoretical and Applied Genetics. 2009; 119: 85–91. pmid:19407986
  7. 7. Rao YC, Li YY, Qian Q. Recent progress on molecular breeding of rice in China. Plant Cell Rep. 2014; 33: 551–564. pmid:24442397
  8. 8. Miura K, Ashikari M, Matsuoka M. The role of QTLs in breeding of high-yielding rice. Trends Plant Sci. 2011; 6: 319–326.
  9. 9. Yu J, Hu SN, Wang J, Wong GK, Li SG, Liu B. A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science. 2002; 296: 79–92. pmid:11935017
  10. 10. Goff SA, Ricke D, Lan TH, Presting G, Wang R L, Dunn M. A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science. 2002; 296: 92–100. pmid:11935018
  11. 11. International Rice Genome Sequencing Project. The map-based sequence of the rice genome, Nature. 2005; 436: 793–800. pmid:16100779
  12. 12. Gao Q, Yue GD, Li WQ, Wang JY, Xu JH, Yin Y. Recent progress using high-throughput sequencing technologies in plant molecular breeding. J Integr Plant Biol. 2012; 4: 215–227.
  13. 13. Guo LB, Gao ZY, Qian Q. Application of resequencing to rice genomics, functional genomics and evolutionary analysis. Rice. 2014; 7: 4. pmid:25006357
  14. 14. Han B, Huang XH. Sequencing-based genome-wide association study in rice. Curr Opin Plant Biol. 2013; 16: 133–138. pmid:23562592
  15. 15. Gao ZY, Zhao SC, He WM, Guo LB, Peng YL, Wang JJ. Dissecting yield-associated loci in super hybrid rice by resequencing recombinant inbred lines and improving parental genome sequences. Proc Natl Acad Sci U S A. 2013; 35: 14492–14497.
  16. 16. Yu HH, Xie WB, Wang J, Xing YZ, Xu CG, Li XH. Grains in QTL detection using an ultra-high density SNP map based on population sequencing relative to traditional RFLP/SSR markers. PLoS One. 2011; 3: e17595.
  17. 17. Li JY, Wang J, Zeigler RS. The 3,000 rice genomes project: new opportunities and challenges for future rice research. Giga Science. 2014; 3: 8. pmid:24872878
  18. 18. Jiang SK, Zhong M, Xu ZJ. Classification of rice cultivars with RAPD molecular markers. J Shenyang Agric Univ. 2006; 37: 639–641. (in Chinese)
  19. 19. Li H., Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010; 26: 589–595. pmid:20080505
  20. 20. Kawahara Y, De la Bastide M, Hamilton JP, Kanamori H, McCombie WR, Ouyang S. Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. Rice. 2013; 6:4. pmid:24280374
  21. 21. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N. 1000 Genome Project Data Processing Subgroup: The Sequence Alignment/Map format and SAM tools. Bioinformatics. 2009; 25: 2078–2079. pmid:19505943
  22. 22. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20:1297–303. pmid:20644199
  23. 23. Chen K, Wallis JW, McLellan MD, Larson DE, Kalicki JM, Pohl CS, et al. BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat Methods. 2009; 9: 677–681.
  24. 24. Cingolani P, Platts A, Wangle L, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012; 2: 80–92.
  25. 25. Deng YY, Li JQ, Wu SF, Zhu YP, Chen YW, He FC. Integrated nr Database in Protein Annotation System and Its Localization. Computer Engineering. 2006; 5: 71–74.
  26. 26. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000; 1: 25–29.
  27. 27. Tatusov RL, Galperin MY, Natale D A, Koonin EV. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 2010; 1: 33–36.
  28. 28. Kanehisa M., Goto S, Kawashima S, Okuno Y, Hattori M. The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004; 32: D277–D280. pmid:14681412
  29. 29. Tian ZX, Qian Q, Liu QQ, Yan MX, Liu XF, Yan CJ. Allelic diversities in rice starch biosynthesis lead to a diverse array of rice eating and cooking qualities. Proc Natl Acad Sci U S A. 2009; 51: 21760–21765.
  30. 30. Ashikari M., Sakakibara H, Lin S, Yamamoto T, Takashi T, Nishimura A, et al. Cytokinin oxidase regulates rice grain production. Science. 2005; 309(5735): 741–745. pmid:15976269
  31. 31. Li M, Tang D, Wang K, Wu X, Lu L, Yu H. Mutations in the F-box gene LARGER PANICLE improve the panicle architecture and enhance the grain yield in rice. Plant Biotechnol J. 2011; 9(9):1002–1013. pmid:21447055
  32. 32. Ookawa T, Hobo T, Yano M, Murata K, Ando T, Miura H. New approach for rice improvement using a pleiotropic QTL gene for lodging resistance and yield, Nat Commun. 2010; 1: 132. pmid:21119645
  33. 33. Wang S, Wu K, Yuan Q, Liu X, Liu Z, Lin X. Control of grain size, shape and quality by OsSPL16 in rice. Nat Genet. 2012; 44(8):950–954. pmid:22729225
  34. 34. Takahashi Y, Teshima KM, Yokoi S, Innan H, Shimamoto K. Variations in Hd1 proteins, Hd3a promoters, and Ehd1 expression levels contribute to diversity of flowering time in cultivated rice. Proc Natl Acad Sci U S A. 2009; 106(11): 4555–4560. pmid:19246394
  35. 35. Koo BH, Yoo SC, Park JW, Kwon CT, Lee BD, An G. Natural variation in OsPRR37 regulates heading date and contributes to rice cultivation at a wide range of latitudes. Mol Plant. 2013; 6(6): 1877–1888. pmid:23713079
  36. 36. Wu W, Zheng XM, Lu G, Zhong Z, Gao H, Chen L. Association of functional nucleotide polymorphisms at DTH2 with the northward expansion of rice cultivation in Asia. Proc Natl Acad Sci U S A. 2013; 110(8): 2775–2780. pmid:23388640
  37. 37. Yu G, Olsen KM, Schaal BA. Molecular evolution of the endosperm starch synthesis pathway genes in rice (Oryza sativa L.) and its wild ancestor, O. rufipogon L. Mol Biol Evol. 2011; 28(1): 659–671. pmid:20829346
  38. 38. Kharabian-Masouleh A, Waters DL, Reinke RF, Ward R, Henry RJ. SNP in starch biosynthesis genes associated with nutritional and functional properties of rice. Nature Sci Rep. 2012; 2: 557.