Somaclonal variation generally occurs in plants regenerated from tissue culture. However, fundamental issues regarding molecular characteristics, mutation rates and mutation spectra of plant somatic variation as well as their phenotypic relevance have been addressed only recently. Moreover, these studies have reported highly discrepant results in different plant species and even in the same plant genotype.
We investigated heritable genomic variation induced by tissue culture in rice by whole genome re-sequencing of an extensively selfed somaclonal line (TC-reg-2008) and its wild type (WT) donor (cv. Hitomebore). We computed the overall mutation rate, single nucleotide polymorphisms (SNPs), small scale insertions/deletions (Indels) and mobilization of transposable elements (TEs). We assessed chromosomal distribution of the various types of genomic variations, tested correlations between SNPs and Indels, and examined concomitancy between TE activity and its cytosine methylation states. We also performed gene ontology (GO) analysis of genes containing nonsynonymous mutations and large-effect mutations, and assayed effects of the genomic variations on phenotypes under both normal growing condition and several abiotic stresses. We found that heritable somaclonal genomic variation occurred extensively in rice. The genomic variations distributed non-randomly across each of the 12 rice chromosomes, and affected a large number of functional genes. The phenotypic penetrance of the genomic variations was condition-dependent.
Tissue culture is a potent means to generate heritable genetic variations in rice, which bear distinct difference at least in space (chromosomal distribution) from those occurred under natural settings. Our findings have provided new information regarding the mutation rate and spectrum as well as chromosomal distribution pattern of somaclonal variation in rice. Our data also suggest that rice possesses a strong capacity to canalize genetic variations under normal growing conditions to maintain phenotypic robustness, which however can be released by certain abiotic stresses to generate variable phenotypes.
Citation: Zhang D, Wang Z, Wang N, Gao Y, Liu Y, Wu Y, et al. (2014) Tissue Culture-Induced Heritable Genomic Variation in Rice, and Their Phenotypic Implications. PLoS ONE9(5): e96879. https://doi.org/10.1371/journal.pone.0096879
Editor: Khalil Kashkush, Ben-Gurion University, Israel
Received: February 27, 2014; Accepted: April 12, 2014; Published: May 7, 2014
Copyright: © 2014 Zhang et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the State Key Basic Research and Development Plan of China (2013CBA01404) and the Program for Introducing Talents to Universities (#B07017). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
It has been well-established that plants possess the ability to regenerate from totipotent, differentiated somatic cells via in vitro tissue culture. However, this process can be associated with a variety of genetic and epigenetic instabilities, and some of which can be translated into heritable phenotypic variations, a phenomenon has been collectively termed somaclonal variation . Since the first report of somaclonal variation in regenerated plants from sugarcane tissue culture , numerous cases of somaclonal variation involving a large number of plant species have been reported . However, insights into the molecular nature, mutation rate and spectrum of somaclonal variation at the molecular level (nucleotide sequence changes) have been gained only recently thanks to the revolutionary breakthroughs of new analytical tools, especially the next generation sequencing (NGS)-based genomic analysis.
Hitherto, three whole genome sequencing-based studies on plant somaclonal variation have been published, one in Arabidopsis  and two in rice , . The Arabidopsis study revealed that regenerated plants (regenerants) displayed a generally elevated genome-wide DNA sequence mutation rate from naturally occurred mutations, and which bore distinct molecular mutation spectra, with base substitutions being the primary type of genomic changes . Unexpectedly, no transpositional reactivation of transposable elements (TEs) was identified in the regenerants of Arabidopsis , suggesting that TE transpositional activity, a widely believed causal factor for tissue culture-induced genetic variations , –, is not underpinning somaclonal variations in Arabidopsis. In contrast, both nucleotide sequence changes and TE transpositions were detected in rice regenerants , . In aspect of TE transpositional activity, however, the two rice studies have obtained sharply discrepant results , . While at least 13 TE families were transposed causing 34 new insertions were reported by Sabot et al. , only a single TE (Tos17) was shown to be active, and which caused only 10 new insertions in the study by Miyao et al. . Causes for the discrepancy remained elusive, but genetic context effect can be ruled out as both studies used the same rice genotype, Nipponbare. Clearly, additional studies with more robust analysis and independent validation are needed to reveal possible rules of the molecular nature and mutation spectra in general and TE mobility in particular for a deeper understanding of plant somatic variation, a phenomenon bears broad implications in germplasm in vitro conservation and propagation, germplasm innovation and biotechnology-facilitated crop improvement .
In this study, we conducted whole genome re-sequencing of a stabilized rice line, TC-reg-2008, which was originated from tissue culture of a japonica rice cultivar Hitomebore, and had been selfed for eight successive generations. We compared the whole genome of TC-reg-2008 with its original wild-type (WT) donor at single-base resolution from a genome-wide scale. We showed that extensive and heritable genomic variations including single nucleotide polymorphisms (SNPs), small-scale insertions and deletions (Indels), as well as transpositional reactivation of three TEs occurred in TC-reg-2008 in spite of the fact that this line showed no appreciable phenotypic variations compared to its WT donor under normal growing conditions. Our data have provided additional insights into the molecular features of somaclonal variation in rice and indicated their phenotypic canalization under normal conditions.
Materials and Methods
Plant Materials and Genomic DNA Extraction
The TC-reg-2008 and wild-type rice (Oryza sativa ssp. japonica cv. Hitomebore) were used for this study. Callus of Hitomebore were induced from germinating seeds on Murashige-Skoog solid medium containing 2 mg/L 2,4-dichlorophenoxyacetic acid (2,4-D) at 26±1°C in darkness for 1 month, then transferred into N6 liquid medium containing 1 mg/L 2,4-D and cultured at 26±1°C in darkness for five months. Then, embryogenic calli were subcultured on N6 medium with 0.3% casamino acid, 0.1% proline and 1 mg/L 2,4-D at 26±1°C under a 14 hrs photoperiod for plant regeneration, followed by rooting in the regeneration medium containing 0.1 mg/L 6 benzyladenine (6-BA) and 0.01 mg/L naphtaleneacetic acid (NAA). Finally, regenerated plantlets in healthy conditions were transferred to soil and grown in green house. Genomic DNA from leaf tissues of uncultured wild-type (WT) and TC-reg-2008 (selfed for eight consecutive generations) was isolated by a modified CTAB method and purified by phenol extractions. Genomic DNA was used for whole genome re-sequencing, Southern blotting, locus-specific PCR amplifications and bisulfite sequencing.
Whole Genome Sequencing, Building of WT Reference and Identification of SNPs and Indels
Purified high quality DNA of both samples was prepared for whole genome re-sequencing. Library construction, cluster generation and Hiseq2000 sequencing were carried out with standard protocols. The 90 nt paired-end reads were yielded. Raw data were cleaned by removing adaptor contamination and low quality reads before further analysis. To identify genomic variations in TC-reg-2008, a whole genome WT reference sequence was generated by mapping-based method . This was accomplished by mapping the WT reads against the standard Nipponbare reference genome (MSU 7.0 http://rice.plantbiology.msu.edu/index.shtml)  using Burrows Wheeler Aligner (BWA) . The reads which were uniquely mapped and the mapping quality scores were higher than 30 (Phred scale) were used to produce a new theoretical reference genome named WT Reference for rice cv. Hitomebore, in which the sequence variations between WT and Nipponbare were replaced with WT of Hitomebore by a stringent custom Perl scripts. Thereafter, the reads of TC-reg-2008 were aligned to the Hitomebore WT Reference to identify SNPs and Indels using SAMtools (v0.1.5c)  with the following parameters: Phred score ≥30, coverage ≥10 and ≤100.
To explore the genome-wide patterns of genomic variation in the TC-reg-2008 genome relative to that of WT, a 100 kilobase pair (kb) sliding window along each of the 12 rice chromosomes was applied to calculate DNA polymorphisms using Perl scripts. Moreover, based on the Generic Feature Format Version 3 (GFF3) files of the annotated Nipponbare reference genome, all detected SNPs were annotated and calculated as genic or intergenic. SNPs in genic regions were classified as coding sequences (CDS), untranslated regions (UTR), and introns. According to the amino acid substitutions, SNPs in CDS were divided into two types, synonymous and nonsynonymous. The nonsynonymous and large-effect mutations were further analyzed for gene ontology (GO) enrichment using the AgriGO Singular Enrichment Analysis tool .
Detection and Validation of New Insertions of Transposable Elements (TEs)
To investigate whole genome TE mobilization events, we used a modified paired-end method reported previously . All known TE sequences were compiled to establish a TE library. A new TE reference was first generated by removing the known TEs from Nipponbare sequence (MSU 7.0 http://rice.plantbiology.msu.edu/index.shtml) , and then each end of TC-reg-2008 and its WT (Hitomebore) raw data were aligned against to this new TE reference and the TE library using BWA . After that, the reads characterized by one end being highly matching to the TEs library and the other one mapping uniquely to this new TE reference were selected using SAMtools followed by perl scripts as candidate TEs. Furthermore, these candidate TEs were filtered and those uniquely appeared in TC-reg-2008 were retained, which were newly inserted TE events in TC-reg-2008.
The TE mobilization was verified by Southern blotting following rational reported previously . Specifically, genomic DNA (∼10 µg per sample) was digested by Xbal (New England Biolabs Inc.), which was suitable for assessing copy number of the analyzed TEs as there were no restriction sites or only one site on one side of the probe region. Restricted DNA was run through 1% agarose gels and transferred onto Hybond N+ nylon membranes purchased from Amersham Pharmacia Biotech by the alkaline transfer method recommended by the supplier. Three pairs of primers were designed (see Table S1) to amplify the fragments from the WT genomic DNA used as hybridization probes. Authenticity of the probe fragments was verified by Sanger-sequencing, and labeled with fluorescein-11-dUTP by the Gene Images random prime-labeling module (Amersham Pharmacia Biotech). Hybridization signal was detected by the Gene Images CDP-Star detection module (Amersham Pharmacia Biotech) after washing at a stringency of 0.2× SSC, 0.1% SDS for 2×50 min. The blots were exposed to X-ray films for chemiluminecence signal detection. The insertion events by Tos17 (the most active TE) in TC-reg-2008, were also investigated by locus-specific PCR amplifications and cloning/sequencing. Specifically, the five insertions by retrotransposon Tos17 were analyzed by locus-specific PCR amplifications with primers targeting both the TE terminals and their immediate flanks (Table S2). The PCR reactions were performed under conditions of 10 min at 94°C; 35 cycles of 30 sec at 94°C, 45 sec at 58°C, 1.5 min at 72°C; a final extension for 10 min at 72°C. PCR products were visualized by ethidium bromide staining after electrophoresis through 2% agarose gels.
Bisulfite Sequencing of Original Tos17 Loci
The protocol was basically as reported , with minor modifications. Specifically, genomic DNA (∼0.5 µg) was treated by EZ DNA Methylation- Gold KIT (Zymo Research, http://zymosearch.com), and then amplified with designed PCR primers followed by nested PCR for Tos17 on chromosome 10 and general PCR for Tos17 on chromosome 7 (Table S3). The sequences were analyzed by the Kismeth program (http://katahdin.mssm.edu/kismeth/revpage.pl) and the Kismeth generated data were presented by histograms.
Abiotic Stress Treatments
Uniform and healthy seeds of TC-reg-2008 and WT were selected for abiotic stress treatments, including 150 mM NaCl (salinity), 0.25 mM CuSO4 (heavy metal), 0.25 mM HgCl2 (heavy metal) and 1 mM sodium nitroprusside (nitric oxide). The seeds were disinfected and thoroughly rinsed, and then germinated in petri-dishes with sterile water at 28°C in the darkness for 2 days. The seeds (30 each for mock and treatment) were then transferred to an illumination incubator (30°C for18 hrs during the day and 25°C for 6 hrs during the night), and the stressed groups were watered with Hoagland solution containing four kinds of abiotic stress while the control group was watered with only Hoagland once each day for a week. Notably, the seeds for sodium nitroprusside stress were placed on an individual illumination incubator because sodium nitroprusside could release NO. All statistical analyses were done by R (v3.0.2) (http://www.r-project.org/).
Tissue Culture Induced DNA Polymorphisms and Base Substitution Patterns
A tissue cultured-originated, and extensively selfed (for 8 successive generations) rice line named TC-reg-2008 was re-sequenced by the Illumina Higseq 2000 platform along with its WT donor (cv. Hitomebore). A salient feature of this line was that it did not show any discernible variation in phenotypes compared with its WT under normal controlled or paddy-field growing conditions, but showed clear phenotypic variation under several abiotic stresses. Therefore, all genomic changes occurred to this line should reflect heritable changes whose biological effects were efficiently canalized under normal conditions but which were released (penetrated) under certain stress conditions.
After removing contaminations and low quality reads, 12.4 G clean data were yielded in total, which were comprised of 318 million 90 nt paired-end reads. The genomic re-sequencing data were mapped against the standard rice genotype Nipponbare reference genome (MSU7.0 http://rice.plantbiology.msu.edu/index.shtml) . Approximately 94% re-sequencing reads pairs of TC-reg-2008 and its WT (cv. Hitomebore) were successfully mapped to the Nipponbare reference genome, and of which more than 75% reads were uniquely mapped (Table S4). On average, the effective sequencing depths of unique mapped reads were 15.4× and 14.7× coverage for TC-reg-2008 and WT, respectively.
We compared the DNA polymorphisms including single nucleotide polymorphisms (SNPs) and small scale (≤2 bp) insertions or deletions (Indels) between the WT genome and that of TC-reg-2008 using SAMtools (v0.1.5c) . A total of 54,268 DNA polymorphisms were detected, which included 37,332 SNPs and 16,936 small-scale Indels, both of which were mapped throughout the rice genome. The average mutation rate was 5.0×10−5 base substitutions per site across all the chromosomes. Based on the distribution of DNA polymorphisms, we found that both SNPs and Indels were non-randomly occurred across each of the 12 rice chromosomes (Table 1). Interestingly, the number of SNPs was significantly correlated with the number of Indels both at the whole genome level and for each of the 12 rice chromosomes (Figure S1). This suggests that probably a common mutational mechanism underlies the two kinds of genomic changes, or one kind mutation has increased propensity for the other to occur. The average density of DNA polymorphisms (defined as the average number of DNA polymorphisms observed for each 100 kb interval along each of all the 12 rice chromosomes) was 14.54, including 10 SNPs, 1.93 insertions and 2.6 deletions in TC-reg-2008 relative to WT (Table 1). The highest and lowest numbers of DNA polymorphisms per 100 kb were located to chromosome 11 (24.67) and 5 (3.5), respectively (Table 1). Within each chromosome, the peaks of SNPs and Indels exhibited the same trend (Figure S2), consistent with the overall strong correlation between the two kinds of changes (Figure S1). To test whether and to what extent the distribution of tissue culture-induced genomic variations would overlap with the genomic regions known as hyper-mutable in the course of rice evolution and domestication , we did a comparison and found that only two concentrated areas of DNA polymorphisms were located to these hyper-mutable genomic regions (Figure S2), suggesting possible different mutagenic mechanisms governing tissue culture-induced mutagenesis from that underlying natural genetic variations.
The identified SNPs were categorized and analyzed for their base substitution patterns. The average ratio of transitions (C/T or A/G) to transversions (C/G, T/A, A/C or G/T) in TC-reg-2008 vs. WT was 2.37 (Figure 1), which ranged from 1.70 for chromosome 4 to 2.76 for chromosome 6 (Table S5). The analysis of transitions and transversions suggested that C/T transition was the most frequently occurred base substitutions (Figure 1), which was consistent with previous results in both Arabidopsis  and rice . Changes from A to G and A to C were also observed with high frequencies. However, the average ratio of transitions vs. transversions we detected (2.37) was substantially greater than those reported previously in both Arabidopsis  and rice , which were 0.92 and 1.1, respectively.
Molecular Characteristics of the Tissue Culture Induced SNPs, Gene Ontology (GO) of the Affected Genes, and Evidence for Canalization
The majority of detected SNPs between TC-reg-2008 and WT were located in inter-genic regions (64%), while only 13,433 (36%) occurred in genic regions, consistent with possible selection constraint for the later. Of genic SNPs, 1,221 (9%), 6,877 (51%) and 5,335 (∼40%) were located in untranslated regions (UTR), introns, and gene-coding regions (CDS), respectively (Figure 2A). The SNPs located in CDS were further classified into nonsynonymous SNPs (totaling 3,431) and synonymous SNPs (totaling 1,972) (Figure 2A). Furthermore, 1,888 genes were found to contain more than one nonsynonymous SNP (Table S6). The average frequency of synonymous and nonsynonymous SNPs was significantly different among the 12 rice chromosomes (Figure 2B, right). There were distinctively more CDS SNPs of both synonymous and nonsynonymous types in chromosome 1, 4, 8 and 11 than the rest eight chromosomes (Figure 1B, right). We further analyzed large-effect SNPs in TC-reg-2008, which by definition, were those that contained substitutions that introduced premature termination codons (Premature, 132), eliminated translation initiation sites (ATG change, 120), and replaced nonsense by sense codons (Stop change, 27) (Table S7). We found that the average frequency of these large-effect SNPs was also non-randomly distributed among the 12 rice chromosomes, with>50% being mapped to the same four chromosomes, 1, 4, 8 and 11 (Figure 1B, left).
(A) Number and percentile of homozygous SNPs between TC-reg-2008 and WT were located in genic and intergenic regions, and then SNPs located in the genetic regions were further annotated. (B) The average frequency distributions of synonymous, nonsynonymous and large effect SNPs on each of the 12 rice chromosomes. The x-axis represents the average frequency (×10−5) of synonymous, nonsynonymous and large effect SNPs on each chromosome. (C) Function assignment of nonsynonymous and large-effect SNPs related to gene categories. The black horizontal bars denote the number of nonsynonymous SNPs occurred to genes enriched in biological process. The grey horizontal bars indicate the number of nonsynonymous SNPs occurred to genes enriched in molecular function. The white horizontal bars indicate the number of large effect SNPs occurred to genes enriched in molecular function.
Based on Gene Ontology annotations, the 1,888 genes containing nonsynonymous SNPs could be classified into different categories. All mutations were enriched in two broad categories: biological process and molecular function. Regardless of broad categories, the three most enriched specific gene categories were “response to stress” (153), “nucleotide binding” (142) and “protein modification process” (118) (Figure 2C). Moreover, GO analysis for the large-effect mutations showed that they were enriched in the broad category of molecular function, which included enzymes and proteins known to participate in transferase activity (21), nucleotide binding (19) and kinase activity (15) (Figure 2C), underscoring functional consequences of these mutations.
The foregoing results indicated that extensive and trangenerationally inherited genomic variations occurred in TC-reg-2008 as a result of tissue culture (Table 1). Moreover, a large number of the mutations affected genes with important functions (Figure 2C). Unexpectedly, in spite of these biologically meaningful genetic mutations, TC-reg-2008 did not show appreciable phenotypic differences from its WT donor (cv. Hitomebore) under normal controlled condition or paddy-field trails (data not shown). Although a robust explanation for this conundrum is gene redundancy given that many genes in the rice genome, as in those of other higher eukaryotes, exist as gene families, we suspected that this could be also due to the well-known phenomenon of canalization or developmental robustness –. If this was the reason or one of the reasons, then phenotypic variability might be released in TC-reg-2008 under some stress conditions whereby canalization was perturbed . To test this possibility, we subjected TC-reg-2008 together with WT to three kinds of abiotic stresses including salt, heavy metals (CuSO4 and HgCl2) and overdose nitric oxide (Figure S3A), and measured three phenotypic traits, shoot-length, root-length and total seedling fresh weight. We found that for each trait, TC-reg-2008 showed significant difference (inferior performance) from WT at least in one of the stress conditions (Figure S3B, C and D). These results suggest that the cryptic effects on phenotypes by the somaclonal genomic variations are due at least in part to canalization.
Tissue Culture Induced TE Mobility and its Concomitancy with Cytosine Methylation Dynamics
Transposable elements (TEs) are major constituents of plant genomes, which are usually epigenetically silenced by DNA methylation . For example, >50% cytosines of rice TEs were methylated . Under certain circumstances, some of the epigenetically silenced TEs can be reactivated to transpose and may lead to a variety of mutations by insertion disruption of gene function and/or re-modulating expression of TE-adjacent genes . Tissue culture induced TE activity has been widely reported in various plant species , –. We therefore analyzed TE mobility in TC-reg-2008 by pairs of reads in which one read was mapped to a TE and the other was uniquely mapped on the rice genome (rationale detailed in Materials and methods). We detected seven new insertions caused by three retrotransposons inTC-reg-2008 (Figure 3A). Notably, all seven insertions occurred at genic regions, and of which five occurred in exons of five different genes, while one in intron of one gene and one in 3′UTR of another gene (Figure 3B). To independently validate these retrotranspositional events, we performed Southern blotting to test for copy number variation of the TEs, and/or by locus-specific PCR amplifications to test for new insertions by the most active retrotransposon, Tos17 (Figure 3C). The retrotranspositional events by all three TEs were validated by Southern blotting, which showed increase of copy numbers consistent with retrotranspositions (Figure 3D). The five insertions by Tos17 were also validated by locus-specific PCR amplifications (Figure 3E). Moreover, Sanger-sequencing of the specifically amplified TE-flank junctions were found to contain the canonical target site duplications (TSDs) for Tos17 (Figure 3C). Furthermore, three forward insertions and two reverse insertions were identified by Sanger-sequencing (Figure 3C).
(A) Three active retrotransposons and seven insertions detected by paired-end mapping based on genome re-sequencing. (B) Insertion positions of all the activated retrotransposons and the function analysis of affected genes in TC-reg-2008. (C) Location of primers designed for locus specific-PCR and the junctions of the Tos17 insertion sites detected by Sanger-sequencing. P1/P4, P1/P2 and P3/P4 were three pairs of primers for amplifying the insertions of Tos17. The grey rectangles stand for LTRs of Tos17. The target site duplications of Tos17 were boxed in red based on Sanger-sequencing. (D) Independent validation of the mobilization events by Southern blotting. a. Tos17 probe, b. RN_21–12 probe, c. Osr6 probe. W1 and W2 stand for individual plants of wild type (WT), and R1 and R2 stand for individual plants of TC-reg-2008. (E) Five insertions by retrotransposon Tos17 were validated by locus-specific PCR. Lanes 1, 2 and 3 represent the PCR amplicons by primers P1/P4, P1/P2 and P3/P4, respectively.
TE activity is often causally linked to compromised repressive epigenetic states in which cytosine DNA methylation is a critical component. Prior studies have indicated extensive and transgenerational epigenetic remodeling associated with plant tissue culture , , . To test if mobility of TEs detected in TC-reg-2008 occurred concomitantly with changes of cytosine methylation, we analyzed cytosine methylation patterns of the two original copies of Tos17, which showed the highest mobility in TC-reg-2008, by bisulfite sequencing (Figure 4; Figure S4). Specifically, we inspected the 5′ LTR along with a portion of its immediate upstream flank and the 3′LTR along with a portion of its upstream contiguous body region as well as downstream flanking region for each of the two Tos17 original copies located on chromosomes 7 and 10, respectively, in TC-reg-2008 and WT (Figure 4A). We obtained the following results (Figure 4B, C; Figure S4A, B): (i) for both Tos17 copies, the upstream flanks adjacent to the 5′LTRs were non-methylated in WT and TC-reg-2008; (ii) in WT, the 5′LTRs of both copies were moderately methylated at CG sites, but CHG and CHH sites showed clear differences between the two copies, with the copy located on chromosome 10 being non- or only slightly methylated while both sites of the copy located on chromosome 7 being methylated to similar extents as CG sites; (iii) in TC-reg-2008, both copies showed significant hypermethylation of all sequence contexts, CG, CHG and CHH; (iv) in both WT and TC-reg-2008, the 3′LTRs and their immediate upstream body-regions of both copies were heavily methylated at CG sites and moderately methylated at CHG and CHH sites, and in no case substantial difference was detected between the two genotypes for these regions of either copy; (v) in both WT and TC-reg-2008, the downstream flanks of both copies of Tos17 showed heavy methylation at CG sites but little methylation at the CHG and CHH sites, and no difference existed between the two genotypes. Combined, it was evident that both original copies of Tos17 had undergone cytosine methylation remodeling during and after the process of tissue culture; the hypermethylation detected primarily in their 5′LTRs (which are promoters for retrotransposon transcriptional expression) in the stabilized line TC-reg-2008 might reflect reinforced silencing via the RNA-directed DNA methylation (RdDM) pathway which has been shown as responsible for repressive control of Tos17 –.
(A) The schematic diagram of Tos17 structure. The regions subjected to bisulfite sequencing were delineated. (B) Methylation changes of each sequence context in TC-reg-2008 (red bars) vs. WT (black bars) in the Tos17 copy located on chromosome 10. (C) Methylation changes of each sequence context in TC-reg-2008 (red bars) vs. WT (black bars) in the Tos17 copy located on chromosome 7.
In this study, genome-wide mutation rates, types and spectra induced by tissue culture in rice were analyzed in detail based on whole genome re-sequencing in a stabilized rice line (TC-reg-2008) derived from tissue culture followed by extensive selfing for eight successive generations. Our results are substantially different from excising results reported by previous studies in many aspects. We detected a mutation rate of 5.0×10−5 base substitutions per site, which is 21-fold and 29-fold higher than those reported in regenerants of Arabidopsis and a different genotypes of rice (Nipponbare), respectively , . Moreover, a total of 16,936 small-scale Indels were detected in TC-reg-2008, which is dramatically higher than that in A. thaliana (only 21 were detected) , while Indels were not analyzed in the previous study in the other rice genotype Nipponbare . Furthermore, seven mobilization events by three retrotransposons were identified and verified in this study, which were different from both previous studies in rice, as one of which showed only one TE (Tos17) being activated while the other showed 17 TEs being transposed . No TE activity was found in regenerants of Arabidopsis . Another notable difference between the tissue culture induced mutations we detected and those reported previously is random vs. biased chromosomal distribution of the mutations. We showed that both SNPs and small-scale Indels were non-randomly distributed across each of the12 rice chromosomes, and the same distribution trend was found between SNPs and Indels. In contrast, in the Arabidopsis study both SNPs (single base substitutions) and Indels were found to map evenly across the chromosomes . In the previous rice study, SNPs were found to distribute randomly across the rice chromosomes while Indels were not analyzed . The pattern of base substitution can be transitions (Ts) or transversions (Tv), and both types of substitution were detected in TC-reg-2008. The rate of transition to transversion we detected was 2.37, which was also different from those reported in the Arabidopsis and rice (cv. Nipponbare) regenerants , . We note however that the within transition and transversion biases, i.e., G:C > A:T, were similar to many previous reports on naturally occurred mutations –, possibly due to the highly conserved mechanism of deamination of methylated cytosines in all higher eukaryotes studied .
Taken together, it is clear that the types, rates, and spectra of tissue cultured induced mutagenesis are highly variable, which may vary across species, genotypes and conditions of tissue culture, or simply being fortuitous. Although we have shown previously based on DNA marker analysis that relative to epigenetic alteration, genetic mutation is the major type of molecular changes associated with rice tissue culture , both previous studies , ,  and our data here showed that at least TE mobilization occurs concomitantly with DNA methylation dynamics, consistent with their repressive control by this epigenetic marker , , . The involvement of epigenetic mechanisms lends support to the fortuitous aspect of tissue culture induced mutagenesis, as the later is known to occur stochastically .
Domestication-related regions of rice have been identified in previous studies, and which are genetically labile and prone to mutations under natural circumstances . Here we show that the hyper-mutagenic genomic regions as a result of tissue culture showed little coincidence with these hyper-mutable regions under nature conditions. This suggests that the mutagenic mechanisms underlying natural mutation and tissue culture-induced mutagenesis might be different. Of course, we cannot rule out the possibility that it was long-term selection that makes the difference, as the somaclonal line (TC-reg-2008) we used has undergone only eight generations without purposeful selection. Still, we consider that the issue merits further investigation to unveil possible novel aspect of practical implications of tissue culture in crop improvements.
Most organisms have the ability to stabilize their phenotypes although they harbor abundant cryptic genetic variations via the mechanism of canalization , . Nevertheless, the hidden genetic variations can be penetrated under certain stressful environments . We have shown here that the large amount of homozygous genetic variations including those apparently altered functionality of important enzymes or proteins manifested no phenotypic effects in the tissue culture-derived line (TC-reg-2008) compared to WT under normal growing conditions. If without further investigation, gene redundancy might be considered as a plausible explanation for this phenomenon. However, we demonstrated that significant phenotypic variations were observed between TC-reg-2008 and WT when the plants were grown under several abiotic stress conditions. This suggests that gene functional redundancy cannot be the only cause for the cryptic nature of the genetic variations. Instead, perturbation of canalization by the stresses, and hence penetrance of the genetic variations should be at least partially responsible for the phenomenon.
Correlation of SNPs and Indels per 100 kb interval on the whole genome level and for each of the 12 rice chromosomes. The X- and Y-axes stand for the number of SNPs per 100 kb window and the number of Indels per 100 kb window in TC-reg-2008, respectively.
Distribution of DNA polymorphisms within 100 kb window across each chromosome. The X-axis represents the physical distance along each chromosome, splitting into 100-kb sliding windows. The Y-axis indicates the number of SNPs, insertions or deletions. The black dot in each chromosome stands for centromere. The two boxed regions represent the domestication related regions identified inTC-reg-2008.
Phenotypic variations induced by different abiotic stresses between TC-reg2008 and its WT. (A) Photographed phenotypes in different stress conditions. (a)–(e) are control, salt, CuSO4, HgCl2 and overdose NO, respectively. Scale bars are 3 cm. (B)–(D) Tabulated results of shoot length, root length and fresh weight between TC-reg-2008 and its WT under different stresses. The black and grey vertical bars denote WT and TC-reg-2008, respectively. **indicate statistical significance at the 0.01 statistical level (One-way ANOVA).
Bisulfite sequencing-based cytosine methylation maps of the two Tos17 original copies located on chromosome 10 and chromosome 7, respectively. (A) All three types of cytosines, CG (red circles), CHG (blue circles) and CHH (green circles) generated by bisulfite sequencing for (a) the upstream flank and 5′LTR of Tos17 and (b) the 3′LTR along with a portion of its upstream contiguous body region and downstream flank of Tos17 on chromosome 10 for both WT and TC-reg-2008. (B) The same as above for (a) the upstream flank and 5′LTR of Tos17 and (b) the 3′LTR along with a portion of its upstream contiguous body region and downstream flank of Tos17 on chromosome 7 for both WT and TC-reg-2008.
A list of primers used for amplifying the Southern blotting probes.
A list of primers used for locus-specific PCR.
A list of primers used for bisulfite sequencing.
Number of Illumina GA reads and coverage of the Nipponbare genome.
Classification of base substitution mutations detected for each chromosome.
Conceived and designed the experiments: DZ CMX BL. Performed the experiments: DZ ZHW NNW YL YW YB ZBZ YG. Analyzed the data: DZ ZHW YG YZD XFO CMX. Contributed reagents/materials/analysis tools: XYL XFO BL. Wrote the paper: DZ CMX BL.
- 1. Larkin PJ, Scowcroft W (1981) Somaclonal variation–a novel source of variability from cell cultures for plant improvement. Theoretical and Applied Genetics 60: 197–214.
- 2. Heinz D, Mee GW (1971) Morphologic, cytogenetic, and enzymatic variation in Saccharum species hybrid clones derived from callus tissue. American journal of botany: 257–262.
- 3. Neelakandan AK, Wang K (2012) Recent progress in the understanding of tissue culture-induced genome level changes in plants and potential applications. Plant cell reports 31: 597–620.
- 4. Jiang C, Mithani A, Gan X, Belfield EJ, Klingler JP, et al. (2011) Regenerant arabidopsis lineages display a distinct genome-wide spectrum of mutations conferring variant phenotypes. Current Biology 21: 1385–1390.
- 5. Sabot F, Picault N, El-Baidouri M, Llauro C, Chaparro C, et al. (2011) Transpositional landscape of the rice genome revealed by paired-end mapping of high-throughput re-sequencing data. The Plant Journal 66: 241–246.
- 6. Miyao A, Nakagome M, Ohnuma T, Yamagata H, Kanamori H, et al. (2012) Molecular spectrum of somaclonal variation in regenerated rice revealed by whole-genome sequencing. Plant and Cell Physiology 53: 256–264.
- 7. Hirochika H, Sugimoto K, Otsuki Y, Tsugawa H, Kanda M (1996) Retrotransposons of rice involved in mutations induced by tissue culture. Proceedings of the National Academy of Sciences 93: 7783–7788.
- 8. Wang Z-H, Zhang D, Bai Y, Zhang Y-H, Liu Y, et al. (2013) Genomewide Variation in an Introgression Line of Rice-Zizania Revealed by Whole-Genome re-Sequencing. PloS one 8: e74479.
- 9. Kawahara Y, de la Bastide M, Hamilton JP, Kanamori H, McCombie WR, et al. (2013) Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. Rice 6: 1–10.
- 10. Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25: 1754–1760.
- 11. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, et al. (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25: 2078–2079.
- 12. Du Z, Zhou X, Ling Y, Zhang Z, Su Z (2010) agriGO: a GO analysis toolkit for the agricultural community. Nucleic acids research 38: W64–W70.
- 13. Liu ZL, Han FP, Tan M, Shan XH, Dong YZ, et al. (2004) Activation of a rice endogenous retrotransposon Tos17 in tissue culture is accompanied by cytosine demethylation and causes heritable alteration in methylation pattern of flanking genomic regions. Theoretical and Applied Genetics 109: 200–209.
- 14. Ngezahayo F, Xu C, Wang H, Jiang L, Pang J, et al. (2009) Tissue culture-induced transpositional activity of mPing is correlated with cytosine methylation in rice. BMC plant biology 9: 91.
- 15. Huang X, Kurata N, Wei X, Wang Z-X, Wang A, et al. (2012) A map of rice genome variation reveals the origin of cultivated rice. Nature 490: 497–501.
- 16. Waddington CH (1942) Canalization of development and the inheritance of acquired characters. Nature 150: 563–565.
- 17. Waddington CH, Kacser H (1957) The strategy of genes.
- 18. Wagner GP, Booth G, Bagheri-Chaichian H (1997) A population genetic theory of canalization. Evolution: 329–347.
- 19. Wagner A (2000) Robustness against mutations in genetic networks of yeast. Nature genetics 24: 355–361.
- 20. Wagner A (2005) Robustness and evolvability in living systems: Princeton University Press Princeton.
- 21. Hermisson J, Wagner GP (2004) The population genetic theory of hidden variation and genetic robustness. Genetics 168: 2271–2284.
- 22. Lisch D (2013) How important are transposons for plant evolution? Nature Reviews Genetics 14: 49–61.
- 23. Kashkush K, Khasdan V (2007) Large-scale survey of cytosine methylation of retrotransposons and the impact of readout transcription from long terminal repeats on expression of adjacent rice genes. Genetics 177: 1975–1985.
- 24. Kashkush K, Feldman M, Levy AA (2003) Transcriptional activation of retrotransposons alters the expression of adjacent genes in wheat. Nature genetics 33: 102–106.
- 25. Peschke VM, Phillips RL, Gengenbach BG (1987) Discovery of transposable element activity among progeny of tissue culture–derived maize plants. Science 238: 804–807.
- 26. Planckaert F, Walbot V (1989) Molecular and genetic characterization of Mu transposable elements in Zea mays: behavior in callus culture and regenerated plants. Genetics 123: 567.
- 27. Brettell RI, Dennis ES (1991) Reactivation of a silent Ac following tissue culture is associated with heritable alterations in its methylation pattern. Molecular and General Genetics MGG 229: 365–372.
- 28. Peschke V, Phillips R (1991) Activation of the maize transposable element Suppressor-mutator (Spm) in tissue culture. Theoretical and applied genetics 81: 90–97.
- 29. Pouteau S, Huttner E, Grandbastien M, Caboche M (1991) Specific expression of the tobacco Tnt1 retrotransposon in protoplasts. The EMBO journal 10: 1911.
- 30. Hirochika H (1993) Activation of tobacco retrotransposons during tissue culture. The EMBO journal 12: 2521.
- 31. Jiang N, Bao Z, Zhang X, Hirochika H, Eddy SR, et al. (2003) An active DNA transposon family in rice. Nature 421: 163–167.
- 32. Kikuchi K, Terauchi K, Wada M, Hirano H-Y (2003) The plant MITE mPing is mobilized in anther culture. Nature 421: 167–170.
- 33. Komatsu M, Shimamoto K, Kyozuka J (2003) Two-step regulation and continuous retrotransposition of the rice LINE-type retrotransposon Karma. The Plant Cell Online 15: 1934–1944.
- 34. Alves E, Ballesteros I, Linacero R, Vazquez A (2005) RYS1, a foldback transposon, is activated by tissue culture and shows preferential insertion points into the rye genome. Theoretical and Applied Genetics 111: 431–436.
- 35. Bureau T, Barret P, Brinkman M, Beckert M (2006) A sequence related to rice Pong transposable element displays transcriptional activation by in vitro culture and reveals somaclonal variations in maize. Genome 49: 1399–1407.
- 36. Huang J, Zhang K, Shen Y, Huang Z, Li M, et al. (2009) Identification of a high frequency transposon induced by tissue culture, nDaiZ, a member of the hAT family in rice. Genomics 93: 274–281.
- 37. Picault N, Chaparro C, Piegu B, Stenger W, Formey D, et al. (2009) Identification of an active LTR retrotransposon in rice. The Plant Journal 58: 754–765.
- 38. Rhee Y, Lin H, Buell R, Childs K, Kaeppler S (2009) A c2 allele of maize identified in regenerant-derived progeny from tissue culture results from insertion of a novel transposon. Maydica 54: 429.
- 39. Stroud H, Ding B, Simon SA, Feng S, Bellizzi M, et al. (2013) Plants regenerated from tissue culture contain stable epigenome changes in rice. eLife 2013.
- 40. Herr A, Jensen M, Dalmay T, Baulcombe D (2005) RNA polymerase IV directs silencing of endogenous DNA. Science 308: 118–120.
- 41. Cheng C, Daigen M, Hirochika H (2006) Epigenetic regulation of the rice retrotransposon Tos17. Molecular Genetics and Genomics 276: 378–390.
- 42. Huettel B, Kanno T, Daxinger L, Aufsatz W, Matzke AJ, et al. (2006) Endogenous targets of RNA-directed DNA methylation and Pol IV in Arabidopsis. The EMBO journal 25: 2828–2836.
- 43. Law JA, Jacobsen SE (2010) Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nature Reviews Genetics 11: 204–220.
- 44. Nuthikattu S, McCue AD, Panda K, Fultz D, DeFraia C, et al. (2013) The initiation of epigenetic silencing of active transposable elements is triggered by RDR6 and 21–22 nucleotide small interfering RNAs. Plant Physiology 162: 116–131.
- 45. Batley J, Barker G, O’Sullivan H, Edwards KJ, Edwards D (2003) Mining for single nucleotide polymorphisms and insertions/deletions in maize expressed sequence tag data. Plant physiology 132: 84–91.
- 46. Lijavetzky D, Cabezas JA, Ibáñez A, Rodríguez V, Martínez-Zapater JM (2007) High throughput SNP discovery and genotyping in grapevine (Vitis vinifera L.) by combining a re-sequencing approach and SNPlex technology. BMC genomics 8: 424.
- 47. Ossowski S, Schneeberger K, Lucas-Lledó JI, Warthmann N, Clark RM, et al. (2010) The rate and molecular spectrum of spontaneous mutations in Arabidopsis thaliana. Science 327: 92–94.
- 48. Wang X, Wu R, Lin X, Bai Y, Song C, et al. (2013) Tissue culture-induced genetic and epigenetic alterations in rice pure-lines, F1 hybrids and polyploids. BMC plant biology 13: 77.
- 49. Becker C, Hagmann J, Müller J, Koenig D, Stegle O, et al. (2011) Spontaneous epigenetic variation in the Arabidopsis thaliana methylome. Nature 480: 245–249.