Figures
Abstract
In woody crop plants, the oligosaccharide components of the cell wall are essential for important traits such as bioenergy content, growth, and structural wood properties. UDP-glucuronate decarboxylase (UXS) is a key enzyme in the synthesis of UDP-xylose for the formation of xylans during cell wall biosynthesis. Here, we isolated a multigene family of seven members (PtUXS1-7) encoding UXS from Populus tomentosa, the first investigation of UXSs in a tree species. Analysis of gene structure and phylogeny showed that the PtUXS family could be divided into three groups (PtUXS1/4, PtUXS2/5, and PtUXS3/6/7), consistent with the tissue-specific expression patterns of each PtUXS. We further evaluated the functional consequences of nucleotide polymorphisms in PtUXS1. In total, 243 single-nucleotide polymorphisms (SNPs) were identified, with a high frequency of SNPs (1/18 bp) and nucleotide diversity (πT = 0.01033, θw = 0.01280). Linkage disequilibrium (LD) analysis showed that LD did not extend over the entire gene (r2<0.1, P<0.001, within 700 bp). SNP- and haplotype-based association analysis showed that nine SNPs (Q <0.10) and 12 haplotypes (P<0.05) were significantly associated with growth and wood property traits in the association population (426 individuals), with 2.70% to 12.37% of the phenotypic variation explained. Four significant single-marker associations (Q <0.10) were validated in a linkage mapping population of 1200 individuals. Also, RNA transcript accumulation varies among genotypic classes of SNP10 was further confirmed in the association population. This is the first comprehensive study of the UXS gene family in woody plants, and lays the foundation for genetic improvements of wood properties and growth in trees using genetic engineering or marker-assisted breeding.
Citation: Du Q, Pan W, Tian J, Li B, Zhang D (2013) The UDP-Glucuronate Decarboxylase Gene Family in Populus: Structure, Expression, and Association Genetics. PLoS ONE 8(4): e60880. https://doi.org/10.1371/journal.pone.0060880
Editor: Pär K. Ingvarsson, University of Umeå, Sweden
Received: December 17, 2012; Accepted: March 4, 2013; Published: April 16, 2013
Copyright: © 2013 Du et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the Specific Program in Graduate Science and Technology Innovation of Beijing Forestry University (No. BLYJ201201), the State Key Basic Research Program of China (No. 2012CB114506), and the Project of the National Natural Science Foundation of China (Nos. 31170622, 30872042). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
With the rapid increases in global industrialization, economic development, and human populations, the world faces potentially serious energy shortages and environmental problems [1]. Forests represent approximately 27% of the world’s land area, and wood is a major renewable resource for timber, paper and emerging bioenergy industries [2]. Therefore, a fundamental understanding of cellulose biosynthesis may enable us to enhance carbon sequestration and meet greater demands for biofuels [3]. Among forest trees, poplar is emerging as a model woody crop because it has several key advantages over other trees, including flexibility of harvest time, substantial carbon allocation to stems, rapid growth, high biomass, minimal requirements for cultivation and lower amounts of fermentation-inhibiting extractives, resulting in higher biomass conversion efficiency [3], [4]. Based on these natural characteristics and the substantial genetic diversity within Populus, the development of fast-growing, high-yield poplars, with improved wood quality has the potential to enable sustainable forest development, allowing both industrial and environmental improvements.
Wood (secondary xylem) is produce by cell division, cell expansion (elongation and radial enlargement), cell wall thickening (involving cellulose, hemicelluloses, cell wall proteins, and lignin biosynthesis and deposition), programmed cell death, and heart wood formation [5]. The secondary walls are composed of cellulose, lignin and hemicelluloses, including xylans and glucomannans. Cellulose and lignin provide mechanical strength to the secondary walls, and hemicelluloses form cross-links among cellulose microfibrils, which are thought to be important for cell wall assembly. In the wood of dicot species, xylan is the second most abundant polysaccharide after cellulose, and UDP-xylose (UDP-Xyl) is a nucleotide sugar required for xylan synthesis [6], [7]. In plants, the biosynthesis of UDP-Xyl is catalyzed by different membrane-bound and soluble UDP-glucuronic acid decarboxylase (UXS) isozymes, which irreversibly convert UDP-GlcA (UDP- glucuronic acid) to UDP-Xyl. Thus, UXS represents a key enzyme for partitioning glycosyl residues between the hexosyl and pentosyl residues. In addition, because of its central role in sugar nucleotide interconversion, UXS is likely ubiquitous among plants and a target for regulatory control during cell wall biosynthesis [8]. The first UXS gene was identified from Cryptococcus neoformans by bioinformatics methods [6]. Subsequently, UXS genes have been cloned from only a few plants; for example, the Arabidopsis thaliana UXS family contains six members grouped into three classes based on their genomic structure and subcellular localization [9]. Rice also has six UXS genes dispersed throughout the genome, and these were classified into three types by phylogenetic analysis [10]. In the Poaceae, such as barley (Hordeum vulgare), analysis of transcript levels of the UXS members reveals that they likely have specific functions in cell wall formation during plant development [11]. In tobacco (Nicotiana acum), antisense downregulation of UDP-glucuronate decarboxylase leads to high glucose-to-xylose ratios in xylem walls due to fewer xylose-containing polymers. Such plants also have altered vascular organization and reduced xylans in their secondary walls [12]. Semiquantitative real time PCR analysis in cotton (Gossypium hirsutum) showed that GhUXSs transcripts were preferentially expressed during fiber development, from elongation through secondary cell wall synthesis [8]. These studies on non-woody species show that UXS family members are expressed throughout plant growth and development as they influence cell wall structure. Therefore, it is crucial to enhance our understanding of the role of UXSs in regulating growth and wood fiber properties in forest tree species.
The complex biological characteristics and long generation intervals of trees hinder the improvement of wood quality through conventional breeding methods. Given these constraints, traditional breeding of forest trees can be enhanced by marker-assisted selection (MAS), with advantages including reduced breeding cycle time, reduced cost of field testing, and increased efficiency and precision of selection [13], [14]. In this way, the selection of target traits can be achieved indirectly using molecular markers that are closely linked to underlying genes. Advances in high-throughput technologies for sequencing and genotyping and new genomic resources have enabled genome-wide examination of the number and effect of candidate genes related to traits of interest, through complex trait dissection using linkage disequilibrium (LD) mapping [15]–[18]. In recent years, SNP-based association genetics and LD mapping have enabled new MAS strategies in forest trees [19], [20]. In particular, candidate gene-based association approaches have been particularly useful to identify alleles associated with growth and wood properties in several tree species, such as conifers [21]–[25] and Eucalyptus [26]–[29]. In recent years, as the genome of Populus trichocarpa has been completely sequenced, poplar is increasingly considered as a model tree for genome-wide identification and characterization of gene families involved in growth and development [30]. For example, a set of candidate gene SNP associations was identified with chemical wood properties in Populus trichocarpa [31] and Populus nigra [32].
In this study, we used poplar as a model to first address the significance of UXS function and multiplicity in trees. We report the identification and characterization of the UXS gene family members, from the economically important tree Populus tomentosa [3], [33]. Transcript profiling revealed that the UXS genes may play important roles in wood formation. Furthermore, we used association tests to examine the allelic effects of natural variation in PtUXS1 on growth and wood-property traits and validated a set of allelic effects by LD mapping to identify useful alleles located within functional genes controlling phenotypic traits.
Results
Isolation of Seven Distinct cDNAs Clones from P. tomentosa
We used reverse transcription (RT)-PCR to isolate seven full-length cDNAs from a cDNA library prepared from the mature xylem zone of P. tomentosa. The seven cDNA clones PtUXS1-7 (GenBank Accession No. KC311162 - KC311168) were 1129 bp to 1800 bp in length, with open reading frames encoding polypeptides of 343 to 443 amino acid residues (Table 1), and 5′UTR and 3′UTR sequences that varied from 47 bp to 618 bp and 34 bp to 374 bp, respectively. Nucleotide sequences comparison of PtUXS1-7 cDNAs with known full-length Arabidopsis UXS cDNA sequences showed that PtUXS1 and PtUXS4 were 69.8% and 68.4% identical to AtUXS1; PtUXS2 and PtUXS5 were 71.4% and 67.9% identical to AtUXS2; PtUXS3, PtUXS6 and PtUXS7 were 78.3%, 70.7% and 69.9% identical to AtUXS5. In addition, the corresponding estimated molecular masses and isoelectric points (pI) ranged from 38.5 kD to 49.7 kD and 6.73 to 9.42, respectively (Table 1).
PtUXSs contain all of the conserved features of the UXS family. For example, all PtUXS family members have several sequence motifs, including an N-terminal GxxGxxG sequence that is characteristic of an ADP-binding βαβαβ-fold associated with NAD (P)-binding proteins [9] (Figure S1). The seven PtUXSs can be classified into two groups. Class I includes PtUXS1, PtUXS2, PtUXS4 and PtUXS5, which have the same amino acid residues-GGAGFVG (Figure S1). Class II includes PtUXS3, PtUXS6 and PtUXS7, which also have the same amino acid residues-GGAGFIG (Figure S1). As reported for Arabidopsis, rice and cotton UXSs, the PtUXS family contains a characteristic and highly conserved Ser, Tyr, and Lys triad, of which, Lys and Tyr are in the YxxxK motif. Although the core catalytic domain of the PtUXS was conserved, variable regions were identified in the N and C termini. In addition, analysis of the seven PtUXSs using PSORT program (http://www.psort.org/) indicated that PtUXS1, PtUXS2, PtUXS4 and PtUXS5 have a transmembrane domain (at residues 49–65, 45–61, 46–62 and 45–61, respectively) in the N-terminal region (Figure S1).
Genomic Organization of the PtUXS Family
To examine changes in intron/exon structure during evolution, we compared the full-length genomic sequences of the PtUXS family (GenBank Accession No. KC311169 and KC311156 - KC311161) and determined the intron/exon organization of each gene (Table 1 and Figure 1). The number (5 to 11) and length of introns (78 bp to 1797 bp) varied (Figure 1). All introns start with 5′G–T and end with 3′A–G among all the PtUXS family members and are in accordance with the GT–AG rule for a splicing site. Although strong conservation in the coding sequences and positions of exon/intron boundaries was detected in all PtUXS genes, the sizes and sequences of the introns among the seven PtUXS genes were found to be significantly divergent. Three patterns of intron-exon structures of the PtUXS genes were identified and designated I, II and III (Figure 1). Pattern I (PtUXS1 and PtUXS4) includes a large intron 3 of 1656 bp and 1797 bp, and the other five small introns that vary from 88 to 327 bp and 95 to 361 bp in length, and 90.0% identity of the cDNA sequences (Table 2). Pattern II (PtUXS2 and PtUXS5) includes two small introns (introns 2 and 3) and three medium introns (introns 1, 4, and 5), with lengths ranging from 100 to 700 bp and 103 to 487 bp; these two genes have a high cDNA sequence identity of 90.8% (Figure 1 and Table 2). Pattern III, which includes PtUXS3, PtUXS6 and PtUXS7, had 11 introns in the encoding regions and the positions and lengths of them were similar (Figure 1). However, the structures of PtUXS6 and PtUXS7 had more identity, and contain small introns in the 5′UTR comprising 568 bp and 174 bp at −14 bp upstream of the ATG initiation codon (Figure 1).
Three patterns of intron-exon structures of the PtUXS genes were identified and designated I, II and III. Exons and untranslated regions (UTRs) are shown as green and blue boxes, respectively, and the lines between boxes indicate introns.
Evolution of UXS Genes in Angiosperms
To clarify the evolutionary relationship between the PtUXS genes and other angiosperm UXS genes, a neighbor-joining (NJ) tree was constructed with 19 complete amino acid sequences of UXS from P. tomentosa, A. thaliana and Oryza sativa (Figure 2). The phylogenetic dendrogram formed three well-defined subgroups (Classes I, II, and III), consistent with the intron-exon structure. Class I contained PtUXS2/5, AtUXS2/4 and OsUXS2/5/6, Class II consisted of PtUXS1/4, AtUXS1 and OsUXS1/4, and PtUXS3/6/7, AtUXS3/5/6 and OsUXS3 formed the third sub-group (Class III). The amino acid sequence similarity found between PtUXS1 and PtUXS4 was 92.7% and the similarity between PtUXS2 and PtUXS5 was 91.8% (Figure 2 and Table 2). Although PtUXS3, PtUXS6 and PtUXS7 were classified into the same sub-group, PtUXS6 and PtUXS7 were more closely related to each other than they were to PtUXS3. Also, their sequence similarity at the protein level was 94.8% (Figure 2 and Table 2). From the results of phylogenetic analysis, we inferred that the UXS family members split off before the species diverged.
Protein sequences of Arabidopsis UXS family members (AtUXSs) and rice UXS members (OsUXSs) were obtained from the (National Center for Biotechnology Information; http://blast.ncbi.nlm.nih.gov/Blast.cgi ).
Transcript Profiling in Different Tissues and Organs
To determine the spatial expression patterns of the PtUXS members, real-time quantitative PCR was used to measure transcript abundance in different tissues and organs (Figure 3). PtUXS family members were differentially expressed in the tissues and organs tested and exhibited different expression patterns. All the family members except PtUXS3 were most abundantly expressed in mature leaf, and PtUXS3 had the highest expression levels in immature xylem of stem, but was lowest in mature leaf (Figure 3). PtUXS1 and PtUXS4 had similar expression patterns, and they were both moderately expressed in the mature xylem and bark (Figure 3). However, PtUXS1 was expressed at higher levels than PtUXS4 in all tissues; for example, expression of PtUXS1 in mature leaf was almost eight-fold higher than that of PtUXS4 (Figure 3). PtUXS2 and PtUXS5 were both expressed at the highest levels in the mature leaf and showed lower expression levels in the mature xylem (Figure 3). In addition, PtUXS6 and PtUXS7 were most abundantly expressed in the mature leaf, and they also had moderate expression levels in the mature xylem and apical shoot meristem, but showed the lowest expression in the immature xylem of stem (Figure 3). Therefore, the transcript profiles of these PtUXS genes appeared to be consistent with their genomic structure and phylogenetic relationships. Also all the PtUXSs appear to be involved in the development of various tissues and organs of poplar, but at different expression levels and with different tissue expression profiles.
Transcript levels were measured in different tissues, as indicated, by quantitative RT-PCR and is normalized to expression of Actin. Error bars represent+SD.
Nucleotide Diversity of PtUXS1 in Natural Populations
To characterize the intraspecific molecular evolution of the poplar UXS genes, we first obtained genomic sequence of PtUXS1 from 44 unrelated individuals in a discovery population that represents almost the entire natural range of P. tomentosa. An approximately 4374 bp genomic region of PtUXS1, including 133 bp of 5′UTR, 1293 bp of coding regions, 2574 bp of intron, and 374 bp of 3′UTR, was amplified and sequenced. Table 3 summarizes the statistical analysis of nucleotide polymorphisms (excluding indels) over different regions of PtUXS1. Across the samples, 243 SNPs were detected in PtUXS1, at a high frequency, 1/18 bp (Table 3). The SNP frequencies in the different gene regions were: 1/19 bp in the 5′UTR, 1/21 bp in exons, 1/17 bp in introns, and 1/15 bp in the 3′UTR. The lowest level of nucleotide polymorphism was found in the coding region, suggesting that the region is conserved relative to the other regions under selective pressure. In the coding sequence, 34 of the 62 SNPs located in the exons of PtUXS1 led to nonsynonymous changes (including 32 missense and 2 nonsense mutations) to the amino acid sequence (Table 3). The other 28 SNPs produced no changes to the amino acid sequence, and were categorized as synonymous mutations. 209 SNPs were categorized as totally silent in the whole gene (Table 3). In total, 82 of the 243 SNPs (34%) were considered common (frequency >0.10). Generally speaking, the PtUXS1 locus has high nucleotide diversity, where πT = 0.01033 and θw = 0.01280 (Table 3). More specifically, estimates of nucleotide diversity (πT) for the different gene regions ranged from 0.00225 (exon 3) to 0.02391 (intron 4) with θw ranging from 0.00497 (Exon 6) to 0.02407 (Intron 4). Within coding regions, the value of non-synonymous nucleotide substitutions (πnonsyn) was markedly lower than πsyn, with a πnonsyn/πsyn ratio of 0.17, suggesting that diversity at the non-synonymous sites of exon regions resulted from strong purifying selection (Table 3).
Linkage Disequilibrium
The decay of LD within PtUXS1 was shown by a plot of r2 against distance in base pairs between SNPs (Figure 4). In the P. tomentosa population, the level of LD decayed rapidly, with r2 values declining to 0.1 within 700 bp, indicating that LD did not extend over the entire gene region. The low LD observed in this study suggested that the resolution of associations between the marker and trait will be high. Using genotype data of 82 SNPs from 426 individuals in the association population, the analysis of LD displayed six high-LD distinct haplotype blocks within PtUXS1 (r2>0.75), including SNP 4–7, SNP 9–16, SNP 18–20, SNP 26–28, SNP 31–34 and SNP 62–64 (Figure S2).
Pairwise correlations between SNPs are plotted against the physical distance between the SNPs in base pairs. The straight line describes the least squares fit of r2 (Er2) to its expectation. Linkage disequilibrium has largely decayed within 700 bp.
Single Marker-trait and Haplotype-based Associations
Single-marker associations between 82 SNPs and 10 growth and wood quality traits were conducted using the mixed linear model (MLM). In total, 25 significant associations representing 16 SNP loci were identified at the threshold of P<0.05 (Table S1). However, correction for multiple testing using the FDR method resulted in only 9 significant associations (Q <0.10, Table 4). Table 2 lists highly significant associations identified with seven traits, including holocellulose content, α-cellulose content, fiber length, fiber width, microfibril angle, the diameter at breast height (D) and stem volume (V). These markers explained a small proportion of the phenotypic variance, with individual effects ranging from 2.70% to 12.37% (Table 4). Of these markers, SNP2 from 5′UTR and SNP22 from intron 2 both showed significant association with holocellulose content (Table 4). The non-synonymous marker SNP6 in exon 1, which results in an encoded amino acid change from Tyr to His, associated significantly with multiple traits, i.e., fiber width, V and D. Also, SNP10 in exon 1, a synonymous mutation, associated with α-cellulose content (Table 4). Of the remaining noncoding markers, SNP27 and SNP56 were both closely associated with fiber length, and SNP27 explained the highest proportion of the phenotypic variance (12.37%); also, SNP 68 was significantly associated with microfibril angle (Table 4). We calculated the gene actions for each significant marker-trait association. One of the nine marker-trait associations showed evidence of overdominance (|d/a| >1.25), and the remaining eight associations were split between modes of gene action that were codominant (|d/a| ≤0.5, 2), and partially to fully dominant (0.50< |d/a| <1.25, 6) (Table 5).
Using the haplotype trend regression method, 12 common haplotypes (frequency >1%) were found to be significantly associated with growth and wood-quality traits (Table 6). Of these, one haplotype from SNP 1–3 and two haplotypes from SNP 21–23 showed genetic associations with holocellulose content; two haplotypes from SNP 27–29 and one haplotype from SNP 56–58 were associated with fiber length; two haplotypes were associated with fiber width, and one haplotype each with α-cellulose content, microfibril angle, D and V traits were observed in the association population (Table 6). The proportion of phenotypic variation explained by these haplotypes varied from 3.00% to 8.82%, and eight single-marker associations (Q <0.05), strongly supporting the haplotype-based associations for these traits (les 4 and 6).
Confirmation of Association Studies in a Linkage Mapping Population
All 16 significant SNP markers (P<0.05; Table S1) identified in the discovery population were present in accordance with Mendelian expectations (P≥0.01), and no novel allele was discovered in the validation population. Therefore, single-marker association analysis (160; 16 SNPs ×10 traits) was conducted in the validation population. We first observed five marker-trait associations (P<0.05; Table 4), and subsequent multiple testing correction of P-values reduced the list of significant associations to four (Q <0.10; Table 4), with the percentages of phenotypic variation explained ranging from 3.07% to 5.63%. In the validation population, markers SNP2 and SNP10 were significantly associated with holocellulose and α-cellulose content, respectively, and explained 4.03% and 3.98% of the phenotypic variation (Table 4). In both the fiber width and D traits, the same significant marker SNP6 was observed, and they explained 5.63% and 3.07% of the phenotypic variance (Table 4). The mean phenotypic values among different genotypes in the four SNP markers showed significant differences, and the allelic effect of each marker was consistent in both association and validation populations (Figure 5).
(a–b) The nonsynonymous marker (SNP6) in exon1 of PtUXS1 significantly associated with fiber width and D in discovery and validation populations, and shows patterns of gene action consistent with dominant effects. The T allele at SNP6 causes a Tyr to His amino acid substitution. (c) SNP10 in exon 1, a synonymous mutation, was associated with α-cellulose content in both populations, illustrating the pattern of gene action consistent with additive effects. (d) SNP2 from the 5′UTR of PtUXS1 showed significant association with holocellulose content in both populations. The differences in holocellulose content among the three genotypes of this marker indicate that patterns of gene action are consistent with additive effects.
Transcript Analysis of SNP Genotypes
To determine whether these significant allelic SNPs affect the PtUXS1 RNA transcript accumulation, transcript levels were compared among the different genotypic classes for seven significant SNPs (Q <0.10, Table 4) identified in association population using RT–qPCR with gene-specific primers. Measurement of differential expression across three or two genotypic classes (10 trees for each genotype) for each of the seven SNPs, indicated that SNP10 exhibited significant differences in the RNA transcript levels among the three genotypes in the association population (Figure 6). For the marker SNP10 (exonic), the highest relative expression levels of mRNA products were found in the GG group (0.7841), followed by the CG group (0.7025), and the transcript levels of the CC group were lowest (0.3566).
The error bars represent+SD.
Discussion
Structure and Evolution of the UXS Family in Populus
Members of the UXS gene family have been found (based on EST daases) in monocots and dicots. However, the UXS gene family members cloned in poplar in our study are the first identified in a forest tree species. The UXS gene family is not restricted to higher plants because it was also identified from green alga (Chlamydomonas reinhardtii), human (Homo sapiens), rat (Rattus norvegicus), Drosophila melanogaster, and bacterial genomic daases [9], indicating that UXS proteins are evolutionarily conserved. In this study, we conducted a thorough analysis of the structure and evolution of the UXS family in the model tree Populus (Figure 1 and 2). The gene structure of the family is similar to UXS gene families reported in other plants [34]. The UXS genes share similarity with dehydratases, dehydrogenases, and epimerases. They all contain GxxGxxG NAD+ binding motifs and conserved Ser, Tyr and Lys amino acid residues that are believed to be located in the catalytic site. Previous reports on UDP-GlcA decarboxylase activities indicated that the subcellular localization of the enzymes from different sources varied, with some UDP-GlcA-DC isoforms cytosolic, and other isoforms membrane-bound [9]. Three of the six Arabidopsis UXS isoforms (AtUXS3, 5, 6) are predicted to be cytosolic (based on their sequence similarity to AtUXS3) and the other three (AtUXS1, 2, 4) are likely to reside in the endomembrane system (based on their similarity to AtUXS2) [35]. On this basis, we inferred that PtUXS1, PtUXS2, PtUXS4 and PtUXS5 reside in the endomembrane system, and the other three isoforms are cytosolic. Extensive sequence conservation across a broad range of plant taxa suggests that the UXS protein may have an essential function in growth and development in plant.
Analysis of the occurrence of UXS family members in complete genomes contributes to our knowledge of the origin and evolution of the plant UDP-glucuronate decarboxylase. In this study, we analyzed the evolution of the UXS family by classifying the family members of A. thaliana, O. sativa and P. tomentosa. This phylogenetic analysis shows the UXS family may split off before the species diverged; suggesting that all plant UXS family may have originated from ancestral types. Within the later gene duplication of UXSs within classes I, II, and III, this event occurred after the divergence of monocots and dicots. And then, the third subclass appears to be the recently split after the divergence of the woody plants (Populus) and the herbaceous (Arabidopsis).
In the PtUXS gene family, PtUXS6 and PtUXS7 have a 5′UTR intron of 568 bp and 174 bp, respectively (Figure 1). 5′UTR introns located close to the initiating ATG codon are thought to play an important role in gene expression from transcription to translation [36], [37]. For example, the rice rubi3 promoter with 5′UTR intron conferred approximately 20-fold higher GUS expression than an intron-less version in transient assays in bombarded rice suspension cells [38]. In Arabiopsis EF1α-A3, the presence of a 5′UTR intron affects gene expression and the size of the 5′UTR intron influences the level of gene expression [39]. Thus, we speculated that the 5′UTR intron in PtUXS6/7 may influence gene expression and regulation for the synthesis of UDP-xylose in Populus. The gene structure of the PtUXS family provides an important beginning to enable future exploration of the mechanisms of the evolution of gene function for each member and may help genetic engineers to regulate growth and development in trees for sustainable production of wood biomass in the future.
UXS Families are Differentially Expressed in Populus
The significant divergence in the genomic structure of the poplar UXS genes, including the 5′UTR structure, suggested that these genes may differ in their expression levels or functions. In this study, the next step to understanding the respective functions of the poplar UXS family members was investigating their expression profiles. They appeared to be expressed throughout all stages of plant growth and development. Meanwhile, the tissue-specific expression pattern of each UXS family member provides a platform for understanding the functional roles of putative orthologs from different species. Among them, PtUXS3 was predominantly expressed in the immature xylem of stem, and the other members, except PtUXS2 and PtUXS5, also had moderate transcript levels in the mature xylem (Figure 3), suggesting that UXSs may be associated with wood cell wall biosynthesis. Previous studies showed that UXS catalyzes an irreversible reaction from UDP-GlcA to UDP-Xyl, which is subsequently converted to UDP-Ara by UDP-Xyl epimerase. Thus, UXS plays a central role in producing these pentose sugars in higher plants [10], [11]. Dalessandro and Northcote [40] studied the activity of the UXS enzyme in secondary cell wall synthesis in trees, and found that the activity of UXS enzyme improved substantially at the beginning of the stage at which the cambium formed immature xylem, whereas the activity decreased after the mature xylem formed. Wheatley et al [41] detected the activity of UXS in the cambium and differentiating vascular tissue of tobacco by immuno-hybridization. In barley, comparisons of transcriptional activities of the genes in various barley tissues showed that HvUXS1 mRNA was relatively abundant in stems and the maturation zone of roots [11].
Moreover, PtUXS1, PtUXS2 and PtUXS4-7 all had the highest transcript levels in the mature leaf (Figure 3), suggesting that they were related to secondary meolites in synthesizing the cell wall. A similar phenomenon was also reported in the GT1 (glycosyltransferase 1) family of P. trichocarpa, which likely results from the fact that plants rely on enzymes to assimilate the products of photosynthesis into sugars and starch, synthesize cell wall biopolymers, and create various glycosylated compounds [42]. Similarly,the CIP7 gene identified in seedlings and adult leaves of A. thaliana has also been shown to contribute to photosynthesis in the woody tissues or leaves of Corymbia citriodora subsp. variegata, and fixes CO2 to maintain stem internal CO2 produced by respiration and also contributes to plant growth [43]–[46]. Therefore, cell wall biosynthesis is coordinated with several other biological processes, and the PtUXS genes in these shared pathways often are functional homologs but from a different phylogenetic division [47]. Comparing the gene structure and tissue specific expression of PtUXS family members showed that their expression profiles are congruent with the evolutionary relationships based on protein sequences. These findings suggest that the most closely related UXS genes have similar expression patterns, whereas the more distant sub-groups have less similar patterns (Figure 1, 2 and 3). Thus, systematic tissue- and organ-specific expression studies of each UXS member are still needed to obtain a complete overview for the entire family.
Linkage Disequilibrium Tests and Detection of Associations in P. tomentosa
LD-based association mapping plays an important role in increasing the resolution of marker-trait associations compared with traditional linkage mapping. Tree species are ideal for association mapping as they are predominantly outcrossing, have long recombination histories, and have large, effective, relatively unstructured populations, resulting in high levels of nucleotide diversity and low LD [19], [48]. Understanding the patterns of LD in the species is an important prerequisite for association mapping, because choosing genome-wide or candidate-gene-based associations depends on the patterns of LD decay in the species. In this study, LD declined rapidly within 700 bp in PtUXS1 (R2<0.1, P<0.001, Figure 4), which is consistent with the results of limited LD for candidate genes in other tree species, such as loblolly pine (Pinus taeda L) [22], [49], Scots Pine (Pinus sylvestris) [50], Douglas fir [51], and Eucalyptus nitens [26], [27]. In Populus, a rapid decay of LD occurs within just 300–1,700 bp in candidate genes among related species of Populus, based on SNP markers [16], [20], [31]. Therefore, candidate-gene-based LD mapping seems to be particularly useful in marker-assisted selection (MAS) breeding programs for trees. However, Slavov et al [52] found a slow decay of LD in the P. trichocarpa genome-wide level, with r2 dropping below 0.2 within 3–6 kb, suggesting that genome-wide association studies may be more feasible in Populus than previously assumed. It should be noted that this study was not specifically designed to address LD in the genome, but rather within these specific genes. Further study of LD decay on a genome-wide level in trees remains to be conducted.
In this study, comparison of single-marker and haplotype-based associations (Table 4 and 6), demonstrated that the effect of the haplotype is mainly derived from an individual significant marker, suggesting that haplotype analysis may not be more powerful than single marker analysis in this low LD tree species. UXS enzymes play essential roles in the synthesis of hemicelluloses, glycoproteins and oligosaccharides. They are related to fiber formation and cross-linking polysaccharides that are synthesized during fiber elongation and secondary wall formation in plants [12], [53]. In wood, the hemicelluloses account for about 25% of the dry weight, and this implies that the UXS genes may have associations with wood fiber traits, of which little is known in tree species. This study identified significant associations between markers within candidate genes and growth and wood fiber traits.
Holocellulose is a combination of cellulose (a glucan polymer) and hemicellulose (mixtures of polysaccharides), accounting for nearly 80% of secondary xylem tissue and affecting mechanical strength [1], [5]. In this study, SNP2, located in the 5′UTR of PtUXS1, had a significant association (Q <0.10) with holocellulose content, with the same effects of genotype in both discovery and validation populations (table 4 and Figure 5). Also, the patterns of gene action are consistent with additive gene effects (Table 5). This finding suggests that SNP2 was a true positive in the linkage population, and it may be a functional polymorphism that controls holocellulose content. SNPs in 5′UTRs could affect phenotypic traits because 5′UTRs play crucial roles in the regulation of gene expression, especially at the transcriptional level [54], [55]. Sequences in the 5′ flanking region can affect mRNA sility, translational efficiency, or subcellular localization [56], [57]. SNP markers in the 5′UTR that significantly affect phenotypic traits in association studies have also been reported elsewhere. For example, Miyamoto et al [58] detected a significant SNP association in the 5′UTR of GDF5 with hip osteoarthritis in two independent Japanese populations. Guerra et al [32] conducted association genetic studies of chemical wood properties in black poplar (P. nigra) and found that two highly significant SNP markers from the 5′UTR of TUB15, which encodes a β-tubulin, were associated with lignin content. The UXS gene family has been studied extensively in Arabidopsis, Oryza and Gossypium. For example, AtUXS1, the Arabidopsis ortholog of PtUXS1, encodes a UDP-GlcA decarboxylase, which converts UDP-GlcA to UDP-Xyl, and thereby regulates the synthesis of hemicellulose [59], [60]. In addition, GhUXSs are preferentially expressed during secondary cell wall synthesis [61]–[63] and antisense downregulation of UXSs may alter vascular organization and reduce xylans in cotton secondary cell walls [8].
For associations with α-cellulose content, we identified the marker SNP10, which is a synonymous mutation in exon 1 of PtUXS1, and found that its mode of gene action was consistent with additive effects (Table 5 and Figure 5). Since it is commonly believed that association studies with candidate genes should be preferentially conducted with functional SNPs, the identification of nucleotide substitutions associated with functional changes should have important implications for the design and interpretation of related association studies [64]. This conjecture was supported by the significant expression differences among three genotype classes of SNP10 in association population (Figure 6). Thumma et al [27] discovered a synonymous exonic SNP of EniCOBL4A associated with cellulose content and kraft pulp yield. Dillon et al [25] found a synonymous SNP in the second exon of an actin family member (ACT7), which was associated with cellulosic pulp yield. In addition, Kien et al [65] found that SNPs that affect the function or amount of actin may affect the amount or distribution of cellulose in the cell wall. In this study, although SNP10 is a synonymous variant positioned within the exon, it does not overlap with known regulatory motifs at the DNA sequence level or functional domains of the translated protein. Hence, the detailed functional effect of the marker in this gene must be further tested via other molecular approaches.
Our discovery that a non-synonymous exonic SNP (SNP6) in exon 1 of PtUXS1 was significantly associated with both fiber width and diameter at breast height (D) may represent pleiotropic effects of PtUXS1 [66]. A similar pleiotropic phenomenon has been identified in previous studies [17], [23]. In the discovery population, the differences in fiber width for the SNP6 marker were significant among three genotypes (Figure 5) and demonstrated a model of gene action consistent with dominant effects (Table 5). The T allele is the minor allele of this non-synonymous marker, and it represented a missense mutation that causes a Tyr→His amino acid substitution. The genotypic effects of SNP6 on diameter at breast height (D) were similar to that of association with fiber width (Figure 5). The results strongly suggest that SNP6 may be a functional polymorphism involved in the control of both fiber width and D. A similar study in maize identified a nonsynonymous SNP in the first exon of C4H1 associated with forage quality traits [67]. The association between SNP6 and fiber width was consistent with findings that GhUXS is a key enzyme in determining the quality and integrity of cotton fibers, which are generated during a longer period of cellulose synthesis [68], and the association with D demonstrated that UXSs can accelerate the growth and development of plants. Undesirable negative correlations between wood quality and growth were not observed (data not shown), indicating the potential to break negative correlations by selecting for individual SNPs in breeding programs [24], [69].
In conclusion, in combination with previous reports, these association results indicate that PtUXS1 affects fiber formation, growth and development, and wood quality of P. tomentosa. Hence, PtUXS1 is an important candidate gene for future tree-breeding programs. In addition, associations with several SNP markers were detected in the linkage population. This validation has become the gold standard for assessing statistical results from association studies with large numbers of independent tests [70]. The validation can assist in ‘ruling in’ associations, and provided additional evidence for SNP effects [25]. In this case, the SNP markers identified in both discovery and validation populations of this study can be applied to breeding programs to improve the quality and quantity of wood products.
Materials and Methods
Plant Materials and Phenotypic Data
Discovery population: In 1982, 1047 native individuals collected from the entire natural distribution region of P. tomentosa were used to eslish a clonal arboretum, using a randomized complete block design with three replications, at Guan Xian County of Shandong Province from root segments [33]. In this study, the association population (discovery population) consisted of 426 unrelated individuals representing almost the whole geographic distribution of P. tomentosa (180 from the Southern region, 86 from the Northwestern region, and 160 from the Northeastern region) were used for the initial SNP association analysis. In addition, a panel of 44 unrelated individuals (15 from the southern region, 15 from the northwestern region, and 14 from the northeastern region) was sequenced to identify SNPs within PtUXS1.
Validation population: In this study, to confirm the association results using LD mapping, a validation population consisted of 1200 hybrid individuals were randomly selected from 5,000 F1 progeny eslished by controlled crossing between two elite poplar parents, clone “YX01” (P. alba × P. glandulosa) as the female and clone “LM 50” (P. tomentosa) as the male; these two species are members of the section Populus. The progeny were grown in 2008 in the Xiao Tangshan horticultural fields of Beijing Forestry University, Beijing, China (40°2′N, 115°50′E) using a randomized complete block design with three replications.
This study was carried out in strict accordance with the recommendations in the Guide for Observational and field studies. All necessary permits were obtained for the described field studies. The sampling of all individuals of P. tomentosa was approved by the Youhui Zhang, director of National Garden of P. tomentosa.
Phenotypic data: For all individuals in these two populations, ten traits were measured using the methods described previously [71], including lignin content, holocellulose content, alpha-cellulose content, fiber length, fiber width, microfibril angle, tree height (H), tree diameter at breast height (D), stem volume (V) and tree height/tree diameter (H/D). Analysis of variance (ANOVA) and phenotypic correlations for these ten traits in these two populations have been reported by Du et al [71] and Tian et al [69].
Isolation of PtUXS cDNAs
The P. tomentosa stem mature xylem cDNA library was constructed using the Superscript λ System (Life Technologies). The cDNA library was generated as part of our large-scale effort to identify genes expressed predominantly in the mature xylem of P. tomentosa stems. The constructed cDNA library consisted of 5.0×106 pfu with an insert size of 1.0–4.0 kb. Random end-sequencing of 10,000 cDNA clones and comparison with all available Arabidopsis UXS sequences revealed that seven EST sequences were highly similar to AtUXSs. Then, BLAST analysis of the seven EST sequences at JGI Daase (http://genome.jgi-psf.org/Poptr1/Poptr1.home.html) was used to detect seven full-length cDNAs of UXS from P. trichocarpa. Gene specific primers were designed based on the full-length cDNAs of P. trichocarpa; finally, seven full-length UXS cDNAs were identified from P. tomentosa and named PtUXS1-7 cDNAs.
DNA Extraction and Identification of UXS Genomic DNA
Total genomic DNA was extracted from fresh young leaves of each P. tomentosa individual using the Plant DNeasy kit (Qiagen China, Shanghai), following the manufacturer’s protocol. The primer sets used for the amplification of UXSs were designed based on the sequenced cDNAs of PtUXS1-7. PCR was performed in a final reaction volume of 25 µl containing 20 ng genomic DNA, 0.8 U Taq DNA polymerase (Promega), 50 ng forward primer, 50 ng reverse primer, 1× PCR buffer (Promega), and 0.2 mM each dNTP (Promega). PCR conditions were as follows: 96°C for 5 min, and 30 cycles of 95°C denaturation for 30 s, 56°C annealing for 30 s, and 72°C extension for 1 min, with a final extension at 72°C for 5 min. The PCR products were finally separated by capillary electrophoresis using an ABI3730×l DNA Analyzer (Applied Biosystems, Carlsbad, CA, USA), after confirmation of PCR amplification on a 1.5% agarose gel. The analysis of polymorphic loci was performed with GeneMapper v4.0 software (Applied Biosystems) using the LIZ 600 size standard (Applied Biosystems).
RNA Extraction, cDNA Synthesis, and Tissue-specific Expression Analysis of PtUXSs
For RNA extraction, fresh tissue samples of root, leaf, and apex were collected from 1-year-old P. tomentosa clone “LM 50”. The wood-forming tissues of upright stems, including, developing and mature xylem tissues, were collected by scraping the thin (approximately 1.0 mm) and the deep layer on the exposed xylem surface at breast height; The other wood forming tissues including phloem and cambium, were collected as described [72]. All tissues were immediately frozen in liquid nitrogen and stored at –80°C.
Total RNA was extracted from various tissues using the Plant Qiagen RNAeasy kit (Qiagen China, Shanghai) according to the manufacturer’s instructions. Additional on-column DNase digestions were performed three times during the RNA purification using the RNase-Free DNase Set (Qiagen). RNA was then quantified and reverse transcribed into cDNA using the SuperScript First-Strand Synthesis system and the supplied polythymine primers (Invitrogen) [33].
Real-time quantitative PCR was performed on a DNA Engine Opticon 2 machine (MJ Research) using the LightCycler-FastStar DNA master SYBR Green I kit (Roche). The PtUXSs-specific and internal control (Actin) primer pairs (Table S2) were designed using Primer Express 3.0 software (Applied Biosystems). The PCR program included an initial denaturation at 94°C for 5 min, and 40 cycles of 30 s at 94°C, 30 s at 58°C, and 30 s at 72°C, and a final melt-curve of 70–95°C. The specificity of the amplified fragments was checked by the melting curve. All reactions were carried out in triplicate, and the data were analyzed using the Opticon Monitor Analysis Software 3.1 tool.
Phylogenetic Analysis
To analyze the phylogenetic relationships between PtUXSs and the UXS genes from other species, the amino acid sequences of UXS family members, including those from dicotyledons and monocotyledons, were identified by searching public daases available at NCBI (http://www.ncbi.nlm.nih.gov). Phylogenetic and molecular evolutionary analyses were conducted using MEGA version 4.0, and the neighbor-joining method was used to build phylogenetic trees [73]. Bootstrap analysis was performed using 1,000 replicates.
SNP Discovery and Genotyping
Of the seven UXS genes identified in the P.tomentosa, we selected PtUXS1 to explore the pattern of nucleotide diversity and conduct candidate-gene-based association mapping analysis. In order to identify SNPs within the PtUXS1, the entire gene was sequenced and analyzed in 44 unrelated individuals from the association population, without considering Insertions/deletions (INDELs), using the software MEGA 4.0 and DnaSP4.90.1 [74]. All 44 sequences described have been deposited in the GenBank daases (GenBank Accession No. KC311169 - KC311212). Subsequently, common SNPs (minor allele frequencies >0.10) were genotyped by the single-nucleotide primer extension method using a Beckman Coulter sequencing system across all DNA samples.
Data Analysis
Linkage disequilibrium analysis: To assess the pattern of linkage disequilibrium in the sequenced candidate gene region, the decay of LD with physical distance (base pairs) between SNP sites within PtUXS1 was estimated by linear regression analysis of linkage disequilibrium in DnaSP program version 4.90.1. The squared correlation of allele frequencies r2 [75] was used to test the LD between pairs of SNP markers using the software package HAPLOVIEW (http://www.broad.mit.edu/mpg/haploview.html). The interval value of the parameter varies from 0 to 1. The significance (P-values) of r2 for each SNP locus was calculated using 100,000 permutations.
Association testing: In the association population (discovery population), all trait-SNP association tests between 82 SNP markers and 10 traits were conducted, using the mixed linear model (MLM) with 104 permutations in the software package TASSEL Ver. 2.0.1 (http://www.maizegenetics.net/) [76]. The MLM can be described as follows: y = µ +Qv+ Zu+e, where y is a vector of phenotype observation, µ is a vector of intercepts; v is a vector of population effects; u is a vector of random polygene background effects; e is a vector of random experimental errors; Q is a matrix defining the population structure from STRUCTURE, and Z is a matrix relating y to u. Var (u) = G = σ2aK with σ2a as the unknown additive genetic variance and K as the kinship matrix (Yu et al., 2006). In this Q+K model, the relative kinship matrix (K) was obtained using the method proposed by Ritland [77], which is built into the program SPAGeDi, Ver. 1.2 [78], and the population structure matrix (Q) was identified based on the significant subpopulations (K = 11) [79], as assessed according to the statistical model described by Evanno et al [80], using 20 neutral genomic SSR markers. The positive false discovery rate (FDR) method was applied to correct for multiple testing by using QVALUE software [81].
A panel of 16 SNPs (P<0.05, Table S1) producing significant associations in the discovery population using the MLM was genotyped in the validation population. Inheritance tests of all significant SNP loci were first examined in the validation population by performing a chi-squared (χ2) test at the 0.01 probability level; and then SNP markers following Mendelian expectations (P≥0.01), were used in single-marker analysis in this hybrid population (excluding the genotype data involving null allele in each locus). Significant SNP loci detection was calculated by fitting the data to the model y = µ+mi+eij, where y is the trait value, µ is the mean, mi is the genotype of the ith marker, and eij is the residual associated with the jth individual in the ith genotypic class. The FDR method was used to correct for multiple testing.
Haplotype analysis: Haplotype frequencies from genotype data were estimated and haplotype association tests were done on a three-marker sliding window, using haplotype trend regression software [82]. The significances of the haplotype associations were based on 1000 permutation tests.
Modes of gene action: The modes of gene action were quantified using the ratio of dominant (d) to additive (a) effects estimated from least-square means for each genotypic class. Partial or complete dominance was defined as values in the range 0.50< |d/a| <1.25, whereas additive effects were defined as values in the range |d/a| ≤0.5. Values of |d/a| >1.25 were equated with under- or overdominance. Details of the algorithm and formulas for calculating gene action were previously described [17], [31].
Supporting Information
Figure S1.
Comparison of the amino acid sequences of plant UXS enzymes. Amino acid sequences of UXSs from Populus (PtUXS1-PtUXS7), cotton (GhUXS1, accession no. ACI46983.1), Arabidopsis (AtUXS1, accession no. AT3G53520.1), and rice (accession no. LOC_Os05g29990.1) were aligned using the DNAMAN 6.0 software. The conserved motifs GxxGxxG (NAD+-binding), YxxxK, and the transmembrane domain are highlighted in red.
https://doi.org/10.1371/journal.pone.0060880.s001
(TIF)
Figure S2.
(a–f) Significant pairwise linkage disequilibrium (r2>0.75, P<0.001) between SNP markers. The significant common genotyped SNP blocks 1–6 are shown on a schematic of PtUXS1 and the pairwise r2 values are shown by color coding in the matrix below.
https://doi.org/10.1371/journal.pone.0060880.s002
(TIF)
Table S1.
Summary of significant SNP marker-trait pairs identified at the threshold of P<0.05, using the mixed linear model (MLM) in the discovery population.
https://doi.org/10.1371/journal.pone.0060880.s003
(DOC)
Table S2.
The real-time PCR primers used in this study.
https://doi.org/10.1371/journal.pone.0060880.s004
(DOC)
Author Contributions
Conceived and designed the experiments: DZ. Performed the experiments: QD WP DZ JT. Analyzed the data: QD WP DZ. Wrote the paper: QD WP DZ.
References
- 1. Li L, Lu S, Chiang VL (2006) A genomic and molecular view of wood formation. Crit Rev Plant Sci 25: 213–233.
- 2. Ahmed S, Haider A, Main MAS (2011) Efficiency of wood degradation of some white-rot fungi. Pak J Phytopathol 23(1): 86–87.
- 3. Zhang DQ, Xu BH, Yang XH, Zhang ZY, Li BL (2011) The sucrose synthase gene family in Populus: structure, expression, and evolution. Tree Genet Genomes 7: 443–456.
- 4. Abramson M, Shoseyov O, Shani Z (2010) Plant cell wall reconstruction toward improved lignocellulosic production and processability. Plant Science 178: 61–72.
- 5. Plomion C, Leprovost G, Stokes A (2001) Wood formation in trees. Plant Physiol 127: 1513–1523.
- 6. Bar-Peled M, Griffith CL, Doering TL (2001) Functional cloning and characterization of a UDP-glucuronic acid decarboxylase: The pathogenic fungus Cryptococcus neoformans elucidates UDP-xylose synthesis. Proc Natl Acad Sci USA 98(21): 12003–12008.
- 7. Zheng HY, York WS, Darvill AG (2006) Important new players in secondary wall synthesis. Trends Plant Sci 11: 162–164.
- 8. Pan YX, Wang XF, Liu HW, Zhang GY, Ma ZY (2010) Molecular Cloning of Three UDP-Glucuronate Decarboxylase Genes That Are Preferentially Expressed In Gossypium Fibers From Elongation to Secondary Cell Wall Synthesis. J Plant Biol 53: 367–373.
- 9. Harper AD, Bar- Peled M (2002) Biosynthesis of UDP-Xylose. Cloning and Characterization of a Novel Arabidopsis Gene Family, UXS, Encoding Soluble and Putative Membrane-Bound UDP-Glucuronic Acid Decarboxylase Isoforms. Plant Physiol 130: 2188–2198.
- 10. Suzuki K, Watanabe K, Masumura T, Kitamura S (2004) Characterization of soluble and putative membrane-bound UDP-glucuronic acid decarboxylase (OsUXS) isoforms in rice. Arch Biochem Biophys 431: 169–177.
- 11. Zhang Q, Shirley N, Lahnstein J, Fincher GB (2005) Characterization and Expression Patterns of UDP-D-Glucuronate Decarboxylase Genes in Barley. Plant Physiol 138: 131–141.
- 12. Bindschedler LV, Tuerck J, Maunders M, Ruel K, Petit-Coni M, et al. (2007) Modification of hemicellulose content by antisense downregulation of UDP-glucuronate decarboxylase in tobacco and its consequences for cellulose extracility. Phytochemistry 68: 2635–2648.
- 13. Collard BCY, Mackill DJ (2008) Marker-assisted selection: an approach for precision plant breeding in the twenty-first century. Phil. Trans. R. Soc. B 363: 557–572.
- 14. Neale DB, Kremer A (2011) Forest tree genomics: growing resources and applications. Nat Rev Genet 12: 111–122.
- 15. Nordborg M, Borevitz JO, Bergelson J, Berry CC, Chory J, et al. (2002) The extent of linkage disequilibrium in Arabidopsis thaliana. Nat Genet 30: 190–193.
- 16. Ingvarsson PK, Garcia MV, Luquez V, Hall D, Jansson S (2008) Nucleotide polymorphism and phenotypic associations within and around the phytochrome B2 locus in European aspen (Populus tremula, Salicaceae). Genetics 178: 2217–2226.
- 17. Eckert AJ, Bower AD, Wegrzyn JL, Pande B, Jermstad KD, et al. (2009) Association genetics of coastal Douglas fir (Pseudotsuga menziesii var. menziesii, Pinaceae). I. Cold hardiness related traits. Genetics 182: 1289–1302.
- 18. Varshney RK, Nayak SN, May GD, Jackson SA (2009) Next-generation sequencing technologies and their implications for crop genetics and breeding. Trends biotechnol 27: 522–530.
- 19. Neale DB, Savolainen O (2004) Association genetics of complex traits in conifers. Trends Plant Sci 9: 325–330.
- 20. Ingvarsson PK (2005) Nucleotide polymorphism and linkage disequilibrium within and among natural populations of European aspen (Populus tremula L., Salicaceae). Genetics 169: 945–953.
- 21. Yu Q, Li B, Nelson CD, McKeand SE, Batista VB, et al. (2006) Association of the cad-n1 allele with increased stem growth and wood density in full-sib families of loblolly pine. Tree Genet Genomes 2: 98–108.
- 22. González-Martínez SC, Wheeler NC, Ersoz E, Nelson CD, Neale DB (2007) Association genetics in Pinus taeda L. I. Wood property traits. Genetics 175: 399–409.
- 23. Beaulieu J, Doerksen T, Boyle B, Clement S, Deslauriers M, et al. (2011) Association genetics of wood physical traits in the conifer White Spruce and relationships with gene expression. Genetics 188: 197–214.
- 24. Dillon SK, Nolan M, Li W, Bell C, Wu HX, et al. (2010) Allelic variation in cell wall candidate genes affecting solid wood properties in association populations and land races of Pinus radiata. Genetics 185: 1477–1487.
- 25. Dillon SK, Brawner JT, Meder R, Lee DJ, Southerton SG (2012) Association genetics in Corymbia citriodora subsp. Variegate identifies single nucleotide polymorphisms affecting wood growth and cellulosic pulp yield. New Phytol 195: 596–608.
- 26. Thumma BR, Nolan MF, Evans R, Moran GF (2005) Polymorphisms in Cinnamoyl CoA Reductase (CCR) are associated variation in microfibril angle in Eucalyptus spp. Genetics 171: 1257–1265.
- 27. Thumma BR, Matheson BA, Zhang D, Meeske C, Meder R, et al. (2009) Identification of a cis-acting regulatory polymorphism in a eucalypt COBRA-like gene affecting cellulose content. Genetics 183: 1153–1164.
- 28. Sexton TR, Henry RJ, McManus LJ, Henson M, Thomas DS, et al. (2010) Genetic association studies in Eucalyptus pilularis Smith (blackbutt). Aust Forestry 73: 254–258.
- 29. Sexton TR, Henry RJ, Harwood CE, Thomas DS, McManus LJ, et al. (2011) Pectin methyltransferase genes influence solid wood properties of Eucalyptus pilularis. Plant Physiol 158: 531–541.
- 30. Tuskan GA, DiFazio S, Jansson S, Bohlmann J, Grigoriev I, et al. (2006) The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science 313: 1596–1604.
- 31. Wegrzyn JL, Eckert AJ, Choi M, Lee JM, Stanton BJ, et al. (2010) Association genetics of traits controlling lignin and cellulose biosynthesis in black cottonwood (Populus trichocarpa, Salicaceae) secondary xylem. New Phytol 188: 515–532.
- 32. Guerra FP, Wegrzyn JL, Sykes R, Davis MF, Stanton BJ, et al. (2012) Association genetics of chemical wood properties in black poplar (Populus nigra). New Phytol 197: 162–176.
- 33. Zhang DQ, Du QZ, Xu BH, Zhang ZY, Li BL, et al. (2010) The actin multigene family in Populus: organization, expression and phylogenetic analysis. Mol Genet Genomics 284: 105–119.
- 34. Reiter WD, Vanzin GF (2001) Molecular genetics of nucleotide sugar interconversion pathways in plants. Plant Mol Biol 47: 95–113.
- 35. Pattathil S, Harper AD, Bar-Peled M (2005) Biosynthesis of UDP-xylose: characterization of membrane-bound At Uxs2. Planta 221: 538–548.
- 36. Rethmeier N, Seurinck J, Van Montagu M, Cornelissen M (1997) Intronmediated enhancement of transgene expression in maize is a nuclear, gene-dependent process. Plant J 12: 895–899.
- 37. Liu JL, Sivamani E, Azhakanandam K, Samadder P, Li XG, et al. (2008) Gene expression enhancement mediated by the 5′UTR intron of the rice rubi3 gene varied remarkably among tissues in transgenic rice plants. Mol Genet Genomics 279: 563–572.
- 38. Sivamani E, Qu R (2006) Expression enhancement of a rice polyubiquitin gene promoter. Plant Mol Biol 60: 225–239.
- 39. Chung BYW, Simons C, Firth AE, Brownand CM, Hellens RP (2006) Effect of 5′UTR introns on gene expression in Arabidopsis thaliana. BMC Genomics 7: 120.
- 40. Dalessandro G, Northcote DH (1977) Changes in enzymic activities of nucleoside diphosphate sugar interconversions during differentiation of cambium to xylem in sycamore and poplar. Biochem J 162(2): 267–279.
- 41. Wheatley ER, Davies DR, Bolwell GP (2002) Characterization and immunolocation of an 87 kDa polypeptide associated with UDP-glucuronic acid decarboxylase activity from differentiating tobacco cells (Nicotiana acum L.). Phytochemistry 61: 771–780.
- 42. Geisler-Lee J, Geisler M, Coutinho PM, Segerman B, Nishikubo N, et al. (2006) Poplar Carbohydrate-Active Enzymes. Gene Identification and Expression Analyses. Plant Physiol 140: 946–962.
- 43. Aschan G, Wittmann C, Pfanz H (2001) Age-dependent bark photosynthesis of aspen twigs. Trees-Struct Funct 15: 431–437.
- 44. Pfanz H (2008) Bark photosynthesis. Trees-Struct Funct 22: 137–138.
- 45. Teskey RO, Saveyn A, Steppe K, McGuire MA (2008) Origin, fate and significance of CO2 in tree stems. New Phytol 177: 17–32.
- 46. Saveyn A, Steppe K, Ubierna N, Dawson TE (2010) Woody tissue photosynthesis and its contribution to trunk growth and bud development in young plants. Plant Cell Environ 33: 1949–1958.
- 47. Persson S, Wei H, Milne J, Page GP, Somerville CR (2005) Identification of genes required for cellulose synthesis by regression analysis of public microarray data sets. Proc Natl Acad Sci USA 102: 8633–8638.
- 48. González-Martínez SC, Krutovsky KV, Neale DB (2006) Forest-tree population genomics and adaptive evolution. New Phytol 170: 227–238.
- 49. Brown GR, Gill GP, Kuntz RJ, Langley CH, Neale DB (2004) Nucleotide diversity and linkage disequilibrium in loblolly pine. Proc Natl Acad Sci USA 101: 15255–15260.
- 50. Dvornyk V, Sirvio A, Mikkonene M, Savolainen O (2002) Low nucleotide diversity at two phytochrome loci along a latitudinal cline in Pinus sylvestris. Mol Biol Evol 19: 179–199.
- 51. Krutovsky KV, Neale DB (2005) Nucleotide diversity and linkage disequilibrium in cold-hardiness and wood quality-related candidate genes in Douglas-fir. Genetics 171: 2029–2041.
- 52. Slavov GT, DiFazio SP, Martin J, Schackwitz W, Muchero W, Rodgers-Melnick E, et al. (2012) Genome resequencing reveals multiscale geographic structure and extensive linkage disequilibrium in the forest tree Populus trichocarpa, New Phytol. 196: 713–725.
- 53. Al-Ghazi Y, Bourot S, Arioli T, Dennis ES, Liewellyn DJ (2009) Transcript profiling during fiber development identifies pathways in secondary meolism and cell wall structure that may contribute to cotton fiber quality. Plant Cell Physiol 50: 1364–1381.
- 54. Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, et al. (2008) The transcriptional landscape of the yeast genome defined by RNA sequencing. Science 320: 1344–1349.
- 55. Tsankov AM, Thompson DA, Socha A, Regev A, Rando OJ (2010) The role of nucleosome positioning in the evolution of gene regulation. PLoS Biol. 8: e1000414.
- 56. Vilela C, McCarthy JEG (2003) Regulation of fungal gene expression via short open reading frames in the mRNA 5′untranslated region. Mol microbiol 49: 859–867.
- 57. Lin Z, Li WH (2012) Evolution of 5′ untranslated region length and gene expression reprogramming in yeasts. Mol Biol Evol 29(1): 81–9.
- 58. Miyamoto Y, Mabuchi A, Shi D, Kubo T, Takatori Y, et al. (2007) A functional polymorphism in the 5′UTR of GDF5 is associated with susceptibility to osteoarthritis. Nat Genet 39: 529–533.
- 59. Zeng Y, Bannon G, Thomas VH, Rice K, Drake R, et al. (1997) Purification and specificity of β1,2-xylosyltransferase, an enzyme that contributes to the allergenicity of some plant proteins. J Biol Chem 272: 31340–31347.
- 60. Strasser R, Mucha J, Mach L, Altmann F, Wilson IB, et al. (2000) Molecular cloning and functional expression of β1,2-xylosyltransferase cDNA from Arabidopsis thaliana. FEBS Lett 472: 105–108.
- 61. Meinert MC, Delmer DP (1977) Changes in biochemical composition of the cell wall of the cotton fiber during development. Plant Physiol 59: 1088–1097.
- 62. Kurek I, Kawagoe Y, Jacob-Wilk D, Doblin M, Delmer D (2002) Dimerization of cotton fiber cellulose synthase catalytic subunits occurs via oxidation of the zinc-binding domains. Proc Natl Acad Sci USA 99: 11109–11114.
- 63. Jacob-Wilk D, Kurek I, Hogan P, Delmer DP (2006) The cotton fiber zinc-binding domain of cellulose synthase A1 from Gossypium hirsutum displays rapid turnover in vitro and in vivo. Proc Natl Acad Sci USA 103: 12191–12196.
- 64. Duan J, Wainwright MS, Comeron JM, Saitou N, Sanders AR (2003) Synonymous mutations in the human dopamine receptor D2 (DRD2) affect mRNA sility and synthesis of the receptor. Hum Mol Genet 12: 205–216.
- 65. Kien ND, Quang TH, Jansson G, Harwood C, Clapham D, et al. (2009) Cellulose content as a selection trait in breeding for kraft pulp yield in Eucalyptus urophylla. Ann Forest Sci 66: 1–8.
- 66. Sari-Gorla M, Krajewski P, Di Fonzo N, Villa M, Frova C (1999) Genetic analysis of drought tolerance in maize by molecular markers. II. Plant height and flowering. Theor Appl Genet 99: 289–295.
- 67. Andersen JR, Zein I, Wenzel G, Darnhofer B, Eder J, et al. (2008) Characterization of phenylpropanoid pathway genes within European maize (Zea mays L.) inbreds. BMC Plant Biol 8: 2.
- 68. Haigler CH, Zhang DS, Wilkerson CG (2005) Biotechnological improvement of cotton fibre maturity. Physiol Plant 124: 285–294.
- 69. Tian JX, Du QZ, Chang MQ, Zhang DQ (2012) Allelic variation in PtGA20Ox associates with growth and wood properties in Populus spp. PloS ONE 7: e53116 .
- 70. Greene CS, Penrod NM, Williams SM, Moore JH (2009) Failure to replicate a genetic association may provide important clues about genetic architecture. PLoS ONE 4: e5639 .
- 71.
Du QZ, Pan W, Xu BH, Li BL, Zhang DQ (2012) Polymorphic simple sequence repeat (SSR) loci within cellulose synthase (PtoCesA) genes are associated with growth and wood properties in Populus tomentosa. New Phytol 2013, 197: 763776.
- 72. Djerbi S, Aspeborg H, Nilsson P, Sundberg B, Mellerowicz E, et al. (2004) Identification and expression analysis of genes encoding putative cellulose synthases (CesA) in the hybrid aspen, Populus tremula (L.) × P. tremuloides (Michx.). Cellulose 11: 301–312.
- 73. Tamura K, Dudley J, Nei M, Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) Software Version 4.0. Mol Biol Evol 24: 1596–1599.
- 74. Rozas J, Sánchez-Delbarrio JC, Messeguer X, Rozas R (2003) DnaSP, DNA polymorphism analyses by the coalescent and other methods. Bioinformatics 19: 2496–2497.
- 75. Hill WG, Robertson A (1968) Linkage disequilibrium in finite populations. Theor Appl Genet 38: 226–231.
- 76. Yu J, Pressoir G, Briggs WH, Vroh Bi I, Yamasaki M, et al. (2006) A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet 38: 203–208.
- 77. Ritland K (1996) Estimators for pairwise relatedness and individual inbreeding coefficients. Genet Res 67: 175–185.
- 78. Hardy OJ, Vekemans X (2002) spagedi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol Ecol Notes 2: 618–620.
- 79. Du QZ, Wang BW, Wei ZZ, Zhang DQ, Li BL (2012) Genetic diversity and population structure of Populus tomentosa (Chinese white poplar) revealed by SSR markers. J Hered 103: 853–862.
- 80. Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol 14: 2611–2620.
- 81. Storey JD, Tibshirani R (2003) Statistical significance for genomewide studies. P Nati Acad Sci, USA 100: 9440–9445.
- 82.
Higuchi T (1997) Biochemistry and molecular biology of wood. Springer Verlag, London.