Actinorhizal root nodule symbioses are very diverse, and the symbiosis of Datisca glomerata has previously been shown to have many unusual aspects. In order to gain molecular information on the infection mechanism, nodule development and nodule metabolism, we compared the transcriptomes of D. glomerata roots and nodules. Root and nodule libraries representing the 3′-ends of cDNAs were subjected to high-throughput parallel 454 sequencing. To identify the corresponding genes and to improve the assembly, Illumina sequencing of the nodule transcriptome was performed as well. The evaluation revealed 406 differentially regulated genes, 295 of which (72.7%) could be assigned a function based on homology. Analysis of the nodule transcriptome showed that genes encoding components of the common symbiosis signaling pathway were present in nodules of D. glomerata, which in combination with the previously established function of SymRK in D. glomerata nodulation suggests that this pathway is also active in actinorhizal Cucurbitales. Furthermore, comparison of the D. glomerata nodule transcriptome with nodule transcriptomes from actinorhizal Fagales revealed a new subgroup of nodule-specific defensins that might play a role specific to actinorhizal symbioses. The D. glomerata members of this defensin subgroup contain an acidic C-terminal domain that was never found in plant defensins before.
Citation: Demina IV, Persson T, Santos P, Plaszczyca M, Pawlowski K (2013) Comparison of the Nodule vs. Root Transcriptome of the Actinorhizal Plant Datisca glomerata: Actinorhizal Nodules Contain a Specific Class of Defensins. PLoS ONE 8(8): e72442. https://doi.org/10.1371/journal.pone.0072442
Editor: Frederik Börnke, Justus-Liebig-University Giessen, Germany
Received: May 27, 2013; Accepted: July 9, 2013; Published: August 29, 2013
Copyright: © 2013 Demina et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This research was funded by a grant from Forskningsrådet Formas (Swedish Research Council for Environment, Agricultural Sciences and Spatial Planning, grant 2005-2773-3234-80) and a grant from Vetenskapsrådet (Swedish Research Council, grant 2007-17840-52674-16) to KP. PS acknowledges support by Fundação para a Ciência e Tecnologia (FCT; grant SFRH/BPD/43228/2008). Web site of Formas: www.formas.se. Web site of VR: www.vr.se. Web site of FCT: www.fct.pt. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Two types of nitrogen-fixing root nodule symbioses are known: legume-rhizobia symbioses and actinorhizal symbioses. The actinorhizal symbiosis is a symbiosis between actinobacteria of the genus Frankia and a diverse group of dicotyledonous plant species from eight different families, collectively called actinorhizal plants . Phylogenetic analysis led to the identification of three major subgroups of actinorhizal plants: Fagales (Betulaceae, Myricaceae and Casuarinaceae), Cucurbitales (Datiscaceae and Coriariaceae) and Rosales (Rhamnaceae, Rosaceae and Elaeagnaceae) .
Actinorhizal nodules are coralloid organs composed of multiple lobes, each of which represents a modified lateral root without root cap, with a superficial periderm and infected cells in the expanded cortex . In nodules formed on the roots of Cucurbitales, the pattern of infected cells is different from that in other actinorhizal nodules; the infected cells form a continuous patch on one side of the acentric stele, not interspersed with uninfected cells , . Nodules of Cucurbitales are unusual in other respects as well; nodule physiology , , anatomy ,  and metabolism , ,  of the best-examined member of actinorhizal Cucurbitales, Datisca glomerata, differ from those of actinorhizal nodules formed on Fagales or Rosales. The mechanism of nodule induction on roots of actinorhizal Cucurbitales has not been examined yet, but detailed cytological analyses of mature nodules of D. glomerata  have led to the conclusion that the mechanism by which the bacteria enter plant cells may be different from those found in actinorhizal Fagales (intracellular infection) and Rosales (intercellular infection), respectively . The absence of prenodules in Cucurbitales would argue for the intercellular infection pathway, but the transcellular growth of infection threads for the intracellular pathway. Yet, in D. glomerata transcellular infection thread growth was not preceded by the formation of pre-infection thread structures . In summary, actinorhizal Cucurbitales may have a unique mechanism for transcellular infection thread growth .
In order to understand nodule development, many studies have been conducted on the comparison of gene expression patterns in legume nodules vs. roots (e.g., , ). For actinorhizal plants, several differential screenings have been carried out (e.g., , , ); however, a large scale transcriptomics analysis has only been performed for Casuarina glauca and Alnus glutinosa (Fagales) , . To date, most transcriptome studies have been conducted by microarray hybridization analysis, but the production of microarrays relies on information from extensive EST sequencing. With the reduced cost of sequencing, transcript profiling is becoming the standard technique for analysing both expression patterns  and quantitative traits . Nevertheless, the short sequence reads of serial analysis of gene expression (SAGE)  and related techniques are severely limited by the requirement of a genome sequence with reliable annotation, which is not available for many plant species including D. glomerata. Use of the 454 GS FLX sequencing technology (Roche), which creates reads of 200 bp or more in length , , while providing a lower depth of sequencing compared to short-read technologies like Solexa 1-G, offers the possibility to yield sufficient sequence information to overcome this limitation. The parallel 454 sequencing method applies high-throughput sequencing for the use with multiple samples by attaching sample-specific barcoding adaptors to blunt-end repaired DNA samples by ligation and strand-displacement (Figure S1) . With this procedure, 3′-anchored template cDNA libraries are constructed in order to generate gene-specific sequence reads. The 5′-end is generated by cutting with the restriction endonuclease NlaIII.
The parallel 454 sequencing was used to obtain an overview of the D. glomerata root and nodule transcriptomes and to enable a comparison with other root nodule symbioses. However, even with this method only 13.4% of transcripts could be identified based on sequence homology. In order to improve the assembly and the identification of genes, Illumina sequencing of the nodule transcriptome was performed as well. With these data, transcript identification could be improved to reach 72.7%, leading to a better understanding of the similarities and differences between actinorhizal root nodules from Cucurbitales and Fagales, respectively.
Results and Discussion
Sequencing of SAGE-type libraries from roots and nodules of Datisca glomerata
In order to obtain an overview of the D. glomerata root and nodule transcriptome, a high-throughput method for sequencing of serial analysis of gene expression (SAGE)-type cDNA libraries by 454 GS FLX technology (Roche) was used . 3′-End cDNA libraries were prepared from high quality total RNA from roots and nodules of D. glomerata, respectively (Figure S1). Altogether, 103,949 individual cDNA 3′-end sequences were obtained; of these 54,833 came from the root library and 49,116 from the nodule library. Clustering of these sequences led to 6,918 unique contigs (GenBank accession no. SRA012607.3).
The functions of the corresponding genes were analysed using BlastX searches of the contig sequences against the DNA databases at www.ncbi.nlm.nih.gov. Identification frequency was 15.4% when significant homology with database sequences was considered (e-value <10−5), 13.4% when homology with unknown/unidentified/hypothetical proteins was excluded. I.e., 930 contigs could be assigned homology to a gene/transcript from another organism or a gene previously characterized in D. glomerata.
The assembled length of the contigs varied: 983 contigs <100 bp, 1,900 contigs of 100–200 bp, 3,552 contigs of 200–300 bp and 483 contigs >400 bp. Statistical analysis was performed using the method described by Journet et al.  to decide whether the differences in the number of representatives in the root vs. the nodule SAGE-type library indicated significantly different expression levels. R = 10 was set as a threshold above which a difference observed was considered significant. 419 contigs were considered significantly upregulated and of these 86 had homology to transcripts in the databases at www.ncbi.nlm.nih.gov.
However, the number of 6,918 unique contigs was overrated because several contigs identified as unique in the assembly were shown to represent two previously characterized genes (see, e.g., Dgc217 and Dgc63, Table S1A).
Illumina sequencing of the nodule transcriptome and assembly
In order to improve the cDNA identification rate, the sequences had to be extended in the 5′-direction. 3′-UTR sequences tend to be AT-rich and generally are not suited to devise gene-specific primers for successful 5′-RACEs. Therefore, paired-end sequencing of the nodule transcriptome was performed using an Illumina HiSeq2000 instrument (NIH short read archive, study accession number SRP026310). Application of CLC Bio Workbench v. 4 (CLC Bio) resulted in the assembly of 117,511 contigs with an average length of 685 bp. Application of Trinity  resulted in the assembly of 64,142 contigs with an average length of 1,318 bp (N50 = 3,397 bp). This collection of sequences will be referred to as ”Trinity assembly“ in this manuscript. Then, CAP3  was used to perform a ”meta-assembly”, in which the contigs of the Trinity assembly were combined with the contigs obtained from 454 sequencing. This resulted in a total of 9,180 contigs, 272 of which consisted only of assembled 454-contigs, while 3,128 were mixed contigs of Trinity and 454-origin and the remaining 5,780 contigs were based solely on Trinity contigs. This collection of sequences will be referred to as ”meta-assembly”.
The new sequence information was to be used to improve identification of the functions of the genes represented by 454-contigs and to determine which 454-contigs represented the same genes, and thereby to make the statistical analysis more reliable. The extended sequence length for 3,128 of the mixed contigs of the meta-assembly improved the identification of homologies via BlastX. A detailed analysis of 288 mixed contigs of the meta-assembly showed that 234 of them (81.35%) showed homology with database sequences (78.5% when homology with conserved proteins of unknown function was not included). Of these 288 meta-assembly contigs, only 15 (5.2%) were chimeric, i.e., they consisted of more than one cDNA based on homology analyses using blastX (data not shown). However, when 73 of the mixed meta-assembly contigs were analysed in detail, it was found that for 48 (55%) of them no significant sequence overlap existed with the 454-contigs that had supposedly been used for their assembly. In conclusion, the quality of the meta-assembly was too low to be used as a template to sort the 454-sequences, since the links between meta-assembly contigs and 454-contigs were not reliable.
In order to assess the possibility of using the Trinity assembly as a template for sorting the 454-sequences, blastN was used to compare all 6,918 of the 454-contigs to the Trinity contigs, and all results were quality-checked by eye. This resulted in 3,756 Trinity contigs or groups of Trinity contigs that represented 5,258 different 454-contigs and 6,936 different Trinity contigs, respectively. However, a blastX search for GenBank homologs of the 101 Trinity sequences with the highest R values revealed that 40 of them were chimeric, i.e., consisted of two or more cDNAs (five of the 101 Trinity contigs examined showed no homology to GenBank sequences and therefore could not be evaluated). In conclusion, the contribution of chimeric contigs in the Trinity assembly was far too high to use this assembly as a template for sorting the 454-sequences.
Therefore, the initial 454-assembly was improved for homology identification and for the identification of contigs representing the same gene, based on the added sequence information. The alignments of the 3,756 Trinity contigs with 454-contigs were checked individually and used as a basis to combine 454-contigs unambiguously derived from the same cDNA as 454-/Trinity supercontigs (Table S1A and S1B). For the 454-contigs with R>10, blastN searches were performed against the Trinity assembly, the meta-assembly and the 454-assembly to (a) get sequence extension for identification of gene functions and (b) improve the 454-assembly by finding more supercontigs. Poisson statistics was applied to this superassembly based on the occurrences of the original 454-contigs in the root and nodule SAGE-type libraries. This way, 406 genes were identified that were expressed differentially in nodules vs. roots of D. glomerata. 86.7% (352) of these genes showed homology to database sequences, which after subtraction of homologies with conserved proteins of unknown function resulted in an identification rate of 72.7% (Table S1A and S1B). 4.6% of the 454-contigs were not represented in either the Trinity assembly or the meta-assembly. All three assemblies are available for Blast searches at fido.nsc.liu.se.
Several D. glomerata genes the cDNAs of which had been characterized previously were found in the combined assemblies. The only result on transcription levels that contradicted previously published information was on DgGHRP1 (Dgc1 in Table S1B; glycine- and histidine-rich protein ). This gene had been published as showing nodule-enhanced expression based on RNA gel blot hybridization analysis and here appeared as root-enhanced. However, in situ hybridization had shown that DgGHRP1 is expressed mainly in the periderm of roots and nodules, and the roots used for RNA isolation in this study were older than the roots used in the previous study (ca. 4 cm root length vs. 1 cm root length, starting at the tip). Hence, the difference in relative expression levels can be explained by the fact that the young root parts used in the previous study hardly contained any periderm, while the longer roots used in this study contained periderm and thus much more DgGHRP1 mRNA.
Confirmation of transcriptomics data by quantitative real-time PCR (qPCR) analysis
In order to analyse the reliability of the SAGE-type library sequencing method for determining relative levels of transcription, qPCR was performed for 23 genes for which homology with database sequences had been found. These analyses were performed on newly isolated RNAs from plants grown under the same conditions as those used for the isolation of RNA for the SAGE-type libraries, except in soil instead of sand. The results are depicted in Table 1. Altogether, with regard to tendency, applying t-test (p<0.05) to the qPCR results and setting a cut-off ratio of fold change as 2, confirmation of the SAGE-type library sequencing results was found for 73.9% of the genes examined, i.e., for 17 out of 23 genes. With regard to sensitivity/sequencing depth, organ-specificity (when set as fold change ≥100) could only be confirmed in six out of 14 cases. When taking the R values of the SAGE-type library results into consideration, it is clear that tendencies were always confirmed for R values >300, i.e., the results of SAGE-type library sequencing were basically reliable though with a higher cut-off value than would be expected based on modified Poisson statistics. The lack of confirmation of tendency for DgMnSOD1 (Dgc73; Table S1A), a superoxide dismutase gene, might be explained by physiological differences of the root samples used for SAGE-type library construction vs. those used for qPCR. It can be expected that the expression of a superoxide dismutase is controlled by various abiotic stresses .
Genes induced or downregulated in D. glomerata nodules compared to roots: overview
Analysis of the functions of the genes that are significantly up- or downregulated in nodules compared to roots implies that primary C and N metabolism is more complex in nodules than in roots, while secondary metabolism is more complex in roots (Table 2). There seem to be more transporter genes induced in nodules, presumably due to the fact that the induction of genes encoding transporters involved in nutrient exchange with the microsymbionts in nodules is higher than the induction of transporters involved in nutrient uptake from the soil in roots. The latter might easily be below the detection level. Interestingly, some chaperonins are induced in nodules, while none seems to be induced in roots, suggesting osmotic stress in nodules.
Homologs of genes encoding the components of the common symbiosis signal transduction pathway of legumes are transcribed in D. glomerata nodules
The signal transduction pathway of arbuscular mycorrhizal signal factors (Myc factors)  that was recruited for the evolution of root nodule symbiosis and therefore is also used by rhizobial nodulation (Nod) factors is referred to as the common symbiosis pathway. Results of Markmann et al.  have shown that a major component of this common symbiosis pathway of legumes, SymRK, is required for the induction of D. glomerata nodules by the homologous Frankia strain.
To find out whether genes encoding other components of the common symbiosis signal transduction pathway as well as other genes encoding proteins essential for legume nodule induction were present in the D. glomerata transcriptome, tblastN analyses were performed on the Trinity assembly and meta-assembly of the nodule transcriptome. The results showed that most legume genes encoding components of the common symbiosis pathway and genes essential for legume nodulation have homologs that are transcribed in D. glomerata nodules (Table 3). E.g., homologs of the LysM receptor kinases that act as Nod factor receptors  were found. Most of these signal transduction pathway component homologs (the nucleoporin NUP133, the calcium- and calmodulin-dependent protein kinase DMI3/CCaMK, the cation channels DMI1/CASTOR/POLLUX, the cytokinin receptor histidine kinase HK1, the transcription factor PIR1, IPD3/CYCLOPS and the transcription factor ERN1) were represented in the Trinity assembly, but not in the SAGE-type libraries (Table 3).
Analysis of the differentially expressed genes showed that a gene encoding a homolog of the nodule-specific transcription factor NIN (nodule inception protein) of Lotus japonicus, a central regulator in nodulation responsible for the de-differentiation of root cortical cells and required for nodule initiation , , was induced in D. glomerata nodules compared to roots (Dgc1007; Table S1A). In legumes, NIN expression is induced by cytokinin signaling, which is induced by Nod factor signaling. The full-size sequence of Dgc1007 (DgNIN; GenBank accession no. JX912727) was obtained, and its nodule-enhanced expression was confirmed by RT-qPCR (Table 3). Interestingly, in D. glomerata NIN expression was induced only 7.3 times in nodules compared to roots, i.e., it was not nodule-specific as in legumes (Table 1). Nodule-upregulated NIN homologs had also been found in Alnus glutinosa and Casuarina glauca . Thus, the transcription factor NIN plays a role in all root nodule symbioses; however, it remains to be shown whether in actinorhizal symbioses NIN expression is induced by cytokinin signaling like in legume symbioses.
There are some differences in differential expression of the D. glomerata homologs in that the genes encoding the closest homologs of NFR1/LYK3 and of NSP2 are not expressed root-specifically in D. glomerata, which might be explained by a different gene family situation in this species. Instead of recruitment by gene duplication, recruitment by expansion of function might have taken place in the evolution of actinorhizal Cucurbitales. For some homologs (CCaMK/DMI3, CASTOR/POLLUX, NUP133, CYCLOPS, PIR1, ERN1, HAP2-1, CRE1), no data on differential expression in D. glomerata are available (Table 3).
In summary, homologs of most components of the common symbiosis pathway known from legumes and of several genes essential for legume nodule formation were found in the D. glomerata nodule transcriptome. Furthermore, Table 3 shows that in most cases where the expression profiles of these genes in roots and nodules could be assessed, they are similar to those in legumes. The data support the conclusion of Markmann et al.  that the effect of SymRK on nodule induction shows that the common symbiosis pathway is required for the induction of actinorhizal nodules on D. glomerata roots, as it is required for the induction of actinorhizal nodules on C. glauca (Fagales) , , .
Homologs of genes encoding proteins involved in infection thread formation are present, but do not always show the same differential expression as in legumes
Two exceptions were found where the expression profile of the D. glomerata homolog of a nodulation-related gene did not fit the expression profile of the legume homolog. The function of the corresponding genes (PUB1, CERBERUS/LIN) ,  are related to infection thread growth. While infection thread growth mechanisms are similar in legumes and actinorhizal Fagales, they seem to be different in actinorhizal Cucurbitales (reviewed in ). In particular, no pre-infection thread structures, which have been described for legumes and A. glutinosa, are formed in D. glomerata nodules , , . Furthermore, transcellular infection threads in Cucurbitales do not grow through the cell center as in legumes and actinorhizal Fagales, filling the cell with branching infection threads from the center outward, but remain in the periphery of the cell, filling it from the periphery inward . Thus, in contrast with actinorhizal Fagales and Rosales, infected cells of nodules of Cucurbitales retain a large central vacuole . In this context, it is interesting that homologs of all genes encoding proteins that have been associated with infection thread growth in legumes (VPY, PUB1, CERBERUS/LIN, RIT/NAP1, PIR1, SYMREM1) , , , ,  have been found in the D. glomerata nodule transcriptome. In legumes, expression of VPY, CERBERUS/LIN and PUB1 is induced in nodules compared to roots , , . Upregulation of CERBERUS/LIN in nodules was also confirmed for C. glauca (homologs of the other genes have not been identified in Fagales yet) . However, the D. glomerata homologs of CERBERUS/LIN and of PUB1 (GenBank accession number KC145163) were not upregulated in nodules when analysed using qPCR. Only the VPY homolog (GenBank accession number KC145164) was induced in nodules (Tables 1,3). The situation for the symbiotic remorin (SYMREM1; MtREM2.1 in ) was more complicated as there were three remorin homologs present in the Trinity assembly and due to the intrinsically high sequence variability of the N-terminal domains it is difficult to determine which is the closest homolog of the symbiotic remorin (data not shown). Phylogenetic analysis using the remorin protein families from Arabidopsis thaliana and M. truncatula  showed that with the inclusion of the three D. glomerata nodule remorins, the latter map in a sister clade to the symbiotic remorins of M. truncatula (Figure S2). Expression of one of them, DgREM2 (Table S1A), was enhanced in nodules compared to roots as confirmed by qPCR (Tables 1, 3), similar to the symbiotic remorin genes in M. truncatula , A. glutinosa and C. glauca . In summary, so far the relative expression levels in roots and nodules of two D. glomerata genes (DgVPY, DgREM2) the products of which have been implicated in infection thread growth are consistent with those of their legume homologs, while the relative expression levels of two other genes (DgCERBERUS/LIN, DgPUB1) are not. Yet, this might be explained by the recruitment of different members of the corresponding gene families in Cucurbitales. Hence, no conclusion can be drawn regarding infection thread growth mechanisms in Cucurbitales vs. Fagales/legumes based on gene expression data.
Genes encoding proteases and cysteine-rich peptides transcribed in roots and nodules of actinorhizal plants
The composition of proteases and cysteine-rich peptides differs between roots and nodules and between A. glutinosa, C. glauca and D. glomerata. The family of cysteine proteases that is strongly upregulated in A. glutinosa nodules  has no representatives in either C. glauca or D. glomerata nodules or roots . However, there are cysteine protease genes expressed at high levels in roots of A. glutinosa, C. glauca and D. glomerata which are downregulated in nodules; in the case of A. glutinosa and C. glauca, they are homologs of xylem cysteine protease 1 , while in the case of D. glomerata, they are papain-type cysteine proteases (Table S1B). Among aspartic proteases, nepenthesin-type proteases are downregulated in A. glutinosa as well as in C. glauca nodules, but upregulated in D. glomerata nodules (Table S1A) , while other types of aspartic proteases are upregulated in C. glauca nodules, but none is upregulated in A. glutinosa nodules. Among serine proteases, a homolog of the nodule-specific subtilisin-type proteases characterized in infection thread-containing cells of A. glutinosa and C. glauca , ,  is also present, and seemingly also nodule-specific, in D. glomerata nodules (Table S1A). The expression of cucumisin-type proteases is induced in nodules of C. glauca and D. glomerata, but not of A. glutinosa (Table S1A) .
It is striking that cytosolic metallothioneins (MTs) constitute the bulk of cysteine-rich peptides in roots of D. glomerata (Table S1B). In principle, this is also the case in A. glutinosa and C. glauca, although there MT gene expression levels in roots and nodules are more similar . In all three actinorhizal species examined, apoplastic defensins are induced in nodules compared to roots. The full-size sequences of the two nodule-specific cysteine-rich peptide cDNAs representing the genes expressed at high levels in D. glomerata nodules (Table S1A), Dgc156 and Dgc232, were obtained (GenBank accession numbers HQ005271 and HQ005272, respectively).
Dgc156, a 639 bp cDNA, encodes a protein of altogether 120 amino acids with a molecular weight of 13.77 kDa. According to Euk-mPLoc 2.0 (http://www.csbio.sjtu.edu.cn/bioinf/euk-multi-2/), the protein localizes to the apoplast; according to SignalP , the first 26 amino acids represent the signal peptide, resulting in a mature protein of 94 amino acids, a molecular weight of 11.44 kDa and an IEP of 5.79. The amino acid sequence shows homology with defensins; accordingly, the protein was termed DgDEF1. Plant defensins are small basic apoplastic proteins of typically 45–55 amino acids and a net positive charge, with eight cysteine residues that form four disulfide bridges . It should be pointed out that the net negative charge and the acidic IEP of DgDEF1 are due to the acidic C-terminal domain comprising 40 amino acids; the N-terminal defensin domain does indeed have a net positive charge and an alkaline IEP (Figure 1).
The amino acid sequences of the mature peptides of DgDEF1 and DgDEF2 are compared with the sequences of four mature defensin peptides of the A3 class, a defensin from Arabidopsis halleri (AhPDF1.4; GenBank accession no. AY961379.1), the Antifungal Protein 1 from Raphanus sativus seeds (RsAFP1) , a defensin from D. merckii (AMP1)  and the aluminum-induced tobacco protein (NtPit1) . Gaps to optimize the alignment were introduced using the program ClustalW (EMBL), and the editor GeneDoc was used to present the alignment . Identical amino acids at conserved positions are labeled by inverse print, whereas positions without full amino acid conservation are shaded in gray. Asterisks mark the cysteine residues conserved in the defensins from plants other than D. glomerata. All negatively charged amino acids in the unique C-terminal domains of DgDEF1 and DgDEF1 are marked by bold print and underlined.
The eight cysteine residues in the N-terminal domain of the mature DgDEF1 protein form a pattern of C-X10-C-X5-C-X3-C-X10-C-X9-C-X-C-X4-C, which is very similar, though not identical, to the cysteine pattern of class A3 defensins (Figure 1) and group III of defensin-like proteins in grapevine (C-X5–10-C-X4–6-C-X3-C-X9–15-C-X5–12-C-X-C-X3-C) . Among characterized defensins, DgDEF1 shows the highest homology with representatives of class A3, an aluminum-induced tobacco protein , a defensin of Dahlia merckii ,  and the Antifungal Protein 1 from radish seeds (RsAFP1) , . It shows lower homology with the A2 defensin from Aesculus hippocastanum  and even lower homology with the B2 thionins from wheat and barley . Detailed analysis revealed that DgDEF1 is a member of a small nodule-specific defensin subfamily including DgDEF2 (Dgc845; Table S1A and Table 1; GenBank accession no. JX912726). DgDEF2 represents a 123 amino acid protein the first 26 amino acids of which represent the signal peptide; as a result, the mature protein consists of 97 amino acids and has a molecular weight of 10.74 kDa and an IEP of 5.87. Thus, both members of this defensin family contain the unusual C-terminal domain resulting in an acidic IEP of the putative mature protein. The cysteine spacing differs between DgDEF1 and DgDEF2 (Figure 1).
While both A. glutinosa and C. glauca contain gene families of defensin-like peptides the expression of which is highly induced in nodules compared to roots  (Figure S3), no defensin from either species contains the unusual C-terminal domain found in DgDEF1 and DgDEF2. Nevertheless, apart from this feature, the nodule-specific or highly nodule-enhanced defensin-like peptides from A. glutinosa and C. glauca belong to the same group of defensins as DgDEF1 and DgDEF2 (Figure S3).
Dgc232, a 645 bp cDNA, encodes a small basic cysteine-rich peptide (CRP) of which no homologs exist in nodules of A. glutinosa and C. glauca or in the databases at www.ncbi.nlm.nih.gov; the gene was termed DgCRP1. The encoded protein has a molecular weight of 12.38 kDa and consists of 115 amino acids. According to Euk-mPLoc 2.0, it is targeted to the apoplast, with a signal peptide (SignalP) of 29 amino acids, resulting in a mature protein of 86 amino acids, a molecular weight of 9.37 kDa and an IEP of 8.36. DgCRP1 contains seven cysteine residues, one near the N-terminus and the other six in the C-terminal domain, the latter forming a pattern of C–X3–C–X4–C–X10–C–X3–C–X3–C. So far, this pattern has not been found in small cysteine-rich proteins from plants .
Eukaryotes produce small cysteine-rich antimicrobial peptides (CRPs) as an innate defense against pathogens . Defensins, a large group of those peptides, induce the permeabilization of fungal membranes , , . However, the function of CRPs is not restricted to defense. Legumes have been shown to contain large gene families encoding different groups of CRPs , and a group of nodule-specific cysteine-rich peptides (NCRs) has been found to control bacterial differentiation including the amplification of the rhizobial genome in nodules of certain legumes . In actinorhizal symbioses, the plant affects bacterial differentiation as exemplified by the fact that shape and subcellular position of Frankia vesicles formed in planta are host-specific , though no data on genome amplification of Frankia in symbiosis are available. It is tempting to speculate that not only legumes but also actinorhizal plants control the differentiation of their bacterial endosymbionts by cysteine-rich peptides. However, CRPs have also been found to be involved in developmental processes , , for instance in guiding pollen tube growth , . Since both pollen tubes and infection threads employ the tip growth mechanism, this might also offer a function for nodule-specific defensins and other cysteine-rich peptides in controlling the infection process.
Thiamine biosynthesis is upregulated in actinorhizal nodules
Nodule-enhanced genes encoding enzymes involved in thiamine biosynthesis (Dgc1186, Dgc1757, Dgc1813, Dgc2133, Dgc1072 and Dgc1622, respectively; Table S1A) show a parallel with actinorhizal nodules from A. glutinosa, where a gene encoding AgThi1 was strongly induced in nodules compared to roots . This was also observed for the AgThi1 homolog in C. glauca . No similar induction of thiamine biosynthesis genes has been reported for legume nodules. Thus, actinorhizal plants from two different phylogenetic subgroups seem to induce thiamine biosynthesis in nodules, while legumes do not. Do actinorhizal plants provide thiamine to their microsymbionts in symbiosis? At least in the microsymbiont of A. glutinosa, ACN14a, thiamine biosynthesis, as indicated by the expression levels of ThiC, is not significantly downregulated in symbiosis compared to N-replete conditions in the free-living state, nor is it induced during free-living nitrogen fixation .
The identification of many components of the common symbiosis signal transduction pathway in D. glomerata nodules opens possibilities for detailed comparisons between root nodule symbiosis of legumes, actinorhizal Fagales and actinorhizal Cucurbitales. The mechanisms of induction of organogenesis and internalization of the microsymbionts in nodule cells can be assessed, at least with regard to the question after the conservation of the mechanisms identified for legumes. A certain subgroup of nodule-specific defensin-like peptides which do not appear in legume nodules has now been found in transcriptomes of actinorhizal plants from different phylogenetic subgroups indicating that these peptides might play a role specific to actinorhizal symbioses. Which role defensin-like peptides play and why the D. glomerata representatives contain an acidic C-terminal domain remains to be examined.
Materials and Methods
Datisca glomerata (Presl.) Baill seeds were originally obtained from plants in Vaca Hills, California. No specific permissions were required because the collection took place from plants growing in a stream-bed, which was not privately owned. D. glomerata is not an endangered or protected species in any part of its geographical range, by either state or federal law. Plants were grown in a greenhouse and watered with 1/4 strength Hoagland's  once per week and twice per week with deionized water. Light conditions in the greenhouse were 150–300 μmol photons m−2 s−1; temperature set points were 22°C/19°C at 13 h light/11 h dark. Seeds were germinated on germination soil (Weibull Trädgard AB, Hammenhög, Sweden). When the plants had reached a height of about 20 cm, they were transferred to pots containing sand (0–2 mm Quartz; Rådasand AB, Lidköping, Sweden) and soil from nodulated D. glomerata plants containing spores of Candidatus Frankia datiscae Dg1, a non-cultured Frankia strain originating from Coriaria nepalensis nodules from Pakistan . For transcriptome analysis, nodules and roots were harvested four to six weeks after transfer. Nodules with one or two lobes were considered young, and noduels composed of more than two lobes were considered mature. Roots from nodulated plants were cut off ca. 4 cm above the root tip. For RT-qPCR analysis, plants were grown on germination soil throughout.
Preparation of cDNA from roots and nodules of D. glomerata for transcriptome analysis
The RNA isolation protocol used was modified after Chomczynski . Plant tissue ground in liquid N2 and then transferred to a 2 ml microcentrifuge tube containing 1 ml of pre-warmed extraction buffer (65°C, 2% CTAB, 2% PVP (K30), 100 mM Tris-HCl pH 8.0, 25 mM EDTA, 2 M NaCl, 0.5 g/l spermidine, 2% ß-mercaptoethanol) per 200 mg of plant material and mixed by inverting and vortexing. The mixture was subjected to RNA extraction twice with an equal volume of (25∶24∶1) phenol:chloroform:isoamyl alcohol and once with (24∶1) chloroform:isoamyl alcohol. Separation of phases was achieved by centrifugation at 10,000×g at room temperature for 10 min. The resulting RNA was precipitated using 1/10 volume of 3 M sodium acetate pH 5.2 and 2.5 volumes of absolute ethanol at −20°C overnight. The RNA from D. glomerata roots was labeled ‘DgR’, from young nodules ‘DgNy’ and from mature nodules ‘DgNm’.
Poly(A) RNA was prepared from RNAs DgR, DgNy and DgNm by Eurofins MWG Operon (Ebergsberg, Germany). Prior to purification, nodule RNA samples were mixed (DgNy/DgNm 1∶2). The poly(A) RNA was used to synthesize double-stranded cDNA  using a specific oligo(dT) adapter primer, which carried the 454 adapter B sequence. Then, the cDNA was cut with the restriction enzyme NlaIII. Subsequently, the purified NlaIII fragments were ligated to a short double-stranded adaptor which carried the 454 adapter A sequence. Finally, the distal 3′-cDNA fragments which carried the poly(A) tails were specifically PCR-amplified to about 40 ng/μl (for cycle numbers see Table S3). The barcode sequences which were attached to the 5′-ends of the cDNAs are included in Table S3.
Massive parallel sequencing and sequence evaluation
Contig assembly and calculation of occurrences of a particular contig in the nodule library vs. root library was performed by Eurofins MWG Operon. Sequence homologies were analysed using BlastX at www.ncbi.nlm.nih.gov. The statistical significance of differences in transcripts levels between roots and nodules was analysed by calculation of the R parameter using modified Poisson statistics as described by Journet et al. . For every contig, the probability was calculated for two hypotheses. H0: number of copies is equal in both libraries. H1: one library contains more copies than the other. The ratio (R) between the probabilities H1/H0 was used to decide whether the difference of copy number between nodule and root libraries was significant. A difference between expression levels was considered significant when R>10.
Protein sequence analysis was performed using the GCG program package (Wisconsin Genetics Computer Group) and the I-TASSER platform .
Illumina paired-end sequencing and sequence evaluation
Nodule RNA was isolated as described before and prepared for sequencing using the TruSeq cDNA preparation kit (Illumina, San Diego, CA, USA). Paired-end sequencing to 100 bp was performed on an Illumina HiSeq2000 instrument using v. 1.5 flow cells, resulting in about 132 million paired-end reads.
Reads were assembled using Trinity version 2011-10-29 . CAP3  was then used to perform a ”meta-assembly” where the contigs of the Trinity assembly were assembled together with the contigs obtained from 454 sequencing. Default parameters were used for both programs.
Amplification of full-size cDNAs
Total RNA was isolated from D. glomerata nodules using RNeasy Plant Mini Kit (Qiagen, Hilden, Germany). To obtain the full-size cDNA sequences, 5′- and 3′-rapid amplification of cDNA ends (RACE) was performed. Reverse transcription was performed on 1 µg of total RNA using MuLV RT (Fermentas, St. Leon-Rot, Germany) and 5′-CDS primer A together with the SMART II oligo for 5′-RACE-Ready cDNA, or 3′-CDS primer A (Clontech, Mountain View, CA, USA) for 3′-RACE-Ready cDNA, respectively, in a final volume of 20 µl following recommendations of the manufacturer. RACEs were performed according to the SMARTTM RACE cDNA Amplification protocol (Clontech) on 2.5 µl aliquots of the first-strand cDNA, diluted 1∶13 with Tricine-EDTA buffer, with Universal Primer A Mix (Clontech) and the first gene-specific primer (Table S4). The diluted product of the primary PCR was used in the secondary PCR with Nested Universal Primer A (Clontech) and the second gene-specific primer (Table S4). Full-length cDNAs were generated by long distance PCR using 2.5 µl of 5′-RACE-Ready cDNA as template. The PCR program used was 35 cycles of 94°C for 30 s, annealing at temperature 5°C lower than the melting temperature of the primer pair for 30 s and 72°C for 3 min. The PCR products were cloned in pCR2.1-TOPO (5′-RACE products for DgMnSOD1 (Dgc73) and DgDCAT1 (Dgc108); Invitrogen, Carlsbad, CA, USA) or pJET1.2 (all other PCR products; Fermentas) and sequenced.
Reverse transcription and quantitative real-time PCR (qPCR)
Total RNA was isolated from D. glomerata nodules and from roots of non-inoculated plants, using the RNeasy Plant Mini Kit with on-column DNase digestion (Qiagen, Hilden, Germany). Reverse transcription was performed on 2.3 µg total RNA with NotI-d(T)18 primers in a final volume of 33 µl, using the First-Strand cDNA Synthesis Kit (GE Healthcare AB, Stockholm, Sweden) according to the protocol provided by the manufacturer. All qPCR assays contained 1X Maxima SYBR Green qPCR Master Mix (Fermentas, Vilnius, Lithuania), 325 nM of each primer, 5 µl of diluted cDNA in a total reaction volume of 20 µl. qPCR was conducted on a LightCycler480 (Roche, Mannheim, Germany) under the conditions of 10 min of initial denaturation at 94°C, 40 cycles of 15 sec at 94°C and 30 sec at 60°C followed by a melt curve analysis. Assays were analysed in triplicate with standard curve method . PCR efficiency was calculated in LightCycler480 software with data obtained from the exponential phase of each amplification plot. The transcript data were normalized against the constitutively expressed D. glomerata ubiquitin gene (Dgc205; Table S5). Primer sequences used in the transcript analysis (Table S5) were designed using the software Primer 3 v. 0.4.0 (http://frodo.wi.mit.edu/primer3/). Data pre-processing, normalization and t-test (p<0.05) were performed with GenEx (version 5.4.1, MultiD Analyses, Göteborg, Sweden).
Preparation of the libraries for sequencing, based on Eveland et al. .
Phylogenetic tree of the Arabidopsis and Medicago truncatula remorin protein families  and the three remorins from the Datisca glomerata nodule transcriptome (arrows). The sequences were aligned using ClustalW . The phylogenetic trees were estimated by neighbor-joining analysis using the software PAUP* 4.0b10 (PPC/Altivec) for Macintosh . Bootstrap analysis with 1000 bootstrap replications using the neighborjoining search option of the program PAUP* 4.0b10 was carried out to test the robustness of the internal branches. A remorin from the liver moss Physcomitrella patens (GenBank accession no. XP_001752001) served as outgroup.
Alignment of the amino acid sequences of DgDEF1 and DgDEF2 with the defensin-like peptides from Alnus glutinosa and Casuarina glauca that show nodule-specific or strongly nodule-enhanced expression: AgDEF1, FQ334620; AgDEF2, FQ344001; AgDEF3, FQ334074; CgDEF1, FQ318729; CgDEF2, FQ362615; CgDEF3, FQ363112; CgDEF4, FQ363205; CgDEF5, FQ320471 . Gaps to optimize the alignment were introduced using the program ClustalW (EMBL), and the editor GeneDoc was used to present the alignment . Identical amino acids at conserved positions are labeled by inverse print, whereas chemically similar amino acids are shaded in gray. Asterisks label every 10th amino acid position. Putative signal peptides are underlined. Sequence AgDEF1 is clearly truncated at the 5′-end, and sequences AgDEF2 and CgDEF3 most probably are truncated since the encoded proteins lack a signal peptide.
A. Datisca glomerata genes upregulated in nodules compared to roots. Genes mentioned in the manuscript and not published earlier are given in bold print. B. Datisca glomerata genes upregulated in roots compared to nodules. Genes mentioned in the manuscript and not published before are given in bold print.
Homology between legume proteins (GenBank accession numbers given) involved in nodule induction, and the corresponding proteins encoded by Datisca glomerata nodule contigs. The Lotus japonicus (Lj) and/or Medicago truncatula (Mt) protein sequences were used for a tBlastN search on the different assemblies of the D. glomerata nodule transcriptome. The homology values (E values) for the contigs given in Table 3 are listed. Homologies for D. glomerata SYMRK/DMI2 are not given as this gene was already functionally characterized .
Preparation of cDNA libraries (1, root cDNA library; 2, nodule cDNA library). PCR cycles used for cDNA amplification and barcode sequences attached to 5′-ends of cDNAs
Primers used for amplification of cDNAs from Datisca glomerata.
We would like to thank Mikael Huss and Sara Light (SciLifeLab, Stockholm, Sweden) for help with the Illumina sequencing results, Helge Küster (University of Hannover, Germany) for helpful discussions and Jenna Persson (Karolinska Institutet, Sweden) for critical reading of the manuscript.
Conceived and designed the experiments: KP IVD TP. Performed the experiments: IVD TP. Analyzed the data: IVD KP MP PS TP. Contributed reagents/materials/analysis tools: MP. Wrote the paper: KP IVD TP MP.
- 1. Benson DR, Silvester WB (1993) Biology of Frankia strains, actinomycete symbionts of actinorhizal plants. Microbiol Rev 57: 293–319.
- 2. Swensen S, Mullin B (1997) Phylogenetic relationships among actinorhizal plants. The impact of molecular systematics and implications for the evolution of actinorhizal symbiosis. Physiol Plant 99: 565–573.
- 3. Pawlowski K, Demchenko KN (2012) The diversity of actinorhizal symbiosis. Protoplasma 249: 967–979.
- 4. Newcomb W, Pankhurst CE (1982) Fine structure of actinorhizal root nodules of Coriaria arborea (Coriariaceae). N Z J Bot 20: 93–103.
- 5. Hafeez F, Akkermans ADL, Chaudhary AH (1984) Observations on the ultrastructure of Frankia sp. in root-nodules of Datisca cannabina L. Plant Soil 79: 383–402.
- 6. Tjepkema JD, Schwintzer CR, Monz CA (1988) Time course of acetylene reduction in nodules of five actinorhizal genera. Plant Physiol 86: 581–583.
- 7. Silvester WB, Langenstein B, Berg RH (1999) Do mitochondria provide the oxygen diffusion barrier in root nodules of Coriaria and Datisca? Can J Bot 77: 1358–1366.
- 8. Jacobsen KR, Berry AM (2002) Callose in Frankia-infected tissue of Datisca glomerata is an artifact of specimen preparation. Plant Biol 4: 46–52.
- 9. Berry AM, Murphy TM, Okubara PA, Jacobsen KR, Swensen SM, et al. (2004) Novel expression pattern of cytosolic glutamine synthetase in nitrogen-fixing root nodules of the actinorhizal host, Datisca glomerata. Plant Physiol 135: 1849–1862.
- 10. Schubert M, Melnikova AN, Mesecke N, Zubkova EK, Fortte R, et al. (2010) Two novel disaccharides, rutinose and methylrutinose, are involved in carbon metabolism in Datisca glomerata. Planta 231: 507–521.
- 11. Schubert M, Koteeva NK, Wabnitz PW, Santos P, Büttner M, et al. (2011) Carbon partitioning in roots and nitrogen-fixing root nodules of Datisca glomerata. Planta 233: 139–152.
- 12. Berg RH, Langenstein B, Silvester WB (1999) Development in the Datisca-Coriaria nodule type. Can J Bot 77: 1334–1350.
- 13. Berg RH (1999) Cytoplasmic bridge formation in the nodule apex of actinorhizal root nodules. Can J Bot 77: 1351–1357.
- 14. Colebatch G, Desbrosses G, Ott T, Krusell L, Montanari O, et al. (2004) Global changes in transcription orchestrate metabolic differentiation during symbiotic nitrogen fixation in Lotus japonicus. Plant J 39: 487–512.
- 15. Manthey K, Krajinski F, Hohnjec N, Firnhaber C, Pühler A, et al. (2004) Transcriptome profiling in root nodules and arbuscular mycorrhiza identifies a collection of novel genes induced during Medicago truncatula root endosymbioses. Mol Plant-Microbe Interact 17: 1063–1077.
- 16. Goetting-Minesky P, Mullin BC (1994) Differential gene expression in an actinorhizal symbiosis: evidence for a nodule-specific cysteine protease. Proc Natl Acad Sci USA 91: 9891–9895.
- 17. Ribeiro A, Akkermans ADL, van Kammen A, Bisseling T, Pawlowski K (1995) A nodule-specific gene encoding a subtilisin-like protease is expressed in early stages of actinorhizal nodule development. Plant Cell 7: 785–794.
- 18. Kim HB, An CS (2002) Differential expression patterns of an acidic chitinase and a basic chitinase in the root nodule of Elaeagnus umbellata. Mol Plant-Microbe Interact 15: 209–215.
- 19. Hocher V, Alloisio N, Auguy F, Fournier P, Doumas P, et al. (2011) Transcriptomics of actinorhizal symbioses reveals homologs of the whole common symbiotic signaling cascade. Plant Physiol 156: 700–711.
- 20. Hocher V, Auguy F, Argout X, Laplaze L, Franche C, et al. (2006) Expressed sequence-tag analysis in Casuarina glauca actinorhizal nodule and root. New Phytol 169: 681–688.
- 21. Weber APM, Weber KL, Carr K, Wilkerson C, Ohlrogge JB (2007) Sampling the Arabidopsis transcriptome with massively parallel pyrosequencing. Plant Physiol 144: 32–42.
- 22. Borevitz JO, Chory J (2004) Genomics tools for QTL analysis and gene discovery. Curr Op Plant Biol 7: 132–136.
- 23. Velculescu VE, Zhang L, Vogelstein B, Kinzler KW (1995) Serial analysis of gene expression. Science 270: 484–487.
- 24. Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, et al. (2005) Genome sequencing in microfabricated high-density picolitre reactors. Nature 437: 376–380.
- 25. Torres TT, Metta M, Ottenwälder B, Schlötterer C (2008) Gene expression profiling by massively parallel sequencing. Genome Res 18: 172–177.
- 26. Eveland AL, McCarty DR, Koch KE (2008) Transcript profiling by 3′-untranslated region sequencing resolves expression of gene families. Plant Physiol 146: 32–44.
- 27. Journet E-P, van Tuinen D, Gouzy J, Crespeau H, Carreau V, et al. (2002) Exploring root symbiotic programs in the model legume Medicago truncatula using EST analysis. Nucl Acids Res 30: 5579–5592.
- 28. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, et al. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29: 644–652.
- 29. Huang X, Madan A (1999) CAP3: A DNA sequence assembly program. Genome Res 9: 868–877.
- 30. Pawlowski K, Swensen S, Guan C, Hadri A–E, Berry AM, et al. (2003) Distinct patterns of symbiosis-related gene expression in actinorhizal nodules from different plant families. Mol Plant-Microbe Interact 16: 796–807.
- 31. Li W, Qi L, Lin X, Chen H, Ma Z, et al. (2009) The expression of manganese superoxide dismutase gene from Nelumbo nucifera responds strongly to chilling and oxidative stresses. J Integr Plant Biol 51: 279–386.
- 32. Maillet F, Poinsot V, André O, Puech-Pagès V, Haouy A, et al. (2011) Fungal lipochitooligosaccharide symbiotic signals in arbuscular mycorrhiza. Nature 469: 58–63.
- 33. Markmann K, Giczey G, Parniske M (2008) Functional adaptation of a plant receptor-kinase paved the way for the evolution of intracellular root symbioses with bacteria. PLoS Biol 6: e68.
- 34. Crespi M, Frugier F (2008) De novo organ formation from differentiated cells: root nodule organogenesis. Sci Signal 1: re11.
- 35. Schauser L, Roussis A, Stiller J, Stougaard J (1999) A plant regulator controlling development of symbiotic root nodules. Nature 402: 191–195.
- 36. Radutoiu S, Madsen LH, Madsen EB, Felle HH, Umehara Y, et al. (2003) Plant recognition of symbiotic bacteria requires two LysM receptor-like kinases. Nature 425: 585–592.
- 37. Gherbi H, Markmann K, Svistoonoff S, Estevan J, Autran D, et al. (2008) SymRK defines a common genetic basis for plant root endosymbioses with arbuscular mycorrhiza fungi, rhizobia, and Frankia bacteria. Proc Natl Acad Sci USA 105: 4928–4932.
- 38. Svistoonoff S, Benabdoun FM, Nambiar-Veetil M, Imanishi L, Vaissayre V, et al. (2013) The independent acquisition of plant root nitrogen-fixing symbiosis in fabids recruited the same genetic pathway for nodule organogenesis. PLoS One 8: e64515.
- 39. Murray JD, Muni RR, Torres-Jerez I, Tang Y, Allen S, et al. (2011) Vapyrin, a gene essential for intracellular progression of arbuscular mycorrhizal symbiosis, is also essential for infection by rhizobia in the nodule symbiosis of Medicago truncatula.. Plant J 65: 244–252.
- 40. Mbengue M, Camut S, de Carvalho-Niebel F, Deslandes L, Froidure S, et al. (2010) The Medicago truncatula E3 ubiquitin ligase PUB1 interacts with the LYK3 symbiotic receptor and negatively regulates infection and nodulation. Plant Cell 22: 3474–3488.
- 41. van Brussel AAN, Bakhuizen R, van Spronsen PC, Spaink HP, Tak T, et al. (1992) Induction of pre-infection thread structures in the leguminous host plant by mitogenic lipooligosaccharides of Rhizobium. Science 257: 70–72.
- 42. Yano K, Shibata S, Chen WL, Sato S, Kaneko T, et al. (2009) CERBERUS, a novel U-box protein containing WD-40 repeats, is required for formation of the infection thread and nodule development in the legume-Rhizobium symbiosis. Plant J 60: 168–180.
- 43. Yokota K, Fukai E, Madsen LH, Jurkiewicz A, Rueda P, et al. (2009) Rearrangement of actin cytoskeleton mediates invasion of Lotus japonicus roots by Mesorhizobium loti. Plant Cell 21: 267–284.
- 44. Lefebvre B, Timmers T, Mbengue M, Moreau S, Hervé C, et al. (2010) A remorin protein interacts with symbiotic receptors and regulates bacterial infection. Proc Natl Acad Sci USA 107: 2343–2348.
- 45. Raffaele S, Mongrand S, Gamas P, Niebel A, Ott T (2007) Genome-wide annotation of remorins, a plant-specific protein family: evolutionary and functional perspectives. Plant Physiol 145: 593–600.
- 46. Laplaze L, Ribeiro A, Franche C, Duhoux E, Auguy F, et al. (2000) Characterization of a Casuarina glauca nodule-specific subtilisin-like protease gene, a homolog of Alnus glutinosa ag12. Mol Plant-Microbe Interact 13: 113–117.
- 47. Svistoonoff S, Laplaze L, Auguy F, Runions J, Duponnois R, et al. (2003) cg12 expression is specifically linked to infection of root hairs and cortical cells during Casuarina glauca and Allocasuarina verticillata actinorhizal nodule development. Mol Plant-Microbe Interact 16: 600–607.
- 48. Chou K-C, Shen H-B (2008) Cell-PLoc: A package of web-servers for predicting subcellular localization of proteins in various organisms. Nature Protocols 3: 153–162.
- 49. Bendtsen JD, Nielsen H, von Heijne G, Brunak S (2004) Improved prediction of signal peptides: SignalP 3.0. J Mol Biol 340: 783–795.
- 50. Thomma BP, Cammue BP, Thevissen K (2002) Plant defensins. Planta 216: 193–202.
- 51. Giacomelli L, Nanni V, Lenzi L, Zhuang J, Dalla Serra M, et al. (2012) Identification and characterization of the defensin-like gene family of grapevine. Mol Plant-Microbe Interact 25: 1118–1131.
- 52. Ezaki B, Yamamoto Y, Matsumoto H (1995) Cloning and sequencing of the cDNAs induced by aluminium treatment and Pi starvation in cultured tobacco cells. Physiol Plant 93: 11–18.
- 53. Osborn RW, De Samblanx GW, Thevissen K, Goderis I, Torrekens S, et al. (1995) Isolation and characterisation of plant defensins from seeds of Asteraceae, Fabaceae, Hippocastanaceae and Saxifragaceae. FEBS Lett 368: 257–262.
- 54. Thevissen K, Ghazi A, De Samblanx GW, Brownlee C, Osborn RW, et al. (1996) Fungal membrane responses induced by plant defensins and thionins. J Biol Chem 271: 15018–15025.
- 55. Terras FRG, Eggermont K, Kovaleva V, Raikhel NV, Osborn RW, et al. (1995) Small cysteine-rich antifungal proteins from radish: their role in host defence. Plant Cell 7: 573–588.
- 56. Fant F, Vranken W, Broekaert W, Borremans F (1998) Determination of the three-dimensional solution structure of Raphanus sativus antifungal protein 1 by 1H NMR. J Mol Biol 279: 257–270.
- 57. Fant F, Vranken WF, Borremans FA (1999) The three-dimensional solution structure of Aesculus hippocastanum antimicrobial protein 1 determined by 1H nuclear magnetic resonance. Proteins 37: 388–403.
- 58. Bruix M, Jimenez MA, Santoro J, Gonzalez C, Colilla FJ, et al. (1993) Solution structure of gamma 1-H and gamma 1-P thionins from barley and wheat endosperm determined by 1H-NMR: a structural motif common to toxic arthropod proteins. Biochem 32: 715–724.
- 59. Silverstein KA, Moskal WA Jr, Wu HC, Underwood BA, Graham MA, et al. (2007) Small cysteine-rich peptides resembling antimicrobial peptides have been under-predicted in plants. Plant J 51: 262–280.
- 60. Theis T, Stahl U (2004) Antifungal proteins: targets, mechanisms and prospective applications. Cell Mol Life Sci 61: 437–455.
- 61. Terras FRG, Schoofs HME, De Bolle MFF, Van Leuven F, Rees SB, et al. (1992) Analysis of two novel classes of antifungal proteins from radish (Raphanus sativus L.) seeds. Biol Chem 267: 15301–5309.
- 62. Aerts AM, François IEJA, Cammue BPA, Thevissen K (2008) The mode of antifungal action of plant, insect and human defensins. Cell Mol Life Sci 65: 2069–2079.
- 63. Silverstein KA, Graham MA, VandenBosch KA (2006) Novel paralogous gene families with potential function in legume nodules and seeds. Curr Opin Plant Biol 9: 142–146.
- 64. Van de Velde W, Zehirov G, Szatmari A, Debreczeny M, Ishihara H, et al. (2010) Plant peptides govern terminal differentiation of bacteria in symbiosis. Science 327: 1122–1126.
- 65. Fletcher JC, Brand U, Running MP, Simon R, Meyerowitz EM (1999) Signaling of cell fate decisions by CLAVATA3 in Arabidopsis shoot meristems. Science 283: 1911–1914.
- 66. Oelkers K, Goffard N, Weiller GF, Gresshoff PM, Mathesius U, et al. (2008) Bioinformatic analysis of the CLE signaling peptide family. BMC Plant Biol 8: 1.
- 67. Dresselhaus T, Márton ML (2009) Micropylar pollen tube guidance and burst: adapted from defense mechanisms? Curr Opin Plant Biol 12: 773–780.
- 68. Okuda S, Tsutsui H, Shiina K, Sprunck S, Takeuchi H, et al. (2009) Defensin-like polypeptide LUREs are pollen tube attractants secreted from synergid cells. Nature 458: 357–361.
- 69. Ribeiro A, Praekelt U, Akkermans ADL, Meacock PA, van Kammen A, et al. (1996) Identification of agthi1, encoding an enzyme involved in biosynthesis of the thiamine precursor thiazole, in actinorhizal nodules of Alnus glutinosa.. Plant J 10: 361–368.
- 70. Alloisio N, Queiroux C, Fournier P, Pujic P, Normand P, et al. (2010) The Frankia alni symbiotic transcriptome. Mol Plant Microbe Interact 23: 593–607.
- 71. Hoagland DR, Arnon DT (1938) The water-culture method for growing plants without soil. California Agriculture Experiment Station Circular 347. Berkeley, CA: University of California.
- 72. Persson T, Benson DR, Normand P, Vanden Heuvel B, Pujic P, et al. (2011) The genome of Candidatus Frankia datiscae Dg1, the uncultured microsymbiont from nitrogen-fixing root nodules of the dicot Datisca glomerata. J Bacteriol 193: 7017–7018.
- 73. Chomczynski P (1993) A reagent for the single-step simultaneous isolation of RNA, DNA and proteins from cell and tissue samples. Biotechniques 15: 532-534, 536–537.
- 74. Gubler U, Hoffman BJ (1983) A simple and very efficient method for generating cDNA libraries. Gene 25: 263–269.
- 75. Roy A, Kucukural A, Zhang Y (2010) I-TASSER: a unified platform for automated protein structure and function prediction. Nat Protoc 5: 725–738.
- 76. Pfaffl MW (2001) A new mathematical model for reative quantification in real-time RT-PCR. Nucl Acid Res 29: 2002–2007.
- 77. Catoira R, Galera C, de Billy F, Penmetsa RV, Journet EP, et al. (2000) Four genes of Medicago truncatula controlling components of a Nod factor transduction pathway. Plant Cell 12: 1647–1665.
- 78. Lévy J, Bres C, Geurts R, Chalhoub B, Kulikova O, et al. (2004) A putative Ca2+ and calmodulin-dependent protein kinase required for bacterial and fungal symbioses. Science 303: 1361–1364.
- 79. Imaizumi-Anraku H, Takeda N, Charpentier M, Perry J, Miwa H, et al. (2005) Plastid proteins crucial for symbiotic fungal and bacterial entry into plant roots. Nature 433: 527–531.
- 80. Smit P, Raedts J, Portyanko V, Debellé F, Gough C, et al. (2005) NSP1 of the GRAS protein family is essential for rhizobial Nod factor-induced transcription. Science 308: 1789–1791.
- 81. Combier JP, Frugier F, de Billy F, Boualem A, El-Yahyaoui F, et al. (2006) MtHAP2-1 is a key transcriptional regulator of symbiotic nodule development regulated by microRNA169 in Medicago truncatula. Genes Dev 20: 3084–3088.
- 82. Miwa H, Sun J, Oldroyd GE, Downie JA (2006) Analysis of Nod-factor-induced calcium signaling in root hairs of symbiotically defective mutants of Lotus japonicus.. Mol Plant Microbe Interact 19: 914–923.
- 83. Andriankaja A, Boisson-Dernier A, Frances L, Sauviac L, Jauneau A, et al. (2007) AP2-ERF transcription factors mediate Nod factor dependent MtENOD11 activation in root hairs via a novel cis-regulatory motif. Plant Cell 19: 2866–2885.
- 84. Kevei Z, Lougnon G, Mergaert P, Horváth GV, Kereszt A, et al. (2007) 3-hydroxy-3-methylglutaryl coenzyme a reductase 1 interacts with NORK and is crucial for nodulation in Medicago truncatula. Plant Cell 19: 3974–3989.
- 85. Murakami Y, Miwa H, Imaizumi-Anraku H, Kouchi H, Downie JA, et al. (2006) Positional cloning identifies Lotus japonicus NSP2, a putative transcription factor of the GRAS family, required for NIN and ENOD40 gene expression in nodule initiation. DNA Res 13: 255–265.
- 86. Kiss E, Oláh B, Kaló P, Morales M, Heckmann AB, et al. (2009) LIN, a novel type of U-box/WD40 protein, controls early infection by rhizobia in legumes. Plant Physiol 151: 1239–1249.
- 87. Yano K, Yoshida S, Müller J, Singh S, Banba M, et al. (2008) CYCLOPS, a mediator of symbiotic intracellular accommodation. Proc Natl Acad Sci USA 105: 20540–20545.
- 88. Murray JD, Karas BJ, Sato S, Tabata S, Amyot L, et al. (2007) A cytokinin perception mutant colonized by Rhizobium in the absence of nodule organogenesis. Science 315: 101–104.
- 89. Nicholas K, Nichola HB Jr, Deerfield DWII (1997) GeneDoc: analysis and visualization of genetic variation. EMBnet NEWS 4: 14.
- 90. Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG (1997) The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucl Acids Res 25: 4876–4882.
- 91. Swofford DL (1998) PAUP*: Phylogenetic Analysis Using Parsimony (and Other Methods). Sunderland, MA: Sinauer Associates.