Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Conserved oligomeric Golgi (COG) complex genes functioning in defense are expressed in root cells undergoing a defense response to a pathogenic infection and exhibit regulation my MAPKs

  • Vincent P. Klink ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft

    vincent.klink@usda.gov

    Affiliation USDA ARS NEA BARC Molecular Plant Pathology Laboratory, Beltsville, MD, United States of America

  • Omar Darwish,

    Roles Data curation, Formal analysis, Methodology, Validation

    Affiliation Department of Mathematics Computer Science, Texas Woman’s University, Denton, TX, United States of America

  • Nadim W. Alkharouf,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Supervision, Validation, Visualization

    Affiliation Department of Computer and Information Sciences, Towson University, Towson, MD, United States of America

  • Bisho R. Lawaju,

    Roles Formal analysis, Investigation, Methodology, Validation

    Affiliation Department of Entomology and Plant Pathology, Auburn University, Auburn, AL, United States of America

  • Rishi Khatri,

    Roles Formal analysis, Investigation, Methodology, Validation

    Affiliation Department of Biological Sciences, Mississippi State University, Mississippi, MS, United States of America

  • Kathy S. Lawrence

    Roles Conceptualization, Investigation, Methodology, Supervision, Validation

    Affiliation Department of Entomology and Plant Pathology, Auburn University, Auburn, AL, United States of America

Abstract

The conserved oligomeric Golgi (COG) complex maintains correct Golgi structure and function during retrograde trafficking. Glycine max has 2 paralogs of each COG gene, with one paralog of each gene family having a defense function to the parasitic nematode Heterodera glycines. Experiments presented here show G. max COG paralogs functioning in defense are expressed specifically in the root cells (syncytia) undergoing the defense response. The expressed defense COG gene COG7-2-b is an alternate splice variant, indicating specific COG variants are important to defense. Transcriptomic experiments examining RNA isolated from COG overexpressing and RNAi roots show some COG genes co-regulate the expression of other COG complex genes. Examining signaling events responsible for COG expression, transcriptomic experiments probing MAPK overexpressing roots show their expression influences the relative transcript abundance of COG genes as compared to controls. COG complex paralogs are shown to be found in plants that are agriculturally relevant on a world-wide scale including Manihot esculenta, Zea mays, Oryza sativa, Triticum aestivum, Hordeum vulgare, Sorghum bicolor, Brassica rapa, Elaes guineensis and Saccharum officinalis and in additional crops significant to U.S. agriculture including Beta vulgaris, Solanum tuberosum, Solanum lycopersicum and Gossypium hirsutum. The analyses provide basic information on COG complex biology, including the coregulation of some COG genes and that MAPKs functioning in defense influence their expression. Furthermore, it appears in G. max and likely other crops that some level of neofunctionalization of the duplicated genes is occurring. The analysis has identified important avenues for future research broadly in plants.

Introduction

The conserved oligomeric Golgi (COG) complex maintains the correct Golgi apparatus structure as well as function with a role in retrograde trafficking in eukaryotes [1]. The COG complex performs functions in homeostasis, in particular, regarding protein glycosylation [1, 2]. By virtue of their role in retrograde trafficking, the COG complex has a central cellular role, broadly, in eukaryotes.

The COG complex is composed of 8 subunits that coalesce into A and B sub-complexes [1, 36]. The A sub-complex is composed of COGs1-4 while the B sub-complex is composed of COGs5-8 [1, 7]. Notably, COG complex components interact with other proteins including the soluble N-ethylmaleimide-sensitive factor attachment protein receptor (SNARE) which is a major component of the cellular membrane fusion apparatus [4, 810]. The COG complex functions with several other associated proteins, including Rabs, various tethers containing coiled-coil proteins, as well as molecular motors which facilitate its many functions [4, 9]. Due to its retrograde trafficking role, the COG complex performs a central function in the delivery of materials between the Golgi cisternae.

The initial understanding of the COG complex came from genetic studies made in the experimental model Saccharomyces cerevisiae [3, 11, 12]. The experiments revealed the growth deficiencies of mutants came from Golgi complex impairment involving retrograde trafficking [3, 11, 12]. COG complex genes have also been identified in humans, with their mutations causing various types of disease and growth defects [1315]. In plants, experiments examining the genetic model Arabidopsis thaliana show COG7 mutations impair cell expansion and meristem organization [16]. While much information has been obtained for COG genes in human and S. cerevisiae, very little is known regarding the COG complex in plants with even less understood regarding their role during their pathogenic interactions.

Recent experiments focusing in on COG complex biology occurring during plant pathogenic interactions have been performed on G. max infected with the parasitic nematode Heterodera glycines (soybean cyst nematode [SCN]) [17]. H. glycines is the most economically important pathogen of G. max, accounting for a 7–10% decrease in yield while causing more economic loss than the rest of its pathogens combined so any knowledge on defense is urgently needed [18, 19]. More broadly, information in this pathosystem can aid in understanding plant defense mechanisms in other pathosystems [20]. G. max may show clear signs of H. glycines activity, including stunting and even chlorosis. However, in some cases no adverse signs of parasitism are evident, except a decrease in yield of approximately 15% [21].

The H. glycines life cycle has a 30-day duration, dependent on ambient temperatures [22]. The life cycle of H. glycines begins as a hardened female carcass (cyst) containing 250–500 fertilized eggs, present within the soil and may remain dormant for up to 9 years [22]. Proper conditions lead to egg hatch, liberating second stage juveniles (J2s) which migrate toward and subsequently burrow into the root, slicing through root cells with a rigid, tubular mouth apparatus known as a stylet. This process takes approximately 24 hours for the J2 to reach its site of parasitism [23, 24]. The H. glycines stylet then is used to deliver effectors into a G. max pericycle or neighboring cell that it will parasitize. The H. glycines-parasitized root cell walls then dissolve. The cell walls dissolve due to processes facilitated by the nematode, resulting in 200–250 neighboring root cells becoming incorporated into a common cytoplasm producing a syncytium. The syncytium is also the site of the localized defense responses, driven by plant-mediated processes that include pathogen activated molecular pattern (PAMP) triggered immunity (PTI) and effector triggered immunity (ETI) [17, 2332]. The original H. glycines resistance loci identified in this plant-pathosystem include the recessive rhg1, rhg2 and rhg3 and the dominant Rhg4 and Rhg5 [3335]. The rhg1 locus is the most effective at combating H. glycines parasitism, containing copies of tandemly repeated gene-containing cassettes composed of an amino acid transporter, a wound inducible protein and a membrane fusion protein known as alpha soluble N-ethylmaleimide-sensitive fusion protein (α-SNAP) [28, 36]. The α-SNAP gene has a role in resistance in G. max to H. glycines which is in agreement with observations of how membrane fusion functions in the plant defense to pathogens [29, 3640]. To further exemplify the importance of membrane trafficking to plant defense to pathogens, in particular retrograde trafficking, experiments performed on Hordeum vulgare (wheat) identified a defense role for COG3 (HvCOG3) to fungal infection by Blumeria graminis f.sp. hordei [41]. The result indicated that the COG complex may function broadly in defense across different plant species to different pathogens. To determine a defense role for the G. max COG complex, a genomic analysis using S. cerevisiae COG protein sequences led to the identification of 2 paralogs for each COG gene. Functional studies demonstrated that one of the two paralogs of each COG gene family function in the defense process [17]. Furthermore, seed treatment with the bacterial effector harpin that functions in ETI leads to the induced transcript abundance of COG paralogs that function in the defense process [17]. Experiments have shown the syntaxin 31 homolog of S. cerevisiae, suppressors of the erd2-deletion 5 (Sed5p) which is a SNARE component, binds to Sec17p, COG4 and COG6 [4246]. The G. max syntaxin 31 homolog, SYP38, functions in the defense process to H. glycines with its overexpression co-regulating α-SNAP-5 expression [30]. Lawaju et al. (2020) took those experiments further, showing an increased syntaxin 31 transcript level in each of the transgenic COG overexpressing roots that impair H. glycines parasitism [17].

A shortcoming in the experiments of Lawaju et al. (2020) [17] was the lack of a demonstration of whether any of the COG genes that function in the defense process are actually expressed within the root cells that are parasitized by H. glycines. This knowledge is important because syncytium-transcription has been an important trait in identifying genes functioning in the defense process that G. max has toward H. glycines [28, 29, 32, 47]. In the experiments presented here, transcriptomic data is presented showing the COG gene expression occurring within H. glycines-parasitized root cells undergoing a defense response in the G. max[Peking/PI 548402] and G. max[PI 88788] genotypes that are capable of a defense response to H. glycines. The newly presented data strengthens the functional transgenic data of Lawaju et al. (2020) [17] obtained from COG overexpressing and RNAi roots by showing that the COG paralogs that are expressed within the syncytia undergoing a defense response are those that function in the defense response. Importantly, experiments reveal that one of those genes is a splice variant of a COG complex gene (COG7-2, Glyma.12G013000.2 [COG7-2-b]), other than its primary transcript (Glyma.12G013000.1 [COG7-2-a]), is expressed within the syncytium undergoing the defense response and functions in the defense process. To obtain a basic understanding of G. max COG gene expression, RNA seq data has been extracted from Phytozome and analyzed to examine the relative transcript abundance of these COG complex gene splice variants in RNA samples from leaf, nodule, pod, root, root hair, seed shoot apical meristem (SAM) and stem examined in biological triplicate using Phytomine [4850]. Due to the functionality of the Hordeum vulgare HvCOG3 in defense, COG complex gene family member gene sequence data have been extracted from various genome sources, including their alternate splice variants, from a number of important crop plants [41]. In some cases, extensive numbers of splice variants have been identified from the genomes of these plant species. Prior experiments in the G. max-H. glycines pathosystem have identified the co-regulation of components of the vesicle transport apparatus and mitogen activated protein kinase (MAPK) signaling. Experiments presented here show COG overexpressing and RNAi roots exhibit COG gene co-regulation. Furthermore, prior experiments have shown the specific G. max MAPK genes function during its defense response to H. glycines parasitism [32]. To gain an understanding of signaling processes relating to the G. max COG complex expression, RNA seq analyses, followed by RT-qPCR experiments, determine whether the relative transcript abundance of COG genes becomes affected by MAPK gene overexpression [51].

Materials and methods

COG component gene identification

The genome sequences, assemblies and annotations for A. thaliana, G. max, M. esculenta, Z. mays, O. sativa, T. aestivum, H. vulgare, S. bicolor, B. rapa, S. tuberosum, S. lycopersicum and G. hirsutum are housed at Phytozome (https://phytozome.jgi.doe.gov) [49, 5269]. The genome sequences for E. guineensis (http://gbrowse.mpob.gov.my); S. officinalis (https://sugarcane-genome.cirad.fr/ and B. vulgaris (https://bvseq.boku.ac.at/) have also been mined [59, 7072]. The proteomes have been queried with the conceptually translated A. thaliana COG gene sequences using Arabidopsis thaliana TAIR10. This process has been performed using the Basic Local Alignment Search Tool (BLAST) [73]. Some queries have been performed in Phytozome [49]. In those cases, the default settings have been used. The default parameters include: Target type: Proteome; Program: BLASTP-protein query (BLASTP 2.2.26+) to protein database; Expect (E) threshold: -1; Comparison matrix: BLOSUM62; Word (W) length: default = 3; number of alignments to show: 100 allowing for gaps and filter query. The proteomes analyzed at Phytozome (https://phytozome.jgi.doe.gov) have included Glycine max Wm82.a2.v1 (soybean), Manihot esculenta v6.1 (cassava), Zea mays Ensembl-18 (Maize), Oryza sativa v7_JGI (rice), Triticum aestivum v2.2 (common wheat), Hordeum vulgare r1 (barley), Sorghum bicolor v3.1.1 (Cereal grass), Brassica rapa FPsc v1.3 (Turnip mustard–FasPlant), Solanum lycopersicum iTAG2.4 (Tomato), Solanum tuberosum v4.03 (Potato), Gossypium hirsutum v1.1 (Upland cotton) [53, 5557, 6166, 69, 7476]. Additional proteomes for E. guineensis; S. officinalis and B. vulgaris have also been mined [59, 70, 71]. The oil palm genome has been mined using E. guineensis Genes.faa (v3) employing BLOSUM62 under their default settings [70]. The sugarcane R570 cultivar genome has been mined using BLOSUM62 on the default settings [59]. The sugar beet KWS2320 genotype RefBeet-1.2 proteome has been mined employing BLOSUM62 under their default settings [71]. The analyses have permitted the identification of COG genes and alternate splice variants. G. max transcriptomic data used to determine expression in leaf, nodule, pod, root, root hair, seed, shoot apical meristem (SAM) and stem has been analyzed using Phytomine in Phytozome using default parameters [49]. COG protein motifs were determined using MOTIF (MOTIF: Searching Protein Sequence Motifs (genome.jp) under default settings. Details of PFam (PF) families can be determined at https://pfam.xfam.org/ [77].

Determination of COG complex gene expression occurring during the resistant reaction in G. max

The identification and selection of the G. max COG genes that have been used in the functional transgenic studies has occurred by using the gene expression data of Matsye et al. (2011) [28]. The procedure is summarized here for clarity. Matsye et al. (2011) have performed microarray analyses that have employed the GeneChip Soybean Genome Array (Affymetrix) [28]. In those studies, Matsye et al. (2011) [28] have infected two different G. max genotypes that are each capable of undergoing either a susceptible or resistant reaction with those reactions dependent on the H. glycines genotype used in the infections. Infection of G. max[Peking/PI 548402] and G. max[PI 88788] with H. glycines[race 14/HG-type 1.3.6.7/TN8] leads to a susceptible reaction. In contrast, infection of G. max[Peking/PI 548402] and G. max[PI 88788] with H. glycines[NL1-Rhg/HG-type 7/race 3] leads to a resistant reaction. The pericycle (control) cells collected at 0 days post infection (dpi) and H. glycines-parasitized syncytia undergoing the process of resistance have been collected at 3 and 6 dpi using laser microdissection (LM) [28]. These time points have been selected for specific reasons. The syncytia develop from pericycle and surrounding cells (0 dpi). Syncytia collected at an earlier stage of parasitism (3 dpi) during susceptible or resistant reactions at 3 dpi appear similar cytologically. The similarities include hypertrophy, an increase in endoplasmic reticulum (ER) and ribosome content, an enlargement of nuclei and the development of dense cytoplasm. Consequently, a 6 dpi time point is selected that functions in better differentiating between a susceptible and resistant reaction. By 6 dpi, syncytia undergoing a susceptible reaction exhibit hypertrophy of nuclei and nucleoli, have a reduction and dissolution of the vacuole, experience a proliferation of cytoplasmic organelles and exhibit an increase in cell expansion by incorporating adjacent cells. Conversely, the resistant reaction cytology is genotype-specific. The 6 dpi G. max[Peking/PI 548402] resistant reaction is characterized by cells having cell wall appositions (CWAs). CWAs are structures that develop through actin polarization and vesicle-mediated delivery of cargo aggregate cytoplasmic components. Also, the 6 dpi G. max[Peking/PI 548402] resistant reaction includes the production of a necrotic layer of cells that surrounds the syncytium. The cells undergoing the resistant reaction also accumulate ER, leading to the blockage of H. glycines development at the parasitic J2 stage. The G. max[PI 88788] resistant reaction also has an accumulation of ER, but differs from G. max[Peking/PI 548402] by lacking cell wall appositions and lacking a necrotic layer of cells that surrounds the syncytium during the resistant reaction. The G. max[PI 88788] resistant reaction, however, leads to blockage of H. glycines development at J3-J4 stage [23, 24].

The cDNA probes that have been used in the Affymetrix GeneChip Soybean Genome Array (arrays) microarray hybridizations are made from the 0, 3 and 6 dpi RNA samples. The arrays are composed, in part, of 37,744 G. max probe sets. The probe sets cover 35,611 transcripts. The microarray experiments have been run in triplicate for each G. max genotype and time point under study. Consequently, the experimental process leads to the production of 6 total arrays for each time point (G. max[Peking/PI 548402]: arrays 1–3; G. max[PI 88788]: arrays 1–3). The detection call methodology (DCM) that has been used in the analysis has been implemented in Bioconductor®. The Bioconductor implementation of the standard Affymetrix® microarray DCM analysis consists of four steps. The four steps include (1) saturated probe removal, (2) discrimination score calculation, (3) Wilcoxon’s rank test p-value calculation, and (4) detection call assignment. The quantitative procedure determines whether the expression of a gene is provably different from zero (present [P]), exhibits uncertain measurement (marginal [G]), or is not provably different from zero (absent [A]). Here, a COG gene is considered measured [M] when the probe signal is detectable above threshold (p < 0.05) on all 6 arrays for a given time point. In contrast, the expression of a COG gene is considered not measured (NM) if probe signal is not detected at a statistically significant level (p ≥ 0.05) on any one of the 6 arrays using the Mann–Whitney–Wilcoxon (MWW) Rank-Sum Test which is a nonparametric test of the null hypothesis not requiring the assumption of normal distributions [78]. In some cases, there are genes that have no probe set fabricated onto the microarray. Consequently, gene expression is not determined and is not applicable (n/a). For the microarray analysis that has been performed by Matsye et al. (2011) [28], the Affymetrix annotations are mapped to the original G. max genome release Wm82.a1.v1.1 (2010). This annotation had to be used at that time (2011) because just that annotation had been available. These older annotations have undergone a comparison here to update the accessions to the more recent Glycine max Wm82.a2.v1 (2015) genome assembly and annotation.

COG gene expression in COG overexpressing and RNAi transgenic roots

COG complex gene expression for the targeted COG gene and the remaining COG complex genes were examined by RT-qPCR using RNA isolated in Lawaju et al. (2020) [17]. The RNA isolation procedure is presented here for clarity. The RNA acquiring procedure involved isolation of G. max root mRNA used the UltraClean®Plant RNA Isolation Kit according to the manufacturer’s instructions (Mo Bio Laboratories®, Inc.) [17]. DNase I (Invitrogen®) has been used to remove genomic DNA. The cDNA synthesis from mRNA used the SuperScript®FirstStrand Synthesis System for RT-PCR (Invitrogen) according to the manufacturer’s instructions. The cDNA synthesis reaction employed the oligo d(T) primer according to the manufacturer’s instructions (Invitrogen). Genomic DNA contamination has been assessed using a β-conglycinin primer pair that amplifies DNA across an intron [79]. The PCR reaction yields different sized amplicons based on intron presence or absence [79]. The COG genes examined in Lawaju et al. (2020) [17] are presented here for clarity. In those analyses, Lawaju et al. (2020) [17] isolated RNA from roots individually undergoing overexpression or RNAi for each of the 16 COG G. max complex genes in biological triplicate, including COG1-1 (Glyma.10G201900), COG1-2 (Glyma.20G188500), COG2-1 (Glyma.17G129100), COG2-2 (Glyma.05G047300), COG3-1 (Glyma.13G114900), COG3-2(Glyma.17G045100), COG4-1 (Glyma.19G260100), COG4-2 (Glyma.03G261100), COG5-1 (Glyma.14G029500), COG5-2 (Glyma.02G286300), COG6-1 (Glyma.01G154500), COG6-2 (Glyma.11G090100), COG7-1 (Glyma.09G224000), COG7-2 (Glyma.12G013000), COG8-1 (Glyma.16G120600) and COG 8–2 (Glyma.02G043400). From these Lawaju et al. (2020) [17] analyses, COG1-2, COG2-2, COG3-1, COG4-2, COG5-1, COG6-1, COG7-2 and COG8-1 expression has been analyzed further here by RT-qPCR because they were the paralogs that functioned in defense. The RT-qPCR procedure is presented in the next section. RT-qPCR primers are provided (S1 Table).

RT-qPCR

Confirmation of COG gene expression has been accomplished by RT-qPCR according to Lawaju et al. (2020) [17]. RT-qPCR involved RNA isolated in 2 different prior analyses [17, 32]. Firstly, RNA has been used from COG overexpression and RNAi roots and controls in experiments that demonstrated specific COG genes functioned in defense in the G. max-H. glycines pathosystem [17]. The specific COG overexpressing and RNAi roots were COG1-2, COG2-2, COG3-1, COG4-2, COG5-1, COG6-1, COG7-2 and COG8-1, along with their respective controls. Secondly, the confirmation of COG gene expression has been done on the same RNA used previously in RNA seq analyses of MAPK overexpressing roots and controls because MAPKs are important genes functioning in defense signaling processes that lead to altered transcription in the G. max-H. glycines pathosystem [32, 51]. These MAPK overexpressing roots include MAPK2 (Glyma.06G029700), MAPK3-1 (Glyma.U021800), MAPK 3–2 (Glyma.12G073000), MAPK 4–1 (Glyma.07G066800), MAPK 5–3 (Glyma.08G017400), MAPK6-2 (Glyma.07G206200), MAPK 13–1 (Glyma.12G073700), MAPK16-4 (Glyma.07G255400) and MAPK20-2 (Glyma.14G028100), in comparison to their control. Taqman 6-Carboxyfluorescein (6-FAM) labeled probes and Black Hole Quencher (BHQ1) (MWG Operon) have been used in the analysis (S1 Table) according to the manufacturer’s instructions. A ribosomal S21 (RPS21) protein coding gene (Glyma.15G147700) has been used as the control in the RT-qPCR experiments (S1 Table). The 2-ΔΔCT method of Livak and Schmittgen (2002) has been used to determine the relative change in gene expression caused by the genetic MAPK-OE engineering event as compared to the control [80]. The same approach has been employed for the RNA isolated from the COG-OE and COG-RNAi roots as compared to their respective pRAP15 and pRAP17 controls. A Student’s t-test has been used to calculate the p-values for the replicated RT-qPCR reactions [81]. Experiments and statistical analyses have been performed from 3 independent biological replicates [32].

COG gene expression in MAPK overexpressing transgenic roots

RNA sequencing (RNA seq) data is available as BioProject ID PRJNA664992, Submission ID: SUB8182387 [47, 51]. Single replicate generation of RNA seq data is derived from 9 defense MAPK overexpressing roots whose gene accessions have been presented in the previous section [32]. These MAPK overexpression roots include MAPK2, MAPK3-1, MAPK 3–2, MAPK 4–1, MAPK 5–3, MAPK6-2, MAPK 13–1, MAPK16-4 and MAPK20-2 and the appropriate pRAP15 controls. The data is shown as normalized log2(fold change) with a p-value cutoff of < 0.05.

Results

Identification of G. max COG complex gene expression in H. glycines-parasitized root cells

The COG complex is composed of 8 proteins that regulate endosome-to-trans Golgi network (TGN) retrograde transport (Fig 1). The purpose of the first analysis presented here is to determine whether the G. max COG complex genes that have been shown to function during the resistant reaction to H. glycines parasitism are expressed within the parasitized cells undergoing a defense response [17]. This objective is relevant since the α-SNAP (rhg1) binding protein syntaxin 31 (SYP38) functions in defense in the G. max-H. glycines pathosystem and is known to bind COG4 and COG6 [30, 45]. To facilitate the analysis presented here, protein sequences of the eight COG complex subunits have been identified in A. thaliana and used to query the G. max proteome employing protein BLAST analyses [17]. To compliment the analysis of Lawaju et al. (2020) [17], G. max COG gene paralog accessions have been used to query a database linked to the accompanying Affymetrix microarray probe set. The results of those analyses are the identification that one G. max COG paralog for each of the 8 different COG gene families is expressed in at least one studied time point (0, 3, 6 dpi) samples relating to H. glycines-parasitized root cells undergoing a defense response in two different G. max genotypes that are capable of a defense response (Tables 1 and S2). Consequently, syncytium gene expression could be determined for 13 of the 16 G. max COG genes (Tables 1 and S2). From these analyses, COG1-2, COG2-2, COG4-2, COG5-1, COG6-1 and COG7-2 exhibit expression in analyzed RNA samples that have been obtained from at least one of the studied time point samples occurring during the resistant reaction. COG1-1, COG2-1, COG3-2, COG4-1, COG6-2, COG7-1 and COG8-2 are not observed to be expressed at the 0, 3 or 6 dpi time point samples that have been analyzed. In contrast, COG3-1, COG5-2 and COG8-1 lack probe sets on the Affymetrix microarray so gene expression could not be determined under the analysis procedures. Therefore, it is possible that COG3-1, COG5-2 and COG8-1 exhibit syncytium expression. The results largely corroborate the functional studies presented by Lawaju et al. (2020) [17] already showing that at least one COG gene family paralog functions in the defense response. Notably, COG7-2-b (Glyma.12G013000.2) is an alternative splice variant of the primary transcript COG7-2-a (Glyma.12G013000.1).

thumbnail
Fig 1. The COG complex.

a. Vesicle trafficking occurring between the endosome and trans-Golgi, facilitated by the COG complex. b. Components of the COG complex.

https://doi.org/10.1371/journal.pone.0256472.g001

COG complex gene family structure present in other agriculturally significant plant species

COG complex genes are important to the defense process in other plant species, making a broad understanding of the complex relevant [41]. Recent studies have presented the 10 most significant agricultural plant species [82]. In addition to G. max, these plant species include M. esculenta, Z. mays, O. sativa, T. aestivum, H. vulgare, S. bicolor, B. rapa, E. guineensis and S. officinalis [83]. Analyses are presented here that identify the COG gene families and their structure in these crops and others that are significant components of U.S. agriculture including B. vulgaris, S. tuberosum, S. lycopersicum and G. hirsutum. To accomplish these analyses, A. thaliana COG proteins have been queried into the proteomes of G. max, M. esculenta, Z. mays, O. sativa, T. aestivum, H. vulgare, S. bicolor, B. rapa, S. lycopersicum, S. tuberosum and G. hirsutum found at Phytozome [49]. These analyses have been complimented by querying the A. thaliana COG proteins to the proteomes of E. guineensis, S. officinalis and B. vulgaris [59, 70, 71]. The results of those analyses are the identification of the gene family structure of the COG genes in those plant species (Tables 2 and S3S17). Duplication of some COG genes are observed. Notably, some COG genes are tandemly duplicated (Table 3). These tandemly duplicated COG genes include the M. esculenta COG4 (Manes.05G016300, Manes.05G016400), S. lycopersicon COG3 (Solyc07g017520.2.1, Solyc07g017530.2.1) and G. hirsutum (Gohir.D08G170700, Gohir.D08G170800) (Tables 3 and S3S17). A larger quantity of segmental duplication is observed than tandem duplication (Tables 3 and S3S17). Some conclusions can be drawn from Lawaju et al. (2020) [17] regarding the specialization of function (neofunctionalization) of some of the G. max COG genes. The Lawaju et al. (2020) analysis demonstrated that COG1-2, COG2-2, COG3-1, COG4-2, COG5-1, COG6-2, COG7-2 and COG8-1 have a defense role [17]. Therefore, with regard to neofunctionalization, COG1-2, COG2-2, COG3-1, COG4-2, COG5-1, COG6-2, COG7-2 and COG8-1 have a defense role that COG1-1, COG2-1, COG3-2, COG4-1, COG5-2, COG6-1, COG7-1 and COG8-2 appear to lack. Furthermore, COG3-1, COG4-1 and COG5-1 also have a role in root growth that appears to be lacking in for the other COG genes [17]. COG3-1 and COG4-1 overexpression decreases root mass while COG3-1 and COG4-1 RNAi increases root mass. COG5-1 overexpression and RNAi increase root mass. COG3-1 and COG5-1 also have a defense role while COG4-1 does not, possibly indicating neofunctionalization. Furthermore, COG1-2, COG7-2 and COG8-1 are induced by harpin treatment while COG1-1, COG7-1 and COG8-2 are not, indicating neofunctionalization [17]. COG4-2 and COG5-1 overexpression increases syntaxin 31 transcript abundance while COG4-2 and COG5-1 RNAi decreases syntaxin 31 transcript abundance. This coupled influence on COG4-1 and COG5-2 is not observed, indicating neofunctionalization [17].To obtain a greater understanding of potential neofunctionalization, the COG protein sequences obtained from the plant genomes have been analyzed using MOTIF in cases where clear duplication has occured. T. aestivum was not analyzed here because the protein sequences appear to be largely composed of fragments of unclear nature. Consequently, 8 proteomes have been analyzed, including G. max, B. rapa, G. hirsutum, M. esculenta, E. guineensis, S. tuberosum, S. officinalis and Z. mays (S18 Table). In contrast, H. vulgare, O. sativa, S. bicolor, S. tuberosum and B. vulgaris have not been analyzed since they lacked duplication of their COG genes. Based off the MOTIF-determined annotations, it appears as though some of the G. max COG paralogs have differences in their deduced protein structure which could lead to the differences in properties that have been described previously. For example, COG1-1 has vacuolar protein sorting 51 (Vps51) (PFAM: PF08700), Secretion 5 (Sec5) (PFAM: PF15469), dependent on RIC1 (Dor1) (PF04124) and KxDL (PF10241) domains while COG1-2 only has Vps51 and Dor1 domains. In contrast, COG2-1 and COG2-2 both have COG2 (PF06148), VPS51 and domain of unknown function (DUF) 3510 (DUF3510) (PF12022) domains, indicating they may have similar functions even though COG2-2 is constitutively expressed and functions in defense while COG2-1 is not [3]. Other notable observations for the G. max COG proteins sequences are the presence of a COG3-3 (Glyma.09G134300) and COG6-3 (Glyma.20G085000), apparently the products of gene truncations caused by premature stop codons. Other examples of truncated COG genes were identified in, but not limited to B. rapa COG4 (Brara.E02159) and COG5 (Brara.A01419), S. lycopersicon, COG3 (Solyc07g017530.2.1), G. hirsutum COG4 (Gohir.A06G116700), M. esculentum COG2 (Manes.14G052400, COG4(Manes.05G016300.1 and Manes.05G016400). E. guineensis COG5 (p5.00_sc00515_p0005) and COG7 (p5.00_sc00013_p0068) are truncated in length as compared to their paralogs (S18 Table). T. aestivum had many COG gene fragments which require further confirmation of their nature before they can be adequately assessed here and therefore have not been included in this analysis.

thumbnail
Table 3. COG genes that have experienced duplication in the studied plants.

https://doi.org/10.1371/journal.pone.0256472.t003

COG gene families have alternative splice variants expressed in root cells undergoing defense

The analyses presented in Tables 2 and S3S17 also provide the alternative splice variants for COG genes. This facet of COG gene transcription may be relevant to the ability that plants have in defending themselves from pathogenic attack [41]. The H. vulgare defense gene HvCOG3 (HORVU7Hr1G062190) has 4 alternate splice variants, indicating that perhaps there are specific variants that may confer specialized functions that facilitate the defense role as shown in G. max for COG7-2-b (Glyma.12G013000.2) [17]. To examine COG complex splice variant structure, the G. max COG gene expression data available in Phytozome has been mined using Phytomine and examined for whether the RNA seq data confirms the expression of these alternate splice variants. The results show the expression of each COG splice variant in relation to 9 different tissue types, including leaf, nodule, pod, root, root hair, seed, shoot apical meristem (SAM) and stem (Fig 2). Each sample type, except for the root hair sample is a plant organ so a clear understanding of the individual cellular expression profiles could not be performed. Unfortunately, since these data have been obtained from a public data base (Phytozome), it was not possible to examine these same RNA samples using RT-qPCR. An examination of the G. max microarray data in relation to these splice variants show that specific splice variant data is available for some of the COG genes. The only COG gene whose alternate splice variant has been shown to function in defense is COG7-2, splice variant 2 (COG7-2-b [Glyma.12G013000.2]). However, in that analysis, each individual G. max COG gene splice variant has not been examined in functional analyses because they were beyond the scope of that analysis and the analysis presented here of [17].

thumbnail
Fig 2. COG RNA seq expression abundance in different tissue types in G. max.

* along with the red arrowhead indicates the gene functions in defense (Lawaju et al. 2020 [17]). In this image, the spice variants are labeled. For example, COG7-2-a is Glyma.12G013000.1, COG7-2-b is Glyma.12G013000.2 and COG7-2-c is Glyma.12G013000.3. COG7-2-b is Glyma.12G013000.2 is the examined splice variant that functions in defense (Lawaju et al. 2020 [17]). Gene expression data has been obtained from Phytomine in Phytozome (Libult et al. 2010; Goodstein et al. 2012 [49]; Wang et al. 2019).

https://doi.org/10.1371/journal.pone.0256472.g002

In contrast, RNA seq analysis have identified that some COG genes, including specific splice variants, are not expressed in the G. max MAPK overexpression roots as compared to controls (S19 Table). Some of these splice variants are expressed in either some or all of the examined tissue types including COG2-1, COG3-1, COG4-1, COG5-2 and COG7-2 in seed, flower, nodules, root, SAM, root hair, leaves, pods and stems [49]. COG2-1 (Glyma.17G129100.4) is expressed in all of the sample types, but has not been observed in syncytia undergoing a defense response under the analysis procedures, nor tested for a function in the defense process [17, 49]. COG3-1 (Glyma.13G114900.3) is not expressed in nodules, roots, SAM and stems (S19 Table) [49]. COG3-1 expression in syncytia could not be determined due to the analysis procedures, but does function in the defense process G. max has to H. glycines [17]. COG5-2 (Glyma.02G286300.2) is expressed in all samples except in seed and is not expressed in syncytia and does not function in the G. max defense process to H. glycines (S19 Table) [17, 49]. The COG7-2 (Glyma.12G013000.3) is not expressed in roots or stems and has not been examined in functional experiments, testing if it functions in defense in G. max to H. glycines (S19 Table) [17, 49].

COG genes exhibit co-regulated expression

Experiments show that co-regulated gene expression exists between SNARE genes in the G. max-H. glycines pathosystem [30, 38]. Roots undergoing COG complex gene overexpression or RNAi have had their RNA isolated from whole transgenic roots, unlike the prior experiments examining gene expression of specific cells (pericycle and syncytia) collected by LM [17]. The RNA has been used in a series of RT-qPCR experiments examining the level of expression for each COG complex component shown to function in the defense process (Fig 2). In some cases, co-regulated gene expression is observed whereby the overexpression/RNAi of one COG gene influences the relative transcript abundance of another COG gene while the affected COG gene, when engineered for overexpression or RNAi affects the relative transcript abundance of the other COG gene in the examined pair in the same manner. Similar results have been obtained in the G. max-H. glycines pathosystem for other genes functioning in vesicle transport and specific MAPKs [30, 32, 38]. In the experiments presented here, roots engineered to undergo overexpression or RNAi of the COG genes have already been produced and had their RNA isolated [17]. Those samples have been used to examine by RT-qPCR the relative transcript abundance of the COG gene targeted for transgenic overexpression or RNAi, along with the other COG genes that function in the defense response (Fig 3).

thumbnail
Fig 3. RT-qPCR analysis of COG gene expression in COG-OE and COG-RNAi transgenic roots.

The COG transgenic roots are a. COG1-2 (Glyma.20G188500), b. COG2-2 (Glyma.05G047300), c. COG3-1 (Glyma.13G114900), d. COG4-2 (Glyma.03G261100), e. COG5-1 (Glyma.14G029500), f. COG6-1 (Glyma.01G154500), g. COG7-2 (Glyma.12G013000) and h. COG8-1 (Glyma.16G120600) in comparison to the appropriate pRAP15 and pRAP17 controls. The control gene is RPS21 (Glyma.15G147700), Lawaju et al. 2020 [17]. The 2-ΔΔCT method has been used to determine the relative change in COG gene expression (the RT-qPCR target) caused by the COG-OE or COG-RNAi genetic engineering event as compared to the control (Livak and Schmittgen 2002). *Statistically significant, Student’s t-test p < 0.05.

https://doi.org/10.1371/journal.pone.0256472.g003

COG gene expression in defense MAPK overexpressing roots

Analyses show that 9 G. max MAPKs out of the 32 occurring in the G. max genome function in the defense response to H. glycines [32]. To obtain an understanding of the potential regulation of COG gene expression relating to defense in the G. max-H. glycines pathosystem, RNA-seq data has been generated from RNA isolated from whole roots overexpressing the defense MAPKs and their control. The results are presented (Fig 4). In certain cases, MAPK overexpression leads to an increase in relative transcript abundance of at least 1.5 fold, p < 0.05, of certain COG genes when examining the RNA seq data. Induced COG1-2 gene expression is observed in MAPK2-OE (1.99 fold), MAPK3-1-OE (1.61 fold), MAPK3-2-OE (1.61 fold), MAPK4-1-OE (1.67 fold), MAPK5-3-OE (1.63 fold) and MAPK20-2-OE (1.74 fold) roots as compared to their controls. The relative transcript abundance of COG1-2 in the MAPK6-2-OE (1.34 fold), MAPK13-3-OE (1.07 fold) and MAPK16-4-OE (0.43 fold) roots are statistically significant (p < 0.05), but did not meet the criteria of the level of induced expression (1.5 fold or greater) as compared to their controls. Similar results are observed in the RT-qPCR experiments. In the RT-qPCR analyses, increased COG1-2 transcript abundance is observed in the MAPK2-OE (2.07 fold), MAPK3-1-OE (1.77 fold), MAPK3-2-OE (1.75 fold), MAPK4-1-OE (1.81 fold), MAPK5-3-OE (1.71 fold), MAPK6-2-OE (1.54 fold) and MAPK20-2-OE (1.91 fold) roots as compared to their controls. The relative transcript abundance of COG1-2 in the MAPK13-1-OE (1.13 fold) and MAPK16-4-OE (1.11 fold) roots did not meet either of the induced expression criteria as compared to their controls.

thumbnail
Fig 4. Relative transcript abundance of COG genes in MAPK-overexpressing roots.

a. COG1-1 analyzed by RNA seq, b. COG1-2 RNA analyzed by RT-qPCR, c. COG2-2 analyzed by RNA seq, d. COG2-2 analyzed by RT-qPCR, e. COG3-1 analyzed by RNA seq, f. COG3-1 analyzed by RT-qPCR, g. COG4-2 analyzed by RNA seq. h. COG4-2 analyzed by RT-qPCR. i. COG5-1 analyzed by RNA seq, j. COG5-1 analyzed by RT-qPCR, k. COG6-1 analyzed by RNA seq, l. COG6-1 analyzed by RT-qPCR, m. COG7-2-b analyzed by RNA seq, n. COG7-2-b analyzed by RT-qPCR, o. COG8-1 analyzed by RNA seq, p. COG8-2 analyzed by RT-qPCR. Single replicate RNA seq analyses have been performed of RNA isolated from MAPK overexpressing roots. These results have been confirmed by RT-qPCR. The MAPK overexpressing roots include MAPK2 (Glyma.06G029700), MAPK3-1 (Glyma.U021800), MAPK 3–2 (Glyma.12G073000), MAPK 4–1 (Glyma.07G066800), MAPK 5–3 (Glyma.08G017400), MAPK6-2 (Glyma.07G206200), MAPK 13–1 (Glyma.12G073700), MAPK16-4 (Glyma.07G255400) and MAPK20-2 (Glyma.14G028100) and the appropriate pRAP15 control. The RNA seq data is shown as normalized log2(fold change) with a p-value cutoff of < 0.05. The RT-qPCR data is shown after employing the 2-ΔΔCT method of Livak and Schmittgen (2002) to determine the relative change in COG gene expression caused by the MAPK-OE genetic engineering event as compared to the control. *Statistically significant and meeting the 1.5 fold induced criteria, Student’s t-test p < 0.05. **Statistically significant and meeting the 1.5 fold induced criteria in RNA seq and RT-qPCR analyses.

https://doi.org/10.1371/journal.pone.0256472.g004

RNA seq analyses identify induced COG4-2 gene expression to be statistically significant (p < 0.05) in the MAPK4-1-OE roots, but did not meet the criteria (≥ 1.5 fold) for induced expression (1.35 fold) as compared to their controls. However, the RT-qPCR analyses determine change in COG4-2 gene expression is statistically significant and meeting the ≥ 1.5 fold criteria in the MAPK4-1-OE roots (1.51 fold) as compared to controls.

RNA seq analyses identify that induced COG6-1 gene expression is statistically significant (p < 0.05) in the MAPK2-OE and MAPK3-1-OE roots as compared to their controls, but did not meet the criteria (≥ 1.5 fold) for induced expression (1.29 fold and 1.48 fold, respectively). However, the RT-qPCR analyses determine COG4-2 gene expression is induced at a statistically significant level and meeting the ≥ 1.5 fold criteria in the MAPK2-OE (1.53 fold) and MAPK3-1-OE (1.58 fold) roots as compared to their controls.

RNA seq analyses identify induced COG7-2-b gene expression is statistically significant (p < 0.05) only in the MAPK3-1-OE roots (1.5 fold) as compared to their controls. In contrast, RNA seq analyses identify induced COG7-2-b gene expression is statistically significant (p < 0.05) in the MAPK2-OE, MAPK5-3-OE and MAPK6-2-OE roots, but did not meet the criteria (≥ 1.5 fold) for induced expression (1.42, 1.29, 1.42 fold, respectively) as compared to their controls. However, the RT-qPCR analyses determine COG7-2-b gene expression is induced at a statistically significant level and meeting the ≥ 1.5 fold criteria in the MAPK2-OE (1.61 fold), MAPK3-1-OE (1.63 fold), MAPK5-3-OE (1.53 fold) and MAPK6-2-OE (1.59 fold) roots as compared to their controls.

Discussion

The goal of the analysis presented here is to analyze data to complement previous studies involving functional transgenic experiments performed in G. max, examining the role that its COG genes have during its defense response to H. glycines parasitism [17]. That goal, obtained in the analysis presented here, allows for an understanding of the COG genes more broadly across different plant species by revealing the expression they have prior to and during the G. max defense process [17, 41]. The experiments of Lawaju et al. (2020) [17] identify the existence of two COG complex paralogs for each of the 8 G. max COG genes. The experiments of Lawaju et al. (2020) [17] then demonstrated that only one of the two paralogs of each COG gene family functions in the defense process that G. max has toward H. glycines parasitism. However, in all, one COG gene of each of the 8 COG complex gene families functions in defense [17]. The role that the COG complex, as a vesicle transport component, performs in homeostasis may have broad implications regarding plant responses to pathogens in general, newly emerging pathogens and to climate change, making this study important [17, 8285]. Because of these roles, analyses done to identify COG genes and their potential splice variants in other important crop species and obtaining an understanding of their regulated expression have been done here.

The relationship between COG complex genes and defense

The presented results are important from the standpoint that the syntaxin 31 homolog of S. cerevisiae, Sed5p, binds to Sec17p, COG4 and COG6 [4346]. Notably, the G. max Sec17p homolog, α-SNAP-5, is stated as being the major H. glycines resistance gene rhg1 although the locus is complex in nature [2830, 38, 39]. Therefore, the results presented here clearly link the vesicle transport system and membrane fusion apparatus to the defense process that G. max has toward H. glycines parasitism [40]. Understanding where, how and why various genes are expressed during the defense process and the ordering of the expression of those genes will provide needed insight into the cellular processes that underlie resistance and the functionality of the COG complex not only in G. max, but broadly in different plant species [86, 87].

Expression of the COG complex genes during the resistant reaction

A comparative analysis of the G. max COG genome accessions identified here is made to accessions accompanying previously reported gene expression patterns occurring within the root cells relating to its resistant reaction (syncytium) to H. glycines. Analyses presented here determine that 13 of the 16 G. max COG complex genes (81.25%) had Affymetrix probe sets on the GeneChip Soybean Genome Array, including at least one COG complex component from each of its 8 gene families [88]. The analyses presented here then identify COG complex gene expression occurring within the syncytium undergoing a resistant reaction. The analyses show that COG2-2, COG4-2 and COG5-1 have measurable expression within the pericycle and surrounding cells (control) at 0 dpi prior to H. glycines infestation of the soil. However, by 6 dpi which would be at a time point occurring as the resistant reaction is concluding, measurable COG gene expression within the syncytium is observed for COG1-2, COG2-2, COG4-2, COG5-1, COG6-1 and COG7-2-b. Consequently, expression is detected within the cells undergoing a resistant reaction for members of 6 of the 8 COG gene families. In contrast, the detection of expression for G. max COG3-1 and COG8-1 paralogs could not be made under the analysis procedures due to the lack of probe sets on the microarray, remaining avenues of research for future study.

The COG complex gene expression identified here to be occurring within certain cell types indicates that these genes may serve a basic function in the cell biology of G. max. This observation is in agreement with the original observations made in S. cerevisiae for Sec35p (COG2), Sec38p (COG4) and Cod4p (COG5) [3, 11, 12, 89]. In contrast, the G. max COG1-2 (Sec36p/Cod3p), COG6-1 (Sec37p/Cod2p) and COG7—b (Cod5p) exhibit measurable amounts of gene expression only at 6 dpi. Therefore, it appears that unlike the other COG complex genes, COG1-2, COG6-1 and COG7-2-b may exhibit a level of gene regulation that is related specifically to the development of the resistant reaction in G. max in the syncytium. Studies performed on H. vulgare demonstrate HvCOG3 functions during the resistant reaction to fungal penetration into the host cell [41]. Consequently, it is not without precedent that multi-subunit structures requiring all of its components are important for the integrity of the structure. For example, the exocyst functions upstream of membrane fusion at the tethering stage of vesicle transport in relation to SNARE which acts downstream at docking stage [90, 91]. The exocyst requires all 8 of its component parts for the functionality of the structure [92]. Not surprising, the elimination of even one component leads to the loss of function of the structure [92]. Consistent with this observation, the expression of each component of the G. max exocyst, like its COG complex, is important to its defense process to H. glycines parasitism [17, 93].

In A. thaliana, its COG7 ortholog, embryo yellow (EYE) gene functions in the maintenance of the meristem, indicating a specialized role in its cellular biology and metabolism [16]. The eye mutants are bushy, have SAMs with aberrant organization and have an altered composition of their cell walls [16]. This is an important observation because in G. max the secreted, hemicellulose-modifying gene xyloglucan endotransglycosylase/hydrolase (XTH), XTH43, is one of the most highly expressed genes in the syncytium undergoing a defense response [28]. XTH43 also has a significant role in the resistant reaction to H. glycines [30]. XTH43 increases xyloglucan (XyG) content, shortens XyG chains and makes more of those shorter chains while it can also be expressed in other plants (i.e. G. hirsutum) to generate a defense response to the root knot nematode Meloidogyne incognita where one does not exist [20, 40]. XTH is targeted to the Golgi apparatus prior to its secretion into the apoplast where it functions in cell wall modification [9496]. The Golgi apparatus, thus, serves prominently in processes involving cell wall modification, requiring the import of enzymes and glycoproteins from the ER to the Golgi via transition vesicles [97, 98]. However, the synthesis of XyG and modification of XyG, itself, occurs in the Golgi apparatus, first in the cisternae then moving to the medial- and trans- Golgi as XyG matures [99, 100]. Transport of the matrix polysaccharides and enzymes to the cell membrane then occurs through secretory vesicles [101]. In the experiments presented here 7 of the targeted COG complex genes are shown to have probe sets fabricated onto the microarray, but those genes did not exhibit measurable amounts of expression (i.e., COG1-1, COG2-1, COG3-2, COG4-1, COG6-2, COG7-1 and COG8-2). The remaining 3 G. max COG complex genes (COG3-1, COG5-2 and COG8-1) did not have corresponding probe sets fabricated onto the array, complicating an understanding of the relationship of these COG complex genes to the resistant reaction under study here. Since the HvCOG3 has been shown to function in the resistant reaction in wheat, it was going to be important to the understanding of the complex to analyze the entire G. max COG complex in transgenic functional analyses to obtain a clear understanding of the structure in relation to the resistant reaction to H. glycines parasitism as shown in Lawaju et al. (2020) [17].

The complexity of the COG complex gene families

The COG complex is an integrated structure made challenging to understand because of the intricate nature of the plant genome, with all plant genomes believed to have undergone polyploidization events [102]. These events are then followed by rearrangement and/or reduction that have various effects on growth and development [103]. For example, A. thaliana has experienced 3 genome duplication events referred to as paleopolyploidy [104, 105]. During its evolutionary history there was an initial paleohexaploidy event that occurred in the asterales and rosids, followed by a paleotetraploidy event that was limited to the Brassicales [104, 105]. Subsequent genome rearrangement and reduction then occurred [104, 105]. While the genome of the A. thaliana ancestors underwent these duplication events, it is functionally diploid (2n = 10) [52]. The diploid nature of the A. thaliana genome is reflected in a single COG gene existing for each of the 8 COG members [17, 52]. A. thaliana COG complex protein sequences have been used to mine the available genomes of agriculturally important crops on a world-wide scale and then some more specific to the U.S. The proteome mining of several plant genomes with the A. thaliana COG sequences is consistent with the diploid nature of other species presented here. O. sativa (2n = 24), H. vulgare (2n = 14), S. tuberosum (double monoploid Phureja DM1-3 516 R44, 2n = 24), S. bicolor (2n = 20) and the B. vulgaris (2n = 18) KWS2320 reference genomes are diploid [64, 71, 106, 107]. However, S. tuberosum is typically tetraploid (2n = 4x = 48) and B. vulgaris can also be triploid [64, 71]. Results of the proteomic analyses are consistent with the diploid nature of O. sativa, H. vulgare, B. vulgaris and S. tuberosum in that they lack duplication of any of their COG genes. Some of the plants that have been studied here are also diploids, but have a limited amount of COG gene duplication. For example, S. lycopersicon (2n = 24) has a duplication limited only to COG3 with the rest of the COG genes not existing as duplicated gene families [108, 109]. The duplicated SlCOG3 is not the product of localized gene amplification, a process shown to be important in generating plant defense capabilities [36]. E. guineensis is diploid (2n = 32), but has duplications of COG1, 2, 4, 5 and 7. It is possible that further analysis of the E. guineensis genome may reveal its genetic structure is more complicated with various levels of polyploidy since its current coverage is at 79% [72]. Evidence has been presented that E. guineensis has experienced 2 polyploidization events, one that is lineage specific and one that is more evolutionarily broad, found in commelinid plants [61]. The remainder of the plants under study, G. max, Z. mays, T. aestivum, B. rapa, S. officinalis and G. hirsutum are polyploid. G. max, (2n = 40) is an allotetraploid with each of its COG genes being at least duplicated [62]. B. rapa, as a member of the Brassicaceae, has shared the same evolutionary history as A. thaliana, but with the addition of a whole-genome triplication (WGT) event that is believed to have occurred, resulting in a mesohexaploid [68, 110, 111]. The diploid B. rapa genome (2n = 2x = 20) has duplicated COG1, 2, 4, 5 and 8 [112]. Z. mays (2n = 20) is a replicated diploid having undergone a whole genome duplication as a paleopolyploid with a subsequent duplication that differentiates it from S. bicolor [113]. Z. mays, consequently unlike S. bicolor, has 2 copies of COG1, 2, 4 and 8 with the rest of the gene families having a single copy. T. aestivum, a hexaploid having 2 copies of its AB and D genomes (2n = 6x = 42), has duplicated COG genes for each of its family members in multiples of 3 except for COG5 (4 copies) and COG6 (5 copies) [114]. The confirmation and annotation of these genes appears to require more work since some of the duplicated gene sequences appear to be fragments that are not the result of premature stop codons. The S. officinalis genome is a polyploid (2n = 8x = 80) with the generation of an accurate genome has been hampered by its polyploid nature [115]. Currently, COG3 is understood as being duplicated. COG1, 2, 4, 5 and 7 are not duplicated. Homologs of COG6 and COG8 could not be identified even in BLASTP searches of the available sequences deposited in Genbank. The G. hirsutum, an allotetraploid (2n = 4x = 52), possessing A and D genomes, has multiples of 2 COG genes composing each family except COG6 which has 3 copies [61, 85]. The results presented here show that species understood to be diploid have a single COG gene for each family member except in very few cases, a duplication. In contrast, species understood to have genome duplications and, in particular, more recent duplications have more extensively duplicated copies of its COG genes. In the examples provided here, the hexaploid T. aestivum stands out in mostly having multiples of 3 COG genes per family. As stated, more work needs to be done to understand these genes better. The results point out another interesting feature of the COG genes that appears to be the maintenance of a fixed number of COG genes in relation to the different gene families in the diploid species where at least the maintenance of gene duplication events appears not to be occurring. In contrast, more complicated features of the COG genes that relate to their transcription (alternate splicing) and possible protein functions, in particular in the polyploid species, have been determined and will be addressed in the next section.

Localized COG gene duplication is uncommon

The results show that localized, tandem duplication of the COG genes is uncommon. Notably, G. max did not experience tandem duplication of its COG genes as observed for some of its other defense genes [36]. However, M. esculenta had tandemly duplicated COG4. G. hirsutum had tandem duplication of COG6. S. lycopersicon COG3 was tandemly duplicated. The nature of these tandem duplications remains unclear.

Alternate splicing of COG genes

The results presented here so far have focused in on identifying the numbers of COG genes in the studied plants. During the course of the proteomic studies, a number of COG protein products that are the outcomes of possible RNA splicing have also been identified. For example A. thaliana COG1 has 4 alternate splice variants AT5G16300.1, AT5G16300.2, AT5G16300.3 and AT5G16300.4. Expression experiments for AT5G16300.1 show it is expressed in low quantities in leaves treated with ammonia, leaves treated with urea, and high in roots treated with nitrate [49]. Unfortunately, the RNA seq experiments did not differentiate between the transcript abundance for the different splice variants. Alternative splicing is known to occur as a stress response and reaction to various stimuli with experiments showing that in A. thaliana 22–30% of intron-containing genes undergo alternative splicing [116118]. Mutants of COG8 impair proper splicing in humans, leading to congenital disease [119]. Recent experiments performed in A. thaliana show that MAPK3, MAPK4 and MAPK6 regulate pathogen activated molecular pattern (PAMP)-induced differentially alternative spliced events through alternative splicing of splicing factors, themselves, and protein kinases that are related to immunity are altered [120]. The G. max MAPK3, 4 and 6 all function in its defense response to H. glycines parasitism [32]. From these studies, a model by which the splice variants could function in the defense process is presented (Fig 5). Pathogen effectors capable of binding to G. max membrane fusion proteins that function in defense have been identified [121]. Consequently, the impairment of binding through changes in the primary structure of the plant protein occurring through alternate splicing of its mRNA could occur.

thumbnail
Fig 5. Model. A.

The COG complex functions in defense. B. A pathogen effector alters the functionality of the COG complex, leading to susceptibility. C. The COG complex composition becomes altered with a splice variant (COG7-2-b*, Glyma.12G013000.2) which alters the ability of the pathogen effector to bind, restoring the ability of the COG complex to function in defense, leading to a resistant reaction. VM, vesicle membrane; GM, Golgi membrane. The position of the COG proteins in relation to the Golgi and vesicle membranes does not imply specific interactions.

https://doi.org/10.1371/journal.pone.0256472.g005

Transcriptomic data has been useful in identifying genes that function in defense in other plant pathosystems. The analyses presented here have extracted data on splice variants of important crop species to the U.S. and globally. In most cases, the number of variants is low or have not been adequately studied or confirmed by RNA seq analyses. For example in B. vulgaris, only a single transcript type has been identified for each COG gene. It is possible that the B. vulgaris genome has not been examined extensively enough to identify all of its COG gene RNA splice variants since even reduced genomes like A. thaliana have multiple splice variants. In the most extensive example of COG gene splice variants identified in the analysis presented here, H. vulgare HORVU7Hr1G107700 (COG7) has 31 different splice variants. Splice variants perform important functions in plants and alternate splicing occurs during defense to pathogen attack [122]. The diversity of COG gene splice variants could be an important feature to consider when examining the role pathogen effectors may have on plant multiprotein complexes [121, 123, 124]. COG7-2-b appears to be a COG gene that is typically expressed to low levels, consistent with it not being the primary transcript. However, its expression occurs in a specialized cell type (syncytia) undergoing a defense response. Consequently, perhaps COG7-2-b has a function that is important to the defense response, explaining why it is otherwise expressed at low levels in whole roots. A similar expression profile is shown for COG8-1-a which functions in defense as compared to COG8-1-b. The results demonstrate that there is much left to be learned regarding the splice variants and their biological function(s).

COG genes exhibit some co-regulated expression

Analyses reveal that α-SNAP and SYP38 expression level influences each other’s relative transcript abundance as revealed in overexpression and RNAi experiments [30]. This concept has been further examined in other SNARE components that function in defense in the H. glycines pathosystem [38]. The experiments presented here reveal that several COG components also exhibit co-regulated gene expression. The results expand on the knowledge of genes functioning in the defense in the G. max-H. glycines pathosystem that experience co-regulation. Related experiments show that MAPKs functioning during defense in this pathosystem also are co-regulated [32]. MAPKs, as an important signaling platform, could be expected to greatly influence gene expression occurring during biotic stress [32].

COG gene expression is influenced by MAPK expression

McNeece et al. (2019) [32] performed a functional transgenic analysis of the 32 members of the G. max MAPK gene family, finding that 9 of them function in its defense response to H. glycines parasitism. Subsequent RNA seq analyses of RNA isolated from the 9 defense MAPK-OE roots have been performed [51]. Analyses of the RNA seq data presented here has led to the identification that some cases, G. max COG gene expression is influenced by the defense MAPKs. The results have been confirmed by RT-qPCR analyses. The genetic analyses of McNeece et al. (2019) [32] have determined that NON-RACE-SPECIFIC DISEASE RESISTANCE1 (NDR1) and BOTRYTIS INDUCED KINASE1 (BIK1), functioning in ETI and PTI, respectively, converge on the MAPK network to induce the transcription of defense genes that themselves have been proven to function in the defense process. More recent experiments presented by Klink et al. (2021) [25] have shown G. max BRI1-ASSOCIATED RECEPTOR KINASE 1 (BAK1) overexpression increases MAPK3 transcript anundance while BAK1 RNAi decreases MAPK3 transcript abundance as compared to controls. Furthermore, G. max BAK1 overexpression decreases H. glycines parasitism by 67% while BAK1 RNAi increases it by 4.9 fold as compared to controls [25]. The result implies specific pathogen recognition receptors alone or in combination play important roles in the defense response that G. max has to H. glycines [25]. The result is consistent with the involvement of the BAK1-interacting cytoplasmic kinase BIK1 in the G. max defense response to H. glycines parasitism [30]. The experiments presented here reveal that those MAPK-induced genes include COG complex genes that function in in G. max during its defense response to H. glycines. These observations have broad implications. The results presented here may aid in the determination of genetic platforms that underlie defense in other important crop plants [125, 126]. An interesting finding from these results is the identification that MAPK-OE samples, including MAPK2, MAPK3-1, MAPK3-2, MAPK4, MAPK5-3, MAPK6-2, and MAPK20-2, measure induced levels of expression of COG1-2. COG1 binds to the Golgi face and is the point of attachment for the other COG proteins of the A and B subcomplexes [1, 36]. Another interesting observation is the induced expression of COG7-2-b in the MAPK2, MAPK3-1 and MAPK6-2 overexpression lines, MAPKs that are important defense components in other plant systems. COG7 binds to the vesicle surface. Consequently, it is possible that having a sufficient amount of the point of attachment for vesicles (COG7) is an important for functionality under certain circumstances like plant defense.

The G. max COG gene family has already been presented and, unlike the diploid A. thaliana, has two paralogs of each gene which is consistent with its allotetrapoid nature [17, 62]. Polyploidization plays an important role in plant evolution and in particular, many agricultural crops are known to be polyploid in nature [127, 128]. The analysis presented here shows that the crop plants under study here exhibit various levels of duplication of their COG gene family members. In addition to this gene duplication, there appears to exist multiple splice variants for each of the COG genes. The G. max splice variant COG7-2-b which functions in defense to H. glycines, appears to exhibit that alternative splice variants are expressed in specific cell types at certain times of a defense response and are important to the defense process [17]. The results may have important implications for understanding basic aspects of the COG gene families of the other studied significant crop plants presented here including M. esculenta, Z. mays, O. sativa, T. aestivum, H. vulgare, S. bicolor, B. rapa, E. guineensis, S. officinalis, S. tuberosum, S. lycopersicum, G. hirsutum and B. vulgaris. In humans, a mutation in a splice site of COG8 results in a congenital disease, demonstrating that proper splicing of COG genes relates to their functionality [119]. Experiments aimed at understanding the diversity of these alternate splice variants and the regulation of their expression, in particular, using cell-type specific procedures should provide important insight and a tool to understand their biological role(s).

Supporting information

S1 Table. RT-qPCR primers used in the analysis.

https://doi.org/10.1371/journal.pone.0256472.s001

(XLSX)

S2 Table. Gene expression of control and syncytium cells relating to Fig 2.

https://doi.org/10.1371/journal.pone.0256472.s002

(XLSX)

S3 Table. COG gene information for A. thaliana.

https://doi.org/10.1371/journal.pone.0256472.s003

(XLSX)

S5 Table. COG gene information for M. esculenta.

https://doi.org/10.1371/journal.pone.0256472.s005

(XLSX)

S6 Table. COG gene information for Z. mays.

https://doi.org/10.1371/journal.pone.0256472.s006

(XLSX)

S7 Table. COG gene information for O. sativa.

https://doi.org/10.1371/journal.pone.0256472.s007

(XLSX)

S8 Table. COG gene information for T. aestivum.

https://doi.org/10.1371/journal.pone.0256472.s008

(XLSX)

S9 Table. COG gene information for H. vulgare.

https://doi.org/10.1371/journal.pone.0256472.s009

(XLSX)

S10 Table. COG gene information for S. bicolor.

https://doi.org/10.1371/journal.pone.0256472.s010

(XLSX)

S11 Table. COG gene information for B. rapa.

https://doi.org/10.1371/journal.pone.0256472.s011

(XLSX)

S12 Table. COG gene information for E. guineensis.

https://doi.org/10.1371/journal.pone.0256472.s012

(XLSX)

S13 Table. COG gene information for S. officinalis.

https://doi.org/10.1371/journal.pone.0256472.s013

(XLSX)

S14 Table. COG gene information for B. vulgaris.

https://doi.org/10.1371/journal.pone.0256472.s014

(XLSX)

S15 Table. COG gene information for S. lycopersicon.

https://doi.org/10.1371/journal.pone.0256472.s015

(XLSX)

S16 Table. COG gene information for S. tuberosum.

https://doi.org/10.1371/journal.pone.0256472.s016

(XLSX)

S17 Table. COG gene information for G. hirsutum.

https://doi.org/10.1371/journal.pone.0256472.s017

(XLSX)

S18 Table. COG protein motifs for selected plant species used in the analyses, including G. max, B. rapa, G. hirsutum, M. esculenta, E. guineensis, S. tuberosum, S. officinalis and Z. mays.

Only plants with duplicated COG genes are analysed for the purpose of identifying protein domains that may be indicative of neofunctionalization of paralogs. Details of PFam (PF) families can be determined at https://pfam.xfam.org/.

https://doi.org/10.1371/journal.pone.0256472.s018

(XLSX)

S19 Table. COG gene expression data for alternative splice variants not expressed in any of the G. max MAPK-OE roots, but whose expression data is obtained from Phytomine from leaf, nodule, pod, root, root hair, seed, SAM, and stem (Goodstein et al. 2012).

https://doi.org/10.1371/journal.pone.0256472.s019

(XLSX)

Acknowledgments

VK is thankful to Robert L. Nichols for his mentorship over the years. VK is also thankful to Gary W. Lawrence for mentorship and plant pathology expertise. VK is appreciative to Aaron R. Klink Sr. for his mentorship leading through to this work.

References

  1. 1. Ungar D, Oka T, Brittle EE, Vasile E, Lupashin V V., Chatterton JE, et al. Characterization of a mammalian Golgi-localized protein complex, COG, that is required for normal Golgi morphology and function. J Cell Biol. 2002;157: 405–415. pmid:11980916
  2. 2. Pokrovskaya ID, Willett R, Smith RD, Morelle W, Kudlyk T, Lupashin V V. Conserved oligomeric Golgi complex specifically regulates the maintenance of Golgi glycosylation machinery. Glycobiology. 2011;21: 1554–1569. pmid:21421995
  3. 3. Whyte JRC, Munro S. The Sec34/35 Golgi transport complex is related to the exocyst, defining a family of complexes involved in multiple steps of membrane traffic. Dev Cell. 2001;1: 527–537. pmid:11703943
  4. 4. Willett R, Ungar D, Lupashin V. The Golgi puppet master: COG complex at center stage of membrane trafficking interactions. Histochem Cell Biol. 2013;140: 271–283. pmid:23839779
  5. 5. Ungar D, Oka T, Vasile E, Krieger M, Hughson FM. Subunit architecture of the conserved oligomeric Golgi complex. J Biol Chem. 2005;280: 32729–32735. pmid:16020545
  6. 6. Fotso P, Koryakina Y, Pavliv O, Tsiomenko AB, Lupashin V V. Cog1p plays a central role in the organization of the yeast conserved oligomeric Golgi complex. J Biol Chem. 2005;280: 27613–27623. pmid:15932880
  7. 7. Blackburn JB D’Souza Z, Lupashin V V. Maintaining order: COG complex controls Golgi trafficking, processing, and sorting. FEBS Lett. 2019;593: 2466–2487. pmid:31381138
  8. 8. Söllner T, Whiteheart SW, Brunner M, Erdjument-Bromage H, Geromanos S, Tempst P, et al. SNAP receptors implicated in vesicle targeting and fusion. Nature. 1993;362: 318–324. pmid:8455717
  9. 9. Cottam NP, Ungar D. Retrograde vesicle transport in the Golgi. Protoplasma. 2012;249: 943–955. pmid:22160157
  10. 10. Jahn R, Fasshauer D. Molecular machines governing exocytosis of synaptic vesicles. Nature. 2012. pp. 201–207. pmid:23060190
  11. 11. VanRheenen SM, Cao X, Lupashin V V., Barlowe C, Waters MG. Sec35p, a novel peripheral membrane protein, is required for ER to Golgi vesicle docking. J Cell Biol. 1998;141: 1107–1119. pmid:9606204
  12. 12. Ram RJ, Li B, Kaiser CA. Identification of Sec36p, Sec37p, and Sec38p: Components of yeast complex that contains Sec34p and Sec35p. Mol Biol Cell. 2002;13: 1484–1500. pmid:12006647
  13. 13. Climer LK, Dobretsov M, Lupashin V. Defects in the COG complex and COG-related trafficking regulators affect neuronal Golgi function. Front Neurosci. 2015;9: 405. pmid:26578865
  14. 14. Moremen KW, Tiemeyer M, Nairn A V. Vertebrate protein glycosylation: Diversity, synthesis and function. Nature Reviews Molecular Cell Biology. 2012. pp. 448–462. pmid:22722607
  15. 15. Smith RD, Lupashin V V. Role of the conserved oligomeric Golgi (COG) complex in protein glycosylation. Carbohydrate Research. 2008. pp. 2024–2031. pmid:18353293
  16. 16. Ishikawa T, Machida C, Yoshioka Y, Ueda T, Nakano A, Machida Y. EMBRYO YELLOW gene, encoding a subunit of the conserved oligomeric Golgi complex, is required for appropriate cell expansion and meristem organization in Arabidopsis thaliana. Genes to Cells. 2008;13: 521–535. pmid:18422605
  17. 17. Lawaju BR, Niraula PM, Lawrence GW, Lawrence KS, Klink VP. The Glycine max conserved oligomeric golgi (COG) complex functions during a defense response to Heterodera glycines. Front Plant Sci. 2020;11: 1–17. pmid:32117356
  18. 18. Wrather JA, Stienstra WC, Koenning SR. Soybean disease loss estimates for the United States from 1996 to 1998. Can J Plant Pathol. 2001;23: 122–131.
  19. 19. Wrather JA, Koenning SR. Estimates of disease effects on soybean yields in the United States 2003 to 2005. J Nematol. 2006;38: 173–180. pmid:19259444
  20. 20. Niraula PM, Lawrence KS, Klink VP. The heterologous expression of a soybean (Glycine max) xyloglucan endotransglycosylase/hydrolase (XTH) in cotton (Gossypium hirsutum) suppresses parasitism by the root knot nematode Meloidogyne incognita. PLoS One. 2020;15: e0235344. pmid:32628728
  21. 21. Wang J, Niblack TL, Tremain JA, Wiebold WJ, Tylka GL, Marett CC, et al. Soybean cyst nematode reduces soybean yield without causing obvious aboveground symptoms. Plant Dis. 2003;87: 623–628. pmid:30812850
  22. 22. Lauritis JA, Rebois R V, Graney LS. Development of Heterodera glycines Ichinohe on Soybean, Glycine max (L.) Merr., under Gnotobiotic Conditions. J Nematol. 1983;15: 272–81. pmid:19295802
  23. 23. Endo BY. Ultrastructure of initial responses of susceptible and resistant soybean roots to infection by Heterodera glycines. Rev Nétnatol i. 1991;4: 73–94.
  24. 24. Endo B. Histological responses of resistant and susceptible soybean varieties, and backcross progeny to entry development of Heterodera glycines. Phytopathology. 1965;55: 375–381.
  25. 25. Klink VP, Darwish O, Alkharouf NW, Lawrence KS. The impact of pRAP vectors on plant genetic transformation and pathogenesis studies including an analysis of BRI1-ASSOCIATED RECEPTOR KINASE 1 (BAK1)-mediated resistance. J Plant Interact. 2021;16: 270–283.
  26. 26. Ross J. Host-Parasite relationship of the soybean cyst nematode in resistant soybean roots. Phytopathology. 1958;48: 578–579.
  27. 27. Jones JDG, Dangl JL. The plant immune system. Nature. 2006;444: 323–329. pmid:17108957
  28. 28. Matsye PD, Kumar R, Hosseini P, Jones CM, Tremblay A, Alkharouf NW, et al. Mapping cell fate decisions that occur during soybean defense responses. Plant Mol Biol. 2011;77: 513–528. pmid:21986905
  29. 29. Matsye PD, Lawrence GW, Youssef RM, Kim KH, Lawrence KS, Matthews BF, et al. The expression of a naturally occurring, truncated allele of an α-SNAP gene suppresses plant parasitic nematode infection. Plant Mol Biol. 2012;80: 131–155. pmid:22689004
  30. 30. Pant SR, Matsye PD, McNeece BT, Sharma K, Krishnavajhala A, Lawrence GW, et al. Syntaxin 31 functions in Glycine max resistance to the plant parasitic nematode Heterodera glycines. Plant Mol Biol. 2014;85: 107–121. pmid:24452833
  31. 31. McNeece BT, Pant SR, Sharma K, Niruala P, Lawrence GW, Klink VP. A Glycine max homolog of NON-RACE SPECIFIC DISEASE RESISTANCE 1 (NDR1) alters defense gene expression while functioning during a resistance response to different root pathogens in different genetic backgrounds. Plant Physiol Biochem. 2017;114: 60–71. pmid:28273511
  32. 32. McNeece BT, Sharma K, Lawrence GW, Lawrence KS, Klink VP. The mitogen activated protein kinase (MAPK) gene family functions as a cohort during the Glycine max defense response to Heterodera glycines. Plant Physiol Biochem. 2019;137: 25–41. pmid:30711881
  33. 33. Caldwell BE, Brim CA, Ross JP. Inheritance of Resistance of Soybeans to the Cyst Nematode, Heterodera Glycines 1. Agron J. 1960;52: 635–636.
  34. 34. Matson A, Williams L. Evience of a fourth gene for resistance to the soybean cyst nematode. Crop Sci. 1965;544: 427–433.
  35. 35. Rao-Arrelli AP. Inheritance of resistance to Heterodera glycines race 3 in soybean accessions. Plant Dis. 1994;78: 898–900.
  36. 36. Cook D, Lee T, Guo X. Copy number variation of multiple genes at Rhg1 mediates nematode resistance in soybean. Science (80-). 2012;338: 1206–1209.
  37. 37. Collins NC, Thordal-Christensen H, Lipka V, Bau S, Kombrink E, Qiu JL, et al. SNARE-protein-mediated disease resistance at the plant cell wall. Nature. 2003;425: 973–977. pmid:14586469
  38. 38. Sharma K, Pant SR, McNeece BT, Lawrence GW, Klink VP. Co-regulation of the Glycine max soluble N-ethylmaleimide-sensitive fusion protein attachment protein receptor (SNARE)-containing regulon occurs during defense to a root pathogen. J Plant Interact. 2016;11: 74–93.
  39. 39. Lakhssassi N, Liu S, Bekal S, Zhou Z, Colantonio V, Lambert K, et al. Characterization of the Soluble NSF Attachment Protein gene family identifies two members involved in additive resistance to a plant pathogen. Sci Rep. 2017;7: 1–11. pmid:28127051
  40. 40. Niraula PM, Zhang X, Jeremic D, Lawrence KS, Klink VP. Xyloglucan endotransglycosylase/hydrolase increases tightly-bound xyloglucan and chain number but decreases chain length contributing to the defense response that Glycine max has to Heterodera glycines. PLoS One. 2021;16: e0244305. pmid:33444331
  41. 41. Ostertag M, Stammler J, Douchkov D, Eichmann R, Hückelhoven R. The conserved oligomeric Golgi complex is involved in penetration resistance of barley to the barley powdery mildew fungus. Mol Plant Pathol. 2013;14: 230–240. pmid:23145810
  42. 42. Novick P, Field C, Schekman R. Identification of 23 complementation groups required for post-translational events in the yeast secretory pathway. Cell. 1980;21: 205–215. pmid:6996832
  43. 43. Hardwick KG, Pelham HRB. SED5 encodes a 39-kD integral membrane protein required for vesicular transport between the ER and the Golgi complex. J Cell Biol. 1992;119: 513–521. pmid:1400588
  44. 44. Lupashin V V, Pokrovskaya ID, McNew JA, Waters MG. Characterization of a novel yeast SNARE protein implicated in golgi retrograde traffic. Mol Biol Cell. 1997;8: 2659–2676. pmid:9398683
  45. 45. Shestakova A, Suvorova E, Pavliv O, Galimat K, Vladimir L. Interaction of the conserved oligomeric Golgi complex with t-SNARE Syntaxin5a/Sed5 enhances intra-Golgi SNARE complex stability. J Cell Biol. 2007;179: 1179–1192. pmid:18086915
  46. 46. Bubeck J, Scheuring D, Hummel E, Langhans M, Viotti C, Foresti O, et al. The syntaxins SYP31 and SYP81 control ER-Golgi trafficking in the plant secretory pathway. Traffic. 2008;9: 1629–1652. pmid:18764818
  47. 47. Niraula PM, Sharma K, McNeece BT, Troell HA, Darwish O, Alkharouf NW, et al. Mitogen activated protein kinase (MAPK)regulated genes with predicted signal peptides function in the Glycine max defense response to the root pathogenic nematode Heterodera glycines. PLoS One. 2020;15: e0241678. pmid:33147292
  48. 48. Libault M, Farmer A, Brechenmacher L, Drnevich J, Langley RJ, Bilgin DD, et al. Complete transcriptome of the soybean root hair cell, a single-cell model, and its alteration in response to Bradyrhizobium japonicum infection. Plant Physiol. 2010;152: 541–552. pmid:19933387
  49. 49. Goodstein D, Shu S, Howson R. Phytosome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012;40: D1178–D1186. pmid:22110026
  50. 50. Wang J, Hossain MS, Lyu Z, Schmutz J, Stacey G, Xu D, et al. SoyCSN: Soybean context-specific network analysis and prediction based on tissue-specific transcriptome data. Plant Direct. 2019;3: 1. pmid:31549018
  51. 51. Alshehri HA, Alkharouf NW, Darwish O, McNeece BT, Klink VP. MAPKDB: A MAP kinase database for signal transduction element identification. Bioinformation. 2019;15: 338–341. pmid:31249436
  52. 52. Arabidopsis Genome Initiative. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000;408: 796–815. pmid:11130711
  53. 53. Saski CA, Scheffler BE, Hulse-Kemp AM, Liu B, Song Q, Ando A, et al. Sub genome anchored physical frameworks of the allotetraploid Upland cotton (Gossypium hirsutum L.) genome, and an approach toward reference-grade assemblies of polyploids. Sci Rep. 2017;7: 1–14. pmid:28127051
  54. 54. Bredeson J V., Lyons JB, Prochnik SE, Wu GA, Ha CM, Edsinger-Gonzales E, et al. Sequencing wild and cultivated cassava and related species reveals extensive interspecific hybridization and genetic diversity. Nat Biotechnol. 2016;34: 562–570. pmid:27088722
  55. 55. Mascher M, Gundlach H, Himmelbach A, Beier S, Twardziok SO, Wicker T, et al. A chromosome conformation capture ordered sequence of the barley genome. Nature. 2017;544: 427–433. pmid:28447635
  56. 56. Beier S, Himmelbach A, Colmsee C, Zhang XQ, Barrero RA, Zhang Q, et al. Construction of a map-based reference genome sequence for barley, Hordeum vulgare L. Sci Data. 2017;4: 1–24. pmid:28448065
  57. 57. McCormick RF, Truong SK, Sreedasyam A, Jenkins J, Shu S, Sims D, et al. The Sorghum bicolor reference genome: improved assembly, gene annotations, a transcriptome atlas, and signatures of genome organization. Plant J. 2018;93: 338–354. pmid:29161754
  58. 58. Zimin A V., Puiu D, Hall R, Kingan S, Clavijo BJ, Salzberg SL. The first near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum. Gigascience. 2017;6: 1–7. pmid:29069494
  59. 59. Garsmeur O, Droc G, Antonise R, Grimwood J, Potier B, Aitken K, et al. A mosaic monoploid reference sequence for the highly complex genome of sugarcane. Nat Commun. 2018;9: 4300. pmid:30327463
  60. 60. Zhang L, Cai X, Wu J, Liu M, Grob S, Cheng F, et al. Improved Brassica rapa reference genome by single-molecule sequencing and chromosome conformation capture technologies. Hortic Res. 2018;5: 50. pmid:30131865
  61. 61. Wang M, Tu L, Yuan D, Zhu D, Shen C, Li J, et al. Reference genome sequences of two cultivated allotetraploid cottons, Gossypium hirsutum and Gossypium barbadense. Nature Genetics. 2019. pp. 224–229. pmid:30510239
  62. 62. Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, et al. Genome sequence of the palaeopolyploid soybean. Nature. 2010;463: 178–183. pmid:20075913
  63. 63. Ouyang S, Zhu W, Hamilton J, Lin H, Campbell M, Childs K, et al. The TIGR Rice Genome Annotation Resource: Improvements and new features. Nucleic Acids Res. 2007;35: D883–D887. pmid:17145706
  64. 64. The potato genome consortium. The potato genome sequncing consortium: The potato genome consortium (participants are listed alphabetically by institution-BGI-Shenzhen et al.). Genome sequence and analysis of the tuber crop potato. Nature. 2011;475: 189–195. pmid:21743474
  65. 65. Wang X, Wang H, Wang J, Sun R, Wu J, Liu S, et al. The genome of the mesopolyploid crop species Brassica rapa. Nat Genet. 2011;43: 1035–1040. pmid:21873998
  66. 66. Tomato Genome Constortium. The tomato genome sequence provides insights into fleshly fruit evolution. Nature. 2012;485: 635–641. pmid:22660326
  67. 67. Lamesch P, Berardini TZ, Li D, Swarbreck D, Wilks C, Sasidharan R, et al. The Arabidopsis Information Resource (TAIR): Improved gene annotation and new tools. Nucleic Acids Res. 2012;40: D1202–D1210. pmid:22140109
  68. 68. Consortium TB rapa GSP. The genome of the mesopolyploid crop sciences Brassica rapa. Nat Genet. 2011;43: 1035–1039. pmid:21873998
  69. 69. Zhang T, Hu Y, Jiang W, Fang L, Guan X, Chen J, et al. Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement. Nat Biotechnol. 2015;33: 531–537. pmid:25893781
  70. 70. Singh R, Ong-Abdullah M, Low ETL, Manaf MAA, Rosli R, Nookiah R, et al. Oil palm genome sequence reveals divergence of interfertile species in old and new worlds. Nature. 2013;500: 335–339. pmid:23883927
  71. 71. Dohm J, Minoche A, Holtgrawe D. The genome of the recently domesticated crop plant sugar beet (Beta vulgaris). Nature. 2014;505: 546–549. pmid:24352233
  72. 72. Ong AL, Teh CK, Mayes S, Massawe F, Appleton DR, Kulaveerasingam H. An improved oil palm genome assembly as a valuable resource for crop improvement and comparative genomics in the Arecoideae subfamily. Plants. 2020;9: 1–15. pmid:33152992
  73. 73. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215: 403–410. pmid:2231712
  74. 74. Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J, Gundlach H, et al. The Sorghum bicolor genome and the diversification of grasses. Nature. 2009;457: 551–556. pmid:19189423
  75. 75. Consortium IWGS. A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome. Science (80-). 2014;345: 1251788–1251789. pmid:25035500
  76. 76. Zhang T, Zheng Q, Yi X, An H, Zhao Y, Ma S, et al. Establishing RNA virus resistance in plants by harnessing CRISPR immune system. Plant Biotechnol J. 2018;16: 1415–1423. pmid:29327438
  77. 77. Mistry J, Chuguransky S, Williams L, Qureshi M, Salazar GA, Sonnhammer ELL, et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 2021;49: D412–D419. pmid:33125078
  78. 78. Mann HB, Whitney DR. On a test of whether one of two random variables is stochastically larger than the other. The Annals of Mathematical Statistics. 1947. pp. 50–60.
  79. 79. Klink VP, Kim KH, Martins V, MacDonald MH, Beard HS, Alkharouf NW, et al. A correlation between host-mediated expression of parasite genes as tandem inverted repeats and abrogation of development of female Heterodera glycines cyst formation during infection of Glycine max. Planta. 2009;230: 53–71. pmid:19347355
  80. 80. Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2 C T method. Methods. 2001;25: 402–408. pmid:11846609
  81. 81. Yuan JS, Reed A, Chen F, Stewart CN. Statistical analysis of real-time PCR data. BMC Bioinformatics. 2006;7: 1–12. pmid:16393334
  82. 82. Tilman D, Balzer C, Hill J, Befort BL. Global food demand and the sustainable intensification of agriculture. Proc Natl Acad Sci. 2011;108: 20260–20264. pmid:22106295
  83. 83. Ray DK, West PC, Clark M, Gerber JS, Prishchepov A V., Chatterjee S. Climate change has likely already affected global food production. PLoS One. 2019;14: e0217148. pmid:31150427
  84. 84. González-Villagra J, Leonid •, Kurepin V, Reyes-Díaz MM. Evaluating the involvement and interaction of abscisic acid and miRNA156 in the induction of anthocyanin biosynthesis in drought-stressed plants. Planta. 2017;246: 299–312. pmid:28534253
  85. 85. Wang J, Fedoseienko A, Chen B, Burstein E, Jia D, Billadeau DD. Endosomal receptor trafficking: Retromer and beyond. Traffic. 2018. pp. 578–590. pmid:29667289
  86. 86. Novick P, Ferro S, Schekman R. Order of events in the yeast secretory pathway. Cell. 1981;25: 461–469. pmid:7026045
  87. 87. Klink VP, Sharma K, Pant SR, McNeece B, Niraula P, Lawrence GW. Components of the SNARE-containing regulon are co-regulated in root cells undergoing defense. Plant Signal Behav. 2017;12. pmid:28010187
  88. 88. Klink V, Overall C, Alkharouf N, MacDonald M, Matthews B. Laser capture microdissection (LCM) and comparative microarray expression analysis of syncytial cells isolated from incompatible and compatible soybean (Glycine max) roots infected by the soybean cyst nematode (Heterodera glycines). Planta. 2007;226: 1389–1409. pmid:17668236
  89. 89. Suvorova ES, Duden R, Lupashin V V. The Sec34/Sec35p complex, a Ypt1p effector required for retrograde intra-Golgi trafficking, interacts with Golgi SNAREs and COPI vesicle coat proteins. J Cell Biol. 2002;157: 631–643. pmid:12011112
  90. 90. Mizuno-Yamasaki E, Rivera-Molina F, Novick P. GTPase networks in membrane traffic. Annu Rev Biochem. 2012;81: 637–659. pmid:22463690
  91. 91. Guo W, Roth D, Walch-Solimena C, Novick P. The exocyst is an effector for Sec4p, targeting secretory vesicles to sites of exocytosis. EMBO J. 1999;18: 1071–1080. pmid:10022848
  92. 92. TerBush DR, Maurice T, Roth D, Novick P. The Exocyst is a multiprotein complex required for exocytosis in Saccharomyces cerevisiae. EMBO J. 1996;15: 6483–6494. pmid:8978675
  93. 93. Sharma K, Niraula PM, Troell HA, Adhikari M, Alshehri HA, Alkharouf NW, et al. Exocyst components promote an incompatible interaction between Glycine max (soybean) and Heterodera glycines (the soybean cyst nematode). Sci Rep. 2020;10: 15003. pmid:32929168
  94. 94. Fry S. The structure and functions of xyloglucan. J Exp Bot. 1989;40: 1–11.
  95. 95. Fry SC, Smith RC, Renwick KF, Martin DJ, Hodge SK, Matthews KJ. Xyloglucan endotransglycosylase, a new wall-loosening enzyme activity from plants. Biochem J. 1992;282: 821–828. pmid:1554366
  96. 96. Nishitani K, Tominaga R. Endo-xyloglucan transferase, a novel class of glycosyltransferase that catalyzes transfer of a segment of xyloglucan molecule to another xyloglucan molecule. J Biol Chem. 1992;267: 21058–21064. pmid:1400418
  97. 97. Handford M, Rodríguez-Furlán C, Marchant L, Segura M, Gómez D, Alvarez-Buylla E, et al. Arabidopsis thaliana AtUTr7 encodes a golgi-localized UDP-glucose/UDP- galactose transporter that affects lateral root emergence. Mol Plant. 2012;5: 1263–1280. pmid:22933714
  98. 98. Zhao W, Colley KJ. Nucleotide sugar transporters of the Golgi apparatus. Golgi Appar State Art 110 Years after Camillo Golgi’s Discov. 2008; 190–206.
  99. 99. Cocuron J-C, Lerouxel O, Drakakaki G, Alonso AP, Liepman AH, Keegstra K, et al. A gene from the cellulose synthase-like C family encodes a beta-1,4 glucan synthase. Proc Natl Acad Sci U S A. 2007;104: 8550–5. pmid:17488821
  100. 100. Chevalier L, Bernard S, Ramdani Y, Lamour R, Bardor M, Lerouge P, et al. Subcompartment localization of the side chain xyloglucan-synthesizing enzymes within Golgi stacks of tobacco suspension-cultured cells. Plant J. 2010;64: 977–989. pmid:21143678
  101. 101. Kim SJ, Brandizzi F. The plant secretory pathway for the trafficking of cell wall polysaccharides and glycoproteins. Glycobiology. 2016;26: 940–949. pmid:27072815
  102. 102. Paterson AH, Bowers JE, Chapman BA. Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. Proc Natl Acad Sci U S A. 2004;101: 9903–9908. pmid:15161969
  103. 103. Doyle JJ, Coate JE. Polyploidy, the nucleotype, and novelty: The impact of genome doubling on the biology of the cell. Int J Plant Sci. 2019;180: 1–52.
  104. 104. Bowers JE, Chapman BA, Rong J, Paterson AH. Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature. 2003;422: 433–438. pmid:12660784
  105. 105. Blanc G, Wolfe KH. Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell. 2004;16: 1667–1678. pmid:15208399
  106. 106. Xu X, Pan S, Cheng S, Zhang B, Mu D, Ni P, et al. Genome sequence and analysis of the tuber crop potato. Nature. 2011;475: 189–195. pmid:21743474
  107. 107. Price HJ, Dillon SL, Hodnett G, Rooney WL, Ross L, Johnston JS. Genome evolution in the genus Sorghum (Poaceae). Annals of Botany. 2005. pp. 219–227. pmid:15596469
  108. 108. Peralta I, Spooner D, Knapp S. Taxonomy of wild tomatoes and their relatives (Solanum sect. Lycopersicoides, sect. Juglandifolia, sect. Lycopersicon; Solanaceae). Am Soc Plant Taxon. 2008;84: 151–160.
  109. 109. Jenkins JA. The origin of the cultivated tomato. Econ Bot. 1948;2: 379–392.
  110. 110. Town CD, Cheung F, Maiti R, Crabtree J, Haas BJ, Wortman JR, et al. Comparative genomics of Brassica oleracea and Arabidopsis thaliana reveal gene loss, fragmentation, and dispersal after polyploidy. Plant Cell. 2006;18: 1348–1359. pmid:16632643
  111. 111. Yang Y-W, Lai K-N, Tai P-Y, Li W-H. Rates of Nucleotide Substitution in Angiosperm Mitochondrial DNA Sequences and Dates of Divergence Between Brassica and Other Angiosperm Lineages. J Mol Evol. 1999;48: 597–604. pmid:10198125
  112. 112. Lu K, Wei L, Li X, Wang Y, Wu J, Liu M, et al. Whole-genome resequencing reveals Brassica napus origin and genetic loci involved in its improvement. Nat Commun. 2019;10: 1–12. pmid:30602773
  113. 113. Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, et al. The B73 maize genome: Complexity, diversity, and dynamics. Science (80-). 2009;326: 1112–1115.
  114. 114. Dubcovsky J, Luo M-C, Zhong G-Y, Bransteitter R, Desai A, Kilian A, et al. Genetic Map of Diploid Wheat, Triticum monococcum L., and Its Comparison With Maps of Hordeum vulgare L. Genetics. 1996;143: 983–999. pmid:8725244
  115. 115. Grivet L, Glaszmann J, DHont A. Molecular evidence of sugarcane evolution and domestication, " in Darwins Harvest: New approaches to the origins, evolution and conservation of crops. Columbia Univ Press. 2006; 49–66.
  116. 116. Campbell MA, Haas BJ, Hamilton JP, Mount SM, Robin CR. Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis. BMC Genomics. 2006;7: 1–17. pmid:16403227
  117. 117. Wang BB, Brendel V. Genomewide comparative analysis of alternative splicing in plants. Proc Natl Acad Sci U S A. 2006;103: 7175–7180. pmid:16632598
  118. 118. Barbazuk WB, Fu Y, McGinnis KM. Genome-wide analyses of alternative splicing in plants: Opportunities and challenges. Genome Res. 2008;18: 1382–1391. pmid:18669480
  119. 119. Arora V, Puri RD, Bhai P, Sharma N, Bijarnia–Mahay S, Dimri N, et al. The first case of antenatal presentation in COG8-congenital disorder of glycosylation with a novel splice site mutation and an extended phenotype. Am J Med Genet Part A. 2019;179: 480–485. pmid:30690882
  120. 120. Bazin J, Mariappan K, Jiang Y, Blein T, Voelz R, Crespi M, et al. Role of MPK4 in pathogen-associated molecular pattern-triggered alternative splicing in Arabidopsis. PLoS Pathog. 2020;16: e1008401. pmid:32302366
  121. 121. Bekal S, Domier LL, Gonfa B, Lakhssassi N, Meksem K, Lambert KN. A SNARE-like protein and biotin are implicated in soybean cyst nematode virulence. PLoS One. 2015;10: e0145601. pmid:26714307
  122. 122. Mandadi KK, Scholthof KBG. Genome-wide analysis of alternative splicing landscapes modulated during plant-virus interactions in Brachypodium distachyon. Plant Cell. 2015;27: 71–85. pmid:25634987
  123. 123. Staiger D, Korneli C, Lummer M, Navarro L. Emerging role for RNA-based regulation in plant immunity. New Phytologist. 2013. pp. 394–404. pmid:23163405
  124. 124. Gonzalez TL, Liang Y, Nguyen BN, Staskawicz BJ, Loqué D, Hammond MC. Tight regulation of plant immune responses bycombining promoter and suicide exon elements. Nucleic Acids Res. 2015;43: 7152–7161. pmid:26138488
  125. 125. Cooke DA. Pests. The sugar beet Crop. 1993. pp. 429–483.
  126. 126. Srivastava SN. Management of sugarbeet diseases. Fruit and vegetable diseases. 2006. pp. 307–355.
  127. 127. Soltis DE, Albert VA, Leebens-Mack J, Bell CD, Paterson AH, Zheng C, et al. Polyploidy and angiosperm diversification. Am J Bot. 2009;96: 336–348. pmid:21628192
  128. 128. Tang H, Bowers JE, Wang X, Paterson AH. Angiosperm genome comparisons reveal early polyploidy in the monocot lineage. Proc Natl Acad Sci U S A. 2010;107: 472–477. pmid:19966307