The centromere, on which kinetochore proteins assemble, ensures precise chromosome segregation. Centromeres are largely specified by the histone H3 variant CENP-A (also known as Cse4 in yeasts). Structurally, centromere DNA sequences are highly diverse in nature. However, the evolutionary consequence of these structural diversities on de novo CENP-A chromatin formation remains elusive. Here, we report the identification of centromeres, as the binding sites of four evolutionarily conserved kinetochore proteins, in the human pathogenic budding yeast Candida tropicalis. Each of the seven centromeres comprises a 2 to 5 kb non-repetitive mid core flanked by 2 to 5 kb inverted repeats. The repeat-associated centromeres of C. tropicalis all share a high degree of sequence conservation with each other and are strikingly diverged from the unique and mostly non-repetitive centromeres of related Candida species—Candida albicans, Candida dubliniensis, and Candida lusitaniae. Using a plasmid-based assay, we further demonstrate that pericentric inverted repeats and the underlying DNA sequence provide a structural determinant in CENP-A recruitment in C. tropicalis, as opposed to epigenetically regulated CENP-A loading at centromeres in C. albicans. Thus, the centromere structure and its influence on de novo CENP-A recruitment has been significantly rewired in closely related Candida species. Strikingly, the centromere structural properties along with role of pericentric repeats in de novo CENP-A loading in C. tropicalis are more reminiscent to those of the distantly related fission yeast Schizosaccharomyces pombe. Taken together, we demonstrate, for the first time, fission yeast-like repeat-associated centromeres in an ascomycetous budding yeast.
Centromeres aid in high fidelity chromosome segregation. Paradoxically, centromere DNA sequences are rapidly evolving in fungi, plants, and animals. Centromere DNA sequences in fungi can be unique in each chromosome or share conserved features such as motifs for sequence specific protein binding, pericentric repeats, or transposon-rich elements. Ascomycetous fungi, in particular, show a wide range of diversity in centromere sequence elements. However, no ascomycetous budding yeast species is known to possess repeat-associated centromeres in all of its chromosomes. Here, we identified and mapped all seven centromeres in an ascomycete, a rapidly emerging human pathogenic yeast, Candida tropicalis. The repeat-associated centromeres of highly homogeneous DNA sequences in C. tropicalis are significantly diverged from the mostly non-repetitive unique centromeric DNA sequences of its closely related sequenced species, Candida albicans, Candida dubliniensis and Candida lusitaniae. Structurally, the centromeres of C. tropicalis more closely resemble those of the distantly related fission yeast Schizosaccharomyces pombe. Thus, we discover rapidly diverging repeat-associated centromeres in an ascomycetous budding yeast and provide evidence of emergence of repeat-associated centromeres via two independent evolutionary events in ascomycetous fungi.
Citation: Chatterjee G, Sankaranarayanan SR, Guin K, Thattikota Y, Padmanabhan S, Siddharthan R, et al. (2016) Repeat-Associated Fission Yeast-Like Regional Centromeres in the Ascomycetous Budding Yeast Candida tropicalis. PLoS Genet 12(2): e1005839. doi:10.1371/journal.pgen.1005839
Editor: Beth A. Sullivan, Duke University, UNITED STATES
Received: August 4, 2015; Accepted: January 11, 2016; Published: February 4, 2016
Copyright: © 2016 Chatterjee et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The sequencing data generated by the ChIP-seq experiments in this study have been submitted to the NCBI under the accession number SUB432163. The sequencing data generated by Sanger sequencing of Scnt 1, Scnt 7 and Scnt 9 have been deposited to the NCBI with following accession numbers KJ398406, KJ425116 and KJ398405 respectively.
Funding: This work is supported by a grant from DBT (Grant number: BT/PR14840/BRB/10/880/2010), Govt. of India and intramural funding of JNCASR to KS. RS acknowledges the PRISM project funded by DAE at his institution. GC was a senior research fellow of CSIR, Govt. of India, SRS is a senior research fellow supported by JNCASR and KG acknowledges SPM fellowship from CSIR, Govt. of India. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
The high fidelity segregation of replicated chromosomes to daughter cells during cell division is essential in maintaining genome integrity. It is achieved by a dynamic and well-coordinated kinetochore-microtubule interaction on a specialized chromosomal element, known as the centromere. Strikingly, the centromere DNA shows rapid diversification in its sequence, length, and the organization of sequence elements across different species [1–3]. The centromere has been categorized into point and regional primarily based on its length. In addition, there are kinetochore protein complexes which are associated specifically to either the point or regional centromere . Point centromeres, which are typically <400 bp long with conserved DNA elements (CDEs) but lacking DNA sequence repeats, appear to have evolved only once and are restricted to the Saccharomyces lineage . However, the centromeres of most other organisms are regional in nature and span from as small as a few tens of kilobases (kb) as in fission yeast Schizosaccharomyces pombe to as large as multiple megabases (Mb) in length as observed in plants and animals. The large regional centromeres of most plants (reviewed in [5, 6]) and animals (reviewed in ) are composed of an array of either repetitive sequences or transposable elements. A classic example is the human centromeres that are organized as 171 bp monomeric repeats arranged into a higher ordered alpha satellite sequence (reviewed in ). The regional centromeres of two ascomycetous fungi, Neurospora crassa and S. pombe, and a basidiomycetous fungus Cryptococcus neoformans are much shorter (40 to 300 kb in length) and composed of either transposon-rich repetitive sequences as in N. crassa [9, 10] and in C. neoformans , or a heterogeneous central core sequence (cnt) flanked by two distinct inverted repeats (imr and otr) that are conserved across the centromeres in S. pombe [12–14]. It is noteworthy that the repeat-associated fungal centromeres lack tandem arrays of repeats as observed in the centromeres of higher metazoans. Interestingly, centromeres of chicken , potato  and unicellular red alga Cyanidioshyzon merolae  represent a distinct class where both repetitive and repeat-less centromeres exist in the same genome. On the other hand, shorter small regional centromeres of 3 to 5 kb non-repetitive, unique sequences have been identified in three Candida species–Candida albicans , Candida dubliniensis  and Candida lusitaniae . Interestingly, the centromeres in these organisms lack any sequence conservation shared among different chromosomes in the same species. However, CEN1, CEN5, and CENR in C. albicans as well as in C. dubliniensis possess pericentric inverted repeats which are unique to each centromere . The driving force enabling the evolution of centromeres with such remarkable diversity both in the DNA sequence as well as structure, rather than a common optimized centromere configuration, across eukaryotes remains an enigma .
The centromere DNA sequence and the organization of the sequence elements are rapidly evolving even in closely related species of three major forms of eukaryotic life—fungi, plants, and animals [1, 18]. In addition, a series of events including–(a) neocentromere formation [19–24] by centromere repositioning at ectopic sites with no obvious DNA sequence homology to the native centromere, (b) selective inactivation of a centromere in a dicentric chromosome [25–28], and (c) the presence of identical sequences elsewhere in the genome that do not serve as centromere/neocentromere sites in various organisms support the conclusion that centromere specification is largely epigenetically regulated (reviewed in [29, 30]).
The centromere specific histone H3 variant CENP-A (also known as Cse4 in yeasts)  is considered to be an epigenetic hallmark of active centromeres . The unique structure of CENP-A chromatin provides the foundation to recruit other kinetochore proteins belonging to the Constitutive Centromere Associated Network (CCAN), Ndc80 complex and Dam1/ Ska complex , and nucleates kinetochore assembly in most organisms . However, the mechanism(s) of CENP-A loading at a particular locus across species required for centromere specification and its propagation in subsequent generations remains unclear. As shown in S. pombe, CENP-A loading at the centromere is probably regulated via distinct processes leading to the establishment and propagation of a centromere in most organisms . De novo CENP-A recruitment without any pre-existing mark is crucial to establish a centromere, whereas loading of CENP-A molecules during every cell cycle is important for the propagation of already established centromeres .
A common feature of the large regional centromeres in ascomycetous fungi is their inherent association with DNA repeats. Detailed studies on the centromeres of S. pombe revealed that centromere associated repeats provide structural determinants in de novo CENP-A recruitment . In contrast, studies in the human pathogenic budding yeast C. albicans, which possesses small regional centromeres  reveal that the centromere DNA sequence (CEN7), that lacks pericentric repeats, fails to form functional centromere de novo on a naked plasmid harboring the CEN7 because CENP-A could not be recruited to the plasmid CEN7 . This result implies that centromeres are epigenetically specified in absence of the pericentric repeats in C. albicans . However, it remains to be tested whether centromeres with inverted repeats (such as CEN5) can recruit CENP-A de-novo in C. albicans.
Candida species, the most commonly encountered human fungal pathogens, cause a wide variety of mucosal infections and organ invasion in immunocompromised patients . Although C. albicans has been long known to be the most abundant Candida species isolated from patients, recent global surveillance programs suggest that non-albicans Candida (NAC) species are rapidly emerging as a serious threat due to widespread use of antifungal drugs [40, 41]. In particular, infections caused by Candida tropicalis, a parasexual human pathogenic yeast, has been increased dramatically worldwide. Particularly in sub-tropical regions of Asia-Pacific, the number of patients with C. tropicalis infection is higher than that caused by C. albicans [40, 42]. Earlier, we reported centromere properties of C. albicans  and C. dubliniensis . Here, we report the identification of the centromeres as binding sites of four evolutionarily conserved kinetochore proteins in C. tropicalis, which has a 30 Mb sequenced diploid genome arranged into 23 supercontigs . A comparative analysis of centromeres suggests a rapid divergence not only in the centromere DNA sequence but also in the organization of the sequence elements in these closely related Candida species. Interestingly, pericentric repeats are shown to be important for de novo CENP-A recruitment on C. tropicalis centromeres. Based on the striking structural resemblance of centromeres and the necessity of pericentric repeats for de novo centromere formation both in C. tropicalis and S. pombe, we propose an independent evolution of repeat-associated centromeres in budding and fission yeasts.
Kinetochore proteins are well conserved in C. tropicalis
We identified four putative kinetochore proteins in C. tropicalis- CtCENP-A (Cse4), CtCENP-C (Mif2), CtNuf2 and CtDad1 (Fig 1A). Each of these proteins shares a high degree of sequence conservation to those of the closely related species C. albicans (S1A Fig). Subcellular localization of these proteins in C. tropicalis revealed localization patterns typical of kinetochore proteins in related yeasts [44–47]: a single punctate structure representing clustered kinetochores in unbudded G1 cells that then segregated into two puncta in large-budded cells undergoing mitosis (Fig 1B). In addition, indirect immunofluorescence microscopy with anti-Cse4 antibodies , which are specific to CtCENP-A (S1B Fig), and anti-tubulin antibodies revealed CENP-A to be localized near the spindle pole bodies (S1C Fig). On the basis of the sequence similarities and localization patterns at two different stages of the cell cycle, we conclude that these genes encode conserved kinetochore proteins in C. tropicalis.
(A) An illustration showing the kinetochore organization in yeasts. (B) Live cell fluorescence microscopic images of indicated proteins at two different stages of the cell cycle: interphase (unbudded) and mitotic (large-budded). Scale bar, 5 μm.
Kinetochore proteins are essential for chromosome segregation during mitosis in C. tropicalis
Kinetochore proteins are important for chromosome segregation in eukaryotes and their depletion results in chromosome segregation defects due to improper microtubule-kinetochore interactions, which may lead to cell cycle arrest due to activation of the spindle assembly checkpoint. For conditional expression of genes, we identified the GAL1 promoter sequence in C. tropicalis (See Methods). To test the function of these putative kinetochore proteins on chromosome segregation in this diploid organism, one copy of each gene was replaced by a marker gene and the remaining copy was placed under the control of the GAL1 promoter. The inability of the conditional mutant strains to grow under non-permissive conditions confirmed that each of these four kinetochore proteins is essential for viability in C. tropicalis (Fig 2A). Moreover, flow cytometry (FACS) analysis revealed an accumulation of large budded cells at the G2/M stage during growth in non-permissive conditions (Fig 2B and S2 Fig). A significant number of the arrested cells had either an unsegregated nuclear mass at the bud neck, or unequally segregated nuclei indicating an arrest due to mitotic checkpoint activation (Fig 2C and S2 Fig). Taken together, these results strongly suggest that each of these proteins is essential for proper chromosome segregation in C. tropicalis.
(A) CENP-A, CENP-C, Nuf2 and Dad1 are essential for viability in C. tropicalis. C. tropicalis conditional mutant strains expressing the only copy of the above mentioned genes under the GAL1 promoter were streaked on plates with galactose (permissive) or glucose (restrictive) as the sole carbon source and were photographed after 2 to 3 days of incubation at 30°C. The GAL1 promoter is induced in the presence of galactose but repressed in glucose containing media in C. tropicalis. (B) FACS analysis of the conditional mutant strains of CENP-A, CENP-C, Nuf2 and Dad1 grown in either permissive (galactose), or non-permissive (glucose) media. The x-axis and y-axis represent the DNA content and number of cells respectively. (C) The distribution of unbudded, small-budded and large-budded (G2/M stage) cells of indicated mutant strains grown in either permissive (+) or non-permissive (-) conditions. The nuclear morphology was visualized by DAPI staining after 6 h of growth in permissive or non-permissive condition and the cells exhibiting proper or improper chromosome segregation during the G2/M stage are counted (n = >250 cells). The y-axis represents the percentage cell population.
Genome-wide mapping reveals seven unique but overlapping CENP-A- and CENP-C- rich regions in C. tropicalis
Having identified authentic kinetochore proteins, we next sought to map the centromeres in the C. tropicalis genome as the binding sites of CENP-A and CENP-C by chromatin immunoprecipitation followed by next generation sequencing (ChIP-seq) . The sequenced C. tropicalis strain MYA-3404 (CSE4/ CSE4) and its derivatives CtKS201 (MIF2/ MIF2-TAP) were used for CENP-A (anti-Cse4 antibodies) and CENP-C (anti-Protein A antibodies) ChIP experiments respectively. Analysis of the ChIP-seq reads against the C. tropicalis genome  identified seven CENP-A- and CENP-C-bound overlapping but unique regions as centromeres in C. tropicalis (Fig 3A and S3A and S3B Fig). Primers designed from the seven unique enriched regions identified by ChIP-seq were used to validate the enrichment of CENP-A and CENP-C binding by analyzing ChIP DNA of each of the four kinetochore proteins namely, CENP-A, CENP-C, Nuf2, and Dad1 on seven supercontigs as compared to a non-centromeric locus (CtLEU2) using semi-quantitative PCR assays (Fig 3B). Moreover, each of these seven regions resides within a long ORF-free region (S1 Table), a common centromeric feature observed in most organisms.
(A) CENP-A and CENP-C ChIP-seq reads along the seven enriched supercontigs are shown. Here, x-axis and y-axis represent the coordinates of the chromosomal regions and the distribution of sequence reads of the specific supercontig respectively. The asterisk (*) denotes the peak observed in input library. (B) Enrichment of indicated proteins at the centromeres on different supercontigs. ChIP DNA fractions of the indicated proteins were analyzed by PCR using a primer-pair unique to each supercontig (see S4 Table for primer sequences). CtLEU2, a non-centromeric locus, was used as negative control. ‘T’, Total DNA, ‘+’, IP DNA with antibodies and ‘-’, beads only control.
To determine chromosomal identity of each of these centromeres, the chromosomes of C. tropicalis (MYA-3404) were first separated on CHEF gels (see methods). The probes were PCR amplified from a unique region adjacent to each centromere. The specific signals on the Southern blot of the CHEF gels revealed that at least five regions reside on different chromosomes (S4 Fig). Due to limited resolution of higher molecular weight chromosomes, the chromosomal identity of two regions (Scnt 3 and Scnt 4) could not be unambiguously verified by the CHEF analysis. These analyses, together with the previously reported 14 telomeric-linked scaffold ends , strongly suggest that there are seven pairs of chromosomes in C. tropicalis.
Kinetochore proteins bind specifically to 2 to 3 kb regions on each chromosome
ChIP-seq analyses show a complete overlap in binding of CENP-A and CENP-C to a 2 to 3 kb region on each of the seven centromeres (Fig 4A and S1 Table). To validate the length of the CENP-A/CENP-C binding regions obtained by the ChIP-seq analysis, we scanned the enrichment of each of the four above mentioned kinetochore proteins on the ORF-free region of Scnt 8 by ChIP followed by quantitative PCR (ChIP-qPCR) with primers designed at approximately 1 kb intervals across the 10 kb region of Scnt 8. This analysis revealed that these kinetochore proteins were enriched over a 3 kb (632950–636200) region on Scnt 8 (Fig 4B) and confirms the results obtained from the ChIP-seq experiment. Binding of evolutionarily conserved kinetochore proteins on the same locus proves that the region on each of the seven chromosomes is an important part of a functional centromere in C. tropicalis.
(A) ChIP-seq analysis revealed that CENP-A and CENP-C bind to the mid core region in C. tropicalis. Here, the x-axis represents the structural components of a centromere in C. tropicalis and the y-axis represents the distribution of sequence reads of CENP-A (red) or CENP-C (green) ChIP DNA of the respective supercontig. Schematic representations of structural components of a centromere in C. tropicalis are shown below each ChIP-seq reads. Black boxes represent the mid regions, blue arrows indicate inverted repeats, the left repeat (LR) and the right repeat (RR). Scale bar, 2 kb. (B) ChIP-qPCR assays confirm the binding of kinetochore proteins across the centromere in Scnt 8. The x-axis represents coordinates on the supercontigs and the y-axis denotes the qPCR value as a percentage of the total chromatin input with standard error mean (SEM). Scale bar, 2 kb (C) Sequence conservation between the mid core regions (mids) and inverted repeats (IRs) from different centromeres in C. tropicalis. Homology is calculated as the percentage of aligned nucleotides in a pair-wise alignment, measured from the shorter sequence. On the other hand, identity is the percentage of aligned and conserved nucleotides in the pair-wise alignment, again measured from the shorter sequence. Averages are calculated from all pair-wise alignments, weighted by length. IR* and IR# denote average value with respect to either others or self respectively.
A dramatic divergence in centromere sequence and its organization in related Candida species
The sequence analysis of centromeric DNA revealed that all seven centromeres in C. tropicalis have common structural elements comprising a non-repetitive mid core flanked by inverted repeats (IRs) (Fig 4A and S2 Table). The average length of the mid core region is 3.5 kb and is flanked by IRs of an average length of each repeat of 3.5 kb. This is a dramatic transition in the centromere organization in comparison with the centromeres of other closely related Candida species, C. albicans [15, 49], C. dubliniensis  and C. lusitaniae . Incidentally, CEN1, CEN5 and CENR of C. albicans and C. dubliniensis also contain non-conserved short pericentric repeats. The binding of both CENP-A and CENP-C is restricted to 2 to 3 kb non-repetitive mid core region in all centromeres in C. tropicalis (Fig 4A). A similar length of CENP-A binding (3 to 5 kb) has been observed in C. albicans , C. dubliniensis , and C. lusitaniae  suggesting a striking conservation in the length of CENP-A chromatin that provides the platform for kinetochore formation [50–52]. On the other hand, the AT-content of the CENP-A-bound mid core regions is found to be 64% in C. tropicalis, which is marginally less than the overall AT-content of the genome (67%). A similar AT-content of CENP-A bound centromere DNA (65%) was observed in C. albicans . Thus, in spite of the observed rapid change in the centromere DNA sequence and its organization, these closely related species employ a similar length and composition (in terms of the AT-content) of the centromere DNA for the recruitment of kinetochore proteins.
The centromeres of C. tropicalis possess highly homogenized inverted repeats (IRs) around a non-repetitive mid core
In silico analysis of centromere sequences in C. tropicalis revealed that the inverted repeats (IRs) and mid core regions share a high degree of sequence homology across the centromeres (Fig 4C and S5A–S5C Fig). Between different chromosomes, the mid core regions share 80% homology and 63% identity while the IRs show an average of 92% homology and 82% identity. However, the conservation is much higher between the left and right repeats (LR and RR) of the same centromere, with an average of 97% homology and 93% identity (Fig 4C and S5D Fig). In addition, we also observed that tandem direct repeats are present within each inverted repeat (S5E Fig and S3 Table). These groups of tandem repeats were prominent across all arms (except Scnt 9, which lacks the final group). However, the copy number varied significantly among arms (S3 Table). The observed high level of sequence conservation of the mid core and inverted repeats among the different centromeres in C. tropicalis suggests that these regions might have undergone homogenization via intra- and inter-chromosomal recombinatorial events. Such process may be facilitated by the close association of centromeres in the clustered kinetochores of C. tropicalis (Fig 1B).
Chromosomal rearrangement involving centromeric regions in closely related Candida species
Rapid divergence in the centromere sequence and structure is often associated with karyotypic changes [53, 54], a hallmark of speciation [55–58]. Previously, we demonstrated rapidly changing DNA sequence at the centromeres of orthologous chromosomes in C. albicans and C. dubliniensis without any significant changes in synteny across chromosomes . Here we performed a synteny dot plot analysis between C. albicans and C. tropicalis genomes. This analysis revealed massive chromosomal rearrangements involving several syntenic breaks happened between these two species. Unusually, it appears that intra-chromosomal transpositions and inversions are far more common than inter-chromosome recombination (Fig 5). Strikingly, inter-chromosome recombination, though uncommon, tends to occur more often near the centromeres (Fig 5 and S6A Fig). For example, in CtScnt3, the large number of genes to the left of the centromere, and a few on the right, map to CaChr3. But immediately after the centromere, there are some segments on CtScnt3 that are in synteny with CaChrR and CaChr5, and then the remainders of CtScnt3 are largely from CaChr6 (Fig 5). Similar patterns of rearrangement can be seen in most other supercontigs also except CtScnt 7. This pattern of rearrangement was plausible probably due to recombination at highly identical sequences of inverted repeats. A similar phenomenon of centromeric repeat-mediated rearrangement and subsequent gain of a chromosome has been observed in two laboratory strains of S. pombe . Incidentally, there is a change in chromosome number from eight pairs in C. albicans and C. dubliniensis to seven pairs in C. tropicalis indicating a possible structural rearrangement involving centromere which might have given rise to a centromere gain or loss (S6B Fig). However, CtScnt7 comes almost entirely from CaChr7, but has been heavily rearranged (Fig 5). The unusual preponderance of intra-chromosomal transposition and reversal compared to the smaller numbers of inter-chromosomal translocation may merit further study.
Orthologous genes are plotted on the x-axis as per C. tropicalis candidates (start to end on that supercontig), and on the y-axis as per C. albicans coordinates for the respective chromosome, and colour-coded according to the C. albicans chromosome. The vertical grey bar indicates the position of the centromere on the C. tropicalis supercontig. Continuous segments of lines indicate rows of syntenous genes.
In addition, a putative retrotransposon present at the centromere in C. tropicalis is found to be conserved at CEN7 in C. dubliniensis  (S5B Fig). A similar retrotransposon was also found to be present within 50 kb region of CEN7 in C. albicans (S6A Fig). This putative transposon is a member of the Ty3/Gypsy family but does not present at putative centromeres in any other Candida species. These results, together with the conservation in the CENP-A chromatin length, indicate that the centromere position of these related species was shared by a common ancestor and may have undergone chromosomal rearrangement involving the centromeres of more than one chromosome during evolution.
C. tropicalis and S. pombe share common centromere properties
The structural features of C. tropicalis centromeres strikingly resemble those of the distantly related fission yeast S. pombe. To understand the function of the underlying centromere DNA sequences in C. tropicalis, we engineered plasmids carrying either the full length centromere (pCEN8) or a part of it (pmid8) on a replicative plasmid pARS2 (Fig 6A). The replicative plasmid pARS2 harbors CaARS2 , which functions as an autonomously replicating sequence (ARS) on a circular plasmid in C. tropicalis (S7 Fig). While the pmid8 plasmid conferred 10 to 13-fold increased mitotic stability as compared to pARS2, inclusion of the full length centromere sequence harboring inverted repeats in the pARS2 plasmid (pCEN8) resulted in a 37 to 42-fold higher mitotic stability after 10 generations of nonselective growth (Fig 6B). A size-dependent stabilization of circular replicative plasmids has been reported previously in S. cerevisiae . To rule out this possibility, we cloned a 10 kb of hererologous DNA sequence from bacteriophage λ (pARS2-λ) and measured the mitotic stability of the same. This plasmid, which is of similar length (15 kb) to that of pCEN8, did not show an increase in the mitotic stability as observed in pCEN8 (Fig 6B). In addition, pCEN8 is 3 to 4-fold more stable mitotically than pmid8 carrying only the mid core sequence (Fig 6B). These results suggest that the inverted repeats flanking the mid core can significantly improve the mitotic stability of an otherwise unstable replicative plasmid in C. tropicalis. Because CENP-A is known to bind to only functional centromeres, functionality of a centromere sequence cloned into the replicative plasmid was further assayed by the extent of CENP-A enrichment on these exogenously introduced plasmid DNA constructs. It should be noted that a unique SalI restriction site was introduced at the edge of the mid region of the plasmid-borne to differentiate it from the endogenous chromosomal ones (see S8 Fig). CENP-A ChIP-qPCR analysis with plasmid specific primer-pair revealed that CENP-A is enriched at the mid core region on only the full length centromere DNA in pCEN8 (Fig 6C) suggesting that the inverted repeats (LR and RR) flanking the mid core are important for de novo CENP-A deposition. Thus, we conclude that the CENP-A recruitment process has been significantly rewired in closely related Candida species. Centromere function was shown to be dependent on the presence of inverted pericentric repeats in S. pombe as well . On the other hand, Candida species and S. pombe shared a common ancestor more than 330 mya . Thus, we demonstrate an extraordinary example of evolution of inverted repeat containing ‘fission yeast-like’ centromeres that appeared independently at least once in the Candida clade.
(A) Schematic of plasmids used in this study. The replicative plasmid pARS2 harbors CaARS2 and CaURA3 sequences. pmid8 has only the mid core region of CEN8, pCEN8 carries the full length centromere (CEN8), and pARS2-λ harbors a ~10 kb lambda DNA. pCEN801 carries LR8 in a direct orientation with respect to RR8. On the other hand, pCEN802 harbors CaLR5 and CaRR5 of chromosome 5 of C. albicans. The size of each of these plasmids is also mentioned. (B) The relative mitotic stability of various plasmids in C. tropicalis. The mitotic stability of each of the plasmids is normalized to that of the average mitotic stability of the replicative plasmid (pARS2). The mitotic stability for each class of plasmids was calculated for five independent transformants (n = 5). One way ANOVA and Bonferroni post tests were performed to determine statistical significance. Errors bars represent standard error mean (SEM). (C) CENP-A-ChIP assays were performed in the C. tropicalis strain CtKS102 (CSE4/CSE4-TAP) transformed with pmid8, pCEN8, pCEN801 and pCEN802. Immunoprecipitated (IP) DNA fractions were analyzed by qPCR with primer-pairs (see S4 Table) specific to each cloned insert to determine the extent of de novo CENP-A recruitment on the centromere DNA sequence on the plasmid exclusively. The enrichment of CENP-A on these exogenously introduced centromere sequences are represented as a percentage of the total chromatin input with standard error mean (SEM) and validated with three independent biological replicates (n = 3). The relative enrichment was calculated using the formula: (pCEN-LEU2)/ (nCEN-LEU2), where nCEN and pCEN indicate the percent input values of CENP-A enrichment at the native centromere (Scnt 8) and on the plasmid centromere sequence respectively. LEU2 is used as a non-centromeric negative control. Similarly, one way ANOVA and Bonferroni post tests were performed to determine statistical significance.
The inverted repeats and the underlying DNA sequence provide a structural determinant for centromere function in C. tropicalis
In order to find out the role of pericentric inverted repeats and the DNA sequence associated with them for centromere function in C. tropicalis, we have constructed two different engineered plasmids namely pCEN801 and pCEN802 (Fig 6A). The pCEN801 plasmid harbors the left repeat of CEN8 (CtLR8) cloned in a direct orientation with respect to the right repeat of the same centromere (CtRR8). Thus, the only difference between pCEN8 (inverted orientation) and pCEN801 (direct orientation) is the orientation of pericentric repeats with respect to each other. However, pCEN801 is found to be significantly less stable mitotically as compared to the pCEN8 (Fig 6B). Moreover, CENP-A ChIP-qPCR analysis revealed that CENP-A does not bind to this engineered pCEN801 plasmid (Fig 6C). These suggest that the inverted orientation of pericentric repeats is an important structural feature for centromere function in C. tropicalis.
To understand the function of the underlying DNA sequence of the inverted repeats, we cloned the inverted repeats of CEN5 (CaIR5) from C. albicans into the pmid8 plasmid (pCEN802). However, the mitotic stability of pCEN802 is found to be 4 to 6-fold lower in C. tropicalis as compared to pCEN8, which harbors pericentric inverted repeats (IR8) of C. tropicalis (Fig 6B). This observation has been further verified by CENP-A ChIP-qPCR analysis (Fig 6C) and confirms that the sequence of the inverted repeats per se is also crucial for centromere function in this species. Thus, we conclude that the DNA sequence of the repeats as well the arrangement of the repeats in an inverted fashion is both important for centromere function in C. tropicalis.
The centromeres of C. tropicalis provide evidence of appearance of pericentric inverted repeats in the Saccharomycotina
To elucidate the route of centromere diversification, we reconstructed a phylogenetic tree of 13 species representing all major lineages of Ascomycota (Fig 7A). It demarcates three distinct monophyletic subphyla within the Ascomycota—Taphrinomycotina, Pezizomycotina and Saccharomycotina (Fig 7A). Moreover, this study also supports that Taphrinomycotina and Pezizomycotina are the early radiating branches in Ascomycetes. Thus, it is evident from both the phylogenetic relationship and the centromere structures of S. pombe and N. crassa that the invasion of transposons or symmetric repetitive elements shaped centromere structure in Taphrinomycotina and Pezizomycotina during an early era of ascomycete evolution (Fig 7A). In contrast, a dramatic reduction in centromere length with a concurrent absence or loss of centromeric transposons or repeats, is evidenced from the centromeres of Candida and Saccharomyces species, and, therefore, evolved in Saccharomycotina (Fig 7A). The identification of the centromere in C. tropicalis in this study is the first report that shows the evolution of repeat-associated centromeres in the clade of Saccharomycotina (Fig 7A).
(A) Phylogeny of ascomycetous fungi showing the diverse nature of centromere structure. An unrooted phylogenetic tree was constructed using 573 uniformly evolving orthologous gene families (see methods). The phylogenetic relationship of the species with different types of centromeres is illustrated with colored shadows. The species names shown in white letters designate those with uncharacterized centromeres. The centromeres are mostly point or regional in nature in Ascomycota. However, the centromere of Y. lipolytica is an example of an unconventional intermediate centromere, which shares properties of both point and regional centromere. The centromeres of Y. lipolytica are small (<200 bp in size) and mutations in the partial palindrome lead to centromere dysfunction . These are the characteristics of a point centromere. On the other hand, YlCENs lack the conserved DNA elements (CDEs). Y. lipolytica also does not code for the point centromere specific protein complex (the CBF3 complex) . On the other hand, Y. lipolytica harbors Sim4 and Fta1 proteins, which are kinetochore proteins associated with regional centromere only. Moreover, it should also be noted that the recently identified centromeres of Naumovozyma castellii represent an unconventional class of point centromere with unique centromere DNA elements . See text for the detailed information about the classification of the centromere. (B) Schematic shows a possible route of evolution of structural components of the centromere in ascomycetous yeasts. The length of the centromeres is also mentioned. However, it should be noted that the size of the centromeres in C. albicans, C. dubliniensis and C. lusitaniae is based on the length of the CENP-A binding domain. The inverted repeats of S. pombe and C. tropicalis centromeres aid in CENP-A recruitment de novo. It was also evident from the study in S. pombe that inverted repeats are essential in the establishment of the centromere, but is no longer required for the maintenance of an already established centromere. On the other hand, the centromeres mostly lack pericentric repeats in C. albicans where the role of DNA elements in de novo CENP-A recruitment is unknown. From these lines of evidence, we propose that the pericentric repeats would have been gradually lost in C. albicans and C. dubliniensis. It should also be noted that point centromeres might have originated from the regional ones .
In this study, we identified and analyzed the centromeres in C. tropicalis. We demonstrate that each centromere consists of a central non-repetitive mid core region, which is bound by evolutionarily conserved proteins from various layers of the kinetochore, and is flanked by inverted repeats. This is the first known saccharomycetous yeast in which all seven native centromeres are repeat-associated. Moreover, the inverted repeats of the same chromosome as well as across different chromosomes of C. tropicalis are highly similar in sequence. Taking together these centromere properties of C. tropicalis and those of other saccharomycetous yeasts, it is now evident that centromeres of all types—point centromeres with conserved motifs that are < 400 bp in length (as in Saccharomyces cerevisiae), shorter non-repetitive regional centromeres with unique CENP-A-rich regions of 3 to 5 kb long (as in C. albicans, C. dubliniensis and C. lusitaniae) as well as repeat-associated regional centromeres of 10 to 11 kb (as in C. tropicalis)–evolved in Saccharomycotina (Fig 7B). Although centromere structures are known in only a limited number of organisms, the discovery of all major types of centromeres in the saccharomycetes makes it a unique sub-phylum for tracing the path of evolution of monocentric chromosomes.
The CENP-A-bound DNA sequence is the most preferred site of kinetochore assembly in an entire chromosome. In spite of sharing conserved motifs among centromeres, the CENP-A- bound DNA sequences are often variable, even in the genetically defined point centromeres of S. cerevisiae. Intriguingly, a comparative analysis between S. cerevisiae and its closest relative Saccharomyces paradoxus, identified that the CENP-A-bound CDE-II elements are the fastest evolving region of the genomes . Similarly, in S. pombe flanking repeat sequences are conserved among the different chromosomes but the CENP-A-rich central core sequences are heterogeneous . The most extreme cases of rapid divergence have been observed in the centromeres of C. albicans , C. dubliniensis , and C. lusitaniae , where CENP-A-rich centromere DNA sequences are all unique and different in each species. In contrast, CENP-A is found to be enriched on highly homogenized arrays in most plants, mouse, and humans (reviewed in ). Thus, homogenization of CENP-A-bound mid core regions in C. tropicalis, as observed in this study, provides a unique feature of yeast centromeres that is more reminiscent of metazoan centromeres. It has been proposed that transposable elements are a major source of centromeric satellite repeats, which gradually homogenized over time by an unknown mechanism in a metazoan system (reviewed in ). We also observed a similar association of a retrotransposon in one centromere in C. tropicalis. More recently, it has been shown that the CENP-A-bound central core has a sequence feature enabling de novo recruitment of CENP-A molecules in S. pombe . Thus, it will be intriguing to investigate a feature of CENP-A-enriched mid core regions in C. tropicalis that may facilitate CENP-A recruitment.
Centromeres are known to be species-specific as centromeres of one organism do not function even in a related species . Inter-species crosses, mostly in plants, suggest that functional incompatibility of centromeres is a frequent cause of uniparental genome elimination [69–71]. Recently, it has been reported that perturbation of the length of the CENP-A binding domain to adopt a uniform size is a prerequisite for a successful inter-species hybridization between maize and oat . Thus, the length of the CENP-A-rich region at the centromere may be a key factor for centromere incompatibility in close relatives. In addition, the length of the CENP-A binding domain is found to be uniform in an organism regardless of the chromosome size or the nature of the centromere. Indeed, we observed that the length of the CENP-A binding region (3 to 5 kb) is surprisingly conserved in related Candida species, in spite of the dramatic transition in the centromere organization. A uniform length of the CENP-A-bound regions in these related species may thus suggest a possible role in maintaining uniform kinetochore-microtubule interactions. This is further supported by the fact that the Dam1 complex is essential in C. tropicalis. Essentiality of the Dam1 complex has been previously correlated to a one microtubule-one kinetochore type of interaction as observed in S. cerevisiae and C. albicans [44, 46]. Recently, it was proposed that DNA sequence repeats might have evolved to provide a ‘safety buffer’ against drifts in kinetochore position . Interestingly, we found that the binding of kinetochore proteins is restricted to a non-repetitive mid core region in all cases in C. tropicalis and does not spread to the surrounding inverted repeats. CENP-A chromatin is generally repressive (reviewed in ) and thus the safety buffer provided by the pericentric inverted repeats perhaps act as a barrier to prevent the drift of kinetochore position and maintain the size of CENP-A binding domain in this organism.
A series of growing lines of evidence suggest that fungal centromeres are rapidly evolving genomic loci (reviewed in ). It has been proposed that rapid evolution of centromere DNA may contribute to its functional incompatibility and perhaps aids in speciation [1, 18]. Speciation is, however, a poorly defined and less understood process in asexual organisms . Some Candida species with known centromere structures (C. albicans, C. dubliniensis and C. tropicalis) are primarily parasexual and capable of mating but lack a recognized meiotic program. In spite of this, we observed in this study a high degree of divergence in the centromere DNA sequences as well as in the organization of centromere elements in these related Candida species. Why does the centromere structure diverge so rapidly in these related organisms? It has been proposed that the loss of centromere function followed by the birth of a centromere in a new position can be viewed as a life cycle of a centromere that operates during evolution (reviewed in ). For such an event, massive chromosomal rearrangements including the loss of an existing centromere would have to occur. Coincidentally, a comparative analysis among the relatives of both yeasts  and mammals  identified frequent breakpoints adjacent to centromeres. These results suggest that centromeres are among the most fragile sites in a genome. We also observed a gross chromosomal rearrangement between C. albicans and C. tropicalis specifically at the centromeres. It is also clear that the centromere loss or gain happened in these two organisms during their divergence from a common ancestor. Being both commensal and opportunistic pathogens, Candida species show considerable genome plasticity possibly as a means to survive in a hostile host environment. Genome rearrangements including karyotype changes, aneuploidy, and loss of heterozygosity have been frequently observed in clinical isolates of Candida species (reviewed in ). Thus, it is likely that the evolutionary life cycle of a centromere may have contributed to their rapid divergence in these related pathogenic yeast species.
Evolution is typically thought to proceed to generate diversity . However, independent evolutionary origins of similar biological structures or functions in distantly related taxa challenge this common paradigm . In this study, we observed that structural features of the C. tropicalis centromeres resemble a shorter version (10 to 11 kb) of the distantly related S. pombe centromeres (40 to 110 kb) [12, 65]. However, the pericentric inverted repeats observed in C. tropicalis have no sequence identity to either the pericentric repeats of S. pombe or the centromere associated inverted repeats of C. albicans or C. dubliniensis. A notable difference between the centromeres of S. pombe and C. tropicalis is the absence of outer repeats (otr in S. pombe) in C. tropicalis. The otr is the site of small RNA (siRNA) generation and subsequently otr recruits other heterochromatin proteins (such as Swi6 and Clr4 in S. pombe) to make the centromeric region heterochromatic in S. pombe (reviewed in ). Heterochromatin proteins and siRNAs play a vital role in centromere identity in this organism. Unlike S. pombe, C. tropicalis genome neither possesses the full RNAi machinery nor several key players required for heterochromatin formation such as an ortholog of Clr4 (H3K9 methyltransferase) . Thus involvement of repeat elements in establishing RNAi-dependent H3K9me heterochromatin formation, as observed in S. pombe, is unlikely in C. tropicalis. In conclusion, we demonstrate for the first time the evolution of repeat-associated centromeres in an ascomycetous budding yeast (Fig 7B). The most reasonable explanation for the appearance of the repeat-associated centromere structure is the contribution of repeats to de novo CENP-A deposition. CENP-A is a universal marker of functional centromeres and does not localize at inactivated centromeres. Studies on artificial CENP-A recruitment, either by direct tethering of CENP-A or its chaperone HJURP (also known as Scm3 in yeasts) to an ectopic locus [83, 84], suggest that de novo CENP-A deposition is in general one of the most significant rate limiting steps to the acquisition of centromere function. The process of CENP-A recruitment is known to be regulated by both genetic and epigenetic means (reviewed in [2, 29]). However, neither the DNA elements nor epigenetic factors are conserved across the kingdom implying an astounding flexibility in centromere specification. In this study, we demonstrate that a dramatic transition in centromere organization has rewired the genetic and epigenetic regulation of CENP-A deposition in related species. Thus, the ways in which the genetic and the epigenetic factors are co-evolving to orchestrate de novo CENP-A recruitment on a DNA sequence to establish a functional centromere may determine the shape of the centromere structure in an organism.
Media and transformation procedure
C. tropicalis strains were grown either in YPDU (1% yeast extract/ 2% peptone/ 2% glucose/ 0.010% uracil), or in complete minimal (CM) media unless stated otherwise. C. tropicalis cells were transformed by the standard lithium acetate method as stated previously . It is important to note that C. tropicalis requires uracil and not uridine in the medium to supplement the Ura auxotrophy.
Identification of CENP-A, CENP-C, NUF2 and DAD1 genes in C. tropicalis
The centromeric histone H3 CENP-A homolog in C. tropicalis , was identified in a BLAST analysis using C. albicans CENP-A (CaCse4) as the query sequence against the Candida tropicalis genome . The BLAST analysis revealed that the proteins with high scores (score >213) were the putative CENP-A homologue, CtCse4 (CTRG_02639.3), and histone H3 proteins (CTRG_04732.3, CTRG_00676.3 and CTRG_05645.3). The CtCse4 (Scnt 3: 1334129–1334845) is a 238-aa-long protein that shows 90% homology with the C-terminal histone fold domain of CaCse4 (S1A Fig). Similarly, CENP-C (Mif2), Nuf2, and Dad1 homologs of C. tropicalis were identified in a BLAST analysis. The CtMif2 (CTRG_05763.3) is a 523-aa-long protein (Scnt 9: 474053–475624+) with a conserved CENP-C box, which is identical in sequence between the CaMif2 and CtMif2 (S1A Fig). CtNUF2 (CTRG_05381.3) and CtDAD1 (CTRG_03625.3) encode 492-aa- and 99-aa-long proteins respectively. Both of these proteins show a high degree of sequence conservation in comparison to those of C. albicans (S1A Fig).
Identification of CtGAL1 promoter (CtGAL1 Pr.) in C. tropicalis
The sequence upstream of the GAL1 gene in S. cerevisiae, harboring the upstream activation sequence (UAS), is used as the GAL1 promoter to regulate the expression of desired genes [86, 87]. However, no such regulatable promoter has been identified previously in C. tropicalis to control the expression level and study the essentiality of proteins. The C. tropicalis homolog of GAL1 was identified as the ORF (CTRG_04617) by BLAST using S. cerevisiae GAL1 as the query sequence. Further, on analyzing the genomic location of this gene, we found that the synteny of GAL1 and GAL10 genes was maintained as observed in S. cerevisiae.
Cells of C. tropicalis strains expressing GFP tagged kinetochore proteins were grown overnight, harvested, and washed twice with sterile distilled water. Cells were then resuspended into sterile distilled water to obtain the desired density before taking the images with a Delta Vision Microscopy Imaging system. Indirect immunofluorescence was done as described before . Asynchronously grown C. tropicalis cells were fixed with a 1/10th volume of formaldehyde (37%) for 1 h at room temperature. Antibodies used were diluted as follows: 1:500 for rabbit anti-Cse4 antibodies  and 1:30 for rat anti-tubulin antibodies (Abcam, Cat No. ab6161). The dilutions for secondary antibodies used were Alexa flour 568 goat anti-rabbit IgG (Invitrogen, Cat No. A11011) 1:500 and Alexa fluor 488 goat anti-rat IgG (Invitrogen, Cat No. A11006) 1:500. DAPI (4, 6-Diamino-2-phenylindole) (D9542 Sigma) was used to stain the nuclei of the cells. Cells were examined under 100 (multi) magnifications using a confocal laser scanning microscope (LSM 510 META, Carl Zeiss). The digital images were processed with Adobe Photoshop.
Flow cytometry (FACS) analysis
C. tropicalis cells were harvested at two different time points and processed as described before . Prior to injection of the sample into the flow cytometer, the cell suspension was sonicated briefly (30% amplitude, 7s pulse). The sonicated samples were diluted to a desired cell density in 1X PBS and injected into the flow cytometer (BD FACSCalibur) for analysis. The output was analyzed using the BD CellQuestPro software.
The conditional mutant strains of C. tropicalis grown in both permissive and non-permissive media were harvested, washed, and resuspended in 300 μl of sterile distilled water. These cells were fixed by adding 700 μl absolute ethanol and incubated at room temperature for 1 h. After fixing, the cells were washed with 1ml of sterile distilled water twice and resuspended in sterile distilled water to obtain desired cell density prior to imaging. To 5 μl cell suspension, 3 μl DAPI (100 ng/ml) was added in the well, mixed gently by pipetting, and the cover slip was then placed. After 5 min of incubation, the cells were imaged using a fluorescence microscope (Olympus BX51) under 100 (multi) magnifications.
Western blot analysis
C. tropicalis strains were grown overnight in YPDU and cells were harvested. The harvested cells were washed with lysis buffer (0.2 M Tris, 1 mM EDTA, 0.39 M ammonium sulphate, 4.9 mM magnesium sulphate, 20% glycerol, 0.95% acetic acid, pH 7.8) and resuspended in 0.5 ml of the same buffer. The cells were disrupted using acid-washed glass beads (Sigma, Cat. No. G8772) by vortexing 5 min (1 min vortexing followed by 1 min cooling on ice) at 4°C. C. tropicalis cell lysates were electrophoresed on a 12% SDS-PAGE gel and blotted onto a nitrocellulose membrane in a semi-dry apparatus (Bio-Rad). The blotted membranes were blocked with 5% skim milk containing 1X PBS (pH 7.4) for 1 h at room temperature and were then incubated with following dilutions of primary antibodies: anti-Cse4 antibodies  1:500; anti-H3 antibodies [Abcam, Cat No. ab1791] 1:2500; for 1 h at room temperature. Next, the membranes were washed three times with PBST (0.1% Tween-20 in 1X PBS) solution. Anti-rabbit HRP conjugated antibodies [Bangalore Genei, Cat No. 105499] in 1:1000 dilutions were added and incubated for 1 h at room temperature followed by three to four washes with the PBST solution. Signals were detected using the chemiluminescence method (SuperSignal West Pico Chemiluminescent substrate, Thermo scientific, Cat No. 34080).
ChIP assays and antibodies
The ChIP assays were done as described previously . Briefly, each strain was grown until exponential phase (~2×107 cells/ml) and cells were cross-linked with formaldehyde (final concentration 1%). Chromatin was isolated and sonicated to yield an average fragment size of 300–500 bp. Then the DNA was immunoprecipitated with anti-Cse4 antibodies  (final concentration is 6 μg/ml) or anti-protein A antibodies (final concentration is 24 μg/ml) or anti-V5 antibodies (Life Technologies, Cat No. R960-25) (final concentration is 0.94μl/ml) and purified. The duration of cross-linking varies—15 min for CENP-A, 20 min for CENP-C, 1 h 45 min for Nuf2 and 3 h 15 min for Dad1. The total, immunoprecipitated (IP) DNA, and beads only material were used to determine the binding of kinetochore proteins in all seven putative centromeres by semi-quantitative PCR. PCR conditions for primers (as listed in S5 Table) were used as follows: 94°C for 2 min, Tm for 30 s (Tm varies with the primers), 72°C for 1 min, for 1 cycle; 94°C for 30 s, Tm for 30 s, 72°C for 1 min for 24 cycles in case of CENP-A and CENP-C; and 27 cycles for Nuf2 and Dad1; 72°C for 10 min.
ChIP sequencing, Sanger sequencing and analysis
Pulsed field gel electrophoresis
C. tropicalis strain MYA-3404 was grown until exponential phase (~2×107 cells/ml). Cells were washed with 50 mM EDTA and counted with a hemocytometer. Approximately 6×108 cells were used for the preparation of 1 ml genomic DNA plugs. The plugs were made according to the instruction manual protocol (BioRad, Cat No. 170–3593) with cleancut agarose (0.6%) and the lyticase enzyme provided by the kit. A 0.6% pulsed field certified agarose gel was prepared using 0.5X TBE buffer (0.1 M Tris, 0.09 M boric acid, 0.01 M EDTA, pH 8) and the PFGE was performed on a CHEF-DR II (Bio-Rad) for 72 h (24 h at 4.5 V/cm/106° with an initial and final switch times 200 s; 48 h at 3 V/cm/106° with an initial and final switch time 700 s). The gel was stained with ethidium bromide (EtBr) and analyzed by using the Quantity One software (Bio-Rad).
Quantitative PCR (qPCR)
To determine the extent of binding of kinetochore proteins on the centromere of Scnt 8, real time PCR (qPCR) was performed. The template used was as follows: 1 μl of 1:100 dilutions for input and 1 μl of 1:5 dilutions for IP. The conditions used in qPCR were as follows: 94°C for 2 min; 94°C for 30 s, Tm for 30 s (Tm varies with the primers), 72°C for 45 s for 30 cycles. The results were plotted on a graph according to the percentage input method using the formula: 100*2^ (adjusted Ct input−adjusted Ct of IP). Here, the adjusted value is the dilution factor (log2 of dilution factor) subtracted from the Ct value of diluted input or IP . Similar conditions were used to determine the enrichment of CENP-A proteins on the centromeric plasmids.
To determine conservation rates for inverted repeats (IRs) within and across centromeres, and mid regions across different centromeres, we used Sigma version 2 , a program that aims to minimize spurious alignments by using a stringent p-value for all local alignments, and uses a background model with correlations and an evolutionary model to link sequences. The background model and substitution matrix were drawn from S. cerevisiae and close relatives, and are not expected to vary significantly across Saccharomycetes. The branch lengths were determined dynamically. The conservation rates in Fig 4C were determined from these alignments using custom python scripts. The visual representation of the alignments as shown in S5A, S5C and S5D Fig was created with an in-house program. The dotplot in S5E Fig was created with dotmatcher, from EMBOSS 6.3.1 . Additionally, the inverted repeats and mid regions were scanned for tandem repeats using the Tandem Repeats Finder version 4.04 . The parameters used were “filename 2 5 5 80 10 2 2000” (maximum period size 2000). The results are summarized in S3 Table. For the synteny analysis as in Fig 5, orthology information was obtained from the Fungal Orthogroups Repository (http://www.broadinstitute.org/regev/orthogroups) . Genes in each species within 100 kb of each centromere were examined, and orthologous genes were plotted using an in-house program.
ARS plasmid assay
Approximately 1 μg of DNA of both pARS2 and the control parental (pUC19-CaURA3) plasmids were used to transform CtKS04 strain using both the lithium acetate and the spheroplast transformation methods as stated before . After transformation, the cells were plated on the complete media lacking uracil (CM-Ura) and incubated at 30°C for 3 to 5 days before taking photographs. The ARS activity of pARS2 was determined as the transformation efficiency (i.e., the number of transformants/ μg of DNA). Each transformation was performed at least 3 times.
Mitotic stability assay
The mitotic stability assay was performed to determine the loss rate of pARS2, pmid8, pCEN8, pARS2-λ, pCEN801 and pCEN802 in C. tropicalis. Briefly, the C. tropicalis strain, CtKS102 transformed with above mentioned plasmids were streaked on CM-Ura plates for single colonies. Single colonies thus obtained were subsequently inoculated in a nonselective media (YPDU) and incubated at 30°C for overnight for at least 10 generations. Next day, equal numbers of cells were simultaneously plated on YPDU and CM-Ura and incubated at 30°C for 2 days. Colonies grown on both plates were counted and the mitotic stability was calculated in percentage as follows: Mitotic stability = (S/NS), where S and NS denote the number of colonies grown on selective and nonselective media respectively.
A phylogenetic tree with estimated geological time was created via a multiple alignment of 573 gene orthologue sets in 13 sequenced species of Ascomycetous fungi (as shown in Fig 7A)–namely, C. tropicalis, S. cerevisiae, C. glabrata, K. lactis, A. gossypii, N. casetelli, C. dubliniensis, C. albicans, C. lusitaniae, D. hansenii, C. guilliermondii, Y. lipolytica, N. crassa, A. nidulans, S. japonicus, S. octosporus, S. pombe. The orthologous genes were identified using the Fungal Orthogroups Repository (http://www.broadinstitute.org/regev/orthogroups/) , except in the case of C. dubliniensis for which orthologues to C. albicans were used as annotated in the gene sequences from the Candida Genome Database (http://www.candidagenome.org/). Only genes for which there were unique reciprocal orthologues between S. cerevisiae and each of the 13 other species, and which lacked introns (or from which we could easily remove introns) were considered. To remove bias from outliers, the orthologous genes in all species were further sub-selected for genes that evolve uniformly. For this, the average rates of synonymous (ds) and non-synonymous (dn) substitution were calculated separately from codon-level alignments. Only genes whose ds and dn both fell within 1.5 standard deviations of the mean for the full set were considered. This yielded a list of 573 genes. These coding DNA sequences were aligned at the codon level with FSA  (command line option “—translated”), concatenated with gaps removed, and a tree was generated with codonphyml  (command line “-d codon -q”). Since the quantity of sequence was very large (nearly 10 Mbp, or over 0.5 Mbp per species) bootstrapping was not done.
S1 Text. Supplemental text discussion and detailed methods.
S1 Fig. Kinetochore proteins in C. tropicalis.
(A) A pair-wise alignment of each of the four putative kinetochore proteins in C. albicans and C. tropicalis. The domain architecture of CENP-A and CENP-C is shown below the sequence alignment. (B) A pair-wise alignment of first 18 amino acids of CaCENP-A and CtCENP-A as shown in above panel revealed a high level of amino acid sequence conservation. Anti-Cse4 antibodies were raised against the first 18 amino acids of CaCENP-A. Western blot analysis with anti-Cse4 and anti-histone H3 antibodies was performed to detect the specificity of anti-Cse4 antibodies. (C) CENP-A is localized at the kinetochores in C. tropicalis. Fixed cells of C. tropicalis strain MYA-3404 were stained with DAPI, anti-Cse4, and anti-tubulin antibodies. The intense single red dot-like CENP-A signals were observed in DAPI-stained (blue) nuclei at G1 unbudded cells and segregate to become two dots during mitosis. Corresponding spindle structures (green) are shown by co-immunostaining with anti-tubulin antibodies. Scale bar, 5 μm.
S2 Fig. Depletion of kinetochore proteins leads to defective nuclear segregation in C. tropicalis.
Cells of respective conditional mutant strains grown in the repressive condition (glucose) were stained with DAPI and images were taken by fluorescence microscopy. Arrows indicate an unsegregated mass of nucleus at the mother bud neck, which is a typical feature of G2/M arrest in yeasts. The corresponding panels show the DIC images. Scale bar, 5 μm.
S3 Fig. Genome-wide CENP-A and CENP-C ChIP-seq analyses in C. tropicalis.
(A) Plots of CENP-A and CENP-C ChIP-seq reads along individual supercontigs of C. tropicalis. The x-axis and y-axis represent the coordinates of the chromosomal regions and the distribution of sequence reads of the specific supercontig respectively as described before. However, it should be noted that ChIP-seq analysis using standardized protocols detected reads on all supercontigs except Supercontig 19 (Scnt 19) (A supercontig will be referred as ‘Scnt’ and number will indicate the supercontig number). (B) The resequenced portion of Scnt 1 shows regions of high sequence similarity (stretches of a few hundred base pair (bp) with no or very few mismatches) with the original Scnt 1, Scnt 3 and Scnt 19. The similar regions are marked as (i), (ii), (iii) and (iv). This suggested that these two supercontigs share long stretches of nearly identical sequence. Given the assumption of the ChIP-seq analysis algorithm that only allows uniquely aligning reads (see methods), this high degree of identity would cause problems both in the original assembly and in uniquely aligning our ChIP-seq reads. Thus, we have carried out ChIP-seq analysis against a reference that consisted of all supercontigs except Scnt 19.
S4 Fig. Centromeric regions belong to different chromosomes in C. tropicalis.
Chromosomes of C. tropicalis were resolved on CHEF gels and stained with ethidium bromide (EtBr) along with C. albicans chromosomes used as size markers (left-most lane). The gels were blotted and probed with unique sequences of the corresponding supercontigs that carry a CENP-A-rich region (right lanes). Southern hybridization shows that each CENP-A-rich centromeric region belongs to a unique chromosomal band in C. tropicalis. All Southern blot images were derived by reprobing the same membrane.
S5 Fig. The centromere of C. tropicalis comprises of a homogenized mid core flanked by inverted repeats.
(A) Homologous segments in the middle regions (mids) of the seven centromeres are shown. In addition, the extended length of mid7 is due to presence of two annotated retrotransposons in C. tropicalis. (B) The centromeric location of one of these retrotransposons (CTRG_05088.3) and its homolog (Cd36_71790) is conserved between C. tropicalis (CtCEN7) and C. dubliniensis (CdCEN7). (C) Homologous segments in the inverted repeat arms of the seven centromeres. The red bars indicate the arms of the inverted repeats, and are drawn to scale. The coloured bands crossing the arms indicate homologous segments. However, when they pass under an arm, it indicates no homology on that arm. 'LR' or 'RR' represents the left or right repeats; supercontig numbers are shown on the right. In (A) and (C) asterisk (*) represents a reverse-complementary sequence. (D) Homologous segments in pair-wise alignments of the 7 pairs of inverted repeat arms at the centromeres, demonstrating that the conservation between inverted repeats of the same centromere is significantly higher than across centromeres. (E) A comparison of the inverted repeats of the centromere in Scnt 5 with a dotplot, showing that regions with tandem repeats (squares along the diagonal) tend to correspond with breaks in the sequence alignment. This pattern of three regions of tandem repeats is seen in most arms.
S6 Fig. Orthologous genes present across 100 kb of centromeres in C. albicans and C. tropicalis.
(A) Orthologous genes or groups of genes on the same strand are joined with blue bands, and on the opposite strand with red bands. Genes in grey colour may have orthologs 100 kb or farther from centromeres in these species. Green regions show the centromeres of both the organisms. (B) Phylogeny of Candida species with chromosome number present in each species. Phylogeny shown here is adapted from Fitzpatrick et al. .
S7 Fig. Characterization of pARS2 as a replicative plasmid in C. tropicalis.
A map of pARS2 with the cloned sites of CaURA3 and CaARS2 is shown. The plate pictures show the ARS function assay of pARS2 as compared to control parent plasmid (pUC19-CaURA3). A table shows the transformation frequency of pARS2 as compared to the control as done either by spheroplasting (ST) or by the lithium acetate method (LT). The transformation experiment was done with three replicates (n = 3) and the mean with standard deviation is indicated in each case. The control plasmid did not yield any transformants.
S8 Fig. Plasmids used in this study.
Schematics represent locations of plasmid specific primer-pairs for each plasmid used in the mitotic stability assays. The brown color demarcates a unique 6-bp SalI site in a plasmid, which is absent at the native locus. The specificity of the amplicon carrying plasmid-borne CEN8 was achieved by the addition of a unique engineered SalI site at the 3’ end of each primer. Schematics were not drawn to scale.
S1 Table. The length and coordinates of the CENP-A and CENP-C binding as identified by the ChIP-seq analysis within an ORF-free region in C. tropicalis.
S2 Table. The length and coordinates of the inverted repeats (IRs) along with mid core region of each centromere in C. tropicalis.
S3 Table. Tandem repeats (TR) within the IRs arms at the pericentric regions, using the Tandem Repeat Finder version 4.04.
S4 Table. Primers used in this study.
S5 Table. Strains used in this study.
We are thankful to Genotypic Technology (Bangalore), B. Suma and G. Anitha for their help in the NGS, confocal microscopy and Sanger sequencing respectively. We thank R. Bennett and J. Morschhäuser for sharing the plasmids. We thank J. Heitman, V. Yadav and L. Sreekumar for critically reading the manuscript.
Conceived and designed the experiments: GC SRS KG YT SP RS KS. Performed the experiments: GC SRS KG YT SP RS. Analyzed the data: GC SRS KG YT SP RS KS. Contributed reagents/materials/analysis tools: GC SRS KG YT SP RS KS. Wrote the paper: GC SRS KG YT SP RS KS.
- 1. Henikoff S, Ahmad K, Malik HS. The centromere paradox: stable inheritance with rapidly evolving DNA. Science. 2001;293(5532):1098–102. Epub 2001/08/11. pmid:11498581
- 2. Roy B, Sanyal K. Diversity in requirement of genetic and epigenetic factors for centromere function in fungi. Eukaryot Cell. 2011;10(11):1384–95. Epub 2011/09/13. doi: 10.1128/EC.05165-11. pmid:21908596
- 3. Sanyal K. How do microbial pathogens make CENs. PLoS Pathog. 2012;8(2):e1002463. doi: 10.1371/journal.ppat.1002463. pmid:22346745
- 4. Meraldi P, McAinsh AD, Rheinbay E, Sorger PK. Phylogenetic and structural analysis of centromeric DNA and kinetochore proteins. Genome Biol. 2006;7(3):R23. Epub 2006/03/28. pmid:16563186
- 5. Wang G, Zhang X, Jin W. An overview of plant centromeres. J Genet Genomics. 2009;36(9):529–37. doi: 10.1016/S1673-8527(08)60144-7. pmid:19782954
- 6. Lamb JC, Yu W, Han F, Birchler JA. Plant chromosomes from end to end: telomeres, heterochromatin and centromeres. Curr Opin Plant Biol. 2007;10(2):116–22. pmid:17291819
- 7. Plohl M, Meśtrović N, Mravinac B. Centromere identity from the DNA point of view. Chromosoma. 2014;123(4):313–25. doi: 10.1007/s00412-014-0462-0. pmid:24763964
- 8. Schueler MG, Sullivan BA. Structural and functional dynamics of human centromeric chromatin. Ann Rev Genomics Hum Genet. 2006;7(1):301–13.
- 9. Cambareri EB, Aisner R, Carbon J. Structure of the chromosome VII centromere region in Neurospora crassa: Degenerate transposons and simple repeats. Mol Cell Biol. 1998;18(9):5465–77. pmid:9710630
- 10. Smith KM, Phatale PA, Sullivan CM, Pomraning KR, Freitag M. Heterochromatin is required for normal distribution of Neurospora crassa CenH3. Mol Cell Biol. 2011;31(12):2528–42. doi: 10.1128/MCB.01285-10. pmid:21505064
- 11. Janbon G, Ormerod KL, Paulet D, Byrnes EJ III, Yadav V, Chatterjee G, et al. Analysis of the genome and transcriptome of Cryptococcus neoformans var. grubii reveals complex RNA expression and microevolution leading to virulence attenuation. PLoS Genet. 2014;10(4):e1004261. doi: 10.1371/journal.pgen.1004261. pmid:24743168
- 12. Chikashige Y, Kinoshita N, Nakaseko Y, Matsumoto T, Murakami S, Niwa O, et al. Composite motifs and repeat symmetry in S. pombe centromeres: Direct analysis by integration of Notl restriction sites. Cell. 1989;57(5):739–51. pmid:2541922
- 13. Clarke L, Baum MP. Functional analysis of a centromere from fission yeast: a role for centromere-specific repeated DNA sequences. Mol Cell Biol. 1990;10(5):1863–72. pmid:2325639
- 14. Hahnenberger KM, Carbon J, Clarke L. Identification of DNA regions required for mitotic and meiotic functions within the centromere of Schizosaccharomyces pombe chromosome I. Mol Cell Biol. 1991;11(4):2206–15. pmid:2005906
- 15. Sanyal K, Baum M, Carbon J. Centromeric DNA sequences in the pathogenic yeast Candida albicans are all different and unique. Proc Natl Acad Sci USA. 2004;101(31):11374–9. pmid:15272074
- 16. Padmanabhan S, Thakur J, Siddharthan R, Sanyal K. Rapid evolution of Cse4p-rich centromeric DNA sequences in closely related pathogenic yeasts, Candida albicans and Candida dubliniensis. Proc Natl Acad Sci USA. 2008;105(50):19797–802. doi: 10.1073/pnas.0809770105. pmid:19060206
- 17. Kapoor S, Zhu L, Froyd C, Liu T, Rusche LN. Regional centromeres in the yeast Candida lusitaniae lack pericentromeric heterochromatin. Proc Natl Acad Sci USA. 2015;112(39):12139–44. doi: 10.1073/pnas.1508749112. pmid:26371315
- 18. Malik HS, Henikoff S. Major evolutionary transitions in centromere complexity. Cell. 2009;138(6):1067–82. Epub 2009/09/22. doi: 10.1016/j.cell.2009.08.036. pmid:19766562
- 19. Voullaire LE, Slater HR, Petrovic V, Choo KH. A functional marker centromere with no detectable alpha-satellite, satellite III, or CENP-B protein: activation of a latent centromere? Am J Hum Genet. 1993;52(6):1153–63. pmid:7684888
- 20. Platero JS, Ahmad K, Henikoff S. A distal heterochromatic block displays centromeric activity when detached from a natural centromere. Mol Cell. 1999;4(6):995–1004. pmid:10635324
- 21. Ishii K, Ogiyama Y, Chikashige Y, Soejima S, Masuda F, Kakuma T, et al. Heterochromatin integrity affects chromosome reorganization after centromere dysfunction. Science. 2008;321(5892):1088–91. doi: 10.1126/science.1158699. pmid:18719285
- 22. Ketel C, Wang HSW, McClellan M, Bouchonville K, Selmecki A, Lahav T, et al. Neocentromeres form efficiently at multiple possible loci in Candida albicans. PLoS Genet. 2009;5(3):e1000400. doi: 10.1371/journal.pgen.1000400. pmid:19266018
- 23. Thakur J, Sanyal K. Efficient neocentromere formation is suppressed by gene conversion to maintain centromere function at native physical chromosomal loci in Candida albicans. Genome Res. 2013;23(4):638–52. doi: 10.1101/gr.141614.112. pmid:23439889
- 24. Shang W- H, Hori T, Martins NunoÂ MC, Toyoda A, Misu S, Monma N, et al. Chromosome engineering allows the efficient isolation of vertebrate neocentromeres. Dev Cell. 2013;24(6):635–48. doi: 10.1016/j.devcel.2013.02.009. pmid:23499358
- 25. Earnshaw WC, Migeon BR. Three related centromere proteins are absent from the inactive centromere of a stable isodicentric chromosome. Chromosoma. 1985;92(4):290–6. Epub 1985/01/01. pmid:2994966
- 26. Agudo M, Abad JP, Molina I, Losada A, Ripoll P, Villasante A. A dicentric chromosome of Drosophila melanogaster showing alternate centromere inactivation. Chromosoma. 2000;109(3):190–6. Epub 2000/08/10. pmid:10929197
- 27. Stimpson KM, Song IY, Jauch A, Holtgreve-Grez H, Hayden KE, Bridger JM, et al. Telomere disruption results in non-random formation of de novo dicentric chromosomes involving acrocentric human chromosomes. PLoS Genet. 2010;6(8). Epub 2010/08/17.
- 28. Sato H, Masuda F, Takayama Y, Takahashi K, Saitoh S. Epigenetic inactivation and subsequent heterochromatinization of a centromere stabilize dicentric chromosomes. Curr Biol. 2012;22(8):658–67. Epub 2012/04/03. doi: 10.1016/j.cub.2012.02.062. pmid:22464190
- 29. Stimpson KM, Sullivan BA. Epigenomics of centromere assembly and function. Curr Opin Cell Biol. 2010;22(6):772–80. doi: 10.1016/j.ceb.2010.07.002. pmid:20675111
- 30. McKinley KL, Cheeseman IM. The molecular basis for centromere identity and function. Nat Rev Mol Cell Biol. 2016;17(1):16–29. doi: 10.1038/nrm.2015.5. pmid:26601620
- 31. Earnshaw WC, Allshire RC, Black BE, Bloom K, Brinkley BR, Brown W, et al. Esperanto for histones: CENP-A, not CenH3, is the centromeric histone H3 variant. Chromosome Res. 2013;21(2):101–6. Epub 2013/04/13. doi: 10.1007/s10577-013-9347-y. pmid:23580138
- 32. Allshire RC, Karpen GH. Epigenetic regulation of centromeric chromatin: old dogs, new tricks? Nat Rev Genet. 2008;9(12):923–37. Epub 2008/11/13. doi: 10.1038/nrg2466. pmid:19002142
- 33. Guse A, Carroll CW, Moree B, Fuller CJ, Straight AF. In vitro centromere and kinetochore assembly on defined chromatin templates. Nature. 2011;477(7364):354–8. doi: 10.1038/nature10379. pmid:21874020
- 34. Fachinetti D, Diego Folco H, Nechemia-Arbely Y, Valente LP, Nguyen K, Wong AJ, et al. A two-step mechanism for epigenetic specification of centromere identity and function. Nat Cell Biol. 2013;15(9):1056–66. doi: 10.1038/ncb2805. pmid:23873148
- 35. Folco HD, Pidoux AL, Urano T, Allshire RC. Heterochromatin and RNAi are required to establish CENP-A chromatin at centromeres. Science. 2008;319(5859):94–7. doi: 10.1126/science.1150944. pmid:18174443
- 36. Buscaino A, Allshire R, Pidoux A. Building centromeres: home sweet home or a nomadic existence? Cur Opin Genet Dev. 2010;20(2):118–26.
- 37. Folco HD, Pidoux AL, Urano T, Allshire RC. Heterochromatin and RNAi are required to establish CENP-A chromatin at centromeres. Science. 2008;319(5859):94–7. Epub 2008/01/05. doi: 10.1126/science.1150944. pmid:18174443
- 38. Baum M, Sanyal K, Mishra PK, Thaler N, Carbon J. Formation of functional centromeric chromatin is specified epigenetically in Candida albicans. Proc Natl Acad Sci USA. 2006;103(40):14877–82. Epub 2006/09/27. pmid:17001001
- 39. Papon N, Courdavault V, Clastre M, Bennett RJ. Emerging and emerged pathogenic Candida species: beyond the Candida albicans paradigm. PLoS Pathog. 2013;9(9):e1003550. Epub 2013/10/03. doi: 10.1371/journal.ppat.1003550. pmid:24086128
- 40. Pfaller MA, Diekema DJ, Gibbs DL, Newell VA, Ellis D, Tullio V, et al. Results from the ARTEMIS DISK Global Antifungal Surveillance Study, 1997 to 2007: a 10.5-year analysis of susceptibilities of Candida Species to fluconazole and voriconazole as determined by CLSI standardized disk diffusion. J Clin Microbiol. 2010;48(4):1366–77. Epub 2010/02/19. doi: 10.1128/JCM.02117-09. pmid:20164282
- 41. Kothavade RJ, Kura MM, Valand AG, Panthaki MH. Candida tropicalis: its prevalence, pathogenicity and increasing resistance to fluconazole. J Med Microbiol. 2010;59(Pt 8):873–80. Epub 2010/04/24.
- 42. Chakrabarti A, Sood P, Rudramurthy Shivaprakash M, Chen S, Kaur H, Capoor M, et al. Incidence, characteristics and outcome of ICU-acquired candidemia in India. Intensive Care Medicine. 2015;41(2):285–95. doi: 10.1007/s00134-014-3603-2. pmid:25510301
- 43. Butler G, Rasmussen MD, Lin MF, Santos MAS, Sakthikumar S, Munro CA, et al. Evolution of pathogenicity and sexual reproduction in eight Candida genomes. Nature. 2009;459(7247):657–62. doi: 10.1038/nature08064. pmid:19465905
- 44. Burrack LS, Applen SE, Berman J. The requirement for the Dam1 complex is dependent upon the number of kinetochore proteins and microtubules. Curr Biol. 2011;21(10):889–96. Epub 2011/05/10. doi: 10.1016/j.cub.2011.04.002. pmid:21549601
- 45. Sanyal K, Carbon J. The CENP-A homolog CaCse4p in the pathogenic yeast Candida albicans is a centromere protein essential for chromosome transmission. Proc Natl Acad Sci USA. 2002;99(20):12969–74. pmid:12271118
- 46. Thakur J, Sanyal K. The essentiality of the fungus-specific Dam1 complex is correlated with a one-kinetochore-one-microtubule interaction present throughout the cell cycle, independent of the nature of a centromere. Eukaryot Cell. 2011;10(10):1295–305. Epub 2011/05/17. doi: 10.1128/EC.05093-11. pmid:21571923
- 47. Thakur J, Sanyal K. A coordinated interdependent protein circuitry stabilizes the kinetochore ensemble to protect CENP-A in the human pathogenic yeast Candida albicans. PLoS Genet. 2012;8(4):e1002661. Epub 2012/04/27. doi: 10.1371/journal.pgen.1002661. pmid:22536162
- 48. Mitra S, Rai LS, Chatterjee G, Sanyal K. Chromatin immunoprecipitation (ChIP) assay in Candida albicans. in Candida species: Methods and Protocols eds. Calderone Richard and Cihlar Ronald. (in Press). 2015.
- 49. Mishra P, Baum M, Carbon J. Centromere size and position in Candida albicans are evolutionarily conserved independent of DNA sequence heterogeneity. Mol Genet Genomics. 2007;278(4):455–65. pmid:17588175
- 50. Blower MD, Karpen GH. The role of Drosophila CID in kinetochore formation, cell-cycle progression and heterochromatin interactions. Nat Cell Biol. 2001;3(8):730–9. pmid:11483958
- 51. Howman EV, Fowler KJ, Newson AJ, Redward S, MacDonald AC, Kalitsis P, et al. Early disruption of centromeric chromatin organization in centromere protein A (Cenpa) null mice. Proc Natl Acad Sci USA. 2000;97(3):1148–53. pmid:10655499
- 52. Oegema K, Desai A, Rybina S, Kirkham M, Hyman AA. Functional analysis of kinetochore assembly in Caenorhabditis elegans. J Cell Biol. 2001;153(6):1209–26. pmid:11402065
- 53. Bulazel KV, Ferreri GC, Eldridge MD, O'Neill RJ. Species-specific shifts in centromere sequence composition are coincident with breakpoint reuse in karyotypically divergent lineages. Genome Biol. 2007;8(8):R170. Epub 2007/08/22. pmid:17708770
- 54. Carbone L, Nergadze SG, Magnani E, Misceo D, Francesca Cardone M, Roberto R, et al. Evolutionary movement of centromeres in horse, donkey, and zebra. Genomics. 2006;87(6):777–82. pmid:16413164
- 55. Chmátal L, Gabriel SI, Mitsainas GP, Martínez-Vargas J, Ventura J, Searle JB, et al. Centromere strength provides the cell biological basis for meiotic drive and karyotype evolution in mice. Curr Biol. 2014;24(19):2295–300. doi: 10.1016/j.cub.2014.08.017. pmid:25242031
- 56. Coghlan A, Eichler EE, Oliver SG, Paterson AH, Stein L. Chromosome evolution in eukaryotes: a multi-kingdom perspective. Trends Genet. 2005;21(12):673–82. pmid:16242204
- 57. Fischer G, James SA, Roberts IN, Oliver SG, Louis EJ. Chromosomal evolution in Saccharomyces. Nature. 2000;405(6785):451–4. pmid:10839539
- 58. Hou J, Friedrich A, deÂ Montigny J, Schacherer J. Chromosomal rearrangements as a major mechanism in the onset of reproductive isolation in Saccharomyces cerevisiae. Curr Biol. 2014;24(10):1153–9. doi: 10.1016/j.cub.2014.03.063. pmid:24814147
- 59. Brown WRA, Thomas G, Lee NCO, Blythe M, Liti G, Warringer J, et al. Kinetochore assembly and heterochromatin formation occur autonomously in Schizosaccharomyces pombe. Proc Natl Acad Sci USA. 2014;111(5):1903–8. doi: 10.1073/pnas.1216934111. pmid:24449889
- 60. Cannon RD, Jenkinson HF, Shepherd MG. Isolation and nucleotide sequence of an autonomously replicating sequence (ARS) element functional in Candida albicans and Saccharomyces cerevisiae. Mol Gen Genet. 1990;221(2):210–8. Epub 1990/04/01. pmid:2196431
- 61. Hieter P, Mann C, Snyder M, Davis RW. Mitotic stability of yeast chromosomes: A colony color assay that measures nondisjunction and chromosome loss. Cell. 1985;40(2):381–92. pmid:3967296
- 62. Baum M, Ngan VK, Clarke L. The centromeric K-type repeat and the central core are together sufficient to establish a functional Schizosaccharomyces pombe centromere. Mol Biol Cell. 1994;5(7):747–61. pmid:7812044
- 63. Sipiczki M. Where does fission yeast sit on the tree of life? Genome Biol. 2000;1(2):reviews1011.1–reviews.4.
- 64. Bensasson D, Zarowiecki M, Burt A, Koufopanou V. Rapid evolution of yeast centromeres in the absence of drive. Genetics. 2008;178(4):2161–7. doi: 10.1534/genetics.107.083980. pmid:18430941
- 65. Wood V, Gwilliam R, Rajandream MA, Lyne M, Lyne R, Stewart A, et al. The genome sequence of Schizosaccharomyces pombe. Nature. 2002;415(6874):871–80. pmid:11859360
- 66. Wong LH, Choo KHA. Evolutionary dynamics of transposable elements at the centromere. Trends Genet. 2004;20(12):611–6. pmid:15522456
- 67. Catania S, Pidoux AL, Allshire RC. Sequence features and transcriptional stalling within centromere DNA promote establishment of CENP-A chromatin. PLoS Genet. 2015;11(3):e1004986. doi: 10.1371/journal.pgen.1004986. pmid:25738810
- 68. Roach KS, Ross BD, Malik HS. Rapid evolution of centromeres and centromeric/ kinetochore proteins. in Rapidly evolving genes and genetic systems, eds Singh RS, Xu J and Kulathinal RJ (Oxford University Press). 2012:83–93.
- 69. Wang K, Wu Y, Zhang W, Dawe RK, Jiang J. Maize centromeres expand and adopt a uniform size in the genetic background of oat. Genome Res. 2014;24(1):107–16. Epub 2013/10/09. doi: 10.1101/gr.160887.113. pmid:24100079
- 70. Ravi M, Chan SW. Haploid plants produced by centromere-mediated genome elimination. Nature. 2010;464(7288):615–8. Epub 2010/03/26. doi: 10.1038/nature08842. pmid:20336146
- 71. Sanei M, Pickering R, Kumke K, Nasuda S, Houben A. Loss of centromeric histone H3 (CENH3) from centromeres precedes uniparental chromosome elimination in interspecific barley hybrids. Proc Natl Acad Sci U S A. 2011;108(33):E498–505. Epub 2011/07/13. doi: 10.1073/pnas.1103190108. pmid:21746892
- 72. Fukagawa T, Earnshaw WC. The centromere: chromatin foundation for the kinetochore machinery. Dev Cell. 2014;30(5):496–508. Epub 2014/09/10. doi: 10.1016/j.devcel.2014.08.016. pmid:25203206
- 73. Chan FL, Wong LH. Transcription in the maintenance of centromere chromatin identity. Nucleic Acids Res. 2012;40(22):11178–88. doi: 10.1093/nar/gks921. pmid:23066104
- 74. Barraclough TG, Birky CW Jr., Burt A. Diversification in sexual and asexual organisms. Evolution. 2003;57(9):2166–72. Epub 2003/10/25. pmid:14575336
- 75. Kalitsis P, Choo KHA. The evolutionary life cycle of the resilient centromere. Chromosoma. 2012;121(4):327–40. doi: 10.1007/s00412-012-0369-6. pmid:22527114
- 76. Gordon JL, Byrne KP, Wolfe KH. Mechanisms of chromosome number evolution in yeast. PLoS Genet. 2011;7(7):e1002190. Epub 2011/08/04. doi: 10.1371/journal.pgen.1002190. pmid:21811419
- 77. Murphy WJ, Larkin DM, Everts-van der Wind A, Bourque G, Tesler G, Auvil L, et al. Dynamics of mammalian chromosome evolution inferred from multispecies comparative maps. Science. 2005;309(5734):613–7. Epub 2005/07/26. pmid:16040707
- 78. Selmecki A, Forche A, Berman J. Genomic plasticity of the human fungal pathogen Candida albicans. Eukaryot Cell. 2010;9(7):991–1008. Epub 2010/05/25. doi: 10.1128/EC.00060-10. pmid:20495058
- 79. Hughes JF, Skaletsky H, Pyntikova T, Graves TA, van Daalen SKM, Minx PJ, et al. Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content. Nature. 2010;463(7280):536–9. doi: 10.1038/nature08700. pmid:20072128
- 80. Foote AD, Liu Y, Thomas GWC, Vinar T, Alfoldi J, Deng J, et al. Convergent evolution of the genomes of marine mammals. Nat Genet. 2015;47(3):272–5. doi: 10.1038/ng.3198. pmid:25621460
- 81. Martienssen R, Moazed D. RNAi and heterochromatin assembly. Cold Spring Harb Perspect Biol. 2015;7(8):a019323. Epub 2015/08/05. doi: 10.1101/cshperspect.a019323. pmid:26238358
- 82. Nakayashiki H, Kadotani N, Mayama S. Evolution and diversification of RNA silencing proteins in fungi. J Mol Evol. 2006;63(1):127–35. pmid:16786437
- 83. Barnhart MC, Kuich PHJL, Stellfox ME, Ward JA, Bassett EA, Black BE, et al. HJURP is a CENP-A chromatin assembly factor sufficient to form a functional de novo kinetochore. J Cell Biol. 2011;194(2):229–43. doi: 10.1083/jcb.201012017. pmid:21768289
- 84. Mendiburo MJ, Padeken J, Fulop S, Schepers A, Heun P. Drosophila CENH3 is sufficient for centromere formation. Science. 2011;334(6056):686–90. doi: 10.1126/science.1206880. pmid:22053052
- 85. Baker RE, Rogers K. Phylogenetic analysis of fungal centromere H3 proteins. Genetics. 2006;174(3):1481–92. pmid:17028330
- 86. Johnston M, Davis RW. Sequences that regulate the divergent GAL1-GAL10 promoter in Saccharomyces cerevisiae. Mol Cell Biol. 1984;4(8):1440–8. pmid:6092912
- 87. West RW, Yocum RR, Ptashne M. Saccharomyces cerevisiae GAL1-GAL10 divergent promoter region: location and function of the upstream activating sequence UASG. Mol Cell Biol. 1984;4(11):2467–78. pmid:6392852
- 88. Mukhopadhyay A, Deplancke B, Walhout AJ, Tissenbaum HA. Chromatin immunoprecipitation (ChIP) coupled to detection by quantitative real-time PCR to study transcription factor binding to DNA in Caenorhabditis elegans. Nat Protoc. 2008;3(4):698–709. Epub 2008/04/05. doi: 10.1038/nprot.2008.38. pmid:18388953
- 89. Jayaraman G, Siddharthan R. Sigma-2: Multiple sequence alignment of non-coding DNA via an evolutionary model. BMC Bioinformatics. 2010;11:464. Epub 2010/09/18. doi: 10.1186/1471-2105-11-464. pmid:20846408
- 90. Rice P, Longden I, Bleasby A. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000;16(6):276–7. Epub 2000/05/29. pmid:10827456
- 91. Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999;27(2):573–80. pmid:9862982
- 92. Wapinski I, Pfeffer A, Friedman N, Regev A. Automatic genome-wide reconstruction of phylogenetic gene trees. Bioinformatics. 2007;23(13):i549–i58. pmid:17646342
- 93. Mitra S, Gómez-Raja J, Larriba Gn, Dubey DD, Sanyal K. Rad51-Rad52 mediated maintenance of centromeric chromatin in Candida albicans. PLoS Genet. 2014;10(4):e1004344. doi: 10.1371/journal.pgen.1004344. pmid:24762765
- 94. Bradley RK, Roberts A, Smoot M, Juvekar S, Do J, Dewey C, et al. Fast statistical alignment. PLoS Comput Biol. 2009;5(5):e1000392. Epub 2009/05/30. doi: 10.1371/journal.pcbi.1000392. pmid:19478997
- 95. Gil M, Zanetti MS, Zoller S, Anisimova M. CodonPhyML: fast maximum likelihood phylogeny estimation under codon substitution models. Mol Biol Evol. 2013;30(6):1270–80. Epub 2013/02/26. doi: 10.1093/molbev/mst034. pmid:23436912
- 96. Yamane T, Sakai H, Nagahama K, Ogawa T, Matsuoka M. Dissection of centromeric DNA from yeast Yarrowia lipolytica and identification of protein-binding site required for plasmid transmission. J Biosci Bioengineering. 2008;105(6):571–8.
- 97. Kobayashi N, Suzuki Y, Schoenfeld LW, Müller CA, Nieduszynski C, Wolfe KH, et al. Discovery of an unconventional centromere in budding yeast redefines evolution of point centromeres. Curr Biol. 2015;25(15):2026–33. doi: 10.1016/j.cub.2015.06.023. pmid:26166782
- 98. Lefrancois P, Auerbach RK, Yellman CM, Roeder GS, Snyder M. Centromere-like regions in the budding yeast genome. PLoS Genet. 2013;9(1):e1003209. doi: 10.1371/journal.pgen.1003209. pmid:23349633
- 99. Fitzpatrick DA, Logue ME, Stajich JE, Butler G. A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis. BMC Evol Biol. 2006;6:99. Epub 2006/11/24. pmid:17121679