Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Sequencer-Based Capillary Gel Electrophoresis (SCGE) Targeting the rDNA Internal Transcribed Spacer (ITS) Regions for Accurate Identification of Clinically Important Yeast Species

  • Xin Hou ,

    Contributed equally to this work with: Xin Hou, Meng Xiao

    Affiliations Department of Clinical Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China, Graduate School, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China

  • Meng Xiao ,

    Contributed equally to this work with: Xin Hou, Meng Xiao

    Affiliations Department of Clinical Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China, Graduate School, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China

  • Sharon C.-A. Chen,

    Affiliation Centre for Infectious Diseases and Microbiology Laboratory Services, ICPMR– Pathology West, Westmead Hospital, University of Sydney, Darcy Road, Westmead, New South Wales, Australia

  • He Wang,

    Affiliations Department of Clinical Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China, Graduate School, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China

  • Li Zhang,

    Affiliations Department of Clinical Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China, Graduate School, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China

  • Xin Fan,

    Affiliations Department of Clinical Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China, Graduate School, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China

  • Zhi-Peng Xu,

    Affiliations Department of Clinical Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China, Graduate School, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China

  • Jing-Wei Cheng,

    Affiliations Department of Clinical Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China, Graduate School, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China

  • Fanrong Kong,

    Affiliation Centre for Infectious Diseases and Microbiology Laboratory Services, ICPMR– Pathology West, Westmead Hospital, University of Sydney, Darcy Road, Westmead, New South Wales, Australia

  • Yu-Pei Zhao,

    Affiliation Department of General Surgery, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China

  • Ying-Chun Xu

    Affiliation Department of Clinical Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China

Sequencer-Based Capillary Gel Electrophoresis (SCGE) Targeting the rDNA Internal Transcribed Spacer (ITS) Regions for Accurate Identification of Clinically Important Yeast Species

  • Xin Hou, 
  • Meng Xiao, 
  • Sharon C.-A. Chen, 
  • He Wang, 
  • Li Zhang, 
  • Xin Fan, 
  • Zhi-Peng Xu, 
  • Jing-Wei Cheng, 
  • Fanrong Kong, 
  • Yu-Pei Zhao


Accurate species identification of Candida, Cryptococcus, Trichosporon and other yeast pathogens is important for clinical management. In the present study, we developed and evaluated a yeast species identification scheme by determining the rDNA internal transcribed spacer (ITS) region length types (LTs) using a sequencer-based capillary gel electrophoresis (SCGE) approach. A total of 156 yeast isolates encompassing 32 species were first used to establish a reference SCGE ITS LT database. Evaluation of the ITS LT database was then performed on (i) a separate set of (n = 97) clinical isolates by SCGE, and (ii) 41 isolates of 41 additional yeast species from GenBank by in silico analysis. Of 156 isolates used to build the reference database, 41 ITS LTs were identified, which correctly identified 29 of the 32 (90.6%) species, with the exception of Trichosporon asahii, Trichosporon japonicum and Trichosporon asteroides. In addition, eight of the 32 species revealed different electropherograms and were subtyped into 2–3 different ITS LTs each. Of the 97 test isolates used to evaluate the ITS LT scheme, 96 (99.0%) were correctly identified to species level, with the remaining isolate having a novel ITS LT. Of the additional 41 isolates for in silico analysis, none was misidentified by the ITS LT database except for Trichosporon mucoides whose ITS LT profile was identical to that of Trichosporon dermatis. In conclusion, yeast identification by the present SCGE ITS LT assay is a fast, reproducible and accurate alternative for the identification of clinically important yeasts with the exception of Trichosporon species.


Fungi are a major cause of human disease especially in patients with immune compromise and serious underlying disease [1]. Candida species are the leading cause of fungemia, which in the USA, is ranked as the third most common cause of bloodstream infection in ICU patients with a crude mortality of 47.1% [2]. However, serious infections due to non-Candida yeasts, including Cryptococcus, Trichosporon, Rhodotorula and other previously rare species, are also increasingly [37]. Accurate and timely species identification is important for clinical management of patients because these yeast species have different antifungal susceptibilities and an increasing number e.g. Candida glabrata, Candida krusei and Trichosporon species, are resistant or less susceptible to many antifungal agents [8, 9].

The limitations of conventional phenotypic methods for yeast identification are well known. Currently, rDNA gene complex sequencing targeted at the internal transcribed spacer (ITS) regions is considered as the “gold-standard” method for identifying yeast species [1013]. However, this approach requires time for the sequencing process, is expensive and dependent on public sequence repositories which are not curated and contain errors [14]. Other molecular identification methods include reverse line blot (RLB) assays, rolling circle amplification (RCA) [9, 15, 16] and pyrosequencing [17, 18]. Proteomic approaches such as matrix-assisted laser desorption/ionization time of flight mass spectrometry (MALDI-TOF MS) also lends itself as an identification tool [19]. As a simple and less expensive alternative to ITS sequencing, another molecular identification method, sequencer-based capillary gel electrophoresis (SCGE) systems, has been reported to have good accuracy, reproducibility and inter-laboratory consistency, whilst retaining flexibility [20, 21].

In the present study, we applied SCGE to develop an identification scheme for the major pathogenic yeast species. We constructed an in-house database of rDNA ITS length types (LTs) of a large number (n = 156) and broad range of yeast species (n = 32) by SCGE. We then evaluated the performance of this LT library for its ability to identify a separate and unrelated set of 97 (nine species) clinical yeast isolates. To broaden the scope of the evaluation, and to examine for potential “cross-identifications” or species misidentifications, we also performed in silico analysis of the combined ITS1 and ITS sequences of 41 additional yeast species against the in-house database. The results of all SCGE-based identifications were compared against the definitive identification provide by a combined phenotypic and molecular approach established in our laboratory [9, 13, 22].

Materials and Methods


The study was approved by the Human Research Ethics Committee of Peking Union Medical College Hospital (No. S-263). Written informed consent was obtained from patients for the use of the samples in research.

Yeast isolates

(i) Database build set: A total of 156 isolates were used to construct the ITS LT database. These comprised five reference strains (Candida parapsilosis sensu stricto ATCC22019, C. krusei ATCC6258, Candida guilliermondii ATCC6260, Candida albicans ATCC90028 and Trichosporon asahii CBS2479) and 151 clinical isolates. The clinical isolates were part of the culture collection of the National China Hospital Invasive Fungal Surveillance Net (CHIF-NET) 2010 and 2011 [10, 13, 23] and encompassed 32 yeast species: Candida (21 species), Cryptococcus (four species), Trichosporon (four species) and other yeasts (three species) (see Table 1). All isolates were identified by MALDI-TOF MS (Vitek MS, bioMérieux, Marcy l’Etoile, France, database version IVD 2.0) according to the manufacturer’s instructions. Any isolates with unsatisfied identification confidence values (<99.9%), or with “no identification” results, were further identified by sequencing of the ITS region as described by Zhang et al. [13]. For non-Trichosporon isolates, this identification was taken as the definitive identification result. Definitive identification of Trichosporon species was provided by sequencing of the intergenic spacer 1 (IGS1) region [9].

Table 1. Summarization of ITS1 and ITS sizes called by SCGE and DNA sequencing for five reference strains and 151 clinical isolates used for establishment of the SCGE ITS length type (LT) database.

(ii) Clinical evaluation set: A “test” set of 97 clinical isolates cultured from patients with invasive fungal diseases (IFDs) as part of routine care at the Peking Union Medical College Hospital (PUMCH) in 2013 (Table 2) were studied. Their identification was challenged the in-house built SCGE ITS LT database. None of the isolates used in this evaluation were employed to establish the database.

(iii) GenBank in silico analysis “test” set: In addition, 41 ITS sequences of 41 additional yeast species that were not involved in development of the ITS LT database were studied (Table 3). Their sequences were downloaded from the GenBank database for in silico analysis against the SCGE LT database to determine potential misidentifications.

Table 3. In silico analysis of 41 yeast species from GenBank not being involve in the ITS length type (LT) database.

ITS Duplex PCR

For each isolate, both the ITS1 and full-length ITS regions were amplified by a duplex PCR using a unique forward primer ITS1-FAM (5’-TCCGTAGGTGAACCTGCGG-3’) and two reverse primers, ITS2 (5’-GCTGCGTTCTTCATCGATG-3’) and ITS4 (5’-TCCTCCGCTTATTGATATGC-3’), with the forward primer carboxyfluorescein (FAM)-labeled at the 5’end (Fig 1A). The duplex PCR was performed in a total volume of 25 μl: each reaction mixture contained 3 μl template DNA, 0.2 μl each of ITS1-FAM, ITS4 and 0.1 μl ITS2 primers (100 μM), 12.5 μl 2×Taq PCR StarMix (TransGen Biotech, Beijing, China), 9 μl molecular biology grade H2O (TransGen Biotech). The PCR parameters were: 95°C for 5 min, 30 cycles of 95°C for 1min, 52°C for 30 s, and 72°C for 1 min, followed by a final extension at 72°C for 10 min.

Fig 1. Principles for SCGE ITS length types (LTs) identification of yeast species.

Fig 1A, structure of yeast ITS region and primers used in this study (showed in arrows with dashed lines). The duplex PCR using primers ITS1-FAM, ITS2 and ITS4 will theoretically generate two amplicons (for ITS1 and full-length ITS region, respectively). As the forward primer ITS1 was 5’-end FAM labeled, the amplicons can be observed by SCGE examination (peaks “a” and “b” in Fig 1B). B. an example of interpreting the SCGE results (strain 12HX414). Peaks “a” and “b” represent amplicons of ITS1 and full-length ITS region, respectively. As peak “a” and “a’” were separated by less than 1 bp, only peak “a” was counted (see Methods section).

Establishment of the ITS SCGE identification LT database

PCR fragment analysis was performed as before [20]. Generally, the ABI 3730xl DNA Analyzer (Applied Biosystems, Foster City, CA) was used employing a 96-capillary 50-cm POP-7 gel. The sample was added into the internal target (96-well plate), followed by vibration (500–1000 rpm for 1 min) and blending, denatured at 95°C for 5 min, rapidly cooled on the ice, and then centrifuged at 3700 rpm for 1 min. The supernatant was injected and the sample injection was carried out at 1.6 kV over 15 s with a total running time of 6,200 s at 15-kV run voltage. A 20- to 1,200-bp LIZ 1200 ladder (Chimerx, Madison, WI) was used as an internal marker for each sample.

The size of each peak was determined using GeneMarker software (Version 2.2.0, SoftGenetics, PA, USA). Double peaks were counted only if they were separated by more than 1 bp, otherwise only the highest peak was counted (Fig 1B). Between different strains, amplicons within ±0.5 bp differences was supposed to have the same length. LTs were defined by the combination of SCGE ITS1 and ITS amplicon lengths. Thereafter, the LTs and their matching “species identification” were recorded into an electronic database. The nomenclature of different ITS LTs was as follows: three lowercase Roman letters representing species, dash followed by an Arabic numeral representing potential length subtypes within each species. For example, “cor-2” represented species Candida orthopsilosis, ITS LT subtype 2 in the database.

One to four isolates of each LT were further selected to determine the physical lengths of the ITS1 and full-length ITS regions, by DNA sequencing, and to compare these with corresponding SCGE-derived lengths. The full-length ITS regions were amplified by primer pair ITS1/ITS4 as previously described [10] and sequenced from both directions. The physical lengths of the ITS1 and ITS regions (including primers) were analyzed by CLC Sequencing Viewer 7.5 (QIAGEN, Dusseldorf, Germany). The full-length ITS sequences were then used for phylogenetic analysis by the maximum-likelihood algorithm with 1000 bootstrap replication to ensure robustness using MEGA software (version 6.0, MEGA Inc., Englewood, NJ).

SCGE reproducibility

SCGE was performed in triplicate on three separate occasions by three different technicians using fresh yeast subcultures for the five reference strains, and four isolates of rare Candida or non-Candida yeast species (Candida rugosa, Trichosporon japonicum, Trichosporon asteroides and Trichosporon dermatis).

Clinical and in silico evaluation of the SCGE ITS LT database

Ninety-seven test isolates (seven Candida species, and two Cryptococcus species were used to evaluate the performance of the SCGE ITS LT database for yeast identification (Table 2). When a particular strain’s ITS LT exactly (with <0.5 bp difference) matched a recorded reference ITS LT in the database, it was designated as the corresponding species (Fig 1).

Because the ITS LT database did not cover all known species of yeasts, further, the ITS1 and ITS sequence length for 41 isolates of 41 additional yeast species, which were not included in the ITS LT database developed, were obtained from GenBank (Table 3). The ITS1 and ITS length of these isolates were in silico calculated, and queried against the developed ITS LT database to examine potential misidentifications.


SCGE analysis of ITS1 and ITS amplicons

Duplex amplification of the ITS1 and ITS regions yielded positive PCR products in all 156 isolates for the ITS LT database development. Calling of the sizes of the ITS1 and ITS regions by SCGE are shown in Table 1 and Fig 1A. A total of 35 and 37 different amplicon lengths were obtained for the ITS1 and ITS region, respectively. The average amplicon lengths ranged from 135.5 bp to 481.3 bp for the ITS1 region and from 356.3 bp to 881.3 bp for the ITS region, with the standard deviation for different lengths ranging from 0–0.3 bp (Table 1). Of note, different physical lengths of the ITS1 and ITS regions obtained by DNA sequencing (Fig 1A) were clearly separated by SCGE, and amplicons with the same SCGE lengths had unique physical lengths (Table 1).

Buildup of the SCGE ITS LT database

SCGE analysis of the five reference strains (four Candida, one Trichosporon species) revealed unique electropherograms, and established the premise of evaluating clinical strains.

Of 32 species represented amongst the 156 isolates, the ITS1 and full length ITS region directed SCGE assays distinguished 71.9% (23/32) and 78.1% (25/32) of the species, respectively. The ITS1 region lengths were identical for all Candida duobushaemulonii and Candida lusitaniae isolates (average length ± standard deviation 142.5±0.0 bp vs. 142.9±0.1 bp, Fig 2B Group I), between all Cryptococcus neoformans and Cryptococcus gattii isolates (197.5±0.1 bp vs. 197.5±0.1 bp, Fig 2B Group II), between all Candida tropicalis and the ITS LT cal-1 C. albicans isolates (214.4±0.1 bp vs. 214.3±0.1 vs. 214.4±0.1 bp, Fig 2B Group III), and between all T. asahii, T. japonicum and T. asteroides isolates (199.5±0.1 bp to 199.6±0.1 bp, Fig 2B Group VI) (Table 1). Meanwhile, the full-length ITS region was identical between all Cryptococcus curvatus and all T. dermatis isolates (526.2 bp vs. 526.4±0.1 bp, Fig 2B Group IV), Cryptococcus laurentii and the ITS LT cal-1 C. albicans isolates (533.0 bp vs. 533.0±0.2 bp, Fig 2B Group V), and T. asahii, T. japonicum and T. asteroides isolates (538.2±0.1 bp to 538.4±0.1 bp, Fig 2B Group VI) (Table 1).

Fig 2. SCGE ITS length types (LTs) database established in the present study, including amplicon lengths (bp) of the ITS1 and ITS regions called by SCGE for each LT, and an in silico gel-view diagram.

The amplicon sizes has been rounded up to the nearest whole number. Fig 2A, the whole database. Fig 2B, LTs of different species that had the same-length ITS1 or ITS region amplicons. In Fig 2A and 2B, gel-bands in blue or red represent amplicon of the ITS1 region, while gel-bands in purple or pink represent amplicon of the ITS region. Gel-bands in red or pink indicated the length of ITS1 or ITS amplicons were shared by different species.

Based on the combination of ITS1 and ITS region lengths called by SCGE, 41 ITS LTs were identified, which was able to identify 94.9% (148/156) isolates of 90.6% (29/32) species used for SCGE ITS LT database development, only except for isolates of T. asahii, T. japonicum and T. asteroides (Fig 2B Group VI, Table 1). Of note, the ITS LTs were able to distinguish the following genetically closely related species within species complexes e.g. C. parapsilosis sensu stricto, C. metapsilosis, C. orthopsilosis and L. elongisporus within C. parapsilosis species complex (Fig 3), Candida glabrata sensu stricto and Candida nivariensis within C. glabrata species complex, Candida haemulonii and C. duobushaemulonii within C. haemulonii species complex, and C. neoformans and C. gattii isolates within C. neoformans species complex.

Twenty-four of the 32 species studied had only one ITS LT, whilst eight species, including C. albicans, C. orthopsilosis (Fig 3C and 3D), C. tropicalis, C. glabrata sensu stricto, C. nivariensis, Candida pelliculosa, Candida norvegensis and Kodamaea ohmeri revealed 2–3 ITS LT subtypes (Table 1, Fig 1A). Dendrogram drawn by maximum-likelihood methods for full-length ITS sequences corresponding to 41 ITS LTs were shown in Fig 4. Different ITS LTs from the same or phylogenetically close-related yeast species, e.g. cal-1 and cal-2, cpa-1, cme-1, cor-1 and cor-2, cha-1 and cdh-1, cne-1 and cga-1, were clustered together.

Fig 4. Phylogenetic tree based on the ITS sequences of 41 ITS LT isolates and ITS LT ctr-3 is shown and was conducted using the maximum-likelihood method, using Aspergillus niger ATCC 16888T as an outgroup.


Reproducibility of the SCGE technique was assessed by repeating the method on nine isolates (five reference strains and four clinical isolates) on three different occasions, and results are summarized in Table 1. For the same amplicon of each strain, the range of measured differences between different repeats was 0.0–0.1 bp.

Clinical evaluation of the SCGE ITS LT database

The identification results by SCGE vs. the reference method (MALDI-TOF MS supplemented by DNA sequencing, see Methods) for the test set of 97 isolates is shown in Table 2. Using the ITS LT database, SCGE correctly identified 99.0% (96/97) isolates to species level, with 13 ITS LTs identified amongst the nine species studied. Only one isolate was not able to be identified—a putative “C. tropicalis” strain, which was not consistent and dissimilar to the ITS LTS of the two established C. tropicalis ITS LTs (i.e. ctr-1 and ctr-2, Table 1) in the database. ITS region sequence of the isolate was 99.6% identical to C. tropicalis type strain ATCC 750T (GenBank accession no. KJ651200), and phylogenetic analysis based on full-length ITS sequences clustered the isolate with ctr-1 and ctr-2 (Fig 4). In addition, the isolate was also identified as “C. tropicalis” by a variety of identification methods including API 20C AUX (bioMérieux), Vitek2 Compact YST (bioMérieux) and Vitek MS; hence, this isolate was assigned as C. tropicalis. A new ITS LT, ctr-3, was designated to this isolate, and ctr-3 was incorporated into the SCGE ITS LT database for future use.

In silico evaluation of ITS LT database

The ITS1 and ITS region sizes of the additional 41 isolates of 41 yeast species were called, as showed in Table 3. Using the ITS LT database developed in the present study, only Trichosporon mucoides was “misidentified”, which had identical ITS1 and ITS lengths as Trichosporon dermatis ITS LT tde-1.

Identification of yeast species by MALDI-TOF MS

Of 156 isolates used for SCGE-based ITS LT database, overall, 120 isolates (76.9%) were identified to species level by Vitek MS (bioMérieux), seven (4.5%) were misidentified, and 29 (18.6%) got “no identification” results (Table 1). Of the 32 yeast species studied, 20 species (62.5%) were included in the Vitek MS mass spectra database (database version IVD 2.0), whilst 12 species (37.5%) were absent from the database. All isolates of yeast species being included in the database (120 isolates) were correctly identified. However, amongst isolates of yeast species being absent from the database (36 isolates), 19.4% (seven isolates) were misidentified, and 80.6% (29 isolates) got “no identification” results.


The increasing spectrum and emergence of rare yeast pathogens continues to pose a challenge for microbiology laboratories to provide rapid and reliable identification [6, 19, 24, 25]. As a potential alternative molecular methods, here we have developed a SCGE-based approach for yeast identification and demonstrated that it can provide accurate and reproducible identification of major Candida, Cryptococcus and other rare yeast species with the exception of Trichosporon species.

By constructing an in-house database of SCGE patterns encompassing the electropherograms of 32 yeast species from six genera, we first established proof of principle that this technique was able to unambiguously identify and distinguish between the majority of the more commonly-encountered Candida and Cryptococcus species. Of note, both ITS1 and full length ITS SCGE amplicon results in combination were needed to make these distinctions allowing for correct identification of all isolates of these genera. Specifically, the ITS1-directed assay could not distinguish between certain Candida species including between C. tropicalis strains and one LT of C. albicans, both major Candida pathogens [10, 13, 23]. The inability of ITS1-diretced profiles on their own to differentiate between C. neoformans and C. gattii is consistent with the results of several ITS sequencing-based studies [26]. This reinforces the need for any method for fungal identification to be evaluated using more than one target, even within the same genetic locus (the ITS region in this case). Interestingly, the SCGE ITS LT assay was able to differentiate all genetically closed related species included in the study which cannot be differentiated by phenotypic methods [2629]. This distinction is important for epidemiological studies and because there may be differences in antifungal susceptibilities between members within the complexes [28, 30].

Another key finding was that SCGE lends itself as a potential typing tool. It has been demonstrated that within the same yeast species, there might be different levels of intra-species ITS sequence diversity [11, 31]. Species with high genetic diversity are most frequently human commensals, and this finding could explain the existence of additional genetic adaptation within normal microbiota with older evolutionary origins [11]. As observed in this study, for eight of the species used to construct the database, subtypes based on their ITS LTs were identified, although from aspect of typing, SCGE was less discriminatory than multi-locus sequence typing [32, 33].

The results obtained with the “database build cohort” of isolates are supported by results obtained on the test isolates, albeit only representing nine species. Only one isolate could not be identified to species, and this isolate represented a novel subtype or ITS LT of C. tropicalis, again raising the possibility of using SCGE as a typing tool. The study of larger numbers of different yeast species in this context would be a clinical interest.

Comparing to the “gold standard” ITS sequencing methods, SCGE-based method was cheaper and with less turnaround time (Table 4). Although both methods relied on the expensive DNA analyzer equipment (e.g. ABI 3730xl, Applied Biosystems), clinical laboratories will benefit from widely available commercial DNA sequencing companies, who provide cost effective (~US$1.5 per SCGE sample analyzed) and timely (24 h) services in urban China (Table 4). MALDI-TOF MS, if available, is another powerful tool for yeast identification, which enables rapid and accurate identification of common clinically-important yeasts, but may be hindered by its equipment acquisition costs (Table 4) [13, 19]. Moreover, insufficiencies in yeast spectrum databases may affect its identification capacity in identifying close-related and rare yeast species (Table 1) [6, 13, 19, 25]. As shown in the present study, 37.5% of yeast isolates could not be correctly assigned to species level by Vitek MS system because the corresponding species were being absent from the mass spectra database.

Table 4. Comparison of SCGE, Vitek MS and ITS sequencing for identification of yeast species.

In the broader sense, the usefulness of CGE-based assays for species identification of Candida has been reported previously by Monstein et al. and Mallus et al. [34, 35]. Both studies used the Seegene Seeplex PCR assay (Seegene Diagnostic, Seoul, South Korea) for nucleotide amplification, but CGE was carried out on two different platforms QIAxcel (Qiagen, Hilden, Germany) [35] and the MultiNA (Shimadzu Corp., Tokyo, Japan) [34], respectively. Our assay here differs as SCGE relies on fluorescent-labeled primers for PCR amplification whereas QIAxcel and MultiNA devices are adapted for analysis of conventional PCR products amplified by non-labeled primers requiring an additional step to identify nucleic acid with e.g. SYBR Green fluorescent stain. We chose the SCGE approach as the method has proved to be more sensitive in detecting low-intensity amplicons, more precise in size-calling, and has higher discriminatory power compared with the QIAxcel-based assay [36]. In addition, compared with previous CGE-focused Candida studies, that examined amplicons for less than 10 Candida species [34, 35], our study developed a relatively more comprehensive identification database, which comprised 21 Candida species, and 11 species of non-Candida yeasts e.g. Cryptococcus, Trichosporon and Rhodotorula. The isolates used for database development were from the CHIF-NET study, which was currently the largest nationwide multicenter surveillance program for IFDs in China, indicating its clinical relevance at least in the region.

The major limitation of the current SCGE ITS LT assay was its inability to identify Trichosporon species including T. asahii, T. japonicum and T. asteroides, T. mucoides and T. dermatis. It has been reported that the ITS and D1/D2 region sequences of above Trichosporon species were highly similar, with only few single nucleotide polymorphisms in nucleotide sequences but no difference in sequence-length [9, 37]. Therefore, the ITS region was not an appropriate target for identification of Trichosporon species. Instead the rDNA intergenic spacer (IGS) 1 region is most suitable for differentiating between phylogenetically close Trichosporon species because of its higher diversity [9, 37, 38]. Further work on IGS1 region for SCGE is warranted for study of this genus.

Another limitation of the study was that among the test 97 isolates, only seven Candida and two Cryptococcus species were analyzed. By studying 41 additional and including rare species of yeasts, by in silico analysis and by querying their identity against the ITS LT database, only T. mucoides was “misidentified” where the ITS LT profile was identical to that of T. dermatis (Table 3). This is encouraging and we plan to add to our in-house database the ITS LTS representative of more species so that it may be more widely applicable.

In summary, we have here established a database for SCGE-based identification for yeasts other than Trichosporon species, and the premise for performing a larger scale study to evaluate the identification capabilities of SCGE. To this end, DNA sequencing remains the “gold standard” identification method. However, the ITS SCGE assay described herein has shown promise to fulfill the function as a potential “reference” method since it is simple to use and adaptable for rapid identification. Moreover, it has the potential to detect mixed infections and to subtype species.


This study was financially supported by a National Research Special Fund for Public Welfare Industry of Health of China (grant number 201402001).

Author Contributions

Conceived and designed the experiments: YCX YPZ XH MX. Performed the experiments: XH HW LZ XF ZPX. Analyzed the data: XH MX. Contributed reagents/materials/analysis tools: HW LZ XF ZPX JWC. Wrote the paper: XH MX SC FK.


  1. 1. Dignani MC. Epidemiology of invasive fungal diseases on the basis of autopsy reports. F1000Prime Rep. 2014; 6: 81. pmid:25343038
  2. 2. Wisplinghoff H, Bischoff T, Tallent SM, Seifert H, Wenzel RP, Edmond MB. Nosocomial bloodstream infections in US hospitals: analysis of 24,179 cases from a prospective nationwide surveillance study. Clin Infect Dis. 2004; 39: 309–317. pmid:15306996
  3. 3. Capoor MR, Aggarwal S, Raghvan C, Gupta DK, Jain AK, Chaudhary R. Clinical and microbiological characteristics of Rhodotorula mucilaginosa infections in a tertiary-care facility. Indian J Med Microbiol. 2014; 32: 304–309. pmid:25008826
  4. 4. Chitasombat MN, Kofteridis DP, Jiang Y, Tarrand J, Lewis RE, Kontoyiannis DP. Rare opportunistic (non-Candida, non-Cryptococcus) yeast bloodstream infections in patients with cancer. J Infect. 2012; 64: 68–75. pmid:22101079
  5. 5. Roman AD, Salvana EM, Guzman-Penamora MA, Roxas EA, Leyritana KT, Saniel MC. Invasive trichosporonosis in an AIDS patient: case report and review of the literature. Int J STD AIDS. 2014; 25: 70–75. pmid:23970618
  6. 6. Xiao M, Wang H, Lu J, Chen SC, Kong F, Ma XJ, et al. Three clustered cases of candidemia caused by Candida quercitrusa and mycological characteristics of this novel species. J Clin Microbiol. 2014; 52: 3044–3048. pmid:24696025
  7. 7. Kwon-Chung KJ, Fraser JA, Doering TL, Wang Z, Janbon G, Idnurm A, et al. Cryptococcus neoformans and Cryptococcus gattii, the etiologic agents of cryptococcosis. Cold Spring Harb Perspect Med. 2014; 4: a019760. pmid:24985132
  8. 8. Alexander BD, Johnson MD, Pfeiffer CD, Jimenez-Ortigosa C, Catania J, Booker R, et al. Increasing echinocandin resistance in Candida glabrata: clinical failure correlates with presence of FKS mutations and elevated minimum inhibitory concentrations. Clin Infect Dis. 2013; 56: 1724–1732. pmid:23487382
  9. 9. Guo LN, Xiao M, Kong F, Chen SC, Wang H, Sorrell TC, et al. Three-locus identification, genotyping, and antifungal susceptibilities of medically important Trichosporon species from China. J Clin Microbiol. 2011; 49: 3805–3811. pmid:21900517
  10. 10. Wang H, Xiao M, Chen SC, Kong F, Sun ZY, Liao K, et al. In vitro susceptibilities of yeast species to fluconazole and voriconazole as determined by the 2010 National China Hospital Invasive Fungal Surveillance Net (CHIF-NET) study. J Clin Microbiol. 2012; 50: 3952–3959. pmid:23035204
  11. 11. Merseguel KB, Nishikaku AS, Rodrigues AM, Padovan AC, e Ferreira RC, de Azevedo Melo AS, et al. Genetic diversity of medically important and emerging Candida species causing invasive infection. BMC Infect Dis. 2015; 15: 57. pmid:25887032
  12. 12. Schoch CL, Seifert KA, Huhndorf S, Robert V, Spouge JL, Levesque CA, et al. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for fungi. Proc Natl Acad Sci U S A. 2012; 109: 6241–6246. pmid:22454494
  13. 13. Zhang L, Xiao M, Wang H, Gao R, Fan X, Brown M, et al. Yeast identification algorithm based on use of the Vitek MS system selectively supplemented with ribosomal DNA sequencing: proposal of a reference assay for invasive fungal surveillance programs in China. J Clin Microbiol. 2014; 52: 572–577. pmid:24478490
  14. 14. Nilsson RH, Tedersoo L, Ryberg M, Kristiansson E, Hartmann M, Unterseher M, et al. A comprehensive, automatically updated fungal ITS sequence dataset for reference-based chimera control in environmental sequencing efforts. Microbes Environ. 2015; 30: 145–150. pmid:25786896
  15. 15. Zhou X, Kong F, Sorrell TC, Wang H, Duan Y, Chen SC. Practical method for detection and identification of Candida, Aspergillus, and Scedosporium spp. by use of rolling-circle amplification. J Clin Microbiol. 2008; 46: 2423–2427. pmid:18495860
  16. 16. Playford EG, Kong F, Sun Y, Wang H, Halliday C, Sorrell TC. Simultaneous detection and identification of Candida, Aspergillus, and Cryptococcus species by reverse line blot hybridization. J Clin Microbiol. 2006; 44: 876–880. WOS:000236095000032 pmid:16517870
  17. 17. Borman AM, Linton CJ, Oliver D, Palmer MD, Szekely A, Johnson EM. Rapid molecular identification of pathogenic yeasts by pyrosequencing analysis of 35 nucleotides of internal transcribed spacer 2. J Clin Microbiol. 2010; 48: 3648–3653. pmid:20702674
  18. 18. Pannanusorn S, Elings MA, Romling U, Fernandez V. Pyrosequencing of a hypervariable region in the internal transcribed spacer 2 to identify clinical yeast isolates. Mycoses. 2012; 55: 172–180. pmid:21736632
  19. 19. Xiao M, Fan X, Chen XX, Wang H, Zhang L, Xu ZP, et al. Misidentification of a rare species, Cryptococcus laurentii, by commonly used commercial biochemical methods and matrix-assisted laser desorption ionization-time of flight mass spectrometry systems: challenges for clinical mycology labs. J Clin Microbiol. 2015. pmid:26560541
  20. 20. Wehrhahn MC, Xiao M, Kong F, Xu YC, Chen SCA. A PCR-based intergenic spacer region-capillary gel electrophoresis typing method for identification and subtyping of Nocardia species. J Clin Microbiol. 2012; 50: 3478–3484. WOS:000309966900012 pmid:22875897
  21. 21. Gray TJ, Kong FR, Jelfs P, Sintchenko V, Chen SCA. Improved identification of rapidly growing mycobacteria by a 16S-23S internal transcribed spacer region PCR and capillary gel electrophoresis. Plos One. 2014; 9. ARTN e102290 WOS:000339378400084 pmid:25013955
  22. 22. Xiao M, Guo LN, Kong FR, Wang H, Sorrell TC, Li RY, et al. Practical identification of eight medically important Trichosporon species by reverse line blot hybridization (RLB) assay and rolling circle amplification (RCA). Medical Mycology. 2013; 51: 300–308. WOS:000316228100009 pmid:23186014
  23. 23. Fan X, Xiao M, Wang H, Zhang L, Kong F, Lu J, et al. Multilocus sequence typing indicates diverse origins of invasive Candida tropicalis isolates in China. Chin Med J (Engl). 2014; 127: 4226–4234. pmid:25533826
  24. 24. Fan X, Xiao M, Kong F, Kudinha T, Wang H, Xu YC. A rare fungal species, Quambalaria cyanescens, isolated from a patient after augmentation mammoplasty—environmental contaminant or pathogen? PLos One. 2014; 9: e106949. pmid:25330078
  25. 25. Miceli MH, Diaz JA, Lee SA. Emerging opportunistic yeast infections. Lancet Infect Dis. 2011; 11: 142–151. pmid:21272794
  26. 26. Katsu M, Kidd S, Ando A, Moretti-Branchini ML, Mikami Y, Nishimura K, et al. The internal transcribed spacers and 5.8S rRNA gene show extensive diversity among isolates of the Cryptococcus neoformans species complex. Fems Yeast Research. 2004; 4: 377–388. WOS:000188505600004 pmid:14734018
  27. 27. Tavanti A, Davidson AD, Gow NA, Maiden MC, Odds FC. Candida orthopsilosis and Candida metapsilosis spp. nov. to replace Candida parapsilosis groups II and III. J Clin Microbiol. 2005; 43: 284–292. pmid:15634984
  28. 28. Cendejas-Bueno E, Kolecka A, Alastruey-Izquierdo A, Theelen B, Groenewald M, Kostrzewa M, et al. Reclassification of the Candida haemulonii complex as Candida haemulonii (C. haemulonii group I), C. duobushaemulonii sp. nov. (C. haemulonii group II), and C. haemulonii var. vulnera var. nov.: three multiresistant human pathogenic yeasts. J Clin Microbiol. 2012; 50: 3641–3651. WOS:000309966900037 pmid:22952266
  29. 29. Swoboda-Kopec E, Sikora M, Golas M, Piskorska K, Gozdowski D, Netsvyetayeva I. Candida nivariensis in comparison to different phenotypes of Candida glabrata. Mycoses. 2014; 57: 747–753. WOS:000345694900005 pmid:25267284
  30. 30. Gil-Alonso S, Jauregizar N, Canton E, Eraso E, Quindos G. In vitro fungicidal activities of anidulafungin, caspofungin, and micafungin against Candida glabrata, Candida bracarensis, and Candida nivariensis evaluated by time-kill studies. Antimicrob Agents Chemother. 2015; 59: 3615–3618. pmid:25801575
  31. 31. Zhao Y, Tsang CC, Xiao M, Cheng J, Xu Y, Lau SK, et al. Intra-genomic internal transcribed spacer region sequence heterogeneity and molecular diagnosis in clinical microbiology. Int J Mol Sci. 2015; 16: 25067–25079. pmid:26506340
  32. 32. Bougnoux ME, Aanensen DM, Morand S, Theraud M, Spratt BG, d'Enfert C. Multilocus sequence typing of Candida albicans: strategies, data exchange and applications. Infect Genet Evol. 2004; 4: 243–252. pmid:15450203
  33. 33. Tavanti A, Davidson AD, Johnson EM, Maiden MC, Shaw DJ, Gow NA, et al. Multilocus sequence typing for differentiation of strains of Candida tropicalis. J Clin Microbiol. 2005; 43: 5593–5600. pmid:16272492
  34. 34. Mallus F, Martis S, Serra C, Loi G, Camboni T, Manzin A. Usefulness of capillary electrophoresis-based multiplex PCR assay for species-specific identification of Candida spp. J Microbiol Methods. 2013; 92: 150–152. pmid:23220101
  35. 35. Monstein HJ, Tarnberg M, Persis S, Johansson AG. Comparison of a capillary gel electrophoresis-based multiplex PCR assay and ribosomal intergenic transcribed spacer-2 amplicon sequencing for identification of clinically important Candida species. J Microbiol Methods. 2014; 96: 81–83. pmid:24252582
  36. 36. Xiao M, Kong F, Jin P, Wang Q, Xiao K, Jeoffreys N, et al. Comparison of two capillary gel electrophoresis systems for Clostridium difficile ribotyping, using a panel of ribotype 027 isolates and whole-genome sequences as a reference standard. J Clin Microbiol. 2012; 50: 2755–2760. pmid:22692737
  37. 37. Chagas-Neto TC, Chaves GM, Melo AS, Colombo AL. Bloodstream infections due to Trichosporon spp.: species distribution, Trichosporon asahii genotypes determined on the basis of ribosomal DNA intergenic spacer 1 sequencing, and antifungal susceptibility testing. J Clin Microbiol. 2009; 47: 1074–1081. pmid:19225102
  38. 38. Sugita T, Ikeda R, Nishikawa A. Analysis of Trichosporon isolates obtained from the houses of patients with summer-type hypersensitivity pneumonitis. J Clin Microbiol. 2004; 42: 5467–5471. pmid:15583267