Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Multicolor Melting Curve Analysis-Based Multilocus Melt Typing of Vibrio parahaemolyticus

  • Ran Liu ,

    Contributed equally to this work with: Ran Liu, Zanzan Liu

    Affiliation Engineering Research Centre of Molecular Diagnostics, Ministry of Education, State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, Fujian, China

  • Zanzan Liu ,

    Contributed equally to this work with: Ran Liu, Zanzan Liu

    Affiliation Engineering Research Centre of Molecular Diagnostics, Ministry of Education, State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, Fujian, China

  • Ye Xu,

    Affiliation Engineering Research Centre of Molecular Diagnostics, Ministry of Education, State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, Fujian, China

  • Yiqun Liao,

    Affiliations Engineering Research Centre of Molecular Diagnostics, Ministry of Education, State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, Fujian, China, State Key Laboratory of Molecular Vaccinology and Molecular Diagnostic, School of Public Health, Xiamen University, Xiamen, Fujian, China

  • Qinghua Hu,

    Affiliations Shenzhen Major Infectious Disease Control Key Laboratory, Shenzhen Centre for Disease Control and Prevention, Shenzhen, Guangdong, China, School of Life Sciences, Shenzhen University, Shenzhen, Guangdong, China

  • Jianwei Huang,

    Affiliation Xiamen Center for Disease Control and Prevention, Xiamen, Fujian, China

  • Xiaolu Shi,

    Affiliation Shenzhen Major Infectious Disease Control Key Laboratory, Shenzhen Centre for Disease Control and Prevention, Shenzhen, Guangdong, China

  • Yinghui Li,

    Affiliation Shenzhen Major Infectious Disease Control Key Laboratory, Shenzhen Centre for Disease Control and Prevention, Shenzhen, Guangdong, China

  • Jianjun Niu ,

    niujianjun@xmu.edu.cn (JN); qgli@xmu.edu.cn (QL)

    Affiliation Zhongshan Hospital of Xiamen, Xiamen University, Xiamen, Fujian, China

  • Qingge Li

    niujianjun@xmu.edu.cn (JN); qgli@xmu.edu.cn (QL)

    Affiliations Engineering Research Centre of Molecular Diagnostics, Ministry of Education, State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, Fujian, China, State Key Laboratory of Molecular Vaccinology and Molecular Diagnostic, School of Public Health, Xiamen University, Xiamen, Fujian, China, Shenzhen Research Institute of Xiamen University, Shenzhen, Guangdong, China

Abstract

Vibrio parahaemolyticus is the leading cause of seafood-borne gastroenteritis outbreaks. To track the source of these diseases in a timely manner, a high throughput typing method is critical. We hereby describe a novel genotyping method for V. parahaemolyticus, termed multilocus melt typing (MLMT), based on multilocus sequence typing (MLST). MLMT utilizes melting curve analysis to interrogate the allelic types of a set of informative single nucleotide polymorphisms (SNPs) derived from the housekeeping genes used in MLST. For each SNP, one allelic type generates distinct Tm values, which are converted into a binary code. Multiple SNPs thus generate a series of binary codes, forming a melt type (MT) corresponding with a sequence type (ST) of MLST. Using a set of 12 SNPs, the MLMT scheme could resolve 218 V.parahaemolyticus isolates into 50 MTs corresponding with 56 STs. The discriminatory power of MLMT and MLST was similar with Simpson’s index of diversity of 0.638 and 0.646, respectively. The global (adjusted Rand index = 0.982) and directional congruence (adjusted Wallace coefficient, MT→ST = 0.965; ST→MT = 1.000) between the two typing approaches was high. The entire procedure of MLMT could be finished within 3 h with negligible hands on time in a real-time PCR machine. We conclude that MLMT provides a reliable and efficient approach for V. parahaemolyticus genotyping and might also find use in other pathogens.

Introduction

Vibrio parahaemolyticus is a gram-negative, halophilic marine bacterium, which can cause gastroenteritis through consumption of raw or undercooked seafood [1]. It is globally disseminated in estuarine areas and epidemic outbreaks are usually reported in coastal countries and regions [27]. At the moment, V. parahaemolyticus has been recognized as the leading cause of seafood-borne gastroenteritis and has increasingly raised public health concerns [4]. The frequent incidence of infections renders significance to surveillance and epidemiological investigation of this pathogen. An easy to use, cost-effective, and discriminatory typing scheme is therefore warranted for V. parahaemolyticus.

Multilocus sequence typing (MLST) is a genotyping approach based on comparative sequences from housekeeping genes to analyze the population structure of microbial isolates. On account of the reproducibility and portability of data, it is regarded as a preferred choice for investigation of population structure of V. parahaemolyticus [810]. Nevertheless, MLST is relatively expensive and time-consuming owing to the sequencing analysis of seven fragments of housekeeping genes for each isolate. Alternative simplified MLST-derivative methods have been developed. Among them, one approach is to choose a set of single nucleotide polymorphisms (SNPs) of high Simpson’s index of diversity (SID). These informative SNP sets are then interrogated by allele-specific real-time PCR [1114] or high resolution melting [1518].This approach has shown great potential as a rapid and cost-effective genotyping tool to complement MLST in the epidemiological study of various pathogens. However, many reactions have to be employed when multiple SNPs are detected, thus lowering the overall throughput of these methods.

We previously established a multicolor melting curve analysis to detect the genotype of mutations using dual-labeled, self-quenched probes on a real-time PCR platform [19]. This melting curve analysis approach proved to be a reliable and rapid tool in the detection of multiple mutations or SNPs in a single reaction tube [2022]. In this study we sought to use this approach to establish a new genotyping scheme of high throughput. This scheme, termed multilocus melt typing (MLMT), could provide melting temperature (Tm) values for the allelic types of the SNPs. The Tm values were converted into binary codes to generate a melt type (MT), which could be defined in correspondence with sequence type (ST) of MLST. MLMT provided digital results, ensuring the portability of data, which is important for data storage and transfer. As a model, V. parahaemolyticus was genotyped using a 12-SNP scheme. By analyzing 218 isolates of V. parahaemolyticus, we concluded that MLMT could provide rapid and discriminatory typing with high throughput and cost-effectiveness.

Methods

Bacterial isolates and genomic DNA preparation

One hundred and thirty-five clinical isolates were from Shenzhen Center of Disease Control and Prevention (Shenzhen CDC, Shenzhen, China), and 83 isolates, including 49 clinical isolates and 34 environmental isolates, were provided by Xiamen CDC (Xiamen, China). Ten isolates with known STs (ST-3, 199, 332, 345) were from our laboratory collection and were used to establish MLMT. Genomic DNA was prepared by using the AxyPrepTM Bacterial Genomic DNA Miniprep Kit (Axygen Biosciences). Isolated DNA was quantified by a Nanodrop spectrophotometer (Nanodrop Products, Wilmington, Delaware, US). Prepared genomic DNA was stored at -20°C before use.

Ethics Statement

All clinical isolates were collected for routine diagnostics by Shenzhen CDC and Xiamen CDC and were supplied for this study as coded specimens without any patient information or identifiers that could be used to decode patient information. The current study was thus exempted from ethical approval by the Ethics Committee on Human Studies in Xiamen University.

SNPs selection

Sequences of 798 ST profiles including seven housekeeping genes (recA, dnaE, gyrB, dtdS, pntA, pyrC and tnaA) were retrieved from the V. parahaemolyticus MLST website (http://pubmlst.org/vparahaemolyticus/, accessed 21 June, 2013). The seven genes were aligned into concatenated sequences (3682 bps). The sequences were analyzed by Minimum SNPs software [14] to select a SNP set of high SID. “Simpson’s Index (0.0–1.0)” in the Allele Identification Parameters panel started at a low SID value of 0.90, and was raised up to a value of 0.9999. Dimorphic SNPs were considered as candidates for better discrimination by probes. The candidate SNPs were further evaluated regarding the conservation of the neighboring nucleotides by sequence alignment. The typing results were confirmed by the “Working Backwards Method” in the software, which can define STs based on the SNP profiles.

Probes design

Dual-labeled probes for the detection of selected SNPs are listed in S1 Table. Probes were designed using Biophysics Version 1.00 (http://biophysics.idtdna.com). When needed, locked nucleic acids (LNAs) were introduced on the SNP positions to increase Tm difference (ΔTm) [23] between matched and mismatched targets. Inosine (I) and mismatch bases were introduced to neutralize the influence from non-target SNPs.

Three types of Tm were used for confirmation of the probe design: i) The in silico Tm for probe:target were obtained by Biophysics under the conditions below: 200 nM probe, 200 nM target, 50 mM Na+, K+, 3 mM Mg2+ and 200 mM deoxyribonucleoside triphosphates (dNTPs). ii) Hybridization Tms were obtained by using synthesized target oligonucleotides hybridized with probes in a thermal denaturation experiment: each 25-μL solution contained 0.2 μM probe, 0.4 μM target oligonucleotides, 67 mM Tris-HCl (pH 8.8), 16 mM (NH4)2SO4, 0.01% (W/V) Tween-20 and 2.5 mM MgCl2. The thermal denaturation procedure started at 95°C for 1 min, 35°C for 2 min, followed by melting analysis ramping from 30°C to 85°C in 0.5°C increments. Probes were synthesized and purified by Sangon (Shanghai, China). iii) The PCR Tms were obtained using isolates of known MLST types. For those isolates not found in our collection, artificial plasmids containing the amplification region were constructed to simulate real isolates. The obtained Tms from post-PCR were used to determine the critical Tms, by which the binary code of the allelic types was defined. The PCR conditions were provided in “MLMT procedure” below.

Primers for MLST and MLMT

MLST primers are listed in S2 Table. A portion of MLST primers were redesigned for efficient amplification, which encompass the original MLST primers [8]. Primer design was based on the complete sequences of V. parahaemolyticus chromosome I and II in NCBI: RIMD 2210633 (BA000031, BA000032) [24], O1:K33 str. CDC_K4557 (CP006007, CP006008), BB22OP (CP003972, CP003973) [25], O1:Kuk str. FDA_R31 (CP006004, CP006005). MLMT primers are listed in S3 Table, with the size of amplicons from 113~248 bps. Primers were designed by using Premier Primer 5.0 (Premier Biosoft International, Palo Alto, CA), and Oligo 6.0 (AVG Technologies Inc., Chelmsford, MA). Primers were synthesized and purified by Sangon.

MLST procedure

PCR was conducted under the following thermocycling conditions: 95°C for 5 min, 15 cycles of touchdown PCR (95°C for 10 s, 69°C with 1°C/cycle decrease for 20 s, and 72°C for 20 s) and 35 cycles of 95°C for 10 s, 55°C for 20 s, and 72°C for 20 s. PCR products were resolved by gel electrophoresis. Sequencing was performed by BGI (Shenzhen, China). Sequences were analyzed online (http://pubmlst.org/vparahaemolyticus/) to assign allele numbers and define STs. New sequences for alleles and new ST profiles were submitted to the V. parahaemolyticus MLST database. The clonal complexes(CCs) of V. parahaemolyticus were analyzed by goeBURST [26] of Phyloviz software (http://www.phyloviz.net) [27]. Those STs that share identical alleles at six of the seven MLST loci with at least one other ST were classified as one CC [28].

MLMT procedure

The dual-labeled, self-quenched probes alone are non-fluorescent or weakly fluorescent but become fluorescent when hybridizing with the reversely complementary single-stranded DNA. After asymmetric PCR, the produced excess single-stranded amplicons are targets for the dual-labeled, self-quenched probes. Post-PCR melting curve analysis would generate Tm values reflecting the sequence variations in the probe binding region of the amplicons. Due to the possible existence of polymorphic SNPs sites in the probe-binding regions, a series of Tm rather than a single Tm for one allelic type would be generated. The probe was designed in such a way that it is complementary with none of the sequence variants at these polymorphic SNP sites. Consequently, the Tm values for one allelic type would be always lower than another allelic type. The principle of MLMT is illustrated by diagnosing a SNP with “A” and “T” alleles (Fig 1).The probe gives allelic type “A” Tm values (Tm1~Tm3) higher than allelic type “T” (Tm4~Tm6). The lowest Tm of allelic type “A” (Tm3) is defined as the critical Tm. The allele with a Tm equal to or higher than the critical Tm is defined as binary code “1” and the allele with a Tm lower than the critical Tm is defined as “0”. A series of binary codes could be obtained when multiple SNPs are genotyped. The concatenated serial binary codes are defined as MTs. The MTs can be further linked to STs or CCs.

thumbnail
Fig 1. Flowchart of MLMT analysis of V. parahaemolyticus.

The flowchart illustrates the typing procedure from SNP detecting to data handling. Isolated genomic DNA is first aliquoted into four PCR reactions (R1-R4). Each reaction detects three SNP sites using three differently fluorophore-labeled probes (FAM, HEX, and ROX). The produced twelve Tm values by four PCR reactions are then converted into a 12-digit binary code, which forms a melt type (MT). Isolate A (MT-3) and isolate B (MT-68) are shown as examples. The rule of converting Tms into binary codes is illustrated in the insert.

https://doi.org/10.1371/journal.pone.0136998.g001

Extracted DNA from each isolate was analyzed in four PCR reactions. Asymmetric linear-after-the-exponential (LATE) PCR [29] was used to generate single-stranded products. Each 25-μL reaction mix contained 67 mM Tris-HCl (pH 8.8), 16 mM (NH4)2SO4, 0.01% (W/V) Tween 20, 3.5 mM MgCl2, 1 unit TaqHS DNA polymerase (Takara, Dalian, China), 200 μM of dNTPs, 0.04~0.08 μM limiting primers, 0.6~0.8 μM excess primers, 0.1~0.4 μM probes, and 5 μL of genomic DNA. The specific amount of primers and probes are given in S4 Table. The real-time PCR program was 95°C for 10 min, followed by 10 cycles of touchdown PCR (95°C for 15 s, 64°C with 1°C/cycle decrease for 15 s, and 72°C for 20 s) and 40 cycles of 95°C for 15 s, 55°C for 15 s, and 72°C for 20 s. The melting curve analysis started with a denaturing step at 95°C for 1 min, a hybridization step at 30°C for 3 min, followed by a stepwise temperature increase from 30°C to 75°C in 0.5°C increments. The fluorescence from FAM, HEX and ROX channels was recorded during the melting analysis procedure. The assay was performed on a Bio-Rad CFX 96 real-time PCR system (Bio-Rad, Hercules, CA) and the Tm values from each reaction were used for conversion into allelic types and to form the binary codes inlet.

Concordance between MLMT and MLST

The concordance between the two typing methods was evaluated by calculating the adjusted Rand index (AR) [30] and the adjusted Wallace coefficient (AW) [31]. The AR was used to evaluate the overall concordance of the typing methods. AR takes into account that the agreement between partitions could arise by chance alone. The Awes used to assess bidirectional concordance by predicting the possibility of two strains clustered together when they have been assigned into one group by another method. The values of both AR and AW range from 0 to 1, with higher values reflecting better concordance. Confidence intervals for AR and AW were estimated using a jackknife pseudo-values re-sampling method [32]. All the calculations were carried out using Comparing Partitions (http://darwin.phyloviz.net/ComparingPartitions/).

Results

Selection of discriminatory SNP set

A set of 12 SNPs were chosen with the cumulative SID values of 0.9974 (Table 1). The association between the ST and MT was given as a Microsoft Excel spreadsheet (S5 Table), which can be used to search for ST by MT and vice versa. The spreadsheet shows that 798 STs of V. parahaemolyticus could be resolved into 427 MTs with a SID value of 0.997 (95% CI, 0.997–0.998). Of the 427 MTs, 249 (58.3%) could be assigned each with a unique ST.

The critical Tms

Because the 0/1 codes for SNPs were determined by comparing Tms with the corresponding critical Tms, the reproducibility of Tm measurement was critical for the accuracy of the typing results. A collection of isolates and artificial plasmids that offer the critical Tm were tested in 10 replicates using three separately prepared batches of reaction mixes. The mean Tm±SD and variation (CV %) were listed in S6 Table. The low variations in Tm measurement ensure correct interrogation of the allelic types of SNP. The critical Tms for 12 probes are listed in Table 1.

In silico analysis of V. parahaemolyticus isolates

We performed an in silico analysis of the 1169 isolates available in the MLST database (April 15, 2014), which represented in total 589 STs (SID = 0.967, 95% CI, 0.959–0.974) in correspondence with349 MTs (SID = 0.960, 95% CI,0.952–0.969). The AR coefficient between MT and ST was 0.912 (95% CI, 0.875–0.946). The AW (MT→ST) was 0.838 (95% CI, 0.782–0.895) and the AW (ST→MT) was 1.000 (95% CI, 1.000–1.000). The above results indicated that MLMT had discriminatory power close to MLST.

Evaluation of MLMT

The practical performance of MLMT was assessed using 218 V. parahaemolyticus isolates. MLMT resolved 218 isolates into 50 MTs with SID of 0.638 (95%CI, 0.563–0.712) (Fig 2 and Table 2). Among them, the most common MT was MT-3, which was composed of 129 isolates and accounted for 59.2% of the total isolates (S7 Table). The second most common MT was MT-345, which was composed of 24 isolates and accounted for 11.0% of the total isolates. Forty two MTs were each composed of one isolate.

thumbnail
Fig 2. MLMT analysis results of 218 V. parahaemolyticus isolates.

The frequency of each MT is given together with the number of the corresponding ST. Also given are the type and number of STs of all the MTs obtained from the 218 isolates. The size of the pies illustrates the relative number of MTs but not in a true scale.

https://doi.org/10.1371/journal.pone.0136998.g002

thumbnail
Table 2. The discriminatory power of MLMT and MLST and their congruence.

https://doi.org/10.1371/journal.pone.0136998.t002

By comparison, MLST resolved these 218 isolates into 56 STs with SID of 0.646 (95%CI, 0.569–0.722) (Fig 2 and Table 2). Among them, ST-3 was the most prevalent sequence type comprising of 129 isolates of MT-3. Forty-six of the STs contained a single isolate, and 34 of them were newly found. The 56 STs analyzed by goeBURST produced two CCs: CC345 (ST-189, 265, 345, 962) and CC527 (ST-527, ST-960). The four STs of CC345 and two STs of CC527 corresponded to MT-345 and MT-527 respectively. The singleton ST-1018, as predicted by S5 Table, was included in MT-527. ST-937, the double-locus variant of ST-8, was clustered by MT-8 with ST-8.Association analysis between MT and ST for the 218 isolates showed that MLMT results fully agreed with the theoretical predication of the Microsoft Excel spreadsheet (S5 Table).

By comparing the MLMT and MLST results, we observed that the two alleles of each SNP could be completely discriminated from each other regardless of the presence of polymorphic SNPs that were encountered in these samples. The representative original melting curves presented in the 218 isolates are shown in Fig 3.

thumbnail
Fig 3. Melting curves obtained from the 218 isolates.

Melting curves from those isolates displaying unique Tm values are shown in color. The non-template controls are shown in grey.

https://doi.org/10.1371/journal.pone.0136998.g003

When we submitted the newly found STs to the MLST database (July, 2014), the ST number of V. parahaemolyticus had increased to 1100. We wonder whether the newly added STs might have changed the discriminatory power of the 12-SNP MLMT scheme. An updated Microsoft Excel spreadsheet was edited that includes all of the 1100 STs (S8 Table). We found that all STs could be resolved into 552 MTs with a SID of 0.998 (95% CI, 0.997–0.998). In comparison with the original database containing798 STs, the discriminatory power of MLMT remains unchanged, demonstrating the robustness of the MLMT scheme.

The congruence between MLMT and MLST was evaluated by AR and AW based on the typing results of 218 isolates (Table 2). The AR index between MT and ST was calculated to be 0.982 (95% CI, 0.969–0.997), showing a good overall concordance between the two typing approaches. The AW coefficient from MT to ST was 0.965 (95%CI, 0.960–0.971), and from ST to MT was 1.000 (95%CI, 1.000–1.000), respectively, further yielding a high congruence between the two typing methods. Altogether, the results demonstrated a high level of concordance between MT and ST in typing the 218 isolates.

For a more intuitive understanding of the performance of MLMT, the relationship between MTs and population structure of STs obtained by goeBURST was investigated for the 218 V.parahaemolyticus isolates. In order to have a comprehensive overview, we also included those STs in the database that belonged to the corresponding CCs in addition to the 56 STs found in this work. Five levels of relatedness could be classified. First, one MT corresponds with one ST (Fig 4A), demonstrating a complete concordance between MLMT and MLST. Second, one MT corresponds with subgroups of one CC (Fig 4B). For example, in CC120, MT-663, MT-447, and MT-480 correspond respectively with different STs. MT-120 corresponds with a subgroup of CC120 containing three STs, i.e., ST-120, ST-188, and ST-133.Third, one MT corresponds with one CC (Fig 4C). For example, MT-345 corresponds with CC345 that contains seven STs. Similarly, MT-199 corresponds with CC199, and MT-527 corresponds with CC527. Both CC199 and CC527 contain varied number of STs. Forth, one MT corresponds with subgroups of one CC, in which STs from other CCs might exist (Fig 4D). For example, CC3 is divided into 14 MTs, and six of which contain non-CC3 STs. Similar situation is also found in CC8 and CC332. Fifth, one MT corresponds with a group of STs from different CCs(Fig 4E). For example, although each of the 15 MTs corresponds with a single ST of the 56 STs, these MTs also contain irrelevant STs according to the prediction on 798 STs.

thumbnail
Fig 4. A goeBURST snapshot for population structures of 56 STs derived from 218 V. parahaemolyticus isolates superimposed by the corresponding MTs.

Colored circles represent clinical isolates from Shenzhen (red), clinical isolates from Xiamen (blue), and environmental isolates from Xiamen (green). The size of the circle represents the relative abundance of the ST. The orange dots linked by grey lines represent those STs differed by a single locus variation from the ancestral ST within one CC. The boxes with dotted lines represent one MT. The numbers shown in grey color are from the MLST database but absent in this study. Panels from A to E represent five levels of relatedness between MT and ST.

https://doi.org/10.1371/journal.pone.0136998.g004

Discussion

MLST is a powerful typing tool in the study of clonal relationships and population structures of bacteria. Unfortunately, its use is often hindered by cost and time when analyzing a large number of samples or when tracking epidemic spread in a timely manner. In this context, MLMT described hereby can be regarded as a simplified version of MLST with increased efficiency and cost-effectiveness. The entire MLMT procedure could be finished within 3 hours on a real-time PCR machine and the cost was approximately 50 times less than MLST. The ability of detecting multiple SNPs in one reaction further helps to simplify the operation and increase the throughput. Moreover, the interpretation of Tms into binary codes can be easily automated and the efficiency of detection can be further improved.

As a MLST derivative method, it is essential for MLMT to keep its typing results associated with MLST. Because the discriminatory SNPs of MLMT are extracted from the concatenated sequences of the housekeeping genes used by MLST, the SNPs-derived MT could thus be associated with ST. This association was given in a Microsoft Excel spreadsheet in our study (S5 Table for 798 STs and S8 Table for 1100 STs). According to the spreadsheet, every ST can be assigned to an MT, and an MT can be assigned to an ST or a group of STs. For one isolate, if the MT obtained by MLMT is included in the spreadsheet, it can be assigned to one or a group known STs in the spreadsheet though it may also represent a new ST not included in the spreadsheet. If otherwise the MT is not included in the spreadsheet, it must represent a new ST. Consequently, a known MT detected by MLMT may be or may not be a new ST, but an unknown MT detected must be a new ST. Through this association, the portable nature of MLST is kept in MLMT, making MTs accessible and exchangeable among different laboratories.

The 12-SNP MLMT scheme could discriminate the majority of STs. In order to resolve more STs, more SNPs should be included to increase the SID but at the cost of additional detection reactions. Obviously, an optimal MLMT scheme should have a high SID but with fewer SNPs. This approach however might miss some SNPs that can discriminate single locus variant (SLV) in certain CCs. For example, MT-3 contains 14 SLVs of CC3, meaning that MLMT could not discriminate all of these 14 SLVs despite its SID being as high as 0.997. To discriminate these SLVs, additional SNPs need to be searched within STs contained in MT-3. For this purpose, an association spreadsheet between MT and ST is demanded to evaluate the concordance between MTs and STs. We are now developing a software named “MTsum”, which is able to generate this spreadsheet. It can also calculate the total number of MT and the number of MTs that have a single ST. Once a set of SNPs are chosen by minimum SNPs, the generated spreadsheet can be used to guide a new round of SNPs selection for improved discrimination. After several rounds of selection, a set of SNPs with ideal discrimination can be obtained. We thus believe that by the combined use of MTsum and Minimum SNPs, an optimal SNP set that is able to discriminate the largest number of STs could be produced.

In our MLMT scheme, SNPs from the recA gene were excluded. Despite the high discriminatory power, recA is frequently recombined and is thus often not recommended as a molecular marker for evolutionary analyses of V. parahaemolyticus [8, 10, 33, 34]. Moreover, many SNPs in recA alleles (but not in other genes) were derived from long segment inserts of horizontal gene transfer [10, 34], and these SNPs may interfere with identification of the allelic types of the original recA. Therefore, the omission of recA could obviate the above uncertainties.

V. parahaemolyticus has a diverse population structure [35, 36]. For example, goeBURST analysis of the 798 STs in the MLST database of V. parahaemolyticus identified 94 CCs (or groups) and 513 singletons. At current stage, the 12-SNP MLMT scheme could only resolve 798 STs into 427 MTs (S5 Table). However, once the MT is known, those STs contained in it can be identified by sequencing only the discrepant loci instead of all the seven housekeeping genes. In this regards, a preliminary MLMT analysis prior to MLST could substantially save time and cost, in particular, when a large number of samples are involved.

In conclusion, we developed a melting curve-based MLST derivative, MLMT, which allows rapid and cost-effective typing of V. parahaemolyticus. The ease of use and high throughput would facilitate processing a large number of isolates. We thus expect its immediate application in clinical microbiology laboratories where real-time PCR instruments are commonly equipped.

Supporting Information

S1 Table. The sequences of probes and their three types of Tm values.

https://doi.org/10.1371/journal.pone.0136998.s001

(DOCX)

S4 Table. The amount of primers and probes used in MLMT.

https://doi.org/10.1371/journal.pone.0136998.s004

(DOCX)

S5 Table. The conversion keys for MLMT to 798 STs (21 June, 2013) of V. parahaemolyticus MLST database.

https://doi.org/10.1371/journal.pone.0136998.s005

(XLSX)

S7 Table. 218 sequenced V.parahaemolyticusisolates in this study.

https://doi.org/10.1371/journal.pone.0136998.s007

(DOCX)

S8 Table. The conversion keys for MLMT to 1100 STs (23July, 2014) of V. parahaemolyticus MLST database.

https://doi.org/10.1371/journal.pone.0136998.s008

(XLSX)

Acknowledgments

We are grateful to Dr. Philip M. Giffard for providing the Minimum SNPs software. We also thank Dr. Ineke Rood and Miss Jingyi Li for her assistance in the revision of the manuscript.

Author Contributions

Conceived and designed the experiments: QL JN RL ZL. Performed the experiments: RL ZL. Analyzed the data: RL ZL YX Y. Liao. Contributed reagents/materials/analysis tools: QH JH XS Y. Li. Wrote the paper: QL RL ZL JN.

References

  1. 1. Letchumanan V, Chan K-G and Lee L-H.Vibrio parahaemolyticus: a review on the pathogenesis, prevalence, and advance molecular identification techniques. Front Microbiol. 2014;5:705. pmid:25566219
  2. 2. Su YC, Liu C. Vibrio parahaemolyticus: a concern of seafood safety. Food Microbiol. 2007 Sep;24(6):549–558. pmid:17418305
  3. 3. Bag PK, Nandi S, Bhadra RK, Ramamurthy T, Bhattacharya S, Nishibuchi M, et al. Clonal Diversity among Recently Emerged Strains of Vibrio parahaemolyticus O3: K6 Associated With Pandemic Spread. http://www.medsci.cn/sci/submit.do?id=395b3600 J Clin Microbiol. 1999;37(7):2354–2357. pmid:10364615
  4. 4. Velazquez-Roman J, Leon-Sicairos N, de Jesus Hernandez-Diaz L, Canizalez-Roman A. Pandemic Vibrio parahaemolyticus O3:K6 on the American continent. Front Cell Infect Microbiol. 2014;3:110. pmid:24427744
  5. 5. Gil AI, Miranda H, Lanata CF, Prada A, Hall ER, Barreno CM, et al. O3:K6 serotype of Vibrio parahaemolyticus identical to the global pandemic clone associated with diarrhea in Peru. Int J Infect Dis. 2007 Jul;11(4):324–328. pmid:17321179
  6. 6. Martinez-Urtaza J, Simental L, Velasco D, DePaola A, Ishibashi M, Nakaguchi Y, et al. Pandemic Vibrio parahaemolyticus O3: K6, Europe. Emerg Infect Dis. 2005;11(8):1319–1320. pmid:16110585
  7. 7. Daniels NA, MacKinnon L, Bishop R, Altekruse S, Ray B, Hammond RM, et al. Vibrio parahaemolyticus infections in the United States, 1973–1998. J Infect Dis. 2000;181(5):1661–1666. pmid:10823766
  8. 8. Gonzalez-Escalona N, Martinez-Urtaza J, Romero J, Espejo RT, Jaykus LA, DePaola A. Determination of molecular phylogenetics of Vibrio parahaemolyticus strains by multilocus sequence typing. JBacteriol. 2008 Apr;190(8):2831–2840.
  9. 9. Sara Urmersbach TA, Madura Sanjeevani Gonsal Koralage, Lisa Sperling, Gunnar Gerdts, Ute Messelhäusser, Stephan Huehn. Population analysis of Vibrio parahaemolyticus originating from different geographical regions demonstrates a high genetic diversity. BMC Microbiol. 2014;59(14).
  10. 10. Theethakaew C, Feil EJ, Castillo-Ramirez S, Aanensen DM, Suthienkul O, Neil DM, et al. Genetic relationships of Vibrio parahaemolyticus isolates from clinical, human carrier, and environmental sources in Thailand, determined by multilocus sequence analysis. Appl Environ Microbiol. 2013 Apr;79(7):2358–2370. pmid:23377932
  11. 11. Robertson GA, Thiruvenkataswamy V, Shilling H, Price EP, Huygens F, Henskens FA, et al. Identification and interrogation of highly informative single nucleotide polymorphism sets defined by bacterial multilocus sequence typing databases. J Med Microbiol. 2004;53(1):35–45.
  12. 12. Sheludchenko MS, Huygens F, Hargreaves MH. Highly discriminatory single-nucleotide polymorphism interrogation of Escherichia coli by use of allele-specific real-time PCR and eBURST analysis. Appl Environ Microbiol. 2010 Jul;76(13):4337–4345. pmid:20453128
  13. 13. Huygens F, Inman-Bamber J, Nimmo GR, Munckhof W, Schooneveldt J, Harrison B, et al. Staphylococcus aureus genotyping using novel real-time PCR formats. J Clin Microbiol. 2006 Oct;44(10):3712–3719. pmid:17021101
  14. 14. Price EP, Inman-Bamber J, Thiruvenkataswamy V, Huygens F, Giffard PM. Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants. BMC Bioinformatics. 2007;8:278. pmid:17672919
  15. 15. Levesque S, Michaud S, Arbeit RD, Frost EH. High-resolution melting system to perform multilocus sequence typing of Campylobacter jejuni. PloS one. 2011;6(1):e16167. pmid:21297862
  16. 16. Lilliebridge RA, Tong SY, Giffard PM, Holt DC. The utility of high-resolution melting analysis of SNP nucleated PCR amplicons—an MLST based Staphylococcus aureus typing scheme. PloS One. 2011;6(6):e19749. pmid:21731606
  17. 17. Tong SY, Xie S, Richardson LJ, Ballard SA, Dakh F, Grabsch EA, et al. High-resolution melting genotyping of Enterococcus faecium based on multilocus sequence typing derived single nucleotide polymorphisms. PloS One. 2011;6(12):e29189. pmid:22195020
  18. 18. Richardson L, Tong S, Towers R, Huygens F, McGregor K, Fagan P, et al. Preliminary validation of a novel high-resolution melt-based typing method based on the multilocus sequence typing scheme of Streptococcus pyogenes. Clin Microbiol and Infect. 2011;17(9):1426–1434.
  19. 19. Huang Q, Liu Z, Liao Y, Chen X, Zhang Y, Li Q. Multiplex fluorescence melting curve analysis for mutation detection with dual-labeled, self-quenched probes. PloS One. 2011;6(4):e19206. pmid:21552536
  20. 20. Xiong F, Huang Q, Chen X, Zhou Y, Zhang X, Cai R, et al. A Melting Curve Analysis–Based PCR Assay for One-Step Genotyping of β-Thalassemia Mutations: A Multicenter Validation. J Mol Diagn. 2011;13(4):427–435. pmid:21704277
  21. 21. Liao Y, Zhou Y, Guo Q, Xie X, Luo E, Li J, et al. Simultaneous detection, genotyping, and quantification of human papillomaviruses by multicolor real-time PCR and melting curve analysis. J Clin Microbiol. 2013;51(2):429–435. pmid:23175255
  22. 22. Hu S, Li G, Li H, Liu X, Niu J, Quan S, et al. Rapid Detection of Isoniazid Resistance in Mycobacterium tuberculosis Isolates by Use of Real-Time-PCR-Based Melting Curve Analysis. J Clin Microbiol. 2014;52(5):1644–1652. pmid:24599986
  23. 23. Sanjay K. Singh, Alexei A. Koshkin, Jesper Wengel, Poul Nielsen. LNA (locked nucleic acids): synthesis and high-affinity nucleic acid recognition. Chem Commun. 1998 (4):455–456.
  24. 24. Makino K, Oshima K, Kurokawa K, Yokoyama K, Uda T, Tagomori K, et al. Genome sequence of Vibrio parahaemolyticus: a pathogenic mechanism distinct from that of V cholerae. Lancet. 2003;361(9359):743–749. pmid:12620739
  25. 25. Jensen RV, Depasquale SM, Harbolick EA, Hong T, Kernell AL, Kruchko DH, et al. Complete Genome Sequence of Prepandemic Vibrio parahaemolyticus BB22OP. Genome Announc. 2013 Jan;1(1).pii: e00002–12.
  26. 26. Francisco AP, Bugalho M, Ramirez M, Carrico JA. Global optimal eBURST analysis of multilocus typing data using a graphic matroid approach. BMC Bioinformatics. 2009;10:152. pmid:19450271
  27. 27. Francisco AP, Vaz C, Monteiro PT, Melo-Cristino J, Ramirez M, Carriço JA. PHYLOViZ: phylogenetic inference and data visualization for sequence based typing methods. BMC Bioinformatics. 2012;13(1):87.
  28. 28. Feil EJ, Li BC, Aanensen DM, Hanage WP, Spratt BG. eBURST: Inferring Patterns of Evolutionary Descent among Clusters of Related Bacterial Genotypes from Multilocus Sequence Typing Data. J Bacteriol. 2004;186(5):1518–1530. pmid:14973027
  29. 29. Sanchez JA, Pierce KE, Rice JE, Wangh LJ. Linear-after-the-exponential (LATE)-PCR: an advanced method of asymmetric PCR and its uses in quantitative real-time analysis. Proc Natl Acad Sci U S A.2004 Feb 17;101(7):1933–1938. pmid:14769930
  30. 30. Hubert Lawrence, Arabie Phipps. Comparing partitions. J Classif. 1985;2(1):193–218.
  31. 31. Severiano A, Pinto FR, Ramirez M, Carrico JA. Adjusted Wallace coefficient as a measure of congruence between typing methods. J Clin Microbiol. 2011 Nov;49(11):3997–4000. pmid:21918028
  32. 32. Severiano A, Carriço JA, Robinson DA, Ramirez M, Pinto FR. Evaluation of Jackknife and Bootstrap for Defining Confidence Intervals for Pairwise Agreement Measures. PloS One. 2011;6(5):e19539. pmid:21611165
  33. 33. Yu Y, Hu W, Wu B, Zhang P, Chen J, Wang S, et al. Vibrio parahaemolyticus isolates from southeastern Chinese coast are genetically diverse with circulation of clonal complex 3 strains since 2002. Foodborne Pathog Dis. 2011;8(11):1169–1176. pmid:21883006
  34. 34. Rapa RA, Islam A, Monahan LG, Mutreja A, Thomson N, Charles IG, et al. A genomic island integrated into recA of Vibrio cholerae contains a divergent recA and provides multi-pathway protection from DNA damage. EnvironMicrobiol. 2015;17(4):1090–1102.
  35. 35. Gavilan RG, Zamudio ML, Martinez-Urtaza J. Molecular epidemiology and genetic variation of pathogenic Vibrio parahaemolyticus in Peru. PLoS Negl Trop Dis. 2013;7(5):e2210. pmid:23696906
  36. 36. Turner JW, Paranjpye RN, Landis ED, Biryukov SV, Gonzalez-Escalona N, Nilsson WB, et al. Population structure of clinical and environmental Vibrio parahaemolyticus from the Pacific Northwest coast of the United States. PloS One. 2013;8(2):e55726. pmid:23409028