Multiple displacement amplification (MDA) is a widely used technique for amplification of DNA from samples containing limited amounts of DNA (e.g., uncultivable microbes or clinical samples) before whole genome sequencing. Despite its advantages of high yield and fidelity, it suffers from high amplification bias and non-specific amplification when amplifying sub-nanogram of template DNA. Here, we present a microfluidic digital droplet MDA (ddMDA) technique where partitioning of the template DNA into thousands of sub-nanoliter droplets, each containing a small number of DNA fragments, greatly reduces the competition among DNA fragments for primers and polymerase thereby greatly reducing amplification bias. Consequently, the ddMDA approach enabled a more uniform coverage of amplification over the entire length of the genome, with significantly lower bias and non-specific amplification than conventional MDA. For a sample containing 0.1 pg/μL of E. coli DNA (equivalent of ~3/1000 of an E. coli genome per droplet), ddMDA achieves a 65-fold increase in coverage in de novo assembly, and more than 20-fold increase in specificity (percentage of reads mapping to E. coli) compared to the conventional tube MDA. ddMDA offers a powerful method useful for many applications including medical diagnostics, forensics, and environmental microbiology.
Citation: Rhee M, Light YK, Meagher RJ, Singh AK (2016) Digital Droplet Multiple Displacement Amplification (ddMDA) for Whole Genome Sequencing of Limited DNA Samples. PLoS ONE 11(5): e0153699. https://doi.org/10.1371/journal.pone.0153699
Editor: Chandan Kumar-Sinha, University of Michigan, UNITED STATES
Received: November 19, 2015; Accepted: April 3, 2016; Published: May 4, 2016
Copyright: © 2016 Rhee et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All sequencing data files are available from the BioProject database (accession number SRR3182712) at the following link, http://www.ncbi.nlm.nih.gov/bioproject/312675.
Funding: Financial support for the work was provided by the grants: R01 DE020891, funded by the NIDCR and ENIGMA, a LBNL Scientific Focus Area Program supported by the U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research. Sandia is a multi-program laboratory operated by Sandia Corporation, a Lockheed Martin Company, for US DOE’s Nuclear Security Administration under contract DE-AC04-94AL85000. The funder provided support in the form of salaries for authors MR, YKL, RJM, and AKS, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.
Competing interests: Authors MR, YKL, RJM, and AKS are current or former employees of Sandia National Laboratories. Sandia National Laboratories did not have any role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The authors declare no competing interests related to their employment with Sandia National Laboratories. Author MR is a current employee of Illumina, Co. Illumina did not have any role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The author declares no competing interests related to his employment with Illumina. This did not alter the authors' adherence to PLOS ONE policies on sharing data and materials.
Whole genome sequencing is beneficial for the study of samples with limited DNA such as difficult-to-culture microorganisms and the analysis of clinical samples [1–6], but most DNA sequencing technologies require nanogram to microgram amounts of DNA for library preparation, while a single bacterial or human cell contains only a few femtograms or picograms of DNA template. When dealing with the limited amounts of DNA from a single or a few cells, it is necessary to perform whole genome amplification to obtain sufficient material for preparation of a sequencing library . Multiple displacement amplification (MDA) is the most common of several techniques used [8–9] for amplifying limited input DNA . MDA is an isothermal method using random primers and the strand-displacing ϕ29 DNA polymerase for high yield, high fidelity amplification [10–12]. The ϕ29 polymerase has an error rate at least one order of magnitude lower than other DNA polymerase enzymes, which is a major advantage of MDA for high fidelity genomic studies. MDA generates substantially more DNA than thermal cycling processes such as PCR . Despite its advantages, MDA is hampered by amplification bias and non-specific amplification that may compromise subsequent genome sequencing [14–15]. when using less than a nanogram of starting template [16–17] Amplification bias is caused by preferential priming of certain sequences, which leads to highly uneven representation of the template DNA [6, 18] after exponential amplification. Nonspecific amplification is caused by exogenous DNA present in the reagents, or by formation of amplifiable dimers of the random primers used for MDA [10, 19]. The first demonstration of single cell genome sequencing using MDA reported >109 fold amplification of isolated E. coli single cells (~5 fg) . However, only 30% of the DNA amplicon was specific to the original E. coli sequence, and 70% was derived from random primer dimers or other DNA contaminants.
One way to improve MDA amplification for small amounts of template is to use smaller reaction volumes to increase the effective template DNA concentration while maintaining the same concentrations of other reagents, including any contaminating DNA. For example, Hutchison et al. demonstrated that improved specificity could be achieved by reducing the volume of the MDA reaction from 50 μL to 600 nL, although no clear improvement was observed in amplification bias . Marcy et al. used a microfluidic device to further reduce the MDA reaction volume down to 60 nL. This increased the specificity up to 80–95% from a single cell and reduced sequencing bias as well . Applying the same principle of volume reduction, digital MDA (dMDA) in an array of ~6 nL microfluidic chambers was used as a method to detect extremely small amounts of DNA fragments of unknown sequence . More recently, MDA in microfabricated wells (~12 nL) was demonstrated, with a modified MDA protocol incorporating a second strand-displacing DNA polymerase . They confirmed more than 80% of assembled bases were mapped to the original E. coli template and showed 88–94% coverage of the entire E. coli genome.
All of the techniques presented above to improve MDA performance require complex or labor-intensive protocols, or rely on complex, custom-fabricated microfluidic devices that constrain the reaction volume to nanoliter level. Here, we present droplet-based whole bacterial genome amplification in millions of sub-nanoliter droplets, which we call droplet digital MDA (ddMDA). The only custom equipment required is a microfluidic droplet generator, versions of which are available commercially. We demonstrate that simply partitioning a conventional MDA reaction into many droplets with volumes of 150 pL improves the quality of whole genome amplification, compared to an identical reaction performed at a conventional microliter scale.
Materials and Methods
Device fabrication and operation
The microfluidic droplet generators were built as previously described . For the continuous phase, fluorinated oil (HFE-7500, 3M) with 2% (w/w) surfactant (PicoSurf®, Dolomite) was used. All fluids were actuated by pressurizing off-chip reservoirs using 0–15 psi scalable pressure modulators (Pneutronics) connected to in-house nitrogen gas. The typical range of operation was 1.9–2.1 psi. The pressure modulators were controlled by a custom LabView interface. Device operation was monitored using an Olympus IX71 inverted microscope equipped with an interline CCD camera (Andor Clara). Droplets were collected into a microcentrifuge tube for incubation immediately following droplet generation.
Escherichia coli K-12 MG1655 (ATCC, Manassas, VA) was cultured in Luria broth and the cell concentration was determined by optical density at 600 nm (OD600 of 1.0 = 1x 109 cells/ml) after washing with PBS. The cells were diluted to four different concentrations in water and heated at 95°C for 10 minutes for thermal lysis and denaturation followed by immediate cool-down on ice. MDA reaction mixture consisted of 1X MDA reaction buffer (40 mM Tris-HCl (pH7.5), 50 mM KCl, 10 mM MgCl2, and 5 mM (NH4)2SO4), 0.5 mM dNTPs, 50 μM random hexamers, 4% PEG-400, 2% DMSO, 20 μM SYBR Green and 0.15 μM of ϕ29 DNA polymerase (New England Biolabs, Ipswich, MA). MDA ready samples were prepared by adding the MDA reaction mixture to varying concentrations of the thermally lysed cells. The final volume for the samples was 300 μL. A 20 μL aliquot from each concentration was taken for tube MDA and transferred to a 200 μL PCR tube, while the rest 280 μL was used for droplet MDA. The reaction mixes containing template and MDA mix were maintained at 4°C until they were immediately emulsified into droplets of 150 pL average volume using the droplet generation chip. Multiple replicates of 60 μL volume of MDA droplets (approximately consisting of 42 μL of aqueous droplets and 18 μL of fluorinated oil) were collected into 200 μL PCR tubes. The PCR tubes for tube MDA and ddMDA were brought together to the thermocycler (PTC-225 DNA Engine Thermocycler, MJ Research) and incubated under the same condition at 30°C for 18 hours. We used a standard MDA protocol without extraordinarily stringent cleaning steps [24–25] or post-amplification treatments for purification. We excluded the effect of the increased effective concentration of template DNA by using the same concentrations of all reagents for both tube MDA and ddMDA.
Sequencing library preparation
The amplified DNA (60 μL of ddMDA or 20 μL of tube MDA) was cleaned using a DNA Clean & Concentrator™-5 (Zymo Research, Irvine, CA) and the control genomic DNA was prepared from 10ml of overnight culture of E.coli K-12 MG1655 using ZR Fungal/Bacterial DNA MiniPrep™ (Zymo Research) by following the manufactures’ protocols. After quantification using Qubit® dsDNA HS Assay Kit (Life Technologies, San Diego, CA), one nanogram each of purified DNA from the MDA samples and one nanogram of genomic DNA were prepared for a sequencing library using Nextera XT kit (Illumina, San Diego, CA) and pooled together as the manufacture’s protocol. Sixteen picomolar of the pooled library was loaded to MiSeq (Illumina) for a 75-cycle paired-end reads using the v3 chemistry.
Mapping, mapping visualization and genome assembly
MiSeq output fastq sequences were filtered and trimmed using Trimmomatic . For the mapping analysis, single-end reads were mapped to E. coli K-12 MG1655 reference genome (NCBI, NC_000913.3) using Bowtie 2 with default settings in ‘end-to-end’ alignment mode . The mapping coverage to the genome was visualized by BRIG 0.95 using the SAM files from Bowtie 2 as input files . GC content of the reference genome was calculated with sliding windows, 10000 bases for whole genome and 2000 bases for zoomed-in region, using Bioconductor packages and genome coverage was visualized with Sushi  using bedgraphs created by bedtools (v.2.24.0) genomecov suite  as input files. The de novo assembly of sequences was performed using SPAdes 3.5 genome assembler  with ‘single-cell’, ‘paired-end libraries’ and ‘careful’ options and was evaluated using QUAST . Lorez curve and Gini Index were generated with read depths at each genome position obtained using bedtools (v.2.24.0) genomecov suite and binned using R.
Results and Discussion
The amplification bias during conventional MDA is mostly caused by preferential priming, where certain preferred portions of the template are repeatedly favored and exponentially amplified as the reaction continues. This results in uneven representation of the template, and uneven coverage in sequencing . Small amounts of template DNA in the sample also leads to increased non-specific amplification as chimeras and contaminating DNA represent a larger fraction of the total amplified DNA relative to the template.
We address both the bias in amplification and non-specific amplification by performing MDA in droplets where a sample containing E. coli genomic DNA into millions of ~150 pL droplets which are amplified in parallel, and then pooled to generate a single sequencing library. This workflow is illustrated in Fig 1A. At the beginning of the reaction, each droplet contains a small subset of the total population of DNA fragments, and amplification in each droplet occurs independently from the other droplets. This reduces the effect of competition for preferential priming and access to polymerase, leading to more even amplification of fragments covering the entire genome (Fig 1B). Contaminating DNA fragments that are present in low initial concentration are also partitioned into a small number of droplets, limiting the representation of these sequences in the final pooled library. Fig 1C shows a fluorescence micrograph of the end product of ddMDA reaction, with an array of droplets each of which contains a discrete hyper-branched MDA product.
(A) The ddMDA procedures as a high quality alternative to the conventional tube MDA. The MDA ready E. coli samples were partitioned into millions of picoliter droplets using a microfluidic droplet generator. Upon collection, droplets were tightly sealed for isothermal incubation at 30°C for 18 hours. DNA amplicons were then purified, cleaned, and prepared for the following sequencing. (B) Denatured and fragmented whole genomes consist of highly amplifiable (yellow) and weakly amplifiable (red) sequences. During tube MDA, yellow fragments are preferred and repeatedly amplified with a high gain until it reaches a concentration plateau, whereas red fragments are less preferred and barely amplify. For ddMDA, DNA fragments are randomly partitioned into picoliter droplets, resulting in different subsets of the template DNA. When a droplet contains yellow fragments, the amplification kinetics favor the yellow fragments, ending up with significant biases on amplification. The enzyme will amplify red fragments at a slower rate only in the absence of yellow fragments. The overall gain of ddMDA is always lower than tube MDA because of the volume constraint. Every droplet is uniquely composed of fragments and ends up with a different amplification gain after MDA. (C) A fluorescence micrograph showing ddMDA endpoint with the initial template DNA concentration of 100 pg/μL. Having started with different parts of the E. coli genome, individual droplets expressed discrete levels of amplification by showing different sizes of DNA amplicon aggregates and different fluorescent signals. The scale bar shows 100μm.
Another feature of the ddMDA reactions was identified is the decreased amplification gain. It is likely that the volume restriction in each droplet limits the amplification reaction and decreases the gain. However, the reduction in amplification gain eventually prevents unlimited exponential growth of preferred sequences, resulting in improved coverage and uniformity of amplification. The amplification gain is thus not only an absolute measure of the amplification yield, but it is also an indirect indicator of the quality of whole genome amplification , where large gains generally correlates with increased amplification bias. It has been known that gains greater than 107 significantly deteriorate amplification chemistry, resulting in poor de novo assembly . This observation suggests that it the amplification gain should be minimized to the level that is barely sufficient for the subsequent sequencing. Average gains for different initial DNA concentrations are shown in Fig A in S1 File. Depending on the initial DNA concentrations, our ddMDA yielded 101−105 gains of amplification, which was far lower than the typical gains in tube MDA (>107) which are associated with poor quality amplification libraries. The amplification gain indeed gradually increased as the volume of droplets increased, which justified the small volume constraints on the gain (Fig B in S1 File). The sacrificed yields from the reduced gain of ddMDA can be compensated with an enormously large number of MDA droplets that enables ultra-high throughput. For sequencing purposes, we collected ~280,000 ddMDA droplets (~ 42 μL in total volume) for each run.
Reduction of amplification bias
To test whether ddMDA alleviates amplification bias, we amplified E. coli genomes by performing tube MDA (20 μL total volume) and ddMDA (42 μL total volume, partitioned into ~300,000 droplets of 150 pL volume), with template DNA concentrations varying from 0.1 pg/μL to 100 pg/μL. We then sequenced the same mass (1 ng) of amplicon prepared from each method. As a control, an amplification-free sequencing library was also prepared directly from 1 ng of E. coli genomic DNA. Fig 2A and 2B shows genome coverage and read depths from tube and droplet MDA with various initial DNA concentrations. Both the coverage and the uniformity of read depths deteriorate over the entire genome as the template concentration decreased from 100 pg/μL to 0.1 pg/μL. However, ddMDA had a markedly improved quality of amplification compared to tube MDA at the same concentrations, and this effect is most pronounced at low template DNA concentrations (Fig 2B).
(A) From the outermost circle, ddMDA for 100 pg/μL (dark purple), tube MDA for 100 pg/μL (bright purple), 10 pg/μL (dark brown), and 10 pg/μL (bright brown), respectively. GC contents across the genome were depicted in black in the innermost circle and reads from genomic DNA were illustrated in green as a reference. (B) From the outermost circle, ddMDA for 1 pg/μL (dark red), tube MDA for 1 pg/μL (bright red), 0.1 pg/μL (dark blue), and 0.1 pg/μL (bright blue), respectively. (C) GC contents (top) and amplification read depths over the entire E. coli genome for ddMDA (middle) and tube MDA (bottom) at the DNA concentration of 10 pg/μL. The right panels show zoomed-in plots of the dotted-line box region of the genome for close-up visualization.
In the coverage maps, empty disconnected regions show missing sequences of the genome where MDA completely failed to amplify DNA. Peaks and valleys reflect regions with high bias in amplification. Most of the coverage peaks appeared at the same positions regardless of the reaction volume and DNA concentrations. This implies that the preferential bias during MDA is inherent and systematic. Furthermore our results are consistent with previous observations that bias is correlated to GC content of template DNA [35–36]. This is illustrated in Fig 2C, with zoomed-in plots for extreme GC content regions of an E. coli genome showing explicit amplification peaks around GC poor regions and valleys around GC rich regions. Note that tube MDA was more sensitive to GC content while ddMDA produced more uniform amplification over the entire genome. While our data is most suggestive of a bias against GC-rich sequences, amplification bias is likely a complex phenomenon arising from multiple parameters such as repetitive regions, template size, GC content, method of denaturation, incubation temperature, and concentration of reactants.
Early literature suggested that pooling of replicate MDA reactions would substantially relieve the amplification bias by making the coverage depth average out statistically , but this was subsequently disproven . Since bias is systematic, pooling of identical microliter-scale MDA reactions with the same template DNA does not improve amplification coverage. By contrast, ddMDA uses hundreds of thousands of droplets, each with a very few fragments comprising different subsets of the total template DNA. Although we ultimately pool the droplets to prepare a single sequencing library, the droplet partitioning is different from pooling a small number of identical microliter-volume reactions.
Improvement of specificity in ddMDA
In an MDA reaction with limited template, small amounts of exogenous or contaminating DNA behaves similarly to highly amplified product and can outcompete template for amplification. In ddMDA, we reduce the impact of contaminating DNA by partitioning template into droplets, which limits the extent to which it can compete with template DNA. The specificity of amplification is determined from the fraction of the total sequencing reads that mapped to the E. coli genome (Table 1 and Fig 3A). We found that, at low concentrations of template DNA, the specificity greatly increased in ddMDA compared to tube MDA. Although ddMDA and tube MDA were not significantly different for high concentrations of template (1–10 pg/μL), our ddMDA showed 14-fold higher fraction of reads mapping to E. coli than the tube MDA for 1 pg/μL template, and 22.5-fold higher mapping at 0.1 pg/μL template.
(A) The fraction of reads correctly mapped to E. coli genome depending on the DNA concentration and the MDA reaction volume. Blue and red columns show ddMDA and tube MDA results, respectively. The green line shows the fold change of the % mapping from tube MDA to ddMDA (= ddMDA/tubeMDA). It stayed around 1 for high concentrations (10–100 pg/μL) while it considerably increased for low concentrations (0.1–1 pg/μL). (B) The fraction of an E. coli genome covered by one or more sequencing reads depending on the DNA concentration and the MDA reaction volume. Tube MDA and ddMDA showed little difference at high concentrations but the fold change significantly increased at low concentrations. (C) The fraction of an E. coli genome covered by contigs during de novo assembly. While the advantage of ddMDA over tube MDA was still limited at high concentrations but the fold change significantly increased at low concentrations. (D) Lorenz curves depict the amplification bias in read coverage across the E. coli genome. Each curve was calculated by evaluating the read depth for each base and using the resultant cumulative distribution function for read depth to determine the cumulative proportion of total genome coverage (y-axis) accounted for by the cumulative proportion of bases (x-axis). The ideal Lorentz curve (black dotted line) for a distribution in which all of the bases have the same coverage and a Lorenz curve for gDNA were plotted for comparison. Other solid curves show ddMDA curves while dotted curves indicate tube MDA. (Upper Left) For 100 pg/μL, ddMDA in dark purple and tube MDA in bright purple. (Bottom Left) For 10 pg/μL, ddMDA in dark brown and tube MDA in bright brown. (Upper Right) For 1 pg/μL, ddMDA in dark red and tube MDA in bright red. (Bottom Right) For 0.1 pg/μL, ddMDA in dark blue and tube MDA in bright blue.
Total number of reads included all sequencing reads mapped and unmapped to E. coli genome. % Reads mapped to genome corresponds to the percentage of sequencing reads that are specifically aligned to E. coli genome. % Genome covered by reads refers to the percentage of E. coli genome covered by one or more sequencing reads. % Genome covered by assembly indicated the percentage of E. coli genome covered by contigs. Gini indices were calculated based on the cumulative distribution of sequencing reads.
De novo assembly of ddMDA products
Quantitative metrics of sequencing coverage and bias are presented in Fig 3 and Table 1. As shown in Fig 3B, the sequencing coverage substantially improved as the concentration increased, which is expected. At high template concentrations (10–100 pg/μL), ddMDA and tube MDA showed almost the same level of coverage at high concentrations. However, ddMDA enabled approximately 13-fold higher coverage at 1 pg/μL and 65-fold higher coverage at 0.1 pg/μL than tube MDA.
Sequence data from both tube MDA and ddMDA libraries were used for de novo assembly of the E. coli genome. We assembled >98% of the E. coli genome from ddMDA when the starting template concentrations were 10 pg/μL and higher (Table 1, Fig 3C), with N50 contig sizes longer than 132 kbp and maximum contig lengths of >268 kbp (Table A in S1 File). The high assembly fraction and the long contig lengths indicate successful high quality amplification. Tube MDA with the same initial concentrations yielded slightly lower coverage levels (95.8–97.5%), which means that the partitioning offered only marginal improvement at such high concentrations. The effect of ddMDA is more dramatic at lower template concentrations, where the assembled coverages for ddMDA showed a 40–220 fold increase compared to those of tube MDA, and a larger number of contigs greater than 500 bases.
The Lorenz curve can be used for read depth distributions of sequencing results to represent uniformity of the sequencing read distribution7. A perfectly uniform read distribution would be one in which every base has the same number of reads, and would be plotted as the straight line y = x (line of perfect uniformity of reads). By contrast, a perfectly unequal distribution would be one in which one base has all the reads and all other bases have none. In that case, the curve would be at y = 0 for all x < 1, and y = 1 when x = 1. Fig 3D illustrates Lorenz curves for sequencing read distributions for different MDA methods with various DNA concentrations. Consistent with Fig 2, higher initial DNA concentrations showed more uniform distributions and our ddMDA yielded greater uniformity than tube MDA for all concentrations although the effect was more significant for lower initial DNA concentrations. The uniformity based on Lorenz curve analysis can be numerically represented as a Gini index (or a Gini coefficient). A perfectly uniform distribution gives a Gini index of 0 while a perfectly unequal distribution gives 1. Gini indices of the read distributions from our ddMDA and tube MDA experiments are listed in Table 1. At all concentrations, ddMDA results in a lower Gini index (meaning higher uniformity of coverage), and this becomes most pronounced at the lower template concentrations (1 pg/μL and 0.1 pg/μL). At these concentrations, tube MDA shows highly biased amplification with Gini coefficients approaching 1. The ddMDA Gini coefficients, while still showing evidence of uneven coverage, are significantly lower than 1, at 0.53 and 0.81.
We showed that performing ddMDA in water-in-oil droplets generated with a simple microfluidic device substantially improved the quality of whole genome amplification compared to a conventional MDA reaction, which was attributed to discretization of template DNA by partitioning into numerous small reaction volumes. The amplification gain depended upon the initial DNA concentration in droplets, and was overall lower than tube MDA where high gain is correlated with high bias. We applied ddMDA to single E. coli cells and accomplished sequencing of almost the whole genome. From de novo assembly, we found that >98% of the genome was correctly assembled via ddMDA at 10 pg/μL or higher concentrations, and significantly higher coverages were achieved than tube MDA at lower template concentrations as well. While tube MDA showed extreme deterioration of amplification quality for initial DNA concentrations of 1 pg/μL (~200 E. coli genomes/μL) or lower, our ddMDA significantly reduced the lower limit of initial DNA concentration for reliable amplification down to 0.1 pg/μL, which corresponds to ~20 E. coli genomes/μL, or only ~0.3% of an E. coli genome per droplet. Note that these results were achieved without using any of the stringent precautions that are often taken for low-template MDA, such as UV irradiation of all reagents or operation in a specialized clean environment. In summary, the benefits of using droplets with picoliter reaction volumes for MDA include improved specificity of amplification, reduction in amplification bias, and highly improved amplification coverage, which are attributed solely to partitioning the template into a large number of small reaction volumes. A most recent publication  showed a good agreement with our findings; yet our study further extended to explain the rationale behind the sequencing quality improvement by showing the preferential bias change depending on GC content and differential amplification gain in droplets.
These results suggest that applying ddMDA to DNA from unculturable organisms would increase the diversity of species amenable to genomic study. The improved specificity and coverage of ddMDA justifies its widespread use in sequencing clinical and environmental samples with limited amounts of DNA, so that novel genomes, chromosomes, genes, and viruses can be amplified and characterized in a high-throughput manner. We note one recent study that demonstrates an emulsion-based MDA technique improves performance with DNA derived from small amounts of human DNA, including high-accuracy detection of SNPs and copy number variation . Coupling ddMDA with on-demand droplet techniques  or droplet sorting methods  would further increase its potentials in metagenomics as well. The current ddMDA protocol leaves room for improvement by employing stringent sample preparation for complete elimination of exogenous DNA, which would further enhance the specificity of amplification. We are also exploring integration of droplet generation, denaturation and amplification into a microfluidic device, although this requires at least one reagent addition step which could be accomplished by picoinjection or droplet merging . Topics for further study and optimization include the effect of denaturation technique (the thermal denaturation used here, versus alkaline denaturation which is more common), the effect of template fragment size prior to partitioning, and the optimal number and volume of droplets for a given amount and complexity of template DNA.
S1 File. Discretization of amplification by ddMDA; Skewness of sequencing reads depth distribution.
Table A. Supplementary statistics of sequencing assembly. Table B. Supplementary statistics for determination of skewness of a distribution. Fig A. Average amplification gains and final concentrations of DNA amplicons depending on the initial DNA concentration. Fig B. The change of amplification gains at different initial DNA concentrations for ddMDA as a function of the reaction volume (= the volume of droplets). Fig C. Comparison of sequencing results over the whole E. coli genome between replicate runs for ddMDA and tube MDA. Fig D. Comparison of statistical representative metrics to determine the degree of skewness of the read depth distributions.
We acknowledge Haifeng Geng for discussions on statistics and Susan Yilmaz for advice on experiments.
Conceived and designed the experiments: MR RJM AKS. Performed the experiments: MR YKL. Analyzed the data: MR YKL. Contributed reagents/materials/analysis tools: MR YKL. Wrote the paper: MR YKL RJM AKS.
- 1. Gole J, Gore A, Richards A, Chiu Y- J, Fung H- L, Bushman D, et al. (2013) Massively parallel polymerase cloning and genome sequencing of single cells using nanoliter microwells. Nat Biotechnol 31: 1126–1132. pmid:24213699
- 2. Zhang K, Martiny AC, Reppas NB, Barry KW, Malek J, Chisholm SW, et al. (2006) Sequencing genomes from single cells by polymerase cloning. Nat Biotechnol 24: 680–686. pmid:16732271
- 3. Rodrigue S, Malmstrom RR, Berlin AM, Birren BW, Henn MR, Chisholm SW. (2009) Whole genome amplification and de novo assembly of single bacterial cells. PLoS ONE 4: e6864. pmid:19724646
- 4. Marcy Y, Ouverney C, Bik EM, Lösekann T, Ivanova N, Martin HG, et al. (2007) Dissecting biological “dark matter” with single-cell genetic analysis of rare and uncultivated TM7 microbes from the human mouth. Proc Natl Acad Sci USA 104: 11889–11894. pmid:17620602
- 5. Wang J, Fan HC, Behr B, Quake SR (2012) Genome-wide single-cell analysis of recombination activity and de novo mutation rates in human sperm. Cell 150: 402–412. pmid:22817899
- 6. Fitzsimons MS, Novotny M, Lo CC, Dichosa AE, Yee-Greenbaum JL, Snook JP, et al. (2013) Nearly finished genomes produced using gel microdroplet culturing reveal substantial intraspecies genomic diversity within the human microbiome. Genome Res 23: 878–888. pmid:23493677
- 7. Motley ST, Picuri JM, Crowder CD, Minich JJ, Hofstadler SA, Eshoo MW. (2014) Improved Multiple Displacement Amplification (iMDA) and Ultraclean Reagents. BMC Genomics 15: 443. pmid:24906487
- 8. Asiello PJ, Baeumner AJ (2011) Miniaturized isothermal nucleic acid amplification, a review. Lap Chip 11: 1420–1430.
- 9. Zong C, Lu S, Chapman AR, Xie S (2012) Genome-wide detection of single-nucleotide and copy-number variations of a single human cell. Science 338: 1622–1626. pmid:23258894
- 10. Blainey PC, Quake SR (2011) Digital MDA for enumeration of total nucleic acid contamination. Nucleic Acids Res 39: e19. pmid:21071419
- 11. Esteban JA, Salas M, Blanco L (1993) Fidelity of phi 29 DNA polymerase comparison between protein-primed initiation and DNA polymerization. J Biol Chem 268: 2719–2726. pmid:8428945
- 12. Paez JG, Lin M, Beroukhim R, Lee JC, Zhao X, Richter DJ, et al. (2004) Genome coverage and sequence fidelity of phi29 polymerase-based multiple strand displacement whole genome amplification. Nucleic Acids Res 32: e71. pmid:15150323
- 13. Lizardi PM, Huangm X, Zhu Z, Bray-Ward P, Thomas DC, Ward DC. (1998) Mutation detection and single-molecule counting using isothermal rolling-circle amplification. Nat Genet 19: 225–232. pmid:9662393
- 14. Dean FB, Nelson JR, Giesler TL, Lasken RS (2011) Rapid amplification of plasmid and phage DNA using Phi 29 DNA polymerase and multiplyprimed rolling circle amplification. Genome Res 11: 1095–1099.
- 15. Hosono S, Faruqi AF, Dean FB, Du Y, Sun Z, Wu X, et al. (2003) Unbiased whole genome amplification directly from clinical samples. Genome Res 13: 954–964. pmid:12695328
- 16. Nelson JR, Cai YC, Giesler TL, Farchaus JW, Sundaram ST, Ortiz-Rivera M, et al. (2003) TempliPhi, phi29 DNA polymerase based rolling circle amplification of templates for DNA sequencing. Biotechniques 32: S44–S47.
- 17. Lage JM, Leamon JH, Pejovis T, Hamann S, Lacey M (2003) Whole genome analysis of genetic alterations in small DNA samples using hyperbranched strand displacement amplification and array-CGH. Genome Res 13: 294–307. pmid:12566408
- 18. Woyke T, Tighe D, Mavromatis K, Clum A, Copeland A, Schackwitz W, et al. (2010) One bacterial cell, one complete genome. PLoS ONE 5: e10314. pmid:20428247
- 19. Yilmaz S, Singh AK (2012) Single cell genome sequencing. Curr Opin Biotech 23: 437–443. pmid:22154471
- 20. Raghunathan A, Ferguson HR Jr, Bornarth C. Song W. Driscoll M, Lasken RS. (2005) Genomic DNA amplification from a single bacterium. Appl Environ Microbiol 71: 3342–3347. pmid:15933038
- 21. Hutchison CA III, Smith HO, Pfannkoch C, Venter JC (2005) Cell-free cloning using phi29 DNA polymerase. Proc Natl Acad Sci USA. 102: 17332–17336. pmid:16286637
- 22. Marcy Y, Ishoey T, Lasken RS, Stockkwell TB, Walenz BP, Halpern AL, et al. (2007) Nanoliter reactors improve multiple displacement amplification of genomes from single cells. PLoS Genet 3: e155.
- 23. Rhee M, Light YK, Yilmaz S, Adams PD, Saxena D, Meagher RJ, et al. (2014) Pressure stabilizer for reproducible picoinjection in droplet microfluidic systems. Lab Chip 14: 4533–4539. pmid:25270338
- 24. Champlot S, Berthelot C, Pruvost M, Bennett EA, Grange T, Geigl E-M. (2010) An efficient multistrategy DNA decontamination procedure of PCR reagents for hypersensitive PCR applications. PLoS One 5: e13042. pmid:20927390
- 25. Woyke T, Sczyrba A, Lee J, Rinke C, Tighe D, Clingenpeel S, et al. (2011) Decontamination of MDA reagents for single cell whole genome amplification. PLoS One 6: e26161. pmid:22028825
- 26. Bolger AM, Lohse M, Usadel B (2014) A flexible trimmer for Illumina Sequence Data. Bioinformatics 30: 2114–2120. pmid:24695404
- 27. Langmead B, Salzberm SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Meth 9: 357–359.
- 28. Alikhan N, Petty NK, Zakour NLB, Beatson SA (2012) BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons. BMC Genomics 12: 402.
- 29. Phanstiel DH (2014) Sushi: Tools for visualizing genomics data. R package version 1.4.0.
- 30. Quinlan A, Hail IM. (2010) BEDTools: a flexible suite of utilites for comparing genomic features. Bioinformatics 26: 1072–1075.
- 31. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. (2012) SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19: 455–477. pmid:22506599
- 32. Gurevich A, Saveliev V, Vyahhi N, Tesler G. (2013) QUAST: quality assessment tool for genome assemblies, Bioinformatics 29: 1072–1075. pmid:23422339
- 33. Lasken RS (2009) Genomic DNA amplification by the multiple displacement amplification (MDA) method. Biochem Soc Trans 37: 450–453. pmid:19290880
- 34. de Bourcy CFA, Vlaminck ID, Kanbar JN, Wang J, Gawad C, Quake SR (2014) A Quantitative Comparison of Single-Cell Whole Genome Amplification Methods. PLoS ONE 9: e105585. pmid:25136831
- 35. Syvanen AC (2005) Toward genome-wide SNP genotyping. Nat Genet 37: S5–S10. pmid:15920530
- 36. Dietmaier W, Hartmann A, Wallinger S, Heinmöller E, Kerner T, Endl E, et al. (1999) Multiple mutation analyses in single tumor cells with improved whole genome amplification. J Am J Path 1999, 154: 83–95. pmid:9916922
- 37. Abulencia CB, Wyborski DL, Garcia JA, Podar M, Chen W, Chang SH, et al. (2006) Environmental whole-genome amplification to access microbial populations in contaminated sediments. Appl Environ Microbiol 72: 3291–3301. pmid:16672469
- 38. Marine R, McCarren C, Vorrasane V, Nasko D, Crowgey E (2014) Caught in the middle with multiple displacement amplification: the myth of pooling for avoiding multiple displacement amplification bias in a metagenome. Microbiome 2: 3. pmid:24475755
- 39. Sidore AM, Lan F, Lim SW, Abate AR (2015) Enhanced sequencing coverage with digital droplet multiple displacement amplification. Nucleic Acids Res.
- 40. Fu Y, Li C, Lu S, Zhou W, Tang F, Xie XS, Huang Y (2015) Uniform and accurate single-cell sequencing based on emulsion whole-genome amplification. Proc Natil Acad Sci USA 112: 11932–11928.
- 41. Rhee M, Liu P, Meagher RJ, Light YK, Singh AK (2014) Versatile on-demand droplet generation for controlled encapsulation. Biomicrofluidics 8: 034112. pmid:25379072
- 42. Niu X, deMello AJ (2012) Building droplet-based microfluidic systems for biological analysis. Biochem SocTrans 40: 615–623.