Hybrid De Novo Genome Assembly Using MiSeq and SOLiD Short Read Data

doi:10.1371/journal.pone.0126289

Fig 1.

Overview of the hybrid assembly pipeline.

Raw data generated by several NGS platforms are preprocessed into a common format, which is registered to the library. A data set used at each assembly stage can be specified separately. The assembly results are denoted according to the supplied data, as illustrated at the bottom of the figure.

More »

Expand

Table 1.

Statistical information of input short read data.

More »

Expand

Table 2.

Characteristic indices of the strands from several assemblies.

More »

Expand

Table 3.

K-mer size dependence of the R50 values^{^a} of the A. oryzae^{^b} unitigs.

More »

Expand

Fig 2.

Dotplot alignments of assembled strands against the reference genome sequence of A. oryzae.

Alignments shorter than 4000 bp were omitted from the plots. Forward and reverse alignments are plotted in red and blue colors, respectively. The Roman numerals I-VIII on the abscissa are the chromosome index of A. oryzae. (a) The MSSH assembly, (b) the denovo2 assembly.

More »

Expand

Table 4.

Characteristics of the A. oryzae^{^a} contigs/scaffolds/strands from several assemblies^{^b}.

More »

Expand

Table 5.

Number of ORFs reproduced in the assemblies.

More »

Expand

Table 6.

Number of SMB gene clusters reproduced in the assemblies.

More »

Expand

Fig 3.

Dotplot alignments of assembled strands against the reference genome sequence of S. avermitilis.

Alignments shorter than 4000 bp were omitted from the plots. Forward and reverse alignments are plotted in red and blue colors, respectively. (a) The MSSH assembly, (b) the HHHH assembly.

More »

Expand