Skip to main content

Advertisement

PLOS Computational Biology

Browse
Publish
- Submissions
- Policies
- Manuscript Review and Publication
About

Search Search

advanced search

< Back to Article

Figure 1.

Overview of the components constituting the FSA alignment program.
The algorithms that are used in each component are highlighted in the accompanying boxes. The bold arrows show the simplest mode of use for FSA, where posterior probabilities are calculated directly using default parameters for all pairs of sequences and the optional steps of anchor finding and iterative refinement are omitted.

More »

Figure 2.

The default Pair HMM used by FSA.
By default FSA uses a Pair HMM with two sets of Insert (I) and Delete (D) states to generate a two-component geometric mixture distribution. FSA can optionally use a three-state HMM, which has only one set of Insert and Delete states. M is a Match state emitting aligned characters.

More »

Figure 3.

Two alignments (left and right) which make the same homology statements and therefore are both represented by the same POSET (center).
“The mathematics of distance-based alignment” in Text S1 discusses this view of alignments as POSETs. The alignment on the right minimizes the number of gap-open events and as such is appropriate for analyses such as inferring parsimonious indel frequencies across a clade. Alignments are displayed with TeXshade [63].

More »

Figure 4.

Schematic overview of FSA's parallelization strategy on a computer cluster.
For large input sizes, a disk-based database may be used to store some of the primary data structures and reduce memory usage.

More »

Figure 5.

The Java GUI allows users to visualize the estimated alignment accuracy under FSA's statistical model.
FSA's alignment is colored according the expected accuracy under FSA's statistical model (top) as well as according to the “true” accuracy (bottom) given from a comparison between FSA's alignment and the reference structural alignment. It is clear from inspection that accuracies estimated under FSA's statistical model correspond closely to the true accuracies. Sequences are from alignment BBS12030 in the RV12 dataset of BAliBASE 3 [24].

More »

Table 1 — Table 1.

Benchmarks against protein structural databases.

More »

Table 2 — Table 2.

Benchmarks against RNA structural databases.

More »

Table 3 — Table 3.

Benchmarks against simulated mammalian and fly genomic DNA.

More »

Table 4 — Table 4.

Benchmarks against simulated unrelated protein and DNA sequences.

More »

Table 5 — Table 5.

Benchmarks against simulated unrelated genomic DNA.

More »

Table 6 — Table 6.

Comparisons of alignments obtained in codon and amino acid space.

More »

Table 7 — Table 7.

Ablation analysis of FSA on protein structural databases.

More »

Table 8 — Table 8.

Ablation analysis of FSA on RNA structural databases.

More »

Table 9 — Table 9.

Ablation analysis of FSA on simulated mammalian genomic DNA.

More »

Table 10 — Table 10.

Ablation analysis of FSA on simulated unrelated protein and DNA sequences.

More »

Table 11 — Table 11.

Timing comparison of FSA and other methods on 16S sequences.

More »

Table 12 — Table 12.

Timing comparison of FSA in regular and parallelized modes.

More »

Table 13 — Table 13.

Timing comparison of FSA in parallelized mode with different numbers of processors.

More »

Publications
PLOS Aging and Health
PLOS Biology
PLOS Climate
PLOS Complex Systems
PLOS Computational Biology
PLOS Digital Health
PLOS Ecosystems
PLOS Genetics

PLOS Global Public Health
PLOS Medicine
PLOS Mental Health
PLOS Neglected Tropical Diseases
PLOS One
PLOS Pathogens
PLOS Sustainability and Transformation
PLOS Water

Home
Blogs
Collections
Give feedback
LOCKSS

Privacy Policy
Terms of Use
Advertise
Media Inquiries
Contact

PLOS is a nonprofit 501(c)(3) corporation, #C2354500, based in California, US