Bayesian Top-Down Protein Sequence Alignment with Inferred Position-Specific Gap Penalties
Log-log plots of program runtimes as a function of the total input length.
Each data point corresponds to one MSA generated by the program indicated. Estimated time complexities based on trendline slopes were for: GISMO, t ∝ N0.96; Clustal-Ω, t ∝ N1.6; Kalign, t ∝ N1.6; MUSCLE, t ∝ N2.1 and Dialign t ∝ N2.2, where N is the total number of residues in the aligned sequences. A trendline is not shown for MAFFT because (with the–auto option) it uses one of several different algorithms depending on the input sequence set; this produces a discontinuity in the data points.