A Discriminative Approach for Unsupervised Clustering of DNA Sequence Motifs
(A) Scatter plot of ED.sqr scores and alignment space values observed in inter-class alignments. The alignment space was the product of aligned motif lengths, which is proportional to the number of possible alignments. Curves show conditional mean and variance estimates (2σ above and below the mean) obtained with non-parametric regression. (B) Histograms of adjusted ED.sqr scores for inter-class (light) and intra-class alignments (dark). (C) Histograms of adjusted ED.sqr scores for inter-class (light) and intra-family alignments (dark).