Skip to main content
Advertisement

< Back to Article

DPCfam: Unsupervised protein family classification by Density Peak Clustering of large sequence datasets

Fig 6

Histograms showing average overlap between Pfam families and their representative MCs.

Colors reflect the contribution of each MC category to each bin (equivalent, reduced, extended and shifted, see S1(B) Fig and Methods for definitions). A: Overlap between individual Pfam families and their representative MCs B: Overlap between individual Pfam families or architectures and representative MCs. Given a Pfam family and its representative MC (same pairs as in A), we search for a better overlap of the representative MC with any multi-family architecture featuring the original Pfam family and up to two additional families. The reported average overlap value is thus the best between the overlap with the original family and any other such Pfam architecture. Note that the Pfam architecture labels (equivalent/reduced/extended/shifted) are still assigned according to the representative MC overlap to the original Pfam family so as to show to which extent the overlap in each MC category increases with respect to A).

Fig 6

doi: https://doi.org/10.1371/journal.pcbi.1010610.g006