Efficient discovery of frequently co-occurring mutations in a sequence database with matrix factorization
Fig 3
The dissimilarity parameter, which indicates the uniqueness of a set of factors in the H matrices for each r value calculated across a range of r.
To better visualize the range of DSIM values, we used the logarithm of these values. The light blue region beneath the function curve highlights which r values produced the most unique factors. We opted to keep factors for for
that included mostly positive DSIM values.