Skip to main content
Advertisement

< Back to Article

Ranking microbial metabolomic and genomic links in the NPLinker framework using complementary scoring functions

Fig 2

The effect of size on strain correlation scoring.

(A) Size discrepancy in the strain correlation score for GCFs of varying sizes. Each box represents a strain, with filled boxes denoting that the strain is a member of the GCF or MF, and blank boxes that it is not. The top GCF-MF pair outscores the bottom pair by 30 to 26, despite the bottom pair having arguably stronger correspondence. (B) Expected value and variance of the strain correlation score for a population of 100 strains, as a function of GCF and MF sizes. Both the expected value and the variance have a considerable range, rendering comparison between links involving different sizes of GCFs and MFs difficult. For instance, a GCF and MF of size 80 could easily get a score of 500 or higher by chance, while for a GCF and MF of size 20, a score this high would be highly significant.

Fig 2

doi: https://doi.org/10.1371/journal.pcbi.1008920.g002