Novel comparison of evaluation metrics for gene ontology classifiers reveals drastic performance differences

doi:10.1371/journal.pcbi.1007419

Novel comparison of evaluation metrics for gene ontology classifiers reveals drastic performance differences

Fig 6

Semantic similarities with different summation methods.

We present every combination of three semantic similarity-based methods (Resnik, Lin, AJacc) in columns and six semantic summation methods (A-F, see Methods) in rows. We show again the results for CAFA dataset. The summation methods have a bigger impact on performance than the actual metric. The novel summation methods, E and F, outperform the previous standards, A and D. ADS signal is plotted on the x-axis and the evaluation metric combined with a given summation method is plotted on the y-axis.

doi: https://doi.org/10.1371/journal.pcbi.1007419.g006