Prediction of virus-host associations using protein language models and multiple instance learning
Fig 3
Performance of multi-class classifications on ESM-1b and k-mer features.
A and B represent the AUC and accuracy, respectively, for prokaryotic and eukaryotic hosts using four feature sets (ESM-1b, AA_2, PC_3 and DNA_5), AUC and accuracy are equivalent with those presented in Table 1. C and D indicate the results obtained by testing the trained models on prokaryotic and eukaryotic hosts associated with 5 to 30 viruses, using the four different feature sets described above.