Skip to main content
Advertisement

< Back to Article

End-to-end sequence-structure-function meta-learning predicts genome-wide chemical-protein interactions for dark proteins

Fig 2

Dark protein space in terms of statistics.

The fraction of proteins that have at least one known ligand in each Pfam family is graphically represented here. Each color bubble indicates a Pfam family, and the size of the bubble is proportional to the total number of proteins in that family. 1, 734 Pfam families have at least one known small molecule ligand. One can see that most Pfam families have less than 1% proteins with known ligands. Furthermore, around 90.2% of the total 17, 772 Pfam families remain completely dark, without any known ligand-binding information. These “dark regions” represent a vast untapped resource in drug discovery.

Fig 2

doi: https://doi.org/10.1371/journal.pcbi.1010851.g002