Building Disease-Specific Drug-Protein Connectivity Maps from Molecular Interaction Networks and PubMed Abstracts
Figure 3
The effect of different disease-related protein seeding situation on the specificity and sensitivity of AD drug identification.
In the text retrieval and information extraction component, the AD-related drugs are identified from the retrieved PubMed abstracts relevant to a list of AD proteins. We have an initial set of 49 AD seed proteins. To evaluate the effect of different seeding situations on AD drug identification, we sub-sampled the initial AD seed set into 8 data sets of varying sizes i.e., S5, S10, S15, S20, S25, S30, S35, S40 (the number indicating size) and also generated a random seed set with 50 proteins.. Given different seed sets, Panel (A) shows the specificity performances of AD-related drug identification at top N drugs determined by FDR (false discovery rate), and Panel (B) shows the sensitivity performances.