Transcription Factor Binding Profiles Reveal Cyclic Expression of Human Protein-coding Genes and Non-coding RNAs
Figure 3
Statistical models for predicting cell cycle genes using Random Forest method.
(A) The ROC curves for 3 classification models that use TF-only, motif-only features or a combination of them as predictors. (B) The relative importance (measured as MDG, Mean Decrease in Gini coefficient) of TF features in the combined model (TF+Motif). (C) The relative importance of motif features in the combined model. (D) The change of prediction accuracy (measured as AUC scores) when remove the most important predictor from the full model one by one. Note that cell cycle genes in the training data are from data in Hela cells, and thus we use only TF binding data from the same cell line in our model.