Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

< Back to Article

Table 1.

Values of hyperparameters used in TabDEG model.

More »

Table 1 Expand

Fig 1.

Workflow of TabDEG.

More »

Fig 1 Expand

Fig 2.

UMAP workflow.

More »

Fig 2 Expand

Fig 3.

K-means clustering flowchart.

More »

Fig 3 Expand

Fig 4.

DA workflow diagram.

The data is divided into two parts: disease group (T) and normal group (N). UMAP, KMEANS and PCA methods is used with aim to get mixed feature data in the process of DA. And then the data is standardized to ensure data scale consistency.

More »

Fig 4 Expand

Fig 5.

TabNet structure chart.

(a) TabNet encoder, composed of a feature transformer, an attentive transformer and feature masking. A split block divides the processed representation and decomposition products obtained are used by the attentive transformer of the subsequent step as well as for the overall output. For each step, the feature selection mask provides interpretable information about the model’s functionality and the masks could be aggregated to obtain global feature important attribution. (b) TabNet decoder, composed of a feature transformer block at each step. (c) Attention transformer, the features of the n_a part enter the FC layer and expand to the same dimension as the input features to facilitate subsequent attention calculations, then enter the BN layer, and then multiply with “priors” before entering the sparsemax layer. The sparsemax layer is used for feature selection and “priors” is a constant vector with all entries being 1 in the initial step, which would change in each step. (d) Feature transformer, the output of the feature transformer is divided into two parts: n_d(n features for decision) and n_a(n features for attetion), where n_a is used for further calculation in subsequent steps and n_d is used for the final decision. (the output of the initial step has only n_a-dimensional features, rather than n_d-dimensional features to participate in the final decision).

More »

Fig 5 Expand

Table 2.

Dataset abbreviations for cancer datasets.

More »

Table 2 Expand

Fig 6.

ROC curves with scores for different methods across ten datasets.

More »

Fig 6 Expand

Table 3.

Performance of TabDEG against other five ML models on test data of all ten datasets with five-fold cross-validation being used.

More »

Table 3 Expand

Table 4.

Datasets GO ID/attribute p-value q-value.

More »

Table 4 Expand

Fig 7.

Pathways mapped from predicted UR and DR genes of BRCA.

More »

Fig 7 Expand