Skip to main content
Advertisement

< Back to Article

Fig 1.

This figure illustrates the architecture of Poly(A)-DG.

Each pair of inputs includes a raw DNA sequence and its label to show it contains PAS or not. The raw DNA sequences are encoded into the one-hot format and send into CNN and MLP sub-network. The outputs of CNN and MLP are concatenated and used HEX technique to minimize the differences among different species. The output of CNN is directly used in prediction after softmax and the output of HEX is used in loss calculation. The back-propagation algorithm is employed to tune the model with loss.

More »

Fig 1 Expand

Table 1.

Source domain: Omni human and BL mouse. Target domain: Rat and bovine.

More »

Table 1 Expand

Table 2.

Source domain: Human and bovine. Target domain: BL mouse and rat.

More »

Table 2 Expand

Table 3.

Source domain: BL mouse and bovine. Target domain: Human and rat.

More »

Table 3 Expand

Table 4.

Source domain: Omni human and rat. Target domain: BL mouse and bovine.

More »

Table 4 Expand

Table 5.

Source domain: Rat and BL mouse. Target domain: Omni human and bovine.

More »

Table 5 Expand

Table 6.

Source domain: Rat and bovine. Target domain: Human and BL mouse.

More »

Table 6 Expand

Fig 2.

The data from different species are in the same scale.

(a)training source domain is mixed by Omni Human and Mouse, target species is Rat. (b) training source domain is mixed by Omni Human and bovine, target species is Rat. (c) training source domain is mixed by Rat and Mouse, target species is Omni Human. (d) training source domain is mixed by bovine and Mouse, target species is Rat.

More »

Fig 2 Expand

Fig 3.

Imbalanced source domains: The number of data from the first source species is fixed.

(a)source domain: Omni Human and Mouse, target species: Rat. (b) source domain: Omni Human and bovine, target species: BL Mouse. (c) source domain: BL Mouse and Rat, target species: Omni Human. (d) source domain: BL Mouse and bovine, target species: Rat. (e) source domain: Omni Human and bovine, target species: Rat. (f) source domain: BL Mouse and Rat, target species: bovine.

More »

Fig 3 Expand

Fig 4.

Imbalanced source domains: The number of data from the second source species is fixed.

(a)source domain: Omni Human and Mouse, target species: Rat. (b) source domain: Omni Human and bovine, target species: Rat. (c) source domain: BL Mouse and Rat, target species: bovine. (d) source domain: BL Mouse and bovine, target species: Rat. (e) source domain: Omni Human and Rat, target species: BL Mouse. (f) source domain: BL Mouse and bovine, target species: Omni Human.

More »

Fig 4 Expand

Table 7.

Imbalanced PAS positive-negative ratio.

More »

Table 7 Expand

Table 8.

Imbalanced PAS positive-negative ratio for different species; 1:2 for the first source species and 1:3 for the second one.

More »

Table 8 Expand

Table 9.

Gene with single PAS V.S. gene with multiple PAS; target domain: Rat.

More »

Table 9 Expand

Fig 5.

Visualization of convolution filters of Poly(A)-DG.

(A)Source domain: BL Mouse and Rat. (B)Source domain: BL Mouse and bovine (C)Source domain: Rat and bovine. (D)Source domain: Omni Human and BL Mouse. (E)Source domain: Omni Human and bovine. (F)Source domain: Omni Human and Rat.

More »

Fig 5 Expand

Fig 6.

Visualization of position importance in PAS identification.

(a)Source domain: Omni Human and BL Mouse. (b)Source domain: Omni Human and BL Mouse. (c)Source domain: Omni Human and BL Mouse. (d)Source domain: Omni Human and BL Mouse. (e)Source domain: Omni Human and BL Mouse. (f)Source domain: Omni Human and BL Mouse.

More »

Fig 6 Expand

Table 10.

Predict PAS on chromosome 1 of rat; FPR: False positive rate; FNR: False negative rate.

More »

Table 10 Expand

Table 11.

Performance of Poly(A)-DG on standard cross-species PAS dataset and filtered cross-species PAS dataset.

More »

Table 11 Expand