Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

< Back to Article

Fig 1.

The overall architecture of the proposed CT-GateNet model.

More »

Fig 1 Expand

Fig 2.

Diagram of our Gated Channel-Spatial Attention (GCSA) mechanism.

More »

Fig 2 Expand

Fig 3.

The Adaptive Feature Fusion Gate (AFFG) mechanism.

More »

Fig 3 Expand

Table 1.

Experimental datasets overview.

More »

Table 1 Expand

Fig 4.

Representative spectrograms of music samples from the employed datasets: (a) Country genre from the GTZAN dataset, (b) Folk genre from the FMA-SMALL dataset, (c) Electronic genre from the FMA-Medium dataset.

More »

Fig 4 Expand

Table 2.

DDIM audio generation algorithm.

More »

Table 2 Expand

Fig 5.

An example of the generation model on the GTZAN dataset.

More »

Fig 5 Expand

Table 3.

Performance comparison with state-of-the-art methods on GTZAN dataset.

More »

Table 3 Expand

Table 4.

Performance comparison of deep learning backbones and proposed method on GTZAN dataset.

More »

Table 4 Expand

Fig 6.

Performance metrics of each target music genre on GTZAN dataset.

More »

Fig 6 Expand

Fig 7.

GTZAN normalized confusion matrix (test set).

More »

Fig 7 Expand

Table 5.

Performance comparison with state-of-the-art methods on FMA-SMALL dataset.

More »

Table 5 Expand

Table 6.

Performance comparison of deep learning backbones and proposed method on FMA-SMALL dataset.

More »

Table 6 Expand

Fig 8.

Performance metrics of each target music genre on FMA-SMALL dataset.

More »

Fig 8 Expand

Fig 9.

FMA-SMALL normalized confusion matrix (test set).

More »

Fig 9 Expand

Table 7.

Performance comparison with state-of-the-art methods on FMA-Medium dataset.

More »

Table 7 Expand

Table 8.

Performance comparison of deep learning backbones and proposed method on FMA-Medium dataset.

More »

Table 8 Expand

Fig 10.

Performance metrics of each target music genre on FMA-Medium dataset.

More »

Fig 10 Expand

Fig 11.

FMA-Medium normalized confusion matrix (test set).

More »

Fig 11 Expand

Table 9.

The ablation experiment results on the GTZAN dataset.

More »

Table 9 Expand

Table 10.

The ablation experiment results on the FMA-SMALL dataset.

More »

Table 10 Expand

Table 11.

The ablation experiment results on the FMA-Medium dataset.

More »

Table 11 Expand

Table 12.

Performance comparison of different data augmentation methods on GTZAN and FMA-SMALL datasets.

More »

Table 12 Expand