DAU-Net: Dual attention-aided U-Net for segmenting tumor in breast ultrasound images

doi:10.1371/journal.pone.0303670

Fig 1.

Sample breast ultrasound images of benign, malignant, and normal types.

More »

Expand

Fig 2.

Block diagram of the proposed DAU-Net model used for segmentation of tumor in breast ultrasound images.

An input image with dimensions 128 × 128 × 1 undergoes feature extraction through the encoder, and the decoder then performs upsampling on the encoded features to predict a binary mask of size 128 × 128 × 1. The in-between connections of the encoder and the decoder are accompanied by the addition of PCBAM and SWA attention mechanisms to enhance the performance.

More »

Expand

Fig 3.

An illustration of the PCBAM attention block.

CBAM and PAM are applied to the input feature F. The addition of the outputs of CBAM and PAM is the output of the PCBAM attention mechanism, F_PCBAM.

More »

Expand

Table 1.

Performance metrics of the segmentation models.

All values are in %. Bold values indicate superior performance. The results are in x(±y) format, where x is the mean and y is the standard deviation of the evaluation metric for the five runs of the model.

More »

Expand

Fig 4.

Results of the ablation study indicate the improvement in model performance with each experimental modification.

GT and PM are Ground Truth and Predicted Mask, respectively. F_c is the heatmap of the bottleneck layer and it demonstrates the improvement of the model’s performance in focusing on the region of interest after the addition of the SWA in the bottleneck layer. F_a and F_b are heatmaps of the features flowing from the first and second encoder layers to the first and second decoder layers via skip connections. It can be seen that F_a and F_b get more enriched with the use of attentions such as CBAM, PAM, and PCBAM.

More »

Expand

Table 2.

Results of the proposed DAU-Net model with 5-fold cross-validation on the BUSI dataset.

More »

Expand

Table 3.

Results of the Mann-Whitney U test of the proposed DAU-Net model used for segmenting tumor regions in breast images of the BUSI dataset.

More »

Expand

Table 4.

Performance metrics of the proposed model with different loss functions.

More »

Expand

Table 5.

Performance comparison with standard segmentation models.

All values are in %. Bold values indicate superior performance.

More »

Expand

Table 6.

Performance comparison with SOTA models.

All values are in %. Bold values indicate superior performance.

More »

Expand

Fig 5.

Results of the proposed segmentation model on images of the BUSI dataset and the heatmaps of SWA and PCBAM layers.

PCBAM₁ corresponds to the PCBAM layer just above the SWA layer, PCBAM₂ corresponds to the PCBAM layer just above PCBAM₁ layer, and PCBAM₃ corresponds to the PCBAM layer just above PCBAM₂ layer.

More »

Expand

Fig 6.

Illustration of some of the failed cases of our model.

The encircled regions are the misclassified segmented masks. GT and PM represent the Ground Truth and Predicted Mask, respectively.

More »

Expand

Fig 7.

Predicted mask and heatmap visualization of the proposed model on the UDIAT dataset.

GT and PM represent the Ground Truth and Predicted Mask, respectively. F _a, F _b, and F _c are the heatmaps of the features flowing from the first and second encoder layers to the first and second decoder layers via skip connections and the bottleneck layer, respectively.

More »

Expand

Table 7.

Performance comparison of the proposed model with past methods on UDIAT dataset.

All values are in %. Bold values indicate superior performance.

More »

Expand