Steel surface defect segmentation with SME-DeeplabV3+

doi:10.1371/journal.pone.0329628

Fig 1.

Schematic of the improved DeepLabv3 + model.

The ASPP module is replaced with a multiscale attention aggregation (MSAA) mechanism, and the ELA module is added to the shallow feature extraction part to enhance feature extraction.

More »

Expand

Fig 2.

StarNet structure diagram.

StarNet follows the traditional hierarchical network, directly using convolutional layers to downsample the resolution, doubling the number of channels at each stage, and repeating multiple star blocks to extract features.

More »

Expand

Fig 3.

Structure diagram of the star blocks.

A comparison between the star operation and the summation operation shows that the star operation consistently outperforms the summation operation, especially in narrower networks; this is attributed to its ability to map inputs to high-dimensional space without expanding the network width.

More »

Expand

Fig 4.

Structural diagram of the multiscale attention aggregation (MSAA) module.

This module uses different sizes of convolution kernels, such as 3 × 3, 5 × 5, and 7 × 7, to perform multiscale analysis on feature maps, with small convolution kernels capturing fine details of small defects and large convolution kernels focusing on the overall shape of large defects.

More »

Expand

Fig 5.

ELA module structure diagram.

The structure of the ELA module and the process of performing multiscale analysis on feature maps via convolution kernels and related operations is shown.

More »

Expand

Fig 6.

Sample images of steel defects.

From left to right are sample images of inclusions, patches, and scratches.

More »

Expand

Table 1.

Performance comparison of the model with other semantic segmentation models.

More »

Expand

Fig 7.

Charts comparing the mIoU values.

More »

Expand

Table 2.

Computational Complexity and Model Size of StarNet with Modules.

More »

Expand

Table 3.

Comparison of StarNet as the backbone network combined with modules.

More »

Expand

Fig 8.

Curve of the change in model loss function.

More »

Expand

Fig 9.

Comparative visualization of defect segmentation results (from left to right: original image, predictions of the improved model, ground truth, MobileNetV2 baseline, Xception baseline).

More »

Expand