SCP-DETR: A efficient small-object-enhanced feature pyramid approach for PCB defect detection

doi:10.1371/journal.pone.0330039

Fig 1.

The overall architecture of SCP-DETR.

(Compared with RT-DETR-R18, the SCP-DETR replaces the second Fusion module in the neck CCFF with our proposed CO-Fusion, incorporates the additional S2 feature layer processed by SPDConv, and replaces the downsampling operation with PSConv.).

More »

Expand

Fig 2.

The detailed architecture of SPDConv with a scaling factor of 2.

More »

Expand

Fig 3.

Structural comparison between the original Fusion module (top) and our proposed CO-Fusion module (bottom).

In this paper, the number of RepBlocks is set to 3.

More »

Expand

Fig 4.

The structure diagram of CSPOKM.

Here, n represents the number of OKMs used, and to reduce the number of parameters, n is set to 1 in this paper.

More »

Expand

Fig 5.

The overall structure of the Omni-Kernel Module.

FFT and IFFT represent the Fast Fourier Transform and its inverse, respectively.

More »

Expand

Fig 6.

The overall architecture of the pinwheel-shaped convolutional module.

Here, k represents the size of both the surrounding padding and the convolutional kernel, which is set to 3 in this paper. s denotes the stride size, with a value of 2 in this work.

More »

Expand

Fig 7.

The effective receptive field of PSConv when k is 3.

The shades of red represent the effectiveness of the receptive field.

More »

Expand

Fig 8.

The six PCB defect categories in the PKU-Market-PCB dataset.

Since the target defects are too small, the images have been magnified and cropped for better visualization of the defects.

More »

Expand

Fig 9.

The label distribution of the PKU-Market-PCB dataset; (a) Statistical chart of the six defect categories.

(b) Distribution of bounding box sizes in the dataset. (c) Distribution of object center points relative to the entire image. (d) Aspect ratio distribution of target objects relative to the whole image.

More »