Enhancing UAV object detection with an efficient multi-scale feature fusion framework

doi:10.1371/journal.pone.0332408

Fig 1.

The architecture of SRD-YOLOv5 model.

More »

Expand

Fig 2.

The structure of MSFE module.

More »

Expand

Fig 3.

The structure of SSFF module.

More »

Expand

Fig 4.

The structure of extremely small target detection module.

More »

Expand

Table 1.

Detailed configurations and deployment considerations of the proposed MSFE, SSFF, and ESTDL modules.

More »

Expand

Fig 5.

The structure of decoupled head module.

More »

Expand

Table 2.

Comparison of target detection accuracy with various state-of-the-art models on the VisDrone2019 dataset.

Results are reported as mean ± 95% confidence interval.

More »

Expand

Table 3.

Comparison between SRD-YOLOv5 and YOLO series models on the VisDrone2019 dataset.

Results are reported as mean ± 95% confidence interval. Statistical significance compared to SRD-YOLOv5 is tested by paired t-test (: p<0.05, *: p<0.01, ns: not significant).

More »

Expand

Table 4.

Comparison experiments with various classic models on RSOD dataset.

More »

Expand

Table 5.

Comparison experiments with various classic models on NWPU VHR-10 dataset.

More »

Expand

Table 6.

Performance comparison of YOLOv5n with various modifications.

More »

Expand

Table 7.

Effect of initial learning rate (LR) on mAP.

More »

Expand

Table 8.

Performance comparison of YOLOv5n and its enhanced versions incorporating the SSFF, MSFE, and ESTDL modules across object sizes.

More »

Expand

Fig 6.

Comparison of detection effects of YOLOv5n (left), and SRD-YOLOv5 (right) in dense road scenes, high-altitude small objects, crowded 522 pedestrian scenes, and multi-scale targets captured at night.

More »

Expand