Fig 1.
The architecture of SRD-YOLOv5 model.
Fig 2.
The structure of MSFE module.
Fig 3.
The structure of SSFF module.
Fig 4.
The structure of extremely small target detection module.
Table 1.
Detailed configurations and deployment considerations of the proposed MSFE, SSFF, and ESTDL modules.
Fig 5.
The structure of decoupled head module.
Table 2.
Comparison of target detection accuracy with various state-of-the-art models on the VisDrone2019 dataset.
Results are reported as mean ± 95% confidence interval.
Table 3.
Comparison between SRD-YOLOv5 and YOLO series models on the VisDrone2019 dataset.
Results are reported as mean ± 95% confidence interval. Statistical significance compared to SRD-YOLOv5 is tested by paired t-test (: p<0.05, *: p<0.01, ns: not significant).
Table 4.
Comparison experiments with various classic models on RSOD dataset.
Table 5.
Comparison experiments with various classic models on NWPU VHR-10 dataset.
Table 6.
Performance comparison of YOLOv5n with various modifications.
Table 7.
Effect of initial learning rate (LR) on mAP.
Table 8.
Performance comparison of YOLOv5n and its enhanced versions incorporating the SSFF, MSFE, and ESTDL modules across object sizes.
Fig 6.
Comparison of detection effects of YOLOv5n (left), and SRD-YOLOv5 (right) in dense road scenes, high-altitude small objects, crowded 522 pedestrian scenes, and multi-scale targets captured at night.