MFDA-YOLO: A multiscale feature fusion and dynamic alignment network for UAV small objects detection

doi:10.1371/journal.pone.0337810

Fig 1.

Structure of the YOLOv8 model.

More »

Expand

Fig 2.

MFDA-YOLO network structure.

More »

Expand

Fig 3.

Structural diagram of the AIFI module.

More »

Expand

Fig 4.

The SPD-Conv specific process when scale = 2.

More »

Expand

Fig 5.

Details of the C-OKM module.

(a): C-OKM. (b): Omni-Kernel module. (c): DCAM. (d): FASM.

More »

Expand

Fig 6.

The structure of the DADH.

More »

Expand

Fig 7.

The principle of the task decomposition.

More »

Expand

Table 1.

Ablation study on hyperparameters of WIoUv3.

More »

Expand

Table 2.

Ablation experiment results of modules on the VisDrone2019-DET-Test.

More »

Expand

Table 3.

Results of different models on the VisDrone2019-DET-Test.

More »

Expand

Fig 8.

Confusion matrix of YOLOv8n.

More »

Expand

Fig 9.

Confusion matrix of MFDA-YOLO.

More »

Expand

Table 4.

Results of different models on the HIT-UAV.

More »

Expand

Table 5.

Results of different models on the NWPU VHR-10.

More »

Expand

Fig 10.

Comparison of detection results across different models on the Visdrone2019 dataset. (The black box demonstrates the MFDA-YOLO’s ability to reduce missed and false detections).

More »

Expand

Fig 11.

Heat map comparison among different models on the HIT-UAV dataset. (The black bounding box highlights that MFDA-YOLO produces markedly more concentrated heat-maps on small objects).

More »

Expand