DEF-Net: A dual-modal feature enhancement and fusion network for infrared and visible object detection | PLOS One

Advertisement

Browse Subject Areas

?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

< Back to Article

Fig 1 — Fig 1.

Dual-modal object detection based on deep learning.

More »

Fig 2 — Fig 2.

Overall framework of the proposed DEF-Net.

More »

Fig 3 — Fig 3.

The backbone network architecture of Darknet53.

More »

Fig 4 — Fig 4.

Dual-branch backbone structure.

More »

Fig 5 — Fig 5.

Feature interaction and enhancement structure.

More »

Fig 6 — Fig 6.

Cross attention fusion network structure.

More »

Fig 7 — Fig 7.

Illustration of Cross attention weight.

More »

Table 1 — Table 1.

Training Hyperparameter Configuration.

More »

Fig 8 — Fig 8.

SYUGV Datasets.

More »

Table 2 — Table 2.

Comparative experimental results of different models.

More »

Fig 9 — Fig 9.

Comparison of model training on the SYUGV dataset.

More »

Fig 10 — Fig 10.

Comparison of detection effects of different models on the SYUGV dataset.

More »

Fig 11 — Fig 11.

Comparison of model training on the LLVIP dataset.

More »

Table 3 — Table 3.

Comparative experimental results of different models.

More »

Fig 12 — Fig 12.

Comparison of detection effects of different models on the LLVIP dataset.

More »

Table 4 — Table 4.

Model detection performance of different modal inputs.

More »

Fig 13 — Fig 13.

P-R curves of different modal inputs.

More »

Fig 14 — Fig 14.

Grad-CAM heatmap of dual branch model.

More »

Table 5 — Table 5.

Model detection performance of different backbone networks.

More »

Table 6 — Table 6.

Ablation study of different module combinations on the SYUGV dataset.

More »

Table 7 — Table 7.

Ablation study of different module combinations on the LLVIP dataset.

More »

Fig 15 — Fig 15.

mAP@0.5 curves of ablation studies on the(a) SYUGV and (b) LLVIP datasets.

More »

Fig 16 — Fig 16.

Visualization of feature activation (heatmaps) for different model variants.

More »