Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

< Back to Article

Fig 1.

The comparison of DFMNet with SOTA stereo methods on KITTI 2012 [17]and KITTI 2015 [18].

The term 3 − all refers to the percentage of pixels with errors larger than the 3-pixel predictions across all regions, and a lower value is desirable. Similarly, D1 − all represents the percentage of stereo disparity outliers in the first frame across all regions, and a smaller value is preferred.

More »

Fig 1 Expand

Fig 2.

The prediction of Fast-GFM on Scene Flow [19].

More »

Fig 2 Expand

Fig 3.

The architecture of the dual-dimension feature modulation network (DFMNet).

The letter C represents the concatenation operation, while CFV stands for the cost filter volume. DAM, which stands for dual-attention modulation, is a technique that incorporates dual-attention mechanisms into a model to improve performance. DFM Block refers to the dual-dimension feature modulation block. The output of our DFMNet is the prediction map.

More »

Fig 3 Expand

Fig 4.

The structure of dual-attention modulation.

More »

Fig 4 Expand

Fig 5.

The structure of Fast-GFM.

More »

Fig 5 Expand

Fig 6.

The structure of SE block.

More »

Fig 6 Expand

Fig 7.

Evaluation results of DFMNet on Scene Flow [19] and ETH3D [33].

Bad1.0 and Bad2.0 represent the proportion of pixels whose prediction differs from the ground truth by more than 1.0 and 2.0, respectively. These metrics are used to evaluate the accuracy of the predictions, and lower values indicate better performance.

More »

Fig 7 Expand

Fig 8.

Evaluation results of Fast-GFM on Scene Flow [19].

EPE is used to evaluate the accuracy of the predictions, and lower values indicate better performance.

More »

Fig 8 Expand

Table 1.

Quantitative evaluation on KITTI [17, 18].

More »

Table 1 Expand

Fig 9.

Results on KITTI 2012 [17].

The matching results of interference area marked in red box.

More »

Fig 9 Expand

Fig 10.

Results on KITTI 2015 [18].

The matching results of interference area marked in red box.

More »

Fig 10 Expand

Fig 11.

Results of DFMNet on ETH3D [33].

More »

Fig 11 Expand

Table 2.

Quantitative evaluation of Fast-GFM on ETH3D [33] and Middlebury [34].

More »

Table 2 Expand

Fig 12.

Results of Fast-GFM on Middlebury [34].

More »

Fig 12 Expand

Fig 13.

Universal study of CFV on Scene Flow [19].

CFV represents the cost filter volume used in our method. PSMNetCFV, which replaces the concatenated volume with CFV, and GwcNetCFV, which replaces the combined volume with CFV.

More »

Fig 13 Expand

Fig 14.

Comparisons between CFV and CAS [41] on Scene Flow [19].

It can be made by evaluating the performance of two different approaches: GwcNetCAS, which utilizes the Cascaded Cost Volume (CAS) [41] as a replacement for the combined volume, and GwcNetCFV, which replaces the volume with our CFV.

More »

Fig 14 Expand

Table 3.

Performance of TOP-K when using different K on Scene Flow [19].

More »

Table 3 Expand

Table 4.

Analysis of performance with different numbers of hourglasses on Scene Flow [19].

More »

Table 4 Expand

Table 5.

Analysis of accuracy and runtime with different numbers of hourglass structures on Scene Flow [19].

More »

Table 5 Expand

Table 6.

Ablation study of DFMNet on Scene Flow [19].

More »

Table 6 Expand

Table 7.

Ablation study of DFMNet on Scene Flow [19].

More »

Table 7 Expand