MPLNet: Mamba prompt learning network for semantic segmentation of remote sensing images of traditional villages

doi:10.1371/journal.pone.0341130

Fig 1.

Representative samples from the TV-RSI dataset.

All sampling, annotations, panel composition, and graphics are original works by the authors and are released under CC BY 4.0.

More »

Expand

Fig 2.

Regional distribution of semantic objects.

More »

Expand

Fig 3.

Heat map of the object’s position.

More »

Expand

Fig 4.

Overall architecture diagram of MPLNet.

More »

Expand

Fig 5.

Structure of the Mamba Fusion Module (MFM).

More »

Expand

Table 1.

Experimental results on the TV-RSI dataset (pixel accuracy Acc and IoU per class).

More »

Expand

Fig 6.

Qualitative comparisons on TV-RSI.

Ground truth masks and model outputs are shown for representative tiles. Base imagery (where visible) was obtained from the Geospatial Data Cloud (GF-2) for non-commercial academic use. All overlays, annotations, and panel layouts are original works by the authors and are released under CC BY 4.0.

More »

Expand

Fig 7.

Ablation of the Mamba Fusion Module (MFM).

Visual comparisons among Backbone, MPLNet (w/o MFM), and MPLNet demonstrate sharper boundaries and reduced omissions with MFM. Base imagery (where visible) was obtained from the Geospatial Data Cloud (GF-2) for non-commercial academic use. All derived visualizations are author-original and released under CC BY 4.0.

More »

Expand

Table 2.

Ablation results of MFM.

More »

Expand

Fig 8.

Ablation of prompt learning (PL).

Compared with the student baseline, MPLNet with PL recovers fine structures and reduces confusion in mixed land-cover zones. Base imagery (where visible) was obtained from the Geospatial Data Cloud (GF-2) for non-commercial academic use. All annotations and compositions are original works by the authors and are released under CC BY 4.0.

More »