DE-HRNet: Detail enhanced high-resolution network for human pose estimation

doi:10.1371/journal.pone.0325540

Fig 1.

The HRNet architecture.

The multi-resolution stage modules are marked with blue color areas. The remained three stages module consists of parallel multi-resolution subnetworks with multi-resolution information interactions.

More »

Expand

Fig 2.

The channel attention mechanism.

The top block (a) is ECA block, which consists of average pooling, a fast 1D convolution of size k and a sigmoid activation. The bottom block (b) is SE block, which consists of average pooling, two fully-connected (FC) layers and a sigmoid activation.

More »

Expand

Fig 3.

The DE-HRNet structure.

The dySample and the Detail Enhancement Module (DEM) are applied to the HRNet [5,9], further implementation details are provided in Section 3.1.

More »

Expand

Fig 4.

The structure of Detail Enhancement Module.

The module is designed based on SE block. Additionally, the global average pooling and dropout [25] technology are located before and after the SE block, respectively. Dashed lines denote selective identity mappings.

More »