Contrastive learning for passive acoustic monitoring: A framework for sound source discovery and cross-site comparison in marine soundscapes
Fig 3
Architecture of the proposed PAM-SimCLR framework.
Multiple augmented views (two global and local crops) are processed by a shared ResNet-18 encoder and projection head, with an EMA teacher providing soft multi-positive/negative targets.