Fig. 2From: Multi-encoder attention-based architectures for sound recognition with partial visual assistanceDiagram of conformer encoder model for sound recognitionBack to article page