Fig. 3From: Multi-encoder attention-based architectures for sound recognition with partial visual assistanceSimplified diagram of encoder-decoder conformer model for sound recognitionBack to article page