Fig. 4From: Multi-encoder attention-based architectures for sound recognition with partial visual assistanceSimplified diagram of multi-encoder frameworkBack to article page