Fig. 2From: Transformer-based ensemble method for multiple predominant instruments recognition in polyphonic music(a) Block diagram of the proposed method of Vision transformer, (b) Internal architecture of transformer encoderBack to article page