Fig. 2From: A new joint CTC-attention-based speech recognition model with multi-level multi-head attentionHigh-level features extraction using CNMFBack to article page