From: A new joint CTC-attention-based speech recognition model with multi-level multi-head attention
System
CTC encoder layers
Attention encoder layers
Decoder layers
MTL λ
P1
2
3
1
0.3
P2
0.2
P3
4