Deep neural networks for automatic speech processing: a survey from large corpora to limited data

EURASIP Journal on Audio, Speech, and Music Processing

Table 3 SOTA results over IEMOCAP using 4 emotions (happiness, neutral, anger, and sadness) and quantity of data used. Self-supervised results come from [7] experiments