From: End-to-end speech emotion recognition using a novel context-stacking dilated convolution neural network
CNN-LSTM [11]
DiCCOSER-CS RECOLA
DiCCOSER-CS IEMOCAP
Number of parameters
≈1300·103
≈475·103
≈430·103