Fig. 4From: End-to-end speech emotion recognition using a novel context-stacking dilated convolution neural networkThe proposed model variations. a DiCCOSER. b DiCCOSER-CS-V2. c DiCCOSER-CS max. d DiCCOSER-CS rms. “RMS Aggr.” indicates using the RMS aggregationBack to article page