Figure 1From: Sparse coding of the modulation spectrum for noise-robust automatic speech recognitionFeature extraction. The magnitude envelope of each of the 15 gammatone filters is decomposed into nine different modulation frequency bands. Thus, the speech is represented by 9×15=135-D feature vectors which are computed every 2.5 ms.Back to article page