Skip to main content
Figure 1 | EURASIP Journal on Audio, Speech, and Music Processing

Figure 1

From: Independent Component Analysis and Time-Frequency Masking for Speech Recognition in Multitalker Conditions

Figure 1

Performance of an ideal binary mask, tested on 12 pairs of same-and mixed-gender speakers. Performance is shown for frame lengths (NFFT) of 256, 512, 1024, and 2048 samples in terms of SDR and SIR-improvement. When the SNR-threshold is increased, the red SDR-curves are decreasing monotonically, while a more pronounced monotonic increase can be observed for the SIR-improvement, shown in green color.

Back to article page