From: Performance vs. hardware requirements in state-of-the-art automatic speech recognition
ASR task
Speech type
Size [h]
# of speakers
Framework
K
P
W
R
N
LibriSpeech [72]
read speech
960
∼2400
✓
WSJ [73]
80
284
TED-LIUM2 [74]
TED talks
207
1242
Switchboard [75]
conversational telephone speech
300
543
Fisher [76]
2742
∼12400