Skip to main content

Table 3 WER (%) of the baseline models for all languages

From: Advanced recurrent network-based hybrid acoustic models for low resource speech recognition

Model

WER

 
 

101 Cantonese

104 Pashto

107 Vietnamese

202 Swahili

204 Tamil

302 Kazakh

404 Georgian

DNN-fbank

44.8

51.2

53.1

46.2

66.7

54.1

50.5

LSTM-fbank

40.7

50.5

47.8

42.5

65

52.9

48.9

DNN-MBN

36.1

44.2

44.7

38.9

61.3

48.8

45

LSTM-MBN

35.7

44.9

45

39.6

61.3

49

45.2