Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling

EURASIP Journal on Audio, Speech, and Music Processing

Table 1 Speech corpus of Lhasa dialect

Datasets	#Speakers	#Utterances	Hours
Training (Lhasa-TRN)	10M + 7F	36,090	31.9
Development (Lhasa-DEV)	3M + 3F	1,700	1.5
Testing (Lhasa-TST)	3M + 3F	2,664	2.4