Skip to main content

Table 1 Speech corpus of Lhasa dialect

From: Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling

Datasets #Speakers #Utterances Hours
Training (Lhasa-TRN) 10M + 7F 36,090 31.9
Development (Lhasa-DEV) 3M + 3F 1,700 1.5
Testing (Lhasa-TST) 3M + 3F 2,664 2.4