From: Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling
Datasets | #Speakers | #Utterances | Hours |
---|---|---|---|
Training (Lhasa-TRN) | 10M + 7F | 36,090 | 31.9 |
Development (Lhasa-DEV) | 3M + 3F | 1,700 | 1.5 |
Testing (Lhasa-TST) | 3M + 3F | 2,664 | 2.4 |