Skip to main content

Table 1 Speech corpus of Lhasa dialect

From: Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling

Datasets

#Speakers

#Utterances

Hours

Training (Lhasa-TRN)

10M + 7F

36,090

31.9

Development (Lhasa-DEV)

3M + 3F

1,700

1.5

Testing (Lhasa-TST)

3M + 3F

2,664

2.4