From: Improving speech recognition systems for the morphologically complex Malayalam language using subword tokens for language modeling
Method
Example
Segment count
Word
3
Morfessor
6
BPE
Unigram
5
Syllable
9
S-BPE
4