From: Multimodal voice conversion based on non-negative matrix factorization
Phoneme
Audio NMF
Multimodal NMF
Vowels
1.508
1.620
Consonants
1.477
1.676