Fig. 3From: Emotional voice conversion using neural networks with arbitrary scales F0 based on wavelet transformGaussian distributions of duration in sentence, phrase, and wordBack to article page