Figure 1
From: Speaker-dependent model interpolation for statistical emotional speech synthesis

The block diagram of a general HMM-based speech synthesis system. In the training phase we train the parameters in the HMMs given a labeled speech data set. In the synthesis phase, we use the model parameters to generate speech given text.