Fig. 5From: Introducing phonetic information to speaker embedding for speaker verificationThe simplified c-vector architecture. The additional acoustic model is removed and the phonetic vectors come from the BN layer of the phonetic-discriminant network. The gradient-based training is stopped at the interconnected link between the two sub-networksBack to article page