Fig. 6
From: U2-VC: one-shot voice conversion using two-level nested U-structure

Comparison among source, target, and the converted spectrograms in cross-lingual scenario(VCC2VCTK). a Source speech. b Target speech. c The converted speech of AGAIN-VC. d The converted speech of U2-VC