U2-VC: one-shot voice conversion using two-level nested U-structure

EURASIP Journal on Audio, Speech, and Music Processing

Table 2 Objective evaluation results of the ablation study on architecture in seen-to-seen conversion scenario. “AGAIN-VC” represents the network has neither U²-Net structure nor SaAdaIN. “U²-VC” represents the network has both U²-Net structure and SaAdaIN