Skip to main content

Table 8 Objective comparison results of mono-lingual conversion in unseen-to-unseen scenario

From: U2-VC: one-shot voice conversion using two-level nested U-structure

  MCD (dB) Predicted MOS by NISQA
  SF2TF SF2TM SM2TF SM2TM Average SF2TF SF2TM SM2TF SM2TM Average
AdaIN-VC 6.53 6.57 6.59 6.84 6.63 2.97 2.54 2.81 2.98 2.83
AGAIN-VC 5.95 6.03 5.96 6.02 5.99 3.71 3.75 3.82 3.93 3.80
U2-VC 6.01 6.09 6.02 6.03 6.04 4.00 3.95 3.85 3.97 3.94