Skip to main content

Table 11 Statistical significance of the MOS results of mono-lingual conversion in unseen-to-unseen scenario. “Overall” represents the overall statistical analysis of all the four conversion cases

From: U2-VC: one-shot voice conversion using two-level nested U-structure

   Statistical significance of similarity Statistical significance of naturalness
   SF2TF SF2TM SM2TF SM2TM Overall SF2TF SF2TM SM2TF SM2TM Overall
AdaIN-VC AGAIN-VC 0.005 0.003 0.000 0.002 0.000 0.003 0.000 0.009 0.000 0.000
  U2-VC 0.001 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000
AGAIN-VC AdaIN-VC 0.005 0.003 0.000 0.002 0.000 0.003 0.000 0.009 0.000 0.000
  U2-VC 0.215 0.063 0.051 0.604 0.009 0.023 0.007 0.045 0.037 0.000
U2-VC AdaIN-VC 0.001 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000
  AGAIN-VC 0.215 0.063 0.051 0.604 0.009 0.023 0.007 0.045 0.037 0.000