Skip to main content

Table 11 Statistical significance of the MOS results of mono-lingual conversion in unseen-to-unseen scenario. “Overall” represents the overall statistical analysis of all the four conversion cases

From: U2-VC: one-shot voice conversion using two-level nested U-structure

  

Statistical significance of similarity

Statistical significance of naturalness

  

SF2TF

SF2TM

SM2TF

SM2TM

Overall

SF2TF

SF2TM

SM2TF

SM2TM

Overall

AdaIN-VC

AGAIN-VC

0.005

0.003

0.000

0.002

0.000

0.003

0.000

0.009

0.000

0.000

 

U2-VC

0.001

0.000

0.000

0.000

0.000

0.000

0.000

0.000

0.000

0.000

AGAIN-VC

AdaIN-VC

0.005

0.003

0.000

0.002

0.000

0.003

0.000

0.009

0.000

0.000

 

U2-VC

0.215

0.063

0.051

0.604

0.009

0.023

0.007

0.045

0.037

0.000

U2-VC

AdaIN-VC

0.001

0.000

0.000

0.000

0.000

0.000

0.000

0.000

0.000

0.000

 

AGAIN-VC

0.215

0.063

0.051

0.604

0.009

0.023

0.007

0.045

0.037

0.000