From: U2-VC: one-shot voice conversion using two-level nested U-structure
Statistical significance of similarity | Statistical significance of naturalness | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
SF2TF | SF2TM | SM2TF | SM2TM | Overall | SF2TF | SF2TM | SM2TF | SM2TM | Overall | ||
AdaIN-VC | AGAIN-VC | 0.005 | 0.003 | 0.000 | 0.002 | 0.000 | 0.003 | 0.000 | 0.009 | 0.000 | 0.000 |
U2-VC | 0.001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | |
AGAIN-VC | AdaIN-VC | 0.005 | 0.003 | 0.000 | 0.002 | 0.000 | 0.003 | 0.000 | 0.009 | 0.000 | 0.000 |
U2-VC | 0.215 | 0.063 | 0.051 | 0.604 | 0.009 | 0.023 | 0.007 | 0.045 | 0.037 | 0.000 | |
U2-VC | AdaIN-VC | 0.001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
AGAIN-VC | 0.215 | 0.063 | 0.051 | 0.604 | 0.009 | 0.023 | 0.007 | 0.045 | 0.037 | 0.000 |