From: U2-VC: one-shot voice conversion using two-level nested U-structure
 |  | Statistical significance of MOS (similarity) | Statistical significance of MOS (naturalness) | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
 |  | SF2TF | SF2TM | SM2TF | SM2TM | Overall | SF2TF | SF2TM | SM2TF | SM2TM | Overall |
AdaIN-VC | AGAIN-VC | 0.000 | 0.015 | 0.001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.001 | 0.000 | 0.000 |
 | U2-VC | 0.001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
AGAIN-VC | AdaIN-VC | 0.000 | 0.015 | 0.001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.001 | 0.000 | 0.000 |
 | U2-VC | 0.380 | 0.094 | 0.083 | 0.007 | 0.003 | 0.086 | 0.014 | 0.018 | 0.038 | 0.000 |
U2-VC | AdaIN-VC | 0.001 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
 | AGAIN-VC | 0.380 | 0.094 | 0.083 | 0.007 | 0.003 | 0.086 | 0.014 | 0.018 | 0.038 | 0.000 |