Skip to main content

Table 6 Objective comparison results of mono-lingual conversion in seen-to-seen scenario

From: U2-VC: one-shot voice conversion using two-level nested U-structure

 

MCD (dB)

Predicted MOS by NISQA

 

SF2TF

SF2TM

SM2TF

SM2TM

Average

SF2TF

SF2TM

SM2TF

SM2TM

Average

AdaIN-VC

7.11

6.62

7.09

6.97

6.95

3.01

2.94

3.07

3.53

3.14

AGAIN-VC

6.33

6.07

6.32

6.33

6.26

3.87

3.63

3.93

4.02

3.86

U2-VC

6.36

6.11

6.32

6.39

6.29

4.13

3.93

4.14

4.05

4.06