Skip to main content

Table 8 Objective comparison results of mono-lingual conversion in unseen-to-unseen scenario

From: U2-VC: one-shot voice conversion using two-level nested U-structure

 

MCD (dB)

Predicted MOS by NISQA

 

SF2TF

SF2TM

SM2TF

SM2TM

Average

SF2TF

SF2TM

SM2TF

SM2TM

Average

AdaIN-VC

6.53

6.57

6.59

6.84

6.63

2.97

2.54

2.81

2.98

2.83

AGAIN-VC

5.95

6.03

5.96

6.02

5.99

3.71

3.75

3.82

3.93

3.80

U2-VC

6.01

6.09

6.02

6.03

6.04

4.00

3.95

3.85

3.97

3.94