Skip to main content

Table 8 Relative improvement depending on the content and style layers in cases of ASR model-based evaluation using 10% UME-ERJ subset

From: Accent modification for speech recognition of non-native speakers using neural style transfer

Style layers Content layers RI(CER)
1–10 6–12 32%
1–8 8–12 29.6%
1–10 10–12 30.1%
1–5 5–12 26.7%
1–4 4–12 15.6%