Skip to main content

Table 4 Summary and splits of the utilized datasets

From: Accent modification for speech recognition of non-native speakers using neural style transfer

Trained network Utilized dataset
Autoencoder UMEERJ (18,662 samples subset)
Style transfer network UMEERJ (split (0.8/0.1/0.1))
Loss network LibriSpeech (train-clean-360)
CNN-RNN-based ASR LibriSpeech (train-clean-360)
TDNN-based ASR LibriSpeech (train-clean-360)