EURASIP Journal on Audio, Speech, and Music Processing

Table 4 Speech quality in terms of SRMR for simulated and real reverberated speech samples through architecture depth for REVERB-Dev dataset. The last rows represents the mean and standard deviation along the experiments presented for each column

From: Progressive loss functions for speech enhancement with deep neural networks

		Reference systems		Progressive systems
Condition	Blocks depth	CNN	ResNet	P-CNN with WP	P-CNN with UP	P-ResNet with WP	P-ResNet with UP
Simulated	8	7.33	8.23	6.49	7.53	8.31	7.91
	16	7.60	8.27	8.96	7.70	8.41	8.05
	24	8.87	8.14	6.18	8.09	8.03	8.02
	32	7.01	8.56	7.65	7.41	7.98	7.78
Real	8	6.05	6.82	4.90	6.32	7.06	6.91
	16	5.98	5.81	3.74	7.26	7.14	6.85
	24	4.76	5.77	2.07	6.90	6.53	6.91
	32	3.35	6.33	2.33	6.34	5.97	6.62
AVG5 ±STD	8	6.69 ±0.64	7.52 ±0.70	5.69 ±0.79	6.92 ±0.60	7.68 ±0.62	7.41 ±0.50
	16	6.79 ±0.81	7.04 ±1.23	6.35 ±2.61	7.48 ±0.22	7.77 ±0.63	7.45 ±0.60
	24	6.81 ±2.05	6.97 ±1.16	4.12 ±2.05	7.49 ±0.59	7.28 ±0.75	7.46 ±0.55
	32	5.18 ±1.83	7.44 ±1.11	4.99 ±2.66	6.87 ±0.53	6.97 ±1.00	7.20 ±0.58

Bold values show the best result for each condition

Back to article page