Multimicrophone Speech Dereverberation: Experimental Validation

Eneman, Koen; Moonen, Marc

doi:10.1155/2007/51831

Research Article
Open access
Published: 02 May 2007

Multimicrophone Speech Dereverberation: Experimental Validation

Koen Eneman^1,2 &
Marc Moonen³

EURASIP Journal on Audio, Speech, and Music Processing volume 2007, Article number: 051831 (2007) Cite this article

1658 Accesses
10 Citations
Metrics details

Abstract

Dereverberation is required in various speech processing applications such as handsfree telephony and voice-controlled systems, especially when signals are applied that are recorded in a moderately or highly reverberant environment. In this paper, we compare a number of classical and more recently developed multimicrophone dereverberation algorithms, and validate the different algorithmic settings by means of two performance indices and a speech recognition system. It is found that some of the classical solutions obtain a moderate signal enhancement. More advanced subspace-based dereverberation techniques, on the other hand, fail to enhance the signals despite their high-computational load.

[1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32]

References

Bees D, Blostein M, Kabal P: Reverberant speech enhancement using cepstral processing. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '91), May 1991, Toronto, Ontario, Canada 2: 977-980.
Google Scholar
Liu Q-G, Champagne B, Kabal P: A microphone array processing technique for speech enhancement in a reverberant space. Speech Communication 1996,18(4):317-334. 10.1016/0167-6393(96)00011-8
Article Google Scholar
Oppenheim A, Schafer R: Digital Signal Processing. Prentice-Hall, Englewood Cliffs, NJ, USA; 1975. chapter 10
MATH Google Scholar
Petropulu AP, Subramaniam S: Cepstrum based deconvolution for speech dereverberation. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '94), April 1994, Adelaide, Australia 1: 9-12.
Google Scholar
Allen JB, Berkley DA, Blauert J: Multimicrophone signal-processing technique to remove room reverberation from speech signals. The Journal of the Acoustical Society of America 1977,62(4):912-915. 10.1121/1.381621
Article Google Scholar
Wittkop T, Hohmann V: Strategy-selective noise reduction for binaural digital hearing aids. Speech Communication 2003,39(1-2):111-138. 10.1016/S0167-6393(02)00062-6
Article MATH Google Scholar
Miyoshi M, Kaneda Y: Inverse filtering of room acoustics. IEEE Transactions on Acoustics, Speech, and Signal Processing 1988,36(2):145-152. 10.1109/29.1509
Article Google Scholar
Nelson PA, Orduna-Bustamante F, Hamada H: Inverse filter design and equalization zones in multichannel sound reproduction. IEEE Transactions on Speech and Audio Processing 1995,3(3):185-192. 10.1109/89.388144
Article Google Scholar
Grenier Y: A microphone array for car environments. Speech Communication 1993,12(1):25-39. 10.1016/0167-6393(93)90016-E
Article MathSciNet Google Scholar
Sydow C: Broadband beamforming for a microphone array. The Journal of the Acoustical Society of America 1994,96(2):845-849. 10.1121/1.410323
Article Google Scholar
van Veen BD, Buckley KM: Beamforming: a versatile approach to spatial filtering. IEEE ASSP Magazine 1988,5(2):4-24.
Article Google Scholar
Affes S, Grenier Y: A signal subspace tracking algorithm for microphone array processing of speech. IEEE Transactions on Speech and Audio Processing 1997,5(5):425-437. 10.1109/89.622565
Article Google Scholar
Moulines E, Duhamel P, Cardoso J-F, Mayrargue S: Subspace methods for the blind identification of multichannel FIR filters. IEEE Transactions on Signal Processing 1995,43(2):516-525. 10.1109/78.348133
Article Google Scholar
van der Veen A-J, Talwar S, Paulraj A: Blind identification of FIR channels carrying multiple finite alphabet signals. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '95), May 1995, Detroit, Mich, USA 2: 1213-1216.
Google Scholar
Gannot S, Moonen M: Subspace methods for multi-microphone speech dereverberation. Proceedings of the 7th IEEE/EURASIP International Workshop on Acoustic Echo and Noise Control (IWAENC '01), September 2001, Darmstadt, Germany 47-50.
Google Scholar
Kuttruff H: Room Acoustics. 2nd edition. Applied Science Publishers, Essex, England; 1979.
Google Scholar
Johnson D, Dudgeon D: Array Signal Processing. Prentice-Hall, Englewood Cliffs, NJ, USA; 1993.
MATH Google Scholar
van Gerven D, van Compernolle S, Wauters P, Verstraeten W, Eneman K, Delaet K: Multiple beam broadband beamforming: filter design and real-time implementation. Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA '95), October 1995, New Paltz, NY, USA 173-176.
Google Scholar
Giannakis GB, Halford SD: Blind fractionally spaced equalization of noisy FIR channels: direct and adaptive solutions. IEEE Transactions on Signal Processing 1997,45(9):2277-2292. 10.1109/78.622950
Article Google Scholar
Eneman K, Moonen M: Ambiguity elimination in frequency-domain subspace identification. In Internal Report ESAT-SCD 06.151. Katholieke Universiteit Leuven, Leuven, Belgium; 2007:12. https://gilbert.med.kuleuven.be/~koen/reports/06-151.pdf
Google Scholar
Eneman K, Moonen M: DFT modulated filter bank design for oversampled subband systems. Signal Processing 2001,81(9):1947-1973. 10.1016/S0165-1684(01)00084-6
Article MATH Google Scholar
Vaidyanathan P: Multirate Systems and Filter Banks. Prentice-Hall, Englewood Cliffs, NJ, USA; 1993.
MATH Google Scholar
Rahbar K, Reilly JP, Manton JH: A frequency domain approach to blind identification of MIMO FIR systems driven by quasi-stationary signals. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '02), May 2002, Orlando, Fla, USA 2: 1717-1720.
Google Scholar
Eneman K, Moonen M: Hybrid subband/frequency-domain adaptive systems. Signal Processing 2001,81(1):117-136. 10.1016/S0165-1684(00)00197-3
Article MATH Google Scholar
Doclo S, Moonen M: Combined frequency-domain dereverberation and noise reduction technique for multi-microphone speech enhancement. Proceedings of the 7th IEEE/EURASIP International Workshop on Acoustic Echo and Noise Control (IWAENC '01), September 2001, Darmstadt, Germany 31-34.
Google Scholar
Allen JB, Berkley DA: Image method for efficiently simulating small-room acoustics. The Journal of the Acoustical Society of America 1979,65(4):943-950. 10.1121/1.382599
Article Google Scholar
Eneman K, Duchateau J, Moonen M, van Compernolle D, van Hamme H: Assessment of dereverberation algorithms for large vocabulary speech recognition systems. Proceedings of the 8th European Conference on Speech Communication and Technology (Eurospeech '03), September 2003, Geneva, Switzerland 2689-2692.
Google Scholar
Duchateau J, Demuynck K, van Compernolle D, Wambacq P: Class definition in discriminant feature analysis. Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech '01), September 2001, Aalborg, Denmark 3: 1621-1624.
Google Scholar
Demuynck K, Duchateau J, van Compernolle D, Wambacq P: Improved feature decorrelation for HMM-based speech recognition. Proceedings of the 5th International Conference on Spoken Language Processing (ICSLP '98), November-December 1998, Sydney, Australia 7: 2907-2910.
Google Scholar
Duchateau J, Demuynck K, van Compernolle D: Fast and accurate acoustic modelling with semi-continuous HMMs. Speech Communication 1998,24(1):5-17. 10.1016/S0167-6393(98)00002-8
Article Google Scholar
Demuynck K, Duchateau J, van Compernolle D, Wambacq P: An efficient search space representation for large vocabulary continuous speech recognition. Speech Communication 2000,30(1):37-53. 10.1016/S0167-6393(99)00030-8
Article Google Scholar
Bees D: Enhancement of acoustically reverberant speech using cepstral methods, Ph.D. thesis. McGill University, Montreal, Canada; 1990.
Google Scholar

Download references

Author information

Authors and Affiliations

ExpORL, Department of Neurosciences, Katholieke Universiteit Leuven, O & N 2, Herestraat 49 bus 721, Leuven, 3000, Belgium
Koen Eneman
GroupT Leuven Engineering School, Vesaliusstraat 13, Leuven, 3000, Belgium
Koen Eneman
SCD, Department of Electrical Engineering (ESAT), Faculty of Engineering, Katholieke Universiteit Leuven, Kasteelpark Arenberg 10, Leuven, 3001, Belgium
Marc Moonen

Authors

Koen Eneman
View author publications
You can also search for this author in PubMed Google Scholar
Marc Moonen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Koen Eneman.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Eneman, K., Moonen, M. Multimicrophone Speech Dereverberation: Experimental Validation. J AUDIO SPEECH MUSIC PROC. 2007, 051831 (2007). https://doi.org/10.1155/2007/51831

Download citation

Received: 06 September 2006
Revised: 09 January 2007
Accepted: 10 April 2007
Published: 02 May 2007
DOI: https://doi.org/10.1155/2007/51831

Multimicrophone Speech Dereverberation: Experimental Validation

Abstract

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords