Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Wang, Jiea; b | Yan, Linhuangb; c | Tian, Jiayib; c | Yuan, Minmind; *
Affiliations: [a] Huangpu Research Institute, Guangzhou University, Guangzhou, China | [b] School of Electronics and Communication Engineering, Guangzhou University, Guangzhou, China | [c] Linköping University - Guangzhou University Research Center on Urban Sustainable Development, Guangzhou University, Guangzhou, China | [d] Research Institute of Highway Ministry of Transport, Beijing, China
Correspondence: [*] Corresponding author. Minmin Yuan, Research Institute of Highway Ministry of Transport, Beijing, China. E-mail: mm.yuan@rioh.cn.
Abstract: In this paper, a bilateral spectrogram filtering (BSF)-based optimally modified log-spectral amplitude (OMLSA) estimator for single-channel speech enhancement is proposed, which can significantly improve the performance of OMLSA, especially in highly non-stationary noise environments, by taking advantage of bilateral filtering (BF), a widely used technology in image and visual processing, to preprocess the spectrogram of the noisy speech. BSF is capable of not only sharpening details, removing unwanted textures or background noise from the noisy speech spectrogram, but also preserving edges when considering a speech spectrogram as an image. The a posteriori signal-to-noise ratio (SNR) of OMLSA algorithm is estimated after applying BSF to the noisy speech. Besides, in order to reduce computing costs, a fast and accurate BF is adopted to reduce the algorithm complexity O(1) for each time-frequency bin. Finally, the proposed algorithm is compared with the original OMLSA and other classic denoising methods using various types of noise with different signal-to-noise ratios in terms of objective evaluation metrics such as segmental signal-to-noise ratio improvement and perceptual evaluation of speech quality. The results show the validity of the improved BSF-based OMLSA algorithm.
Keywords: Speech enhancement, bilateral filtering, optimally modified log-spectral amplitude, bilateral spectrogram filtering, spectrogram
DOI: 10.3233/JIFS-192088
Journal: Journal of Intelligent & Fuzzy Systems, vol. 39, no. 5, pp. 6881-6889, 2020
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl