Speech Enhancement Based on the Combination of Deep Learning and Wavelet Algorithm

Yue, Li; Ji, Qiu

doi:10.1007/978-981-97-0126-1_16

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1141))

Included in the following conference series:

International Symposium on Automatic Control and Emerging Technologies

240 Accesses

Abstract

In this paper, a method is proposed to enhance the Signal to Noise Ratio (SNR) of speech by combining the wavelet algorithm with deep learning techniques. First, wavelet threshold denoising is introduced for speech enhancement. Second, the deep neural network is proposed to enhance SNR with Ideal Binary Mask. To achieve a better performance, the speech signal is analyzed with these methods with characteristic parameters. Third, these methods compose a novel method to refine the process of speech enhancement. Design efficiency and effectiveness are compared analytically and computationally via numerical experiments, which justifies the superiority of this combination method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 299.00; Price excludes VAT (USA)

Hardcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Enhancement of single channel speech quality and intelligibility in multiple noise conditions using wiener filter and deep CNN

Article 06 October 2021

A Study on Effectiveness of Deep Neural Networks for Speech Signal Enhancement in Comparison with Wiener Filtering Technique

Low-complexity artificial noise suppression methods for deep learning-based speech enhancement algorithms

Article Open access 12 April 2021

References

Kandagatla, R., Subbaiah, P.V.: Speech enhancement using MMSE estimation under phase uncertainty. Int. J. Speech Technol. 20(2), 373–385 (2017)
Article Google Scholar
Abou-loukh, S.J., Ibrahim, A.K.: Speech denoising using mixed transform. 16(1) (2017)
Google Scholar
Seke, E., Özkan, K.: A new speech signal denoising algorithm using common vector approach. Int. J. Speech Technol. 21, 659–670 (2018)
Article Google Scholar
Yanlei, Z., Shifeng, O., Ying, G.: Improved wiener filter algorithm for speech enhancement. Autom. Control Intell. Syst. 7(3), 92–98 (2019)
Google Scholar
Cornell, S., et al.: A novel adversarial training scheme for deep neural network based speech enhancement. In: 2020 International Joint Conference on Neural Networks (IJCNN), pp. 1–8 (2020)
Google Scholar
Wei, L., Li, W.: A Speech denoising method Based on wavelet threshold and improved spectral subtraction. In: IICCSEE-13 (2013)
Google Scholar
Swami, P.D., Sharma, R., Jain, A., Swami, D.K.: Speech enhancement by noise driven adaptation of perceptual scales and thresholds of continuous wavelet transform coefficients. Speech Commun. 70, 1–12 (2015)
Article Google Scholar
Jia, H.R., Zhang, X.Y., Bai, J.: A continuous differentiable wavelet threshold function for speech enhancement. J. Central South Univ. 20(8), 2219–2225 (2013)
Article Google Scholar
Erdmann, M., Martin, E., Jonas, G.: Deep learning based algorithms in astroparticle physics. In: Journal of Physics: Conference Series, vol. 1525, no. 1 (2020)
Google Scholar
Liu, D., Smaragdis, P., Kim, M.: Experiments on deep learning for speech denoising. In: Interspeech (2014)
Google Scholar
Xia, W., Wu, Q., Feng, X.: Research on speech accurate recognition technology based on deep learning DNN-HMM. In: International Symposium on Multispectral Image Processing and Pattern Recognition (2020)
Google Scholar
Nossier, S.A., Wall, J., Moniri, M., Glackin, C., Cannings, N.: An experimental analysis of deep learning architectures for supervised speech enhancement. Electronics 10(1), 17 (2020)
Article Google Scholar
Fuzzy Research; Study Results from Inner Mongolia University for the Nationalities Broaden Understanding of Fuzzy Research (Multimedia english teaching analysis based on deep learning speech enhancement algorithm and robust expression positioning). Rob. Mach. Learn. 715 (2020)
Google Scholar
Wang, L., Zheng, W., Ma, X., Lin, S.: Denoising speech based on deep learning and wavelet decomposition. Sci. Program. 2021, 1–10 (2021)
Google Scholar

Download references

Author information

Authors and Affiliations

Guangxi Minzu University, Nanning, China
Li Yue & Qiu Ji

Authors

Li Yue
View author publications
You can also search for this author in PubMed Google Scholar
Qiu Ji
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qiu Ji .

Editor information

Editors and Affiliations

National School of Applied Sciences, Ibn Tofail University, Kénitra, Morocco
Hassan El Fadil
School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, Beijing, China
Weicun Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yue, L., Ji, Q. (2024). Speech Enhancement Based on the Combination of Deep Learning and Wavelet Algorithm. In: El Fadil, H., Zhang, W. (eds) Automatic Control and Emerging Technologies. ACET 2023. Lecture Notes in Electrical Engineering, vol 1141. Springer, Singapore. https://doi.org/10.1007/978-981-97-0126-1_16

Download citation

DOI: https://doi.org/10.1007/978-981-97-0126-1_16
Published: 30 January 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0125-4
Online ISBN: 978-981-97-0126-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Speech Enhancement Based on the Combination of Deep Learning and Wavelet Algorithm

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Enhancement of single channel speech quality and intelligibility in multiple noise conditions using wiener filter and deep CNN

A Study on Effectiveness of Deep Neural Networks for Speech Signal Enhancement in Comparison with Wiener Filtering Technique

Low-complexity artificial noise suppression methods for deep learning-based speech enhancement algorithms

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Speech Enhancement Based on the Combination of Deep Learning and Wavelet Algorithm

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Enhancement of single channel speech quality and intelligibility in multiple noise conditions using wiener filter and deep CNN

A Study on Effectiveness of Deep Neural Networks for Speech Signal Enhancement in Comparison with Wiener Filtering Technique

Low-complexity artificial noise suppression methods for deep learning-based speech enhancement algorithms

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation