Multi-order Replay Attack Detection Using Enhanced Feature Extraction and Deep Learning Classification

Joshi, Sanil; Dua, Mohit

doi:10.1007/978-981-19-8825-7_63

Sanil Joshi¹³ &
Mohit Dua¹³

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 600))

339 Accesses
7 Citations

Abstract

The authenticated users are identified and verified using automatic speaker verification (ASV) technologies. An automatic speaker verification (ASV) system, like any other user identification system, is also sensitive to spoofing. In order to make the ASV systems robust against spoofing, these systems are alienated into two different phases, i.e., frontend feature extraction and backend classification model. The main emphasis of the paper is on the development of the system against multi-order replay attacks. The joint frequency-domain linear prediction (FDLP) and mel-frequency cepstral coefficients (MFCC) is used at frontend to extract the features from the audio samples. At backend, gated recurrent unit (GRU) classification model is used. The proposed system is achieving 2.99% equal error rate (ERR) and 1.6% ERR under 1PR and 2PR spoofing attacks, respectively, and also provides 97.7% and 97.9% accuracy under the same environment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Audio Replay Attack Detection for Speaker Verification System Using Convolutional Neural Networks

Deep Learning Approach: Detection of Replay Attack in ASV Systems

Light CNN Architecture Enhancement for Different Types Spoofing Attack Detection

References

Mittal A, Dua M (2021) Automatic speaker verification system using three dimensional static and contextual variation-based features with two-dimensional convolutional neural network. Int J Swarm Intell 6(2):143–153
Article Google Scholar
Mittal A, Dua M (2021) Automatic speaker verification systems and spoof detection techniques: review and analysis. Int J Speech Technol 1–30
Google Scholar
Lavrentyeva G, Novoselov S, Malykh E, Kozlov A, Kudashev O, Shchemelinin V (2017) Audio replay attack detection with deep learning frameworks. In: Interspeech, pp 82–86
Google Scholar
Campbell JP (1995) Testing with the YOHO CD-ROM voice verification corpus. In 1995 international conference on acoustics, speech, and signal processing, vol 1, IEEE, pp 341–344
Google Scholar
Mittal A, Dua M (2021) Static–dynamic features and hybrid deep learning models-based spoof detection system for ASV. Complex Intell Syst 1–14
Google Scholar
Delgado H et al (2021) ASVspoof 2021: Automatic speaker verification spoofing and countermeasures challenge evaluation plan. arXiv Prepr. arXiv2109.00535
Google Scholar
Malik KM, Javed A, Malik H, Irtaza A (2020) A light-weight replay detection framework for voice controlled IoT devices. IEEE J Sel Top Signal Process 14(5):982–996
Article Google Scholar
Dua M, Jain C, Kumar S (2021) LSTM and CNN based ensemble approach for spoof detection task in automatic speaker verification systems. J Ambient Intell Humaniz Comput 1–16
Google Scholar
Mittal A, Dua M, Dua S (2021) Classical and deep learning data processing techniques for speech and speaker recognitions. In: Deep learning approaches for spoken and natural language processing. Springer, Cham, pp 111–126
Google Scholar
Dua M, Aggarwal RK, Biswas M (2019) GFCC based discriminatively trained noise robust continuous ASR system for Hindi language. J Ambient Intell Humaniz Comput 10(6):2301–2314
Article Google Scholar
Biau G, Scornet E (2016) A random forest guided tour. TEST 25(2):197–227
Article MathSciNet MATH Google Scholar
Mittal A, Dua M (2021) Constant Q cepstral coefficients and long short-term memory model-based automatic speaker verification system. In: Proceedings of international conference on intelligent computing, information and control systems. Springer, Singapore, pp 895–904
Google Scholar
Ganapathy S, Pelecanos J, Omar MK (2011) Feature normalization for speaker verification in room reverberation. In: 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 4836–4839
Google Scholar
Dua M, Sadhu A, Jindal A, Mehta R (2022) A hybrid noise robust model for multireplay attack detection in automatic speaker verification systems. Biomed Signal Process Control 74:103517
Article Google Scholar
Shukla S, Prakash J, Guntur RS (2019) Replay attack detection with raw audio waves and deep learning framework. In: 2019 international conference on data science and engineering (ICDSE), IEEE, pp 66–70
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering, National Institute of Technology, Kurukshetra, India
Sanil Joshi & Mohit Dua

Authors

Sanil Joshi
View author publications
You can also search for this author in PubMed Google Scholar
Mohit Dua
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sanil Joshi .

Editor information

Editors and Affiliations

SRM Institute of Science and Technology, Ghaziabad, Uttar Pradesh, India
Rajendra Prasad Mahapatra
Department of Computer Science and Engineering, Indian Institute of Technology Roorkee, Roorkee, India
Sateesh K. Peddoju
Indian Institute of Technology Roorkee, Roorkee, Uttarakhand, India
Sudip Roy
SRM Institute of Science and Technology, Ghaziabad, Uttar Pradesh, India
Pritee Parwekar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Joshi, S., Dua, M. (2023). Multi-order Replay Attack Detection Using Enhanced Feature Extraction and Deep Learning Classification. In: Mahapatra, R.P., Peddoju, S.K., Roy, S., Parwekar, P. (eds) Proceedings of International Conference on Recent Trends in Computing. Lecture Notes in Networks and Systems, vol 600. Springer, Singapore. https://doi.org/10.1007/978-981-19-8825-7_63

Download citation

DOI: https://doi.org/10.1007/978-981-19-8825-7_63
Published: 21 March 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-8824-0
Online ISBN: 978-981-19-8825-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Multi-order Replay Attack Detection Using Enhanced Feature Extraction and Deep Learning Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Audio Replay Attack Detection for Speaker Verification System Using Convolutional Neural Networks

Deep Learning Approach: Detection of Replay Attack in ASV Systems

Light CNN Architecture Enhancement for Different Types Spoofing Attack Detection

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Multi-order Replay Attack Detection Using Enhanced Feature Extraction and Deep Learning Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Audio Replay Attack Detection for Speaker Verification System Using Convolutional Neural Networks

Deep Learning Approach: Detection of Replay Attack in ASV Systems

Light CNN Architecture Enhancement for Different Types Spoofing Attack Detection

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation