Abstract
Digital media forensics can exploit the electric network frequency of audio signals to detect tampering. However, current electric network based audio forensic schemes are limited by their inability to obtain concurrent electric network frequency reference datasets from power grids. In addition, most forensic algorithms do not provide high detection precision in adverse signal-to-noise conditions.
This chapter proposes an automated electric network frequency based audio forensic scheme that monitors abrupt mutations of tampered frames and discontinuities in the variations of electric network frequency features. Specifically, the scheme utilizes the multiple signal classification, Hilbert linear prediction and Welch algorithms to extract electric network frequency features from audio signals; the extracted features are passed to a convolutional neural network classifier to detect audio tampering. The negative effects of low signal-to-noise ratios on electric network frequency extraction are addressed by employing extra low-rank filtering that removes voice activity and noise interference. Simulation results demonstrate that the proposed scheme provides better audio tampering detection accuracy compared with a benchmark method, especially under adverse signal-to-noise conditions.
Chapter PDF
Similar content being viewed by others
References
M. Arnold, Audio watermarking: Features, applications and algorithms, Proceedings of the IEEE International Conference on Multimedia and Exposition – Latest Advances in the Fast-Changing World of Multimedia, vol. 2, pp. 1013–1016, 2000.
J. Chen, X. Kang, Y. Liu and Z. Wang, Median filtering forensics based on convolutional neural networks, IEEE Signal Processing Letters, vol. 22(11), pp. 1849–1853, 2015.
P. Esquef, J. Apolinario and L. Biscainho, Edit detection in speech recordings via instantaneous electric network frequency variations, IEEE Transactions on Information Forensics and Security, vol. 9(12), pp. 2314–2326, 2014.
P. Esquef, J. Apolinario and L. Biscainho, Improved edit detection in speech via ENF patterns, Proceedings of the IEEE International Workshop on Information Forensics and Security, 2015.
R. Garg, A. Varna, A. Hajj-Ahmad and M. Wu, “Seeing” ENF: Power-signature-based timestamps for digital multimedia via optical sensing and signal processing, IEEE Transactions on Information Forensics and Security, vol. 8(9), pp. 1417–1432, 2013.
A. Hajj-Ahmad, R. Garg and M. Wu, Instantaneous frequency estimation and localization for ENF signals, Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012.
A. Hajj-Ahmad, R. Garg and M. Wu, ENF-based region-of-recording identification for media signals, IEEE Transactions on Information Forensics and Security, vol. 10(6), pp. 1125–1136, 2015.
G. Hua, G. Bi and V. Thing, On practical issues of electric network frequency based audio forensics, IEEE Access, vol. 5, pp. 20640–20651, 2017.
G. Hua, Y. Zhang, J. Goh and V. Thing, Audio authentication by exploring the absolute error map of ENF signals, IEEE Transactions on Information Forensics and Security, vol. 11(5), pp. 1003–1016, 2016.
M. Imran, Z. Ali, S. Bakhsh and S. Akram, Blind detection of copy-move forgery in digital audio forensics, IEEE Access, vol. 5, pp. 12843–12855, 2017.
G. Karantaidis and C. Kotropoulos, Assessing spectral estimation methods for electric network frequency extraction, Proceedings of the Twenty-Second Pan-Hellenic Conference on Informatics, pp. 202–207, 2018.
X. Lin and X. Kang, Robust electric network frequency estimation with rank reduction and linear prediction, ACM Transactions on Multimedia Computing, Communications and Applications, vol. 14(4), article no. 84, 2018.
X. Lin, J. Liu and X. Kang, Audio recapture detection with convolutional neural networks, IEEE Transactions on Multimedia, vol. 18(8), pp. 1480–1487, 2016.
Y. Liu, Z. Yuan, P. Markham, R. Conners and Y. Liu, Application of power system frequency for digital audio authentication, IEEE Transactions on Power Delivery, vol. 27(4), pp. 1820–1828, 2012.
A. Martin, G. Doddington, T. Kamm, M. Ordowski and M. Przybocki, The DET curve in assessments of detection task performance, Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997.
D. Nicolalde Rodriguez and J. Apolinario, Evaluating digital audio authenticity with spectral distances and ENF phase change, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1417–1420, 2009.
D. Nicolaide Rodriguez, J. Apolinario and L. Biscainho, Audio authenticity: Detecting ENF discontinuity with high precision phase analysis, IEEE Transactions on Information Forensics and Security, vol. 5(3), pp. 534–543, 2010.
X. Pan, X. Zhang and S. Lyu, Detecting splicing in digital audios using local noise level estimation, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 1841–1844, 2012.
P. Reis, J. da Costa, R. Miranda and G. Del Galdo, ESPRIT-Hilbert-based audio tampering detection with SVM classifier for forensic analysis via electrical network frequency, IEEE Transactions on Information Forensics and Security, vol. 12(4), pp. 853–864, 2016.
H. Su, A. Hajj-Ahmad, M. Wu and D. Oard, Exploring the use of ENF for multimedia synchronization, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 4613–4617, 2014.
S. Vatansever, A. Dirik and N. Memon, Factors affecting ENF-based time-of-recording estimation for video, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 2497–2501, 2019.
Z. Wang, J. Wang, C. Zeng, Q. Min, Y. Tian and M. Zuo, Digital audio tampering detection based on ENF consistency, Proceedings of the International Conference on Wavelet Analysis and Pattern Recognition, pp. 209–214, 2018.
Q. Zhao, D. Meng, Z. Xu, W. Zuo and L. Zhang, Robust principal component analysis with complex noise, Proceedings of the Thirty-First International Conference on Machine Learning, vol. II, pp. 55–63, 2014.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 IFIP International Federation for Information Processing
About this paper
Cite this paper
Mao, M., Xiao, Z., Kang, X., Li, X., Xiao, L. (2020). Electric Network Frequency Based Audio Forensics Using Convolutional Neural Networks. In: Peterson, G., Shenoi, S. (eds) Advances in Digital Forensics XVI. DigitalForensics 2020. IFIP Advances in Information and Communication Technology, vol 589. Springer, Cham. https://doi.org/10.1007/978-3-030-56223-6_14
Download citation
DOI: https://doi.org/10.1007/978-3-030-56223-6_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-56222-9
Online ISBN: 978-3-030-56223-6
eBook Packages: Computer ScienceComputer Science (R0)