Abstract
Neural style transfer (NST) was created to give a new look for images, audios and videos through optimization and manipulation techniques. Nowadays, this specific field has picked up pace amongst various techniques that deal with neural networks and it has emerged as one of the most efficient means of producing style transfer. In order to address the shortcomings in the existing system, multi-model neural style transfer (MMNST) approach for image and audio is proposed. It focuses on two kinds of data: audio and image. The main objective of this proposed system is to create artistic imagery by separating and recombining image content and style. For the audio style transfer, we have two inputs which are broken down, optimized and enhanced and finally combined together in a fulfilling manner. Specifically, local and global features can be transferred using both parametric and non-parametric neural style transfer algorithms, which result in an outcome that has equal portions of both—content and style input as they coalesce perfectly. For experimentation, VGG-19 (CNN) and TensorFlow Lite models are used. The proposed model outperforms the existing models in terms of accuracy, execution speed and the total loss incurred during the process.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
M.-M. Cheng, X.-C. Liu, J. Wang, S.-P. Lu, Y.-K. Lai, P.L. Rosin, Structure-preserving neural style transfer, in IEEE Transactions on Image Processing, vol. 29 (2020)
M.-C. Yeh, S. Tang, A. Bhattad, C. Zou, D. Forsyth, Improving style transfer with calibrated metrics, in 2020 IEEE Winter Conference on Applications of Computer Vision (WACV) (2020)
L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, A.L. Yuille, Deeplab: semantic image segmentation with deep convolutional nets, Atrous convolution, and fully connected (2016)
P. Rathi, P. Adarsh, M. Kumar, Deep learning approach for arbitrary image style fusion and transformation using SANET model, in 2020 4th International Conference Trends in Electronics and Informatics (ICOEI) (2020)
C. Khosla, B.S. Saini, Enhancing performance of deep learning models with different data augmentation techniques: a survey, in 2020 International Conference on Intelligent Engineering and Management (ICIEM) (2020)
E. Grinstein, N.Q.K. Duong, A. Ozerov, P. Perez, Audio style transfer, in ASSP—IEEE International Conference on Acoustics, Speech and Signal Processing (2018)
Z. Huang, S. Chen, B. Zhu, Deep leaning for audio style transfer
F. Luan, S. Paris, E. Schechtman, Deep photo style transfer, in 2017 IEEE Conference on CVPR (July, 2017)
Y. Jing, Y. Yang, Z. Feng, J. Ye, Y. Yu, M. Song, Neural style transfer: a review. IEEE Trans. Vis. Comp. Graphics 26(11) (2020)
P. Li, D. Zhang, L. Zhao, D. Xu, D. Lu, Style permutation for diversified arbitrary style transfer. IEEE Access 8 (2020)
A.J. Champandard, Semantic style transfer and turning two-bit doodles into fine artworks, in nucl.ai Conference (Mar, 2016.)
Y. Zhu, Y. Niu, F. Li, C. Zou, G. Shi, Channel-grouping based patch swap for arbitrary style transfer, in 2020 IEEE International Conference on Image Processing (ICIP) (2020)
W. Ma, Z. Chen, C. Ji, Block shuffle: a method for high-resolution fast style transfer with limited memory. IEEE Access 8 (2020)
A. Levin, D. Lischinski, Y. Weiss, A closed-form solution to natural image matting. IEEE Trans. Pattern Anal. Mach. Intell. (2008)
M. Pasini, MelGAN-VC: voice conversion and audio style transfer on arbitrarily long samples using Spectrograms (2019)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Vishal, B., Sriram, K.G., Sujithra, T. (2022). Multi-model Neural Style Transfer (MMNST) for Audio and Image. In: Reddy, V.S., Prasad, V.K., Wang, J., Reddy, K. (eds) Soft Computing and Signal Processing. ICSCSP 2021. Advances in Intelligent Systems and Computing, vol 1413. Springer, Singapore. https://doi.org/10.1007/978-981-16-7088-6_18
Download citation
DOI: https://doi.org/10.1007/978-981-16-7088-6_18
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-7087-9
Online ISBN: 978-981-16-7088-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)