Multi-model Neural Style Transfer (MMNST) for Audio and Image

Vishal, B.; Sriram, K. G.; Sujithra, T.

doi:10.1007/978-981-16-7088-6_18

B. Vishal¹⁸,
K. G. Sriram¹⁸ &
T. Sujithra¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1413))

Included in the following conference series:

International Conference on Soft Computing and Signal Processing

826 Accesses

Abstract

Neural style transfer (NST) was created to give a new look for images, audios and videos through optimization and manipulation techniques. Nowadays, this specific field has picked up pace amongst various techniques that deal with neural networks and it has emerged as one of the most efficient means of producing style transfer. In order to address the shortcomings in the existing system, multi-model neural style transfer (MMNST) approach for image and audio is proposed. It focuses on two kinds of data: audio and image. The main objective of this proposed system is to create artistic imagery by separating and recombining image content and style. For the audio style transfer, we have two inputs which are broken down, optimized and enhanced and finally combined together in a fulfilling manner. Specifically, local and global features can be transferred using both parametric and non-parametric neural style transfer algorithms, which result in an outcome that has equal portions of both—content and style input as they coalesce perfectly. For experimentation, VGG-19 (CNN) and TensorFlow Lite models are used. The proposed model outperforms the existing models in terms of accuracy, execution speed and the total loss incurred during the process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Review: The Study and Analysis of Neural Style Transfer in Image

Fast Image Multi-style Transfer and Its Quality Assessment

NCCNet: Arbitrary Neural Style Transfer with Multi-channel Conversion

References

M.-M. Cheng, X.-C. Liu, J. Wang, S.-P. Lu, Y.-K. Lai, P.L. Rosin, Structure-preserving neural style transfer, in IEEE Transactions on Image Processing, vol. 29 (2020)
Google Scholar
M.-C. Yeh, S. Tang, A. Bhattad, C. Zou, D. Forsyth, Improving style transfer with calibrated metrics, in 2020 IEEE Winter Conference on Applications of Computer Vision (WACV) (2020)
Google Scholar
L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, A.L. Yuille, Deeplab: semantic image segmentation with deep convolutional nets, Atrous convolution, and fully connected (2016)
Google Scholar
P. Rathi, P. Adarsh, M. Kumar, Deep learning approach for arbitrary image style fusion and transformation using SANET model, in 2020 4th International Conference Trends in Electronics and Informatics (ICOEI) (2020)
Google Scholar
C. Khosla, B.S. Saini, Enhancing performance of deep learning models with different data augmentation techniques: a survey, in 2020 International Conference on Intelligent Engineering and Management (ICIEM) (2020)
Google Scholar
E. Grinstein, N.Q.K. Duong, A. Ozerov, P. Perez, Audio style transfer, in ASSP—IEEE International Conference on Acoustics, Speech and Signal Processing (2018)
Google Scholar
Z. Huang, S. Chen, B. Zhu, Deep leaning for audio style transfer
Google Scholar
F. Luan, S. Paris, E. Schechtman, Deep photo style transfer, in 2017 IEEE Conference on CVPR (July, 2017)
Google Scholar
Y. Jing, Y. Yang, Z. Feng, J. Ye, Y. Yu, M. Song, Neural style transfer: a review. IEEE Trans. Vis. Comp. Graphics 26(11) (2020)
Google Scholar
P. Li, D. Zhang, L. Zhao, D. Xu, D. Lu, Style permutation for diversified arbitrary style transfer. IEEE Access 8 (2020)
Google Scholar
A.J. Champandard, Semantic style transfer and turning two-bit doodles into fine artworks, in nucl.ai Conference (Mar, 2016.)
Google Scholar
Y. Zhu, Y. Niu, F. Li, C. Zou, G. Shi, Channel-grouping based patch swap for arbitrary style transfer, in 2020 IEEE International Conference on Image Processing (ICIP) (2020)
Google Scholar
W. Ma, Z. Chen, C. Ji, Block shuffle: a method for high-resolution fast style transfer with limited memory. IEEE Access 8 (2020)
Google Scholar
A. Levin, D. Lischinski, Y. Weiss, A closed-form solution to natural image matting. IEEE Trans. Pattern Anal. Mach. Intell. (2008)
Google Scholar
M. Pasini, MelGAN-VC: voice conversion and audio style transfer on arbitrarily long samples using Spectrograms (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science Engineering, SRMIST, SRM Nagar, Chennai, India
B. Vishal, K. G. Sriram & T. Sujithra

Authors

B. Vishal
View author publications
You can also search for this author in PubMed Google Scholar
K. G. Sriram
View author publications
You can also search for this author in PubMed Google Scholar
T. Sujithra
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electronics and Communication Engineering, Malla Reddy College of Engineering and Technology, Hyderabad, Telangana, India
V. Sivakumar Reddy
Department of Computer Science and Engineering, Jawaharlal Nehru Technological University Hyderabad, Hyderabad, Telangana, India
V. Kamakshi Prasad
Department of Computer Science and Software Engineering, Monmouth University, New Jersey, NJ, USA
Jiacun Wang
Department of Electronics and Communication Engineering, Sir Visvesvaraya Institute of Technology, Nashik, Maharashtra, India
K.T.V. Reddy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vishal, B., Sriram, K.G., Sujithra, T. (2022). Multi-model Neural Style Transfer (MMNST) for Audio and Image. In: Reddy, V.S., Prasad, V.K., Wang, J., Reddy, K. (eds) Soft Computing and Signal Processing. ICSCSP 2021. Advances in Intelligent Systems and Computing, vol 1413. Springer, Singapore. https://doi.org/10.1007/978-981-16-7088-6_18

Download citation

DOI: https://doi.org/10.1007/978-981-16-7088-6_18
Published: 15 February 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-7087-9
Online ISBN: 978-981-16-7088-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Multi-model Neural Style Transfer (MMNST) for Audio and Image

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Review: The Study and Analysis of Neural Style Transfer in Image

Fast Image Multi-style Transfer and Its Quality Assessment

NCCNet: Arbitrary Neural Style Transfer with Multi-channel Conversion

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Multi-model Neural Style Transfer (MMNST) for Audio and Image

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Review: The Study and Analysis of Neural Style Transfer in Image

Fast Image Multi-style Transfer and Its Quality Assessment

NCCNet: Arbitrary Neural Style Transfer with Multi-channel Conversion

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation