Abstract
A long-standing problem of monotonicity in naturalness has been solved using a well-founded model, namely the Speech Hierarchy Model. This model is based on the fact that all natural speech signals have infinite variations. For example, red light is present in an infinite number of frequencies in nature, whereas a computer has only a few numbers within a finite range to create red color. Paralinguistic content, which is a part of a speech signal, also varies infinitely. Using the concept of paralinguistic content expression, which can be used to express any form of variation onto a speech signal, the present methods of synthesizing speech are enhanced and will lead to technology which is more natural in the human sense. This paper implements the method and results in a tool for synthesizing Hindi speech which gave high intelligibility in 81% of input text samples.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Klatt, D.H.: Software for a cascade/parallel formant synthesizer. J. Acoust. Soc. Am. 67(3), 971–995 (1980)
Lemmetty, S.: Review of Speech Synthesis Technology, Master’s Thesis, Helsinki University of Technology, Helsinki, Finland (1999)
Bengio, Y.: Learning deep architectures for AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009)
Lakra, S., Prasad, T.V., Ramakrishna, G.: Using Fuzzy sets to model paralinguistic content in speech as a generic solution for current problems in speech recognition and speech synthesis. J. Theoret. Appl. Info. Tech. 78(3), 441–446 (2015)
Dutoit, T.: High Quality Text-to-Speech Synthesis of the French Language. Ph.D. thesis, The Austrian Research Institute for Artificial Intelligence (OFAI), Vienna, Austria (1993)
Lakra, S., Prasad, T.V., Ramakrishna, G.: Modelling and Simulating response generation by a computer using a rule-based approach. Int. J. Soft Comput. 11(5), 299–304 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Prasad, T.V. (2019). Hindi Speech Synthesis Using Paralinguistic Content Expression. In: Bansal, J., Das, K., Nagar, A., Deep, K., Ojha, A. (eds) Soft Computing for Problem Solving. Advances in Intelligent Systems and Computing, vol 816. Springer, Singapore. https://doi.org/10.1007/978-981-13-1592-3_7
Download citation
DOI: https://doi.org/10.1007/978-981-13-1592-3_7
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1591-6
Online ISBN: 978-981-13-1592-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)