Hindi Speech Synthesis Using Paralinguistic Content Expression

Prasad, T. V.

doi:10.1007/978-981-13-1592-3_7

T. V. Prasad¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 816))

827 Accesses

Abstract

A long-standing problem of monotonicity in naturalness has been solved using a well-founded model, namely the Speech Hierarchy Model. This model is based on the fact that all natural speech signals have infinite variations. For example, red light is present in an infinite number of frequencies in nature, whereas a computer has only a few numbers within a finite range to create red color. Paralinguistic content, which is a part of a speech signal, also varies infinitely. Using the concept of paralinguistic content expression, which can be used to express any form of variation onto a speech signal, the present methods of synthesizing speech are enhanced and will lead to technology which is more natural in the human sense. This paper implements the method and results in a tool for synthesizing Hindi speech which gave high intelligibility in 81% of input text samples.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Emilia: a speech corpus for Argentine Spanish text to speech synthesis

Article 02 February 2019

Synthesising Expressive Speech – Which Synthesiser for VOCAs?

Towards Improving Intelligibility of Black-Box Speech Synthesizers in Noise

References

Klatt, D.H.: Software for a cascade/parallel formant synthesizer. J. Acoust. Soc. Am. 67(3), 971–995 (1980)
Article Google Scholar
Lemmetty, S.: Review of Speech Synthesis Technology, Master’s Thesis, Helsinki University of Technology, Helsinki, Finland (1999)
Google Scholar
Bengio, Y.: Learning deep architectures for AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009)
Article MathSciNet Google Scholar
Lakra, S., Prasad, T.V., Ramakrishna, G.: Using Fuzzy sets to model paralinguistic content in speech as a generic solution for current problems in speech recognition and speech synthesis. J. Theoret. Appl. Info. Tech. 78(3), 441–446 (2015)
Google Scholar
Dutoit, T.: High Quality Text-to-Speech Synthesis of the French Language. Ph.D. thesis, The Austrian Research Institute for Artificial Intelligence (OFAI), Vienna, Austria (1993)
Google Scholar
Lakra, S., Prasad, T.V., Ramakrishna, G.: Modelling and Simulating response generation by a computer using a rule-based approach. Int. J. Soft Comput. 11(5), 299–304 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Godavari Institute of Engineering and Technology, Rajahmundry, 533296, Andhra Pradesh, India
T. V. Prasad

Authors

T. V. Prasad
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to T. V. Prasad .

Editor information

Editors and Affiliations

Department of Mathematics, South Asian University New Delhi , New Delhi, India
Jagdish Chand Bansal
Department of Mathematics, National Institute Of Technology Silchar Department of Mathematics, Silchar, Assam, India
Kedar Nath Das
Department of Mathematics and Computer Science, Faculty of Science, , Liverpool Hope University, Liverpool, UK
Atulya Nagar
Department of Mathematics, Indian Institute of Technology Roor Department of Mathematics, Roorkee, Uttarakhand, India
Kusum Deep
School of Basic Sciences, Indian Institute of Technology Bhubanesw School of Basic Sciences, Bhubaneswar, Odisha, India
Akshay Kumar Ojha

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Prasad, T.V. (2019). Hindi Speech Synthesis Using Paralinguistic Content Expression. In: Bansal, J., Das, K., Nagar, A., Deep, K., Ojha, A. (eds) Soft Computing for Problem Solving. Advances in Intelligent Systems and Computing, vol 816. Springer, Singapore. https://doi.org/10.1007/978-981-13-1592-3_7

Download citation

DOI: https://doi.org/10.1007/978-981-13-1592-3_7
Published: 14 December 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1591-6
Online ISBN: 978-981-13-1592-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Hindi Speech Synthesis Using Paralinguistic Content Expression

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Emilia: a speech corpus for Argentine Spanish text to speech synthesis

Synthesising Expressive Speech – Which Synthesiser for VOCAs?

Towards Improving Intelligibility of Black-Box Speech Synthesizers in Noise

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Hindi Speech Synthesis Using Paralinguistic Content Expression

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Emilia: a speech corpus for Argentine Spanish text to speech synthesis

Synthesising Expressive Speech – Which Synthesiser for VOCAs?

Towards Improving Intelligibility of Black-Box Speech Synthesizers in Noise

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation