Abstract
Traditional concatenative speech synthesizers equipped with a small amount of speech segments suffer from the lack of naturalness. On the other hand, corpus-based speech synthesizers are able to produce much more natural speech. This paper presents a comparison of two new unit-selection methods in the corpus-based speech synthesis. An experimental comparison of comprehensibility and naturalness of all three approaches is provided here. The results are compared with one widely-used unit-selection method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Batůšek, R.: An objective measure for assessment of the concatenative tts segment inventories. In: Proceedings of Eurospeech 2001 — Scandinavia, Aalborg, Denmark (September 2001)
Batůšek, R.: Symbolic segment dissimilarity measure and its applications in speech synthesis. In: Proceedings of IEEE 2002 Workshop on Speech Synthesis, Santa Monica, USA (September 2002)
Black, A.W., Campbell, N.: Optimising selection of units from speech databases for concatenative synthesis. In: Eurospeech, pp. 581–584 (1995)
Hunt, A., Black, A.W.: Unit selection in a concatenative speech synthesis system using a large speech database. In: Proceedings of ICASSP 1996, pp. 373–376, Atlanta, Georgia, USA (1996)
Sagisaka, Y.: Speech synthesis by rule using an optimal selection of non-uniform synthesis units. In: Proceedings of ICASSP 1998, pp. 679–682, NewYork, USA (1988)
Yi, J., Glass, J., Hetherington, I.: A flexible, scalable finite-state transducer architecture for corpus-based concatenative speech synthesis. In: Proceedings of ICSLP, Beijing, China (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Batůšek, R., Gaura, P. (2003). A Comparison of Unit Selection Techniques in Limited Domain Speech Synthesis. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2003. Lecture Notes in Computer Science(), vol 2807. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39398-6_35
Download citation
DOI: https://doi.org/10.1007/978-3-540-39398-6_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20024-6
Online ISBN: 978-3-540-39398-6
eBook Packages: Springer Book Archive