Glottal Source Model Selection for Stationary Singing-Voice by Low-Band Envelope Matching

Villavicencio, Fernando

doi:10.1007/978-3-642-38847-7_6

Fernando Villavicencio²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7911))

Included in the following conference series:

International Conference on Nonlinear Speech Processing

1063 Accesses

Abstract

In this paper a preliminary study on voice excitation modeling by single glottal shape parameter selection is presented. A strategy for direct model selection by matching derivative glottal source estimates with LF-based candidates driven by the Rd parameter is explored by means of two state-of-the-art similarity measures and a novel one considering spectral envelope information. An experimental study on synthetic singing-voice was carried out aiming to compare the performance of the different measures and to observe potential relations with respect to different voice characteristics (e.g. vocal effort, pitch range, amount of aperiodicities and aspiration noise). The results of this study allow us to claim competitive performance of the proposed strategy and suggest us preferable source modeling conditions for stationary singing-voice.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Objective Description of Choral Singers Voice Quality Using Glottal-to-Noise Excitation Ratio

Speech synthesis for glottal activity region processing

Article 03 December 2018

Commonalities of Glottal Sources and Vocal Tract Shapes Among Speakers in Emotional Speech

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Alku, P.: Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering. Speech Communication 11, 109–118 (1992)
Article Google Scholar
Drugman, T., Bozkurt, B., Dutoit, T.: Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation. Speech Communication 53, 855–866 (2011)
Article Google Scholar
Degottex, G., Röbel, A., Rodet, X.: Joint estimate of shape and time-synchronization of a glottal source model by phase flatness. In: Proc. of ICASSP, Dallas, USA, pp. 5058–5061 (2010)
Google Scholar
Kane, J., Yanushevskaya, I., Chasaide, A.N., Gobl, C.: Exploiting time and frequency domain measures for precise voice source parameterisation. In: Proc. of Speech Prosody, Shanghai, China, pp. 143–146 (May 2012)
Google Scholar
Fant, G.: The lf-model revisited. transformations and frequency domain analysis. STL-QPSR Journal 36(2-3), 119–156 (1995)
Google Scholar
Lu, H.-L.: Toward a High-Quality Singing-Voice Synthesizer with Vocal Texture Control, Ph.D. thesis, Stanford University (2002)
Google Scholar
Henrich, N.: Etude de la source glottique en voix parlée et chantée, Ph.d. thesis, Université Paris 6, France (2001)
Google Scholar
Röbel, A., Rodet, X.: Efficient spectral envelope estimation and its application to pitch shifting and envelope preservation. In: Proc. of DAFx, Spain (2005)
Google Scholar
Villavicencio, F., Röbel, A., Rodet, X.: Improving lpc spectral envelope extraction of voiced speech by true-envelope estimation. In: Proc. of ICASSP (2006)
Google Scholar
Kreiman, J., Gerratt, B.R.: Perception of aperiodicity in pathological voice. Journal of the Acoustical Society of America 117, 2201–2211 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Corporate Research & Development Center, Yamaha Corporation, 203 Matsunokijima, Iwata, Shizuoka, Japan
Fernando Villavicencio

Authors

Fernando Villavicencio
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

TCTS Lab, University of Mons, 31, Bouldevard Bolez, 7000, Mons, Belgium
Thomas Drugman
TCTS Lab, University of Mons, 31, Boulevard Dolez, 7000, Mons, Belgium
Thierry Dutoit

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Villavicencio, F. (2013). Glottal Source Model Selection for Stationary Singing-Voice by Low-Band Envelope Matching. In: Drugman, T., Dutoit, T. (eds) Advances in Nonlinear Speech Processing. NOLISP 2013. Lecture Notes in Computer Science(), vol 7911. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38847-7_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-38847-7_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38846-0
Online ISBN: 978-3-642-38847-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Glottal Source Model Selection for Stationary Singing-Voice by Low-Band Envelope Matching

Abstract

Chapter PDF

Similar content being viewed by others

Objective Description of Choral Singers Voice Quality Using Glottal-to-Noise Excitation Ratio

Speech synthesis for glottal activity region processing

Commonalities of Glottal Sources and Vocal Tract Shapes Among Speakers in Emotional Speech

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Glottal Source Model Selection for Stationary Singing-Voice by Low-Band Envelope Matching

Abstract

Chapter PDF

Similar content being viewed by others

Objective Description of Choral Singers Voice Quality Using Glottal-to-Noise Excitation Ratio

Speech synthesis for glottal activity region processing

Commonalities of Glottal Sources and Vocal Tract Shapes Among Speakers in Emotional Speech

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation