Abstract
Information about the age of the speaker is always present in speech. It is used as perceptual cues to age by human listeners, and can be measured acoustically and used by automatic age estimators. This chapter offers an introduction to the phonetic study of speaker age, with focus on what is known about the acoustic features which vary with age. The age-related acoustic variation in temporal as well as in laryngeally and supralaryngeally conditioned aspects of speech has been well documented. For example, features related to speech rate, sound pressure level (SPL) and fundamental frequency (F0) have been studied extensively, and appear to be important correlates of speaker age. However, the relationships among the correlates appear to be rather complex, and are influenced by several factors. For instance, differences have been reported between correlates of female and male age, between speakers of good and poor physiological condition, between chronological age and perceived age, and also between different speech sample types (e.g. sustained vowels, read or spontaneous speech). More research is thus needed in order to build reliable automatic classifiers of speaker age.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Beck, J.M.: Organic variation of the vocal apparatus. In: Hardcastle, W.J., Laver, J. (eds.) The Handbook of Phonetic Sciences, pp. 256–297. Blackwell Publ., Oxford (1997)
Linville, S.E.: Vocal Aging. Singular Thomson Learning, San Diego, CA (2001)
Ramig, L.A., Ringel, R.L.: Effects of physiological aging on selected acoustic characteristics of voice. Journal of Speech and Hearing Research 26, 22–30 (1983)
Linville, S.E.: The aging voice. In: Kent, R.D., Ball, M.J. (eds.) Voice Quality Measurement. Singular Thomson Learning, San Diego, CA, pp. 359–376 (2000)
Linville, S.E.: The aging voice. The American Speech-Language-Hearing Association (ASHA) Leader 12, 21 (October 19, 2004)
Jurik, A.: Ossification and calcification of the laryngeal skeleton. Acta Radiol Diagn. 25, 17–22 (1984)
Lindblad, P.: Rösten. Studentlitteratur, Lund (1992)
Dedivitis, R.A.: Abrahão, M., Simães, M.J., Mora, O.A., Cervantes, O.W.: Aging histological changes in the cartilages of the cricoarytenoid joint. Acta Cir Bras [serial online] 19 (2004) (retrieved August 16, 2006), from http://www.scielo.br/acb
Mupparapu, M., Vuppalapati, A.: Ossification of laryngeal cartilages on lateral cephalometric radiographs. The Angle Orthodontist 75, 196–201 (2005)
Ptacek, P.H., Sander, E.K.: Age recognition from voice. Journal of Speech and Hearing Research 9, 273–277 (1966)
Ryan, W.J., Burk, K.W.: Perceptual and acoustic correlates of aging in the speech of males. Journal of Communication Disorders 7, 181–192 (1974)
Huntley, R., Hollien, H., Shipp, T.: Influences of listener characteristics on perceived age estimations. Journal of Voice 1, 49–52 (1987)
Shipp, T., Hollien, H.: Perception of the aging male voice. Journal of Speech and Hearing Research 12, 703–710 (1969)
Ramig, L.: Aging speech: Physiological and sociological aspects. Language and Communication 6, 25–34 (1986)
Schötz, S.: Perception, Analysis and Synthesis of Speaker Age. PhD thesis, Travaux de l’Institut de linguistique de Lund 47. Lund: Dept. of Linguistics and Phonetics, Lund University (2006)
Shafran, I., Riley, M., Mohri, M.: Voice signatures. In: Proc. of The 8th IEEE Automatic Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin Islands (2003)
Minematsu, N., Sekiguchi, M., Hirose, K.: Automatic estimation of one’s age with his/her speech based upon acoustic modeling techniques of speakers. In: Proc. of ICASSP 2002, Orlando, FL, pp. 137–140 (2002)
Minematsu, N., Sekiguchi, M., Hirose, K.: Performance improvement in estimating subjective agedness with prosodic features. In: Proc. of Speech Prosody 2002, Aix-en- Provence, pp. 507–510 (2002)
Minematsu, N., Yamauchi, K., Hirose, K.: Automatic estimation of perceptual age using speaker modeling techniques. In: Proc. of Eurospeech, Geneva, pp. 3005–3008 (2003)
Müller, C., Wittig, F., Baus, J.: Exploiting speech for recognizing elderly users to respond to their special needs. In: Proc. of Eurospeech 2003, Geneva, pp. 1305–1308 (2003)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kauffman, San Mateo (1993)
Müller, C.: Zweistufige kontextsensitive Sprecherklassifikation am Beispiel von Alter und Geschlecht. PhD thesis, Computer Science Institute, Saarland University (2005)
Müller, C.: Automatic recognition of speakers’ age and gender on the basis of empirical studies. In: Proc. of Interspeech 2006, Pittsburgh, PA (2006)
Hollien, H.: Old voices: What do we really know about them? Journal of Voice 1, 2–13 (1987)
Morris, R.J., Brown, W.S.: Age-related differences in speech variability among women. Journal of Communication Disorders 27, 49–64 (1994)
Decoster, W.: Akoestische kenmerken van de ouder wordene stem. PhD thesis, Leuwen: Leuwen University Press (Summary in English) (1998)
Higgins, M.B., Saxman, J.H.: A comparison of selected phonatory behaviours of healthy aged and young adults. Journal of Speech and Hearing Research 13, 1000–1010 (1991)
Brückl, M., Sendlmeier, W.: Aging female voices: An acoustic and perceptive analysis. In: Proc. of VOQUAL 2003, Geneva, pp. 163–168 (2003)
Benjamin, B.: Phonological performance in gerontological speech. Journal of Psycholinguistic Research 11, 159–167 (1982)
Oyer, E., Deal, L.: Temporal aspects of speech and the aging process. Folia Phoniatrica (Basel) 37, 109–112 (1985)
Morris, R.J., Brown, W.S.: Age-related voice measures among adult women. Journal of Voice 1, 38–43 (1987)
Brown, W.S., Morris, R.J., Michel, J.F.: Vocal jitter in young adult and aged female voices. Journal of Voice 3, 113–119 (1989)
Shipp, T., Qi, Y., Huntley, R., Hollien, H.: Acoustic and temporal correlates of perceived age. Journal of Voice 6, 211–216 (1992)
Amerman, J.D., Parnell, M.M.: Speech timing strategies in elderly adults. Journal of Voice 20, 65–67 (1992)
Slawinski, E.B.: Acoustic correlates of [b] and [w] produced by normal young to elderly adults. Journal of the Acoustical Society of America 95(4), 2221–2230 (1994)
Schötz, S., Müller, C.: A study of acoustic correlates of speaker age. In: Speaker Classification II. LNCS(LNAI), vol. 4441, Springer, Heidelberg (2007)
Hoit, J., Hixon, K., Altman, M., Morgan, W.: Speech breathing in women. Journal of Speech and Hearing Research 32, 353–365 (1989)
Stölten, K., Engstrand, O.: Effects of sex and age in the Arjeplog dialect: A listening test and measurements of preaspiration and vot. In: Proc. of Fonetik 2002, vol. 44, TMH-QPSR, pp. 29–32 (2002)
Petrosino, L., Colcord, R.D., Kurcz, K.B., Yonker, R.J.: Voice onset time of velar stop productions in aged speakers. Journal of Perceptual and Motor Skills 76, 83–88 (1993)
Neiman, G., Kluch, R., Shuey, E.: Voice onset time in young and 70-year-old women. Journal of Speech and Hearing Research 26, 118–123 (1983)
Ryan, W.J.: Acoustic aspects of the aging voice. Journal of Gerontology 27, 256–268 (1972)
Morris, R.J., Brown, W.S.: Age-related differences in speech intensity among adult females. Folia Phoniatrica (Basel) 46, 64–69 (1994)
Xue, S.A., Deliyski, D.: Effects of aging on selected acoustic voice parameters: Preliminary normative data and educational implications. Educational Gerontology 21, 159–168 (2001)
Brown, W.S., Morris, R.J., Hollien, H., Howell, E.: Speaking fundamental frequency characteristics as a function of age and professional singing. Journal of Voice 5, 310–315 (1991)
Hollien, H., Shipp, T.: Speaking fundamenal frequency and chronological age in males. Journal of Speech and Hearing Research 15, 155–159 (1972)
Kitzing, P.: Glottografisk frekvensindikering: En undersökningsmetod för mätning av röstläge och röstomfång samt framställning av röstfrekvensdistributionen. PhD thesis, Lund University, Malmö (1979)
Traunmüller, H., Eriksson, A.: The frequency range of the voice fundamental in the speech of male and female adults [manuscript] (1995) (retrieved January 2, 2006), from http://www.ling.su.se/staff/hartmut/aktupub.htm
Mysak, E.: Pitch and duration characteristics of older males. Journal of Speech and Hearing Research 2, 46–54 (1959)
Linville, S.E.: Acoustic-perceptual studies of aging voice in women. Journal of Voice 1, 44–48 (1987)
Ptacek, P.H., Sander, E.K., Maloney, W.H., Jackson, C.C.R.: Phonatory and related changes with advanced age. Journal of Speech and Hearing Research 9, 350–360 (1966)
Ringel, R.L., Chodzko-Zajko, W.J.: Vocal indices of biological age. Journal of Voice 1, 31–37 (1987)
Linville, S.E.: The sound of senescence. Journal of Voice 10, 190–200 (1996)
Orlikoff, R.: The relationship of age and cardiovascular health to certain acoustic characteristics of male voices. Journal of Speech and Hearing Research 33, 450–457 (1990)
Ferrand, C.T.: Harmonics-to-noise ratio: An index of vocal aging. Journal of Voice 16, 480–487 (2002)
Shuey, E., Herr-McCauley, J., Prohaska, C., Martin, K.: Perturbation measures and chronologic age. Presented at the annual convention of the American Speech-Language-Hearing Association (ASHA), Chicago, IL (November 13–15, 2003)
Campbell, N.: Loudness, spectral tilt, and perceived prominence in dialogues. In: Proc. of ICPhS 1995. vol. 3, pp. 676–679. Stockholm (1995)
Heldner, M.: Spectral emphasis as an additional source of information in accent detection. In: Bacchiani, M., Hirschberg, J., Litman, D., Ostendorf, M. (eds.) Prosody 2001: ISCA Tutorial and Research Workshop on Prosody in Speech Recognition and Understanding, ISCA, Red Bank, NJ, pp. 57–60 (2001)
Sluijter, A.M.C., van Heuven, V.J.: Spectral balance as an acoustic correlate of linguistic stress. Journal of the Acoustical Society of America 100, 2471–2485 (1996)
Traunmüller, H.: Perception of speaker sex, age, and vocal effort. In: PHONUM, Reports in Phonetics 4. Umeå University, pp. 183–186 (1997)
O’Leidhin, E., Murphy, P.: Analysis of Spectral Measures for Voiced Speech with Varying Noise and Pertubation Levels. Proc. of ICASSP 1, 869–872 (2005)
Decoster, W., Debruyne, F.: Changes in spectral measures and voice onset time with age: A cross-sectional and a longitudinal study. Folia Phoniatrica et Logopaedica 49, 269–280 (1997)
Linville, S.E.: Source characteristics of aged voice assessed from long-term average spectra. Journal of Voice 16, 472–479 (2002)
Winkler, R., Brückl, M., Sendlmeier, W.: The aging voice: an acoustic, electroglottographic and perceptive analysis of male and female voices. In: Proc. of ICPhS 2003, Barcelona, pp. 2869–2872 (2003)
McAllister, A., Sundberg, J., Hibi, S.: Acoustic measurements and perceptual evaluation of hoarseness in children’s voices. Logopedics Phoniatrics Vocology 23, 27–38 (1998)
Kreiman, J., Gerratt, B.R.: Perception of aperiodicity in pathological voice. Journal of the Acoustical Society of America 117, 2201–2211 (2005)
Ramig, L.A.: Effects of physiological aging on vowel spectral noise. Journal of Gerontology 38, 223–225 (1983)
de Krom, G.: A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals. Journal of Speech and Hearing Research 36, 224–266 (1993)
Boersma, P.: Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. In: Proc. of the Institute of Phonetic Sciences, vol. 17, pp. 97–110 (1993)
Wang, C.C., Huang, H.T.: Voice acoustic analysis of normal Taiwanese adults. J. Chin. Med. Assoc. 67, 179–184 (2004)
Schötz, S.: Prosodic cues in human and machine estimation of female and male speaker age. In: Bruce, G., Horne, M. (eds.) Nordic Prosody. Proc. of the IXth Conference, Lund 2004. Frankfurt am Main: P. Lang, pp. 215–223 (2006)
Deliyski, D., Gress, C.: Intersystem reliability of MDVP for DOS and Windows 95/98. Paper presented at the 1998 Annual Convention of American Speech-Language-Hearing Association, San Antonio, Texas (1998)
Pereira Jotz, G., Cervantes, O., Abrahao, M., Settanni, F.A.P., de Angelis, E.C.: Noise-to-harmonics ratio as an acoustic measure of voice disorders in boys. Journal of Voice 16, 28–31 (2002)
Shuey, E., Herr-McCauley, J., Anders, M.: Indices of turbulence in aging voice. Presented at the annual convention of the American Speech-Language-Hearing Association (ASHA), Philadelphia, PA (November 17-20, 2004)
Jacques, R., Rastatter, M.: Recognition of speaker age from selected acoustic features as perceived by normal young and older listeners. Folia Phoniatrica (Basel) 42, 118–124 (1990)
Endres, W., Bambach, W., Flösser, G.: Voice spectrograms as a function of age, voice disguise, and voice imitation. Journal of the Acoustical Society of America 49, 1842–1848 (1971)
Linville, S.E., Fisher, H.: Acoustic characteristics of perceived versus actual vocal age in controlled phonation by adult females. Journal of the Acoustical Society of America 78, 40–48 (1985)
Rastatter, M., Jacques, R.: Formant frequency structure of the aging male and female vocal tract. Folia Phoniatrica (Basel) 42, 118–124 (1990)
Rastatter, M., McGuire, R., Kalinowski, J., Stuart, A.: Formant frequency characteristics of elderly speakers in contextual speech. Folia Phoniatrica et Logopaedica 49, 1–8 (1997)
Linville, S.E., Rens, J.: Vocal tract resonance analysis of aging voice, using long-term average spectra. Journal of Voice 15, 323–330 (2001)
Klatt, D., Klatt, L.: Analysis, synthesis, and perception of voice quality variations among female and male talkers. Journal of the Acoustical Society of America 87, 820–857 (1990)
Orlikoff, R.: Heartbeat-related fundamental frequency and amplitude variation in healthy young and elderly male voices. Journal of Voice 4, 322–328 (1990)
Sataloff, R.T., Rosen, D.C., Hawksha, M., Spiegel, J.R.: The three ages of voice: the aging adult voice. Journal of Voice 11, 156–160 (1997)
González, J.: Formant frequencies and body size of speaker: a weak relationship in adult humans. Journal of Phonetics 32, 277–287 (2004)
Braun, A., Rietveld, T.: The influence of smoking habits on perceived age. In: Proc. of ICPhS 1995. vol. 2, pp. 294–297, Stockholm (1995)
González, J., Carpi, A.: Early effect of smoking on voice: A multidimensional study. Medical Science Monitor 10, 649–656 (2004)
Wagner, A., Braun, A.: Is voice quality language-dependent? Acoustic analyses based on speakers of three different languages. In: Proc. of ICPhS 2003, Barcelona, pp. 651–654 (2003)
Roach, P.: Phonetics. Oxford University Press, Oxford (2001)
Andersson, L.G.: Språket, Vetenskapsradion [radio programme]. Article (2006) (retrieved August 24, 2006), from http://www.sr.se
Traunmüller, H., Eriksson, A.: Acoustic effects of variation in vocal effort by men, women, and children. Journal of the Acoustical Society of America 107, 3438–3451 (2000)
Pemberton, C., McCormack, P., Russell, A.: Have women’s voices lowered across time? A cross sectional study of Australian women’s voices. Journal of Voice 12, 208–213 (1998)
Decoster, W., Debruyne, F.: Longitudinal voice changes: facts and interpretation. Journal of Voice 14, 184–193 (2000)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Schötz, S. (2007). Acoustic Analysis of Adult Speaker Age. In: Müller, C. (eds) Speaker Classification I. Lecture Notes in Computer Science(), vol 4343. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74200-5_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-74200-5_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74186-2
Online ISBN: 978-3-540-74200-5
eBook Packages: Computer ScienceComputer Science (R0)