Abstract
Summerfield et al. [22] described a perceptual effect of temporal spectral contrast which they called the ‘Flat-spectrum Vowels’ effect. They summed the first 50 harmonics of 100 Hz with equal amplitudes, creating a signal with a flat spectrum. Three sets of adjacent harmonics were then omitted at frequencies centred on the first three formants of a vowel, creating the ‘spectral complement’ of the vowel. Stimuli were constructed in which 500-ms segments of a vowel complement preceded and followed a 500-ms segment of the flat spectrum. Listeners identified the flat spectrum as the vowel whose complement surrounded it, despite the absence of formant peaks. The effect reveals the existence of processes that enhance newly-arriving acoustical energy in relation to pre-existing energy. These processes could improve the effective signal-to-noise ratio of intermittent signals in background noises. The aims of this paper are to discuss factors that may contribute to this effect and to illustrate roles it may play in the perception of speech.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Assmann, P.F. (1985). The role of harmonics and formants in the perception of vowel quality. Unpublished Ph.D. Thesis, University of Alberta.
Cardozo, B.L. (1967). Ohm’s Law and masking. IPO Annual Progress Report, No. 2, 59–64 (Institute for Perception Research, Eindhoven, Netherlands).
Chistovich, L.A., Lublinskaya, V.V., Malinnikova, T.G., Ogorodnikova, E.A., Stoljarova, E.I., and Zhukov, S.Ja. (1982). Temporal processing of peripheral auditory patterns of speech. In The Represenation of Speech in the Peripheral Auditory System, R. Carlson and B. Granström (Eds.). Elsevier, Netherlands. Pp. 165–180.
Delgutte, B. (1982). Some correlates of phonetic distinctions at the level of the auditory nerve, in The Representation of Speech in the Peripheral Auditory System, R. Carlson and B. Granström (Eds.). Elsevier, NL. Pp. 131–149
Eggermont, J.J. and Odenthal, D.W. (1974). Action potentials and summating potentials in the normal human cochlea, Acta Otolaryngologica, Supplement, 316, 39–61.
Frisina, R.D. (1983). Enhancement of responses to amplitude modulation in the gerbil cochlear nucleus: Single-unit recordings using an improved surgical approach. Special Report. ISR-S-23. (Institute for Sensory Research, Syracuse University, NY.)
Green, D.M. and Kidd, G. (1983). Further studies of auditory profile analysis. Journal of the Acoustical Society of America, 73, 1260–1265.
Haggard, M.P. (1977). Mechanisms of formant frequency discrimination. In Psychophysics and physiology of hearing, E.F. Evans and J.P. Wilson (Eds.). Academic Press. Pp. 499–507
Kiang, N.Y.S., Watanabe, T., Thomas, E.C., and Clark, L.F. (1965). Response patterns of single fibers in the cat’s auditory nerve. MIT Press, Cambridge, MA.
Klatt, D.H. (1980). Software for a cascade/parallel formant synthesizer. Journal of the Acoustical Society of America, 67, 971–995.
Klatt, D.H. (1982). Speech processing strategies based on auditory models. In The Representation of Speech in the Peripheral Auditory System, R. Carlson and B. Granstrom (Eds.). Elsevier, Netherlands. Pp. 181–196.
Kubovy, M. and Jordan, R. (1979). Tone segregation by phase: On the phase sensitivity of the single ear, Journal of the Acoustical Society of America, 66, 100–106.
Moller, A. (1975). Dynamic properties of excitation and inhibition in the cochlear nucleus, Acta Physiologica Scandinavia, 93, 442–454.
Moore, B.C.J, and Glasberg, B.R. (1983). Suggested formulae for calculating auditory-filter bandwidths and excitation patterns, Journal of the Acoustical Society of America, 74, 750–753.
Nocerino, N., Soong, F.K., Rabiner, L.R., and Klatt, D.H. (1985). Comparative study of several distortion measures for speech recognition. Speech Communication, 4, 317–331.
Plomp, R. (1976). Aspects of Tone Sensation. Academic, London.
Scheffers, M.T.M. (1979). The role of pitch in perceptual separation of simultaneous vowels, IPO Annual Progress Report, 14, 51–54 (Institute for Perception Research, Eindhoven, Netherlands).
Scheffers, M.T.M. (1983). Identification of synthesized vowel sounds in a noise background. IPO Manuscript No. 450/11 (Institute for Perception Research, Eindhoven, Netherlands).
Smith, R.L. (1979). Adaptation, saturation, and physiological masking in single auditory nerve fibres, Journal of the Acoustical Society of America, 65, 166–178.
Smith, R.L., Brachman, M.L., and Frisina, R.D. (1985). Sensitivity of auditory-nerve fibres to changes in intensity: A dichotomy between decrements and increments. Journal of the Acoustical Society of America, 78, 1310–1316.
Spiegel, M.F., Picardi, M.C. and Green, D.M. (1981). Signal and masker uncertainty in intensity discrimination. Journal of the Acoustical Society of America, 70, 1015–1019.
Summerfield, Q., Haggard, M.P., Foster, J. and Gray, S. (1984). Perceiving vowels from uniform spectra: phonetic exploration of an auditory after-effect. Perception and Psychophysics, 35, 203–213.
Summerfield, Q., Sidwell, A., and Nelson, T. (1986). Auditory enhancement of changes in spectral amplitude. Journal of the Acoustical Society of America (in press).
Viemeister, N.F. (1980). Adaptation of masking. In Psychophysical, Psysiological, and Behavioural Studies in Hearing, G. v.d. Brink and F.A. Bilsen (Eds.). Delft University Press. Pp. 190–199.
Viemeister, N.F. and Bacon, S. (1982). Forward masking by enhanced components in harmonic complexes. Journal of the Acoustical Society of America, 71, 1502–1507.
Westerman, L.A. (1985). Adaptation and recovery of auditory nerve responses. Special Report. ISR-S-24. (Institute for Sensory Research, Syracuse University, NY).
Wilson, J.P. (1970). An auditory after-image. In Frequency Analysis and Periodicity Detection in Hearing, R. Plomp and G.F. Smoorenburg (Eds.) A.W. Sijthoff, Leiden, Netherlands.
Zwicker, E. (1964). ‘Negative afterimage’ in hearing, Journal of the Acoustical Society of America, 36, 2413–2415.
Zwicker, E. (1965). Temporal effects in simultaneous masking and loudness. Journal of the Acoustical Society of America, 38, 132–141.
Zwicker, U.T. (1984). Auditory recognition of diotic and dichotic vowel pairs. Speech Communication, 3, 265–277.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1987 Martinus Nijhoff Publishers, Dordrecht
About this chapter
Cite this chapter
Summerfield, Q., Assmann, P. (1987). Auditory Enhancement in Speech Perception. In: Schouten, M.E.H. (eds) The Psychophysics of Speech Perception. NATO ASI Series, vol 39. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-3629-4_10
Download citation
DOI: https://doi.org/10.1007/978-94-009-3629-4_10
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-8123-8
Online ISBN: 978-94-009-3629-4
eBook Packages: Springer Book Archive