Measuring temporal compensation effect in speech perception

Kato, Hiroaki; Tsuzaki, Minoru; Sagisaka, Yoshinori

doi:10.1007/978-1-4612-2258-3_16

Hiroaki Kato,
Minoru Tsuzaki &
Yoshinori Sagisaka

Abstract

The perceptual compensation effect between neighboring speech segments is measured in various word contexts to explore the following two problems: (1) whether temporal modifications of multiple segments perceptually affect each other, and (2) which aspect of the stimulus correlates with the perceptually salient temporal markers. Experiment 1 utilizes an acceptability rating of temporal unnaturalness for words with temporal modifications. It shows that a vowel (V) duration and its adjacent consonant (C) duration can perceptually compensate each other. This finding demonstrates the presence of a time perception range wider than a single segment (V or C). The results of the first experiment also show that rating scores for compensatory modification between C and V do not depend on the temporal order of modified pairs (C-to-V or V-to-C) but rather on the loudness difference between V and C; acceptability decreases when the loudness difference between V and C becomes high. This suggests that perceptually salient markers locate around major loudness jumps. Experiment 2 further investigates the influence of the temporal order of V and C by utilizing a detection task instead of the acceptability rating.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Rhythmic and speech rate effects in the perception of durational cues

Article 12 July 2021

Rate dependent speech processing can be speech specific: Evidence from the perceptual disappearance of words under changes in context speech rate

Article 22 September 2015

Natural fast speech is perceived as faster than linearly time-compressed speech

Article 09 February 2016

References

W. N. Campbell. Multi-level timing in speech. PhD thesis, University of Sussex, Department of Experimental Psychology, 1992. Available as ATR Technical Report TR-IT-0035.
Google Scholar
R. Carlson and B. Granström. Perception of segmental duration. In A. Cohen and S. G. Nooteboom, editors, Structure and process in speech perception, pp. 90–106. Heidelberg: Springer-Verlag, 1975.
Chapter Google Scholar
W. N. Campbell and Y. Sagisaka. Moraic and syllable-level effects on speech timing. Technical Report SP 91–107, IEICE, 1991.
Google Scholar
G. Fant and A. Kruckenberg. Preliminaries to the study of Swedish prose reading and reading style. Technical Report 2, Royal Institute of Technology, 1989.
Google Scholar
H. Fujisaki, K. Nakamura, and T. Imoto. Auditory perception of duration of speech and non-speech stimuli. In G. Fant and M. A. A. Tatham, editors, Auditory Analysis and Perception of Speech, pp. 197–219. London: Academic Press, 1975.
Google Scholar
D. M. Green and J. A. Swerts. Signal Detection Theory and Psychophysics. New York: John Wiley, 1966.
Google Scholar
N. Higuchi and H. Fujisaki. Durational control of segmental features in connected speech. Technical Report S80–40, Acoust. Soc. Jpn., 1980. in Japanese with English abstract..
Google Scholar
M. Hoshino and H. Fujisaki. A study on perception of changes in segmental durations. Technical Report H83–8/S82–75, 1983.
Google Scholar
S Hiki, Y. Kanamori, and J. Oizumi. On the duration of phonemes in running speech. Journal of the Institute of Electrical Communication Engineers of Japan, 50:849–856, 1967. in Japanese.
Google Scholar
A. W. F. Huggins. Just noticeable differences for segment duration in natural speech. J. Acoust Soc. Am., 51 (4): 1270–1278, 1972.
Article ADS Google Scholar
A. W. F. Huggins. On the perception of temporal phenomena in speech. J. Acoust Soc. Am., 51(4):1279–1290, 1972.
Article ADS Google Scholar
S. Imai and T. Kitamura. Speech analysis synthesis system using the log magnitude approximation filter. Trans. Institute of Electronics and Communication Engineers, J61-A:527–534, 1978. in Japanese with English figure captions.
Google Scholar
ISO. Acoustics-method for calculating loudness level. International Organization for Standardization, ISO 532–1975(E), 1975.
Google Scholar
D. H. Klatt. Linguistic uses of segmental duration in English: acoustic and perceptual evidence. J. Acoust Soc. Am., 59:1208–1221, 1976.
Article ADS Google Scholar
N. Kaiki and Y. Sagisaka. The control of segmental duration in speech synthesis using statistical methods. In E Vatikotis-Bateson, Y Tohkura, and Y Sagisaka, editors, Speech Perception, Production and Linguistic Structure, pp. 391–402. Ohmsha (Tokyo)/ IOS Press (Amsterdam), 1992.
Google Scholar
H. Kato and M. Tsuzaki. Intensity effect on discrimination of auditory duration flanked by preceding and succeeding tones. J. Acoust. Soc. Japan (E), 15(5):349–351, 1994.
Google Scholar
H. Kato and M. Tsuzaki. Temporal discrimination of part of tone marked by two amplitude changes — comparison among on-marker and off-marker, and their combinations. Proceedings of the Fall meeting of Acoustics Society Japan, pp. 555–556, 1994.
Google Scholar
N. Kaiki, K. Takeda, and Y. Sagisaka. Linguistic properties in the control of segmental duration for speech synthesis. In G. Bailly, C. Benoît, and T. R. Sawallis, editors, Talking Machines: Theories, Models, and Designs, pp. 255–263. Amsterdam: Elsevier Science, 1992.
Google Scholar
SAS Institute Inc. The GLM procedure, SAS/STAT User’s Guide edition, 1990.
Google Scholar
H. Sato. Segmental duration and timing location in speech. Technical Report S77–31, 1977. in Japanese with English abstract and English figure captions.
Google Scholar
H. Sato. Some properties of phoneme duration in Japanese nonsense words. Proceedings of the Fall Meeting of Acoustics Society Japan, pp. 43–44, 1977. in Japanese with English figure captions.
Google Scholar
H. H. Schulze. The detectability of local and global displacements in regular rhythmic patterns. Psychological Research, 40:173–181, 1978.
Article Google Scholar
Y. Sagisaka and Y. Tohkura. Phoneme duration control for speech synthesis by rule. Transactions of the Institute of Electronics, Information and Communication Engineers of Japan, J67-A(7):629–636, 1984.
Google Scholar
Y. Sagisaka, K. Takeda, M. Abe, S. Katagiri, T. Umeda, and H. Kuwabara. A large-scale Japanese speech database. In Proceedings of the International Conference on Spoken Language Processing, Kobe, Japan, pp. 1089–1092, 1990.
Google Scholar
W. S. Torgerson. Theory and Methods of Scaling. New York: John Wiley, 1958.
Google Scholar
K. Takeda, Y. Sagisaka, and H. Kuwabara. On sentence-level factors governing segmental duration in Japanese. J. Acoust Soc. Am., 89:2081–2087, 1989.
Article ADS Google Scholar
J. P. H. van Santen. Contextual effects on vowel duration. Speech Communication, 11:513–546, 1992.
Article Google Scholar
E. Zwicker, H. Fastl, U. Widmann, K. Kurakata, S. Kuwano, and S. Namba. Program for calculating loudness according to DIN 45631 (ISO 532b). J. Acoust Soc. Japan (E), 12(l):39–42, 1991.
Google Scholar

Download references

Authors

Hiroaki Kato
View author publications
You can also search for this author in PubMed Google Scholar
Minoru Tsuzaki
View author publications
You can also search for this author in PubMed Google Scholar
Yoshinori Sagisaka
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ATR Interpreting Telecommunications Research Labs, 2-2, Hikaridai, Seika-cho, Soraku-gun, 619-02, Kyoto, Japan
Yoshinori Sagisaka , Nick Campbell & Norio Higuchi , &

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kato, H., Tsuzaki, M., Sagisaka, Y. (1997). Measuring temporal compensation effect in speech perception. In: Sagisaka, Y., Campbell, N., Higuchi, N. (eds) Computing Prosody. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-2258-3_16

Download citation

DOI: https://doi.org/10.1007/978-1-4612-2258-3_16
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4612-7476-6
Online ISBN: 978-1-4612-2258-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Measuring temporal compensation effect in speech perception

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Rhythmic and speech rate effects in the perception of durational cues

Rate dependent speech processing can be speech specific: Evidence from the perceptual disappearance of words under changes in context speech rate

Natural fast speech is perceived as faster than linearly time-compressed speech

References

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Measuring temporal compensation effect in speech perception

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Rhythmic and speech rate effects in the perception of durational cues

Rate dependent speech processing can be speech specific: Evidence from the perceptual disappearance of words under changes in context speech rate

Natural fast speech is perceived as faster than linearly time-compressed speech

References

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation