Abstract
As you will see, this chapter is a natural continuation of the preceding one. That is, the processing techniques described in Chapter 6 were presented in order that you could better understand (1) how speech on a noisy or distorted tape recording is processed for improved message intelligibility and (2) how speech enhancement techniques can be used in support of speech decoding. Thus, the materials contained in this chapter will overlap with those found in Chapter 6. They will focus on methods that can be used to decode heard speech and convert it to a reasonably accurate and complete written copy or transcript. Please remember that, in this chapter, we are considering the decoding of materials that have been degraded in some manner (sometimes quite seriously) and not simply the secretarial process of transferring ordinary spoken dialogue into text. That process is itself difficult, but it is not nearly as challenging as those to be described here.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aull, A. M. and Zue, V. W. (1985) Lexical Stress Determination and Its Application to Large Vocabulary Speech Recognition, IEEE-ICASSP CH218:1549–1552.
Bolinger, D. L. (1958) A Theory of Pitch Accent in English, Word 14:109–149.
Bolinger, D. and Sears, D. A. (1981) Aspects of Language, 3rd. ed., New York, Harcourt Brace Jovanovich.
Broadbent, D. E. (1954) The Role of Auditory Localization in Attention and Memory Span, Psychology 47:191–196.
Carhart, Raymond (1967) Binaural Reception of Meaningful Material, in Sensorineural Hearing Processes and Disorders, (A. B. Graham, Ed.), Boston, Little, Brown, 153–168.
Cutler, A. and Foss, D. J. (1977) On the Role of Sentence Stress in Sentence Processing, Lang., Speech 20:1–10.
Fonagy, I. (1966) Electrophysiological and Acoustic Correlates of Stress and Stress Perception, J. Speech, Hear Res. 9:231–244.
Francis, W. N. (1977) International Colloquium on Automatic Dialect Mapping: A Report, Computers and the Humanities 11:339–340.
Fry, D. B. (1955) Duration and Intensity as the Physical Correlates of Lingusitic Stress, J. Acoust. Soc. Amer. 27:765–768.
Fry, D. B. (1958) Experiments in the Perception of Stress Lang., Speech 1:126–152.
Hirsh, I. (1950) The Relation Between Localization and Intelligibility, J. Acoust. Soc. Amer. 22:196–200.
Hollien, H. (1980) Vocal Indicators of Psychological Stress, in Forensic Psychology and Psychiatry (F. Wright, C. Bahn and R. Rieber, Eds.), New York, New York Academy of Sciences, 47–72.
Hollien, H. and Fitzgerald, James T. (1977) Speech Enchancement Techniques for Crime Lab Use, Proceed Internat. Confer. Crime Countermeasures, Oxford, UK, 21-29.
Hollien, H., Geisson, L. and Hicks, J. W., Jr. (1987) Data on Psychological Stress Evaluators and Voice Lie Detection J. Forensic Sciences 32:405–418.
Hollien, P. A. (1983) Utilization of Blind Decoders in Phonetics, Abst., 10th Inter. Cong. Phonetic Sciences (A. Cohen and M. Broecke, Eds.), Dordrecht, Holland, Foris Publications, 532.
Hollien, P. A. (1984) An Update on Speech Decoding, Proceed. Inst. Acoustics, Part I: Police Applic. Speech and Tape Record. Analysis, London, 33-40.
Klatt, D. (1977) Review of the DARPA Speech Understanding Project, J. Acoust. Soc. Amer. 62:1345–1365.
Lea, W. (1980) Trends in Speech Recognition, Englewood Cliffs, NJ, Prentice-Hall.
Lehiste, I. and Peterson, G. E. (1959) Vowel Amplitude and Phonetic Stress in American Speech, J. Acoust. Soc. Amer. 31:428–435.
Mole, H. and Uhlenbeck, E. M. (1956) The Linguistic Relevance of Intensity in Stress, Lingua 5:205–213.
Osberger, M. J. (1979) A Comparison Between Procedures Used to Locate Speech Segment Boundaries, J. Acoust. Soc. Amer. 50:225–228.
Parsons, T. (1976) Seperation of Speech from Interfering Speech by Means of Harmonic Selection, J. Acoust. Soc. Amer. 60:911–919.
Peterson, G. E. and Lehiste, I. (1960) Duration of Syllable Nuclei in English, J. Acoust. Soc. Amer. 32:693–703.
Rothman, Howard B. (1977) Decoding Speech from Tape Recordings, Proceed. Carnahan Conf., Crime Countermeasures, Lexington, KY, 63-67.
Taylor, D. S. (1981) Non-Native Speakers and the Rhythm of English. Internat. Rev. Applied Ling. in Language Teach. 19:219–226.
Van Coestsem, F., Hendricks, R. and McCormick, S. (1981) Accent Typology and Sound Change, Lingua 53:293–315.
Warren, R. and Warren, P. (1970) Auditory Illusions and Confusions, Scientific American 223:30–36.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 1990 Springer Science+Business Media New York
About this chapter
Cite this chapter
Hollien, H. (1990). Speech Decoding and Transcripts. In: The Acoustics of Crime. Applied Psycholinguistics and Communication Disorders. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-0673-1_7
Download citation
DOI: https://doi.org/10.1007/978-1-4899-0673-1_7
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-0675-5
Online ISBN: 978-1-4899-0673-1
eBook Packages: Springer Book Archive