A Decision Tree-Based Method for Speech Processing: Question Sentence Detection

Quang, Vũ Minh; Castelli, Eric; Yên, Phạm Ngọc

doi:10.1007/11881599_150

Vũ Minh Quang²³,
Eric Castelli²³ &
Phạm Ngọc Yên²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4223))

Included in the following conference series:

International Conference on Fuzzy Systems and Knowledge Discovery

1694 Accesses
5 Citations

Abstract

Retrieving pertinent parts of a meeting or a conversation recording can help for automatic summarization or indexing of the document. In this paper, we deal with an original task, almost never presented in the literature, which consists in automatically extracting questions utterances from a recording. In a first step, we have tried to develop and evaluate a question extraction system which uses only acoustic parameters and does not need any textual information from a speech-to-text automatic recognition system (called ASR system for Automatic Speech Recognition in the speech processing domain) output. The parameters used are extracted from the intonation curve of the speech utterance and the classifier is a decision tree. Our first experiments on French meeting recordings lead to approximately 75% classification rate. An experiment in order to find the best set of acoustic parameters for this task is also presented in this paper. Finally, data analysis and experiments on another French dialog database show the need of using other cues like the lexical information from an ASR output, in order to improve question detection performance on spontaneous speech.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Detecting Speech Disorders Using A Machine-Learning Guided Method in Spontaneous Tunisian Dialect Speech

Article 17 April 2024

Spoken Language Understanding of Human-Machine Conversations for Language Learning Applications

Article 11 November 2019

An Investigation of Single-Pass ASR System Combination for Spoken Language Understanding

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Ferrer, L., Shriberg, E., Stolcke, A.: A Prosody-Based Approach to End-of-Utterance Detection That Does Not Require Speech Recognition. In: IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Hong Kong, vol. I, pp. 608–611 (2003)
Google Scholar
Shriberg, E., Bates, R., Stolcke, A.: A prosody-only decision-tree model for disfluency detection. In: Eurospeech 1997, Rhodes, Greece (1997)
Google Scholar
Standfpord, V., Garofolo, J., Galibert, O., Michel, M., Laprun, C.: The NIST Smart Space and Meeting Room Projects: Signal, Acquisition, Annotation and Metrics. In: Proc of ICASSP 2003, Hong-Kong, China, Mai (2003)
Google Scholar
Wang, D., Lu, L., Zhang, H.J.: Speech Segmentation Without Speech Recognition. In: IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), April 2003, vol. I, pp. 468–471 (2003)
Google Scholar
Mana, N., Burger, S., Cattoni, R., Besacier, L., Maclaren, V., McDonough, J., Metze, F.: The NESPOLE! VoIP Multilingual Corpora in Tourism and Medical Domains. In: Eurospeech 2003, Geneva, September 1-4 (2003)
Google Scholar
Marquez, L.: Machine learning and Natural Language processing, Technical Report LSI-00-45-R, Universitat Politechnica de Catalunya (2000)
Google Scholar
Witten, I.H., Frank, E.: Data mining: Pratical machine learning tools and techniques with Java implementations. Morgan Kaufmann, San Francisco (1999)
Google Scholar
Besacier, L., Bonastre, J.F., Fredouille, C.: Localization and selection of speaker-specific information with statistical modeling. Speech Communication 31, 89–106 (2000)
Article Google Scholar

Download references

Author information

Authors and Affiliations

International research center MICA, IP Hanoi – CNRS/UMI-2954, INP Grenoble, 1, Dai Co Viet, Hanoi, Viet Nam
Vũ Minh Quang, Eric Castelli & Phạm Ngọc Yên

Authors

Vũ Minh Quang
View author publications
You can also search for this author in PubMed Google Scholar
Eric Castelli
View author publications
You can also search for this author in PubMed Google Scholar
Phạm Ngọc Yên
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electrical and Electronic Engineering, Nanyang Technological University,, Block S1, Nanyang Avenue, 639798, Singapore
Lipo Wang
Life Science Research Center, School of Electronic Engineering, Xidian University,, 710071, Xi’an, Shaanxi, China
Licheng Jiao
School of Electrical and Electronic Engineering, Xidian University, 710071, Xi’an, China
Guanming Shi
School of Information Technology and Electrical Engineering, The University of Queensland, 4072, Brisbane, Queensland, Australia
Xue Li
College of Mathematics and Information Science, Hebei Normal University, 050016, Shijiazhuang, Hebei, P.R. China
Jing Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Quang, V.M., Castelli, E., Yên, P.N. (2006). A Decision Tree-Based Method for Speech Processing: Question Sentence Detection. In: Wang, L., Jiao, L., Shi, G., Li, X., Liu, J. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2006. Lecture Notes in Computer Science(), vol 4223. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11881599_150

Download citation

DOI: https://doi.org/10.1007/11881599_150
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45916-3
Online ISBN: 978-3-540-45917-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Decision Tree-Based Method for Speech Processing: Question Sentence Detection

Abstract

Chapter PDF

Similar content being viewed by others

Detecting Speech Disorders Using A Machine-Learning Guided Method in Spontaneous Tunisian Dialect Speech

Spoken Language Understanding of Human-Machine Conversations for Language Learning Applications

An Investigation of Single-Pass ASR System Combination for Spoken Language Understanding

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Decision Tree-Based Method for Speech Processing: Question Sentence Detection

Abstract

Chapter PDF

Similar content being viewed by others

Detecting Speech Disorders Using A Machine-Learning Guided Method in Spontaneous Tunisian Dialect Speech

Spoken Language Understanding of Human-Machine Conversations for Language Learning Applications

An Investigation of Single-Pass ASR System Combination for Spoken Language Understanding

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation