Comparison of Acoustic Adaptation Methods in Multilingual Speech Recognition Environment

Žgank, Andrej; Kačič, Zdravko; Horvat, Bogomir

doi:10.1007/978-3-540-39398-6_34

Andrej Žgank⁷,
Zdravko Kačič⁷ &
Bogomir Horvat⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2807))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

431 Accesses
2 Citations

Abstract

This paper presents the comparison of different acoustic adaptation methods in a multilingual speech recognition environment. Baseline multilingual acoustic models were generated using the tree based clustering with common phonetic broad classes. After the expert based port to a new language was performed, the influence of several adaptation methods on speech recognition performance was investigated. The target language adaptation subset contained 2% of complete speech database. The best adapted ported system had significant improvement in the speech recognition performance and its results were close to the results of pure reference monolingual system. The relationship between languages used in the mapping configuration remained unchanged after the adaptation. ...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Literature Review

Multilingual Speech Recognition for Indian Languages

Cross-Language Acoustic Modeling for Macedonian Speech Technology Applications

References

Žgank, A., Imperl, B., Johansen, F.T., Kačič, Z., Horvat, B.: Crosslingual Speech Recognition with Multilingual Acoustic Models Based on Agglomerative and Tree-Based Triphone Clustering. In: Proc. Eurospeech 2001, Aalborg, Denmark (2001)
Google Scholar
Žgank, A., Imperl, B., Johansen, F.T., Kačič, Z., Horvat, B.: Crosslingual Adaptation of Multilingual Triphone Acoustic Models. In: Proc. MSLP 2001, Aalborg, Denmark (2001)
Google Scholar
Schultz, T.: Multilinguale Spracherkennung - Kombination akustischer Modelle zur Portierung auf neue Sprachen. PhD Thesis, University of Karlsruhe, Germany (2000)
Google Scholar
Nieuwoudt, C.: Cross-language acoustic adaptation for automatic speech recognition. PhD Thesis, University of Pretoria, South Africa (2000)
Google Scholar
Leggetter, C.J., Woodland, P.C.: Flexible Speaker Adaptation using Maximum Likelihood Linear Regression. In: Proc. ARPA Spoken Language Technology Workshop, Austin, USA (1995)
Google Scholar
van den Heuvel, H., Boves, L., Moreno, A., Omologo, M., Richard, G., Sanders, E.: Annotation in the SpeechDat Projects. International Journal of Speech Technology 4(2), 127–143 (2001)
Article MATH Google Scholar
Johansen, F.T., Warakagoda, N., Lindberg, B., Lehtinen, G., Kačič, Z., Žgank, A., Elenius, K., Salvi, G.: The COST 249 SpeechDat Multilingual Reference Recogniser. In: Proc. LREC 2000, Athens, Greece (2000)
Google Scholar
Young, S.: The HTK Book (for HTK version 3.1). Cambridge University (2001)
Google Scholar
Lindberg, B., Johansen, F.T., Warakagoda, N., Lehtinen, G., Kačič, Z., Žgank, A., Elenius, K., Salvi, G.: A noise robust multilingual reference recogniser based on SpeechDat(II). In: Proc. ICSLP 2000, Beijing, China (2000)
Google Scholar
Imperl, B., Kačič, Z., Horvat, B., Žgank, A.: Clustering of triphones using phoneme similarity estimation for the definition of a multilingual set of triphones. Speech Communication 39(3-4), 353–366 (2003)
Article MATH Google Scholar
IPA Homepage (2003), http://www2.arts.gla.ac.uk/IPA/ipa.html

Download references

Author information

Authors and Affiliations

Institute of Electronics, Faculty of EE & CS, University of Maribor, Smetanova 17, SI-2000, Maribor, Slovenia
Andrej Žgank, Zdravko Kačič & Bogomir Horvat

Authors

Andrej Žgank
View author publications
You can also search for this author in PubMed Google Scholar
Zdravko Kačič
View author publications
You can also search for this author in PubMed Google Scholar
Bogomir Horvat
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of West Bohemia in Pilsen, Univerzitni 8, 30614, Plzen, Czech Republic
Václav Matoušek & Pavel Mautner &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Žgank, A., Kačič, Z., Horvat, B. (2003). Comparison of Acoustic Adaptation Methods in Multilingual Speech Recognition Environment. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2003. Lecture Notes in Computer Science(), vol 2807. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39398-6_34

Download citation

DOI: https://doi.org/10.1007/978-3-540-39398-6_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20024-6
Online ISBN: 978-3-540-39398-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Comparison of Acoustic Adaptation Methods in Multilingual Speech Recognition Environment

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Literature Review

Multilingual Speech Recognition for Indian Languages

Cross-Language Acoustic Modeling for Macedonian Speech Technology Applications

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Comparison of Acoustic Adaptation Methods in Multilingual Speech Recognition Environment

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Literature Review

Multilingual Speech Recognition for Indian Languages

Cross-Language Acoustic Modeling for Macedonian Speech Technology Applications

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation