Automatic Alignment of Phonetic Transcriptions for Russian

Kocharov, Daniil

doi:10.1007/978-3-319-11581-8_15

Daniil Kocharov²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8773))

Included in the following conference series:

International Conference on Speech and Computer

1326 Accesses
1 Citations

Abstract

This paper presents automatic alignment of Russian phonetic pronunciations using the information about phonetic nature of speech sounds in the aligned transcription sequences. This approach has been tested on 24 hours of speech data and has shown significant improvement in alignment errors has been obtained in comparison with commonly used Levenstein algorithm: the numbers of error has been reduced from 1.1 % to 0.27 %.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Automatic Phonetic Transcription for Russian: Speech Variability Modeling

Konkani Phonetic Transcription System 1.0

Towards a Free, Forced Phonetic Aligner for Brazilian Portuguese Using Kaldi Tools

Keywords

References

Heeringa, W.J.: Measuring Dialect Pronunciation Differences Using Levenshtein Distance. PhD Thesis, Rijksuniv., Groningen (2004)
Google Scholar
Valls, E., Wieling, M., Nerbonne, J.: Linguistic Advergence and Divergence in Northwestern Catalan: A Dialectometric Investigation of Dialect Leveling and Border Effects. LLC: Journal of Digital Scholarship in the Humanities 28(1), 119–146 (2013)
Google Scholar
Álvarez, A., Arzelus, H., Ruiz, P.: Long Audio Alignment for Automatic Subtitling Using Different Phone-Relatedness Measures. In: Proc. of the 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP), pp. 6321–6325 (2014)
Google Scholar
Skrelin, P., Volskaya, N., Kocharov, D., Evgrafova, K., Glotova, O., Evdokimova, V.: CORPRES - Corpus of Russian Professionally Read Speech. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2010. LNCS (LNAI), vol. 6231, pp. 392–399. Springer, Heidelberg (2010)
Chapter Google Scholar
Bordel, G., Nieto, S., Penagarikano, M., Rodríguez-Fuentes, L.J., Varona, A.: A Simple and Efcient Method to Align Very Long Speech Signals to Acoustically Imperfect Transcriptions. In: 13th Annual Conference of the International Speech Communication Association (2012)
Google Scholar
Elffers, B., Van Bael, C., Strik, H.: ADAPT: Algorithm for Dynamic Alignment of Phonetic Transcriptions. Internal report, Department of Language and Speech, Radboud University Nijmegen, the Netherlands. Electronically (2005), http://lands.let.ru.nl/literature/elffers.2005.1.pdf
Levenstein, V.: Binary codes capable of correcting deletions, insertions and reversals. Doklady Akademii Nauk SSSR 163, 845–848 (1965) (in Russ.)
Google Scholar
Hirschberg, D.S.: A Linear Space Algorithm for Computing Maximal Common Subsequence. Communications of the ACM 18(6), 341–343 (1975)
Article MathSciNet MATH Google Scholar
Wieling, M., Nerbonne, E.M., Nerbonne, J.: Inducing a Measure of Phonetic Similarity from Pronunciation Variation. Journal of Phonetics 40, 307–314 (2012)
Article Google Scholar
Bondarko, L.V.: Phonetics of contemporary Russian language. St. Petersburg (1988) (in Russ.)
Google Scholar
Phonetics of spontaneous speech. Svetozarova N. D. (ed). Leningrad (1988) (in Russ.)
Google Scholar
Bondarko, L.V., Volskaya, N.B., Tananiko, S.O., Vasilieva, L.A.: Phonetic Propeties of Russian Spontaneous Speech. In: 15th International Congress of Phonetic Studies (2003)
Google Scholar
De Silva, V., Iivonen, A., Bondarko, L.V., Pols, L.C.W.: Common and Language Dependent Phonetic Differencies between Read and Spontaneous Speech in Russian, Finnish and Dutch. In: 15th International Congress of Phonetic Studies (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Phonetics, Saint-Petersburg State University, Universitetskaya Emb., 11, 199034, Saint-Petersburg, Russia
Daniil Kocharov

Authors

Daniil Kocharov
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Speech and Multimodal Interfaces Laboratory, St. Petersburg Institute of Informatics and Automation of the Russian Academy of Sciences, 39, 14th line, 199178, St. Petersburg, Russia
Andrey Ronzhin
Institute of Applied and Mathematical Linguistics, Moscow State Linguistic University, 38, Ostozhenka, 119034, Moscow, Russia
Rodmonga Potapova
Faculty of Technical Sciences, University of Novi Sad, 6, Trg Dositeja Obradovića, 21000, Novi Sad, Serbia
Vlado Delic

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kocharov, D. (2014). Automatic Alignment of Phonetic Transcriptions for Russian. In: Ronzhin, A., Potapova, R., Delic, V. (eds) Speech and Computer. SPECOM 2014. Lecture Notes in Computer Science(), vol 8773. Springer, Cham. https://doi.org/10.1007/978-3-319-11581-8_15

Download citation

DOI: https://doi.org/10.1007/978-3-319-11581-8_15
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11580-1
Online ISBN: 978-3-319-11581-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Automatic Alignment of Phonetic Transcriptions for Russian

Abstract

Chapter PDF

Similar content being viewed by others

Automatic Phonetic Transcription for Russian: Speech Variability Modeling

Konkani Phonetic Transcription System 1.0

Towards a Free, Forced Phonetic Aligner for Brazilian Portuguese Using Kaldi Tools

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Automatic Alignment of Phonetic Transcriptions for Russian

Abstract

Chapter PDF

Similar content being viewed by others

Automatic Phonetic Transcription for Russian: Speech Variability Modeling

Konkani Phonetic Transcription System 1.0

Towards a Free, Forced Phonetic Aligner for Brazilian Portuguese Using Kaldi Tools

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation