Abstract
This paper presents automatic alignment of Russian phonetic pronunciations using the information about phonetic nature of speech sounds in the aligned transcription sequences. This approach has been tested on 24 hours of speech data and has shown significant improvement in alignment errors has been obtained in comparison with commonly used Levenstein algorithm: the numbers of error has been reduced from 1.1 % to 0.27 %.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Heeringa, W.J.: Measuring Dialect Pronunciation Differences Using Levenshtein Distance. PhD Thesis, Rijksuniv., Groningen (2004)
Valls, E., Wieling, M., Nerbonne, J.: Linguistic Advergence and Divergence in Northwestern Catalan: A Dialectometric Investigation of Dialect Leveling and Border Effects. LLC: Journal of Digital Scholarship in the Humanities 28(1), 119–146 (2013)
Álvarez, A., Arzelus, H., Ruiz, P.: Long Audio Alignment for Automatic Subtitling Using Different Phone-Relatedness Measures. In: Proc. of the 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP), pp. 6321–6325 (2014)
Skrelin, P., Volskaya, N., Kocharov, D., Evgrafova, K., Glotova, O., Evdokimova, V.: CORPRES - Corpus of Russian Professionally Read Speech. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds.) TSD 2010. LNCS (LNAI), vol. 6231, pp. 392–399. Springer, Heidelberg (2010)
Bordel, G., Nieto, S., Penagarikano, M., Rodríguez-Fuentes, L.J., Varona, A.: A Simple and Efcient Method to Align Very Long Speech Signals to Acoustically Imperfect Transcriptions. In: 13th Annual Conference of the International Speech Communication Association (2012)
Elffers, B., Van Bael, C., Strik, H.: ADAPT: Algorithm for Dynamic Alignment of Phonetic Transcriptions. Internal report, Department of Language and Speech, Radboud University Nijmegen, the Netherlands. Electronically (2005), http://lands.let.ru.nl/literature/elffers.2005.1.pdf
Levenstein, V.: Binary codes capable of correcting deletions, insertions and reversals. Doklady Akademii Nauk SSSR 163, 845–848 (1965) (in Russ.)
Hirschberg, D.S.: A Linear Space Algorithm for Computing Maximal Common Subsequence. Communications of the ACM 18(6), 341–343 (1975)
Wieling, M., Nerbonne, E.M., Nerbonne, J.: Inducing a Measure of Phonetic Similarity from Pronunciation Variation. Journal of Phonetics 40, 307–314 (2012)
Bondarko, L.V.: Phonetics of contemporary Russian language. St. Petersburg (1988) (in Russ.)
Phonetics of spontaneous speech. Svetozarova N. D. (ed). Leningrad (1988) (in Russ.)
Bondarko, L.V., Volskaya, N.B., Tananiko, S.O., Vasilieva, L.A.: Phonetic Propeties of Russian Spontaneous Speech. In: 15th International Congress of Phonetic Studies (2003)
De Silva, V., Iivonen, A., Bondarko, L.V., Pols, L.C.W.: Common and Language Dependent Phonetic Differencies between Read and Spontaneous Speech in Russian, Finnish and Dutch. In: 15th International Congress of Phonetic Studies (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Kocharov, D. (2014). Automatic Alignment of Phonetic Transcriptions for Russian. In: Ronzhin, A., Potapova, R., Delic, V. (eds) Speech and Computer. SPECOM 2014. Lecture Notes in Computer Science(), vol 8773. Springer, Cham. https://doi.org/10.1007/978-3-319-11581-8_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-11581-8_15
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11580-1
Online ISBN: 978-3-319-11581-8
eBook Packages: Computer ScienceComputer Science (R0)