Abstract
Speech corpus is an important and primary requirement for several speech tasks. Building a speech corpora is a lengthy, time consuming and expensive process, it typically involves collection of a large set of textual utterances and then selective distribution of these text utterances among a set of speakers, called speaker sheets. These speaker sheets are articulated by speakers to generate the speech corpora. Depending on the task at hand the speech corpora needs to satisfy certain criteria; For example, a phonetically balanced speech corpora is essential for building an automatic speech recognition (ASR) engine, while for a text dependent speaker recognition engine there is a need for several spoken repetition of the same text by several speakers. In this paper, we formulate a method that enables creation of speaker sheets from a predetermined set of text utterances such that the speech corpora satisfies the desired requirement.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
SPEECON, Speech-driven interfaces for consumer devices (2014). http://www.speechdat.org/speecon/index.html
Abushariah, M.A., Ainon, R.N., Zainuddin, R., Elshafei, M., Khalifa, O.O.: Phonetically rich and balanced text and speech corpora for Arabic language. Lang. Resour. Eval. 46(4), 601–634 (2012)
Pineda, L.A., Pineda, L.V., Cuétara, J., Castellanos, H., López, I.: DIMEx100: a new phonetic and speech corpus for Mexican Spanish. In: Lemaître, C., Reyes, C.A., González, J.A. (eds.) IBERAMIA 2004. LNCS (LNAI), vol. 3315, pp. 974–983. Springer, Heidelberg (2004)
Uraga, E., Gamboa, C.: VOXMEX speech database: design of a phonetically balanced corpus. In: Proceedings of the Fourth International Conference on Language Resources and Evaluation. LREC 2004, Lisbon, Portugal, May 26–28. European Language Resources Association (2004)
Asinovsky, A., Bogdanova, N., Rusakova, M., Ryko, A., Stepanova, S., Sherstinova, T.: The ORD speech corpus of Russian everyday communication “One Speaker’s Day”: creation principles and annotation. In: Matoušek, V., Mautner, P. (eds.) TSD 2009. LNCS, vol. 5729, pp. 250–257. Springer, Heidelberg (2009)
van Heerden, C., Davel, M.H., Barnard, E.: The semi-automated creation of stratified speech corpora (2013). http://www.nwu.ac.za/sites/www.nwu.ac.za/files/files/v-must/Publications/prasa2013-17.pdf
Tian, J., Nurminen, J., Kiss, I.: Optimal subset selection from text databases. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, (ICASSP 2005), vol. 1, pp. 305–308, March 2005
Wu, Y., Zhang, R., Rudnicky, A.: Data selection for speech recognition. In: IEEE Workshop on Automatic Speech Recognition Understanding, ASRU, pp. 562–565, December 2007. http://www.cs.cmu.edu/~yiwu/paper/asru07.pdf
Nagroski, A. Boves, L., Steeneken, H.: Optimal selection of speech data for automatic speech recognition systems. In: ICSLP, pp. 2473–2476 (2002)
Chitturi, R., Mariam, S.H., Kumar, R.: Rapid methods for optimal text selection. In: Recent Advances in Natural Language Processing, September 2005
Mandal, S., Das, B., Mitra, P., Basu, A.: Developing Bengali speech corpus for phone recognizer using optimum text selection technique. In: 2011 International Conference on Asian Language Processing (IALP), pp. 268–271, November 2011
Awaz, Y.P.: Data: Speaker sheet generation for building speech corpora (2015). https://sites.google.com/site/awazyp/data/speaker
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Patel, C., Kopparapu, S.K. (2015). A Multi-criteria Text Selection Approach for Building a Speech Corpus. In: Král, P., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2015. Lecture Notes in Computer Science(), vol 9302. Springer, Cham. https://doi.org/10.1007/978-3-319-24033-6_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-24033-6_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24032-9
Online ISBN: 978-3-319-24033-6
eBook Packages: Computer ScienceComputer Science (R0)