Voice Technology to Enable Sophisticated Access to Historical Audio Archive of the Czech Radio

Nouza, Jan; Blavka, Karel; Bohac, Marek; Cerva, Petr; Zdansky, Jindrich; Silovsky, Jan; Prazak, Jan

doi:10.1007/978-3-642-27978-2_3

Jan Nouza²,
Karel Blavka²,
Marek Bohac²,
Petr Cerva²,
Jindrich Zdansky²,
Jan Silovsky² &
…
Jan Prazak²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 247))

Included in the following conference series:

International Workshop on Multimedia for Cultural Heritage

649 Accesses
13 Citations

Abstract

The Czech Radio archive of spoken documents is considered one of the gems of the Czech cultural heritage. It contains the largest collection (more than 100.000 hours) of spoken documents recorded during the last 90 years. We are developing a complex platform that should automatically transcribe a significant portion of the archive, index it and eventually prepare it for full-text search. The four-year project supported by the Czech Ministry of culture is challenging in the way that it copes with huge volumes of data, with historical as well as contemporary language, a rather low signal quality in case of old recordings, and also with documents spoken not only in Czech but also in Slovak. The technology used includes speech, speaker and language recognition modules, speaker and channel adaptation components, tools for data indexation and retrieval, and a web interface that allows for public access to the archive. Recently, a demo version of the platform is available for testing and searching in some 10.000 hours of already processed data.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks

Article Open access 09 August 2022

Evalita 2011: Automatic Speech Recognition Large Vocabulary Transcription

Grappling with Web Technologies: The Problems of Remote Speech Recording

Keywords

References

Hayashi, Y., et al.: Speech-based and video-supported indexing multimedia broadcast news. In: Proc. ACM SIGIR (2003)
Google Scholar
Ordelman, R., de Jong, F., Huijbregts, M., van Leeuwen, D.: Robust audio indexing for Dutch spoken word collections. In 16th Int. Conference of the Association for History and Computing, Humanities, Computers and Cultural Heritage, Amsterdam, pp. 215–223 (2005)
Google Scholar
Hansen, J.H.L., Huang, R., Zhou, B., Seadle, M., Deller, J.R., Gurijala, A.R., Kurimo, M., Angkititrakul, P.: SpeechFind: Advances in Spoken Document Retrieval for a National Gallery of the Spoken Word. IEEE Trans. on Speech and Audio Processing 13(5), 712–730 (2005)
Article Google Scholar
Byrne, W., et al.: Automatic recognition of spontaneous speech for access to multilingual oral history archives. IEEE Trans. Speech Audio Process. 12(4), 420–435 (2004)
Article Google Scholar
Nouza, J., Zdansky, J., Cerva, P., Kolorenc, J.: A System for Information Retrieval from Large Records of Czech Spoken Data. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2006. LNCS (LNAI), vol. 4188, pp. 485–492. Springer, Heidelberg (2006)
Chapter Google Scholar
Nouza, J., Zdansky, J., Cerva, P.: System for automatic collection, annotation and indexing of Czech broadcast speech with full-text search. In: 15th IEEE Mediterranean Electrotechnical Conference (MELECON 2010), Malta, pp. 202–205 (2010)
Google Scholar
Nouza, J., Zdansky, J.: Automatic Alignment between Speech Records and Their Text Transcriptions for Audio Archive Indexing and Searching. In: 6th IEEE Conference on Informatics and Systems, pp. MM6–MM12. IEEE, Egypt (2008)
Google Scholar
FFmpeg converter program, http://www.ffmpeg.org/
MySQL platform, http://www.mysql.com/
SPHINX platform, http://sphinxsearch.com/
Demo of APAP platform, http://ahmed.ite.tul.cz/demo/
Nouza, J., Silovsky, J., Zdansky, J., Cerva, P., Kroul, M., Chaloupka, J.: Czech-to-Slovak Adapted Broadcast News Transcription System. In: Proc. of Interspeech 2008, Australia, pp. 2683–2686 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Information Technology and Electronics, Technical Univesity of Liberec, Studentska 2, 461 17, Liberec, Czech Republic
Jan Nouza, Karel Blavka, Marek Bohac, Petr Cerva, Jindrich Zdansky, Jan Silovsky & Jan Prazak

Authors

Jan Nouza
View author publications
You can also search for this author in PubMed Google Scholar
Karel Blavka
View author publications
You can also search for this author in PubMed Google Scholar
Marek Bohac
View author publications
You can also search for this author in PubMed Google Scholar
Petr Cerva
View author publications
You can also search for this author in PubMed Google Scholar
Jindrich Zdansky
View author publications
You can also search for this author in PubMed Google Scholar
Jan Silovsky
View author publications
You can also search for this author in PubMed Google Scholar
Jan Prazak
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Ingegneria dell’Informazione, Università degli Studi di Modena e Reggio Emilia, Via Vignolese 905/b, 41125, Modena, Italy
Costantino Grana & Rita Cucchiara &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nouza, J. et al. (2012). Voice Technology to Enable Sophisticated Access to Historical Audio Archive of the Czech Radio. In: Grana, C., Cucchiara, R. (eds) Multimedia for Cultural Heritage. MM4CH 2011. Communications in Computer and Information Science, vol 247. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27978-2_3

Download citation

DOI: https://doi.org/10.1007/978-3-642-27978-2_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27977-5
Online ISBN: 978-3-642-27978-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Voice Technology to Enable Sophisticated Access to Historical Audio Archive of the Czech Radio

Abstract

Chapter PDF

Similar content being viewed by others

Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks

Evalita 2011: Automatic Speech Recognition Large Vocabulary Transcription

Grappling with Web Technologies: The Problems of Remote Speech Recording

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Voice Technology to Enable Sophisticated Access to Historical Audio Archive of the Czech Radio

Abstract

Chapter PDF

Similar content being viewed by others

Lahjoita puhetta: a large-scale corpus of spoken Finnish with some benchmarks

Evalita 2011: Automatic Speech Recognition Large Vocabulary Transcription

Grappling with Web Technologies: The Problems of Remote Speech Recording

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation