Abstract
Automatic speech recognition acknowledges the spoken words and converts them to a machine-readable format of text. By converting spoken audio into text, this technology allows users to control digital devices by speaking instead of using conventional tools like keystrokes and buttons. The challenges in speech recognition are the improvisation of the accuracy, varying user responsiveness, performance, reliability and fault tolerance. The audio signal quality affects the recognition accuracy rate. Delayed speech recognition is used to overcome the issues by user responsiveness. This is because the pronunciation of a word differs when used under different contexts. Since the world is moving at a rapid pace towards digitisation, new technologies are being developed to make lives easy. Interactive Voice Response System is an example. The Interactive Voice Response System allows the computer to interact with human by using their voices. We have proposed an Interactive Voice Response System for railway reservation system. The proposed approach uses LSTM with CTC to recognise the spoken word. The methods used in the creation of this model outperform other models where testing is done to arrive at the resultant with a better accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Dhanashri, D., Dhonde, S.B.: Speech recognition using neural networks: a review. Int. J. Multidiscip. Res. Dev. 2(6), 226–229 (2015)
Geetha, K., Dr. Vadivel, R.: Phoneme segmentation of Tamil speech signals using spectral transition measure. Orient. J. Comput. Sci. Technol. 10, 114–119 (2017)
Graves, A., Mohamed, A., Hinton, G.: Speech recognition with deep recurrent neural network. In: Proceedings of International Conference on Acoustics, Speech, and Signal Processing (2013)
Halageri, A., Bidappa, A., Arjun, C., Sarathy, M., Sultana, S.: Speech recognition using deep learning. Int. J. Comput. Sci. Inf. Technol. 6(3), 3206–3209 (2015)
Kim, S., Hori, T., Watanabe, S.: Joint CTC-attention based end-to-end speech recognition using multi-task learning (2017). arXiv:1609.06773v2
Lekshmi, K., Dr. Sherly, E.: Automatic speech recognition using different neural network architectures a survey. Int. J. Comput. Sci. Inf. Technol. 7(6), 2422–2427 (2016)
Liu, E.: Deep convolutional and LSTM neural networks for acoustic modelling in automatic speech recognition. In: vol. 8, no. 6. Pearson Education Inc. (2011)
Panzner, M., Cimiano, P: Comparing hidden Markov models and long short term memory neural networks for learning action representations. In: Proceedings of International Workshop on Machine Learning, Optimization, and Big Data, pp. 94–105 (2016)
Rubi, C.: Rana: review on speech recognition with deep learning method. Int. J. Comput. Sci. Mobile Comput. 4(8), 301–307 (2015)
Tebelskis, J.: Speech recognition using neural networks. In: Proceedings of CMU-CS-95-142 (1995)
Acknowledgements
We would like to thank the management of SSN College of Engineering for funding GPU system, which helps us to carry out the deep learning-related research work.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sharmadha, S., Shivani, K., Shruthi, K., Bharathi, B., Kavitha, S. (2020). Automatic Speech Recognition Using Deep Neural Network. In: Reddy, V., Prasad, V., Wang, J., Reddy, K. (eds) Soft Computing and Signal Processing. ICSCSP 2019. Advances in Intelligent Systems and Computing, vol 1118. Springer, Singapore. https://doi.org/10.1007/978-981-15-2475-2_33
Download citation
DOI: https://doi.org/10.1007/978-981-15-2475-2_33
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-2474-5
Online ISBN: 978-981-15-2475-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)