Abstract
Speech is one of the most natural forms of vocalized communication media. Nowadays with the advancement of machine learning, different doors are opened to us for finding several standard ways to step out in the real world. ASR is just like the door to explore the concept of communication through speech between human and digital devices that can recognize speech. In this paper, we have designed a Hidden Markov Model-based isolated Bangla numerals recognition system where the Short-Term Fourier Transform is used for collecting the feature vectors. The defined system achieved 91.50% accuracy for our own dataset of 2000 uttered samples for 10 classes, which gives a satisfied result for this Bangla numerals recognition.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Karpagavalli, S., Chandra, E.: Phoneme and word based model for Tamil speech recognition using GMM-HMM. In: 2015 International Conference on Advanced Computing and Communication Systems, pp. 1–5. IEEE (2015)
Abdullah-al-MAMUN, M.D., Mahmud, F.: Performance analysis of isolated Bangla speech recognition system using Hidden Markov Model. Int. J. Sci. Eng. Res. 6(1)
Hammami, N., Bedda, M., Farah, N.: Spoken Arabic digits recognition using MFCC based on GMM. In: 2012 IEEE Conference on Sustainable Utilization and Development in Engineering and Technology (STUDENT), pp. 160–163. IEEE (2012)
Chauhan, V., Dwivedi, S., Karale, P., Potdar, S.M.: Speech to text converter using Gaussian mixture model (GMM). Int Res J Eng Technol 3(5), 160–164 (2016)
Rosdi, F., Ainon, R.N.: Isolated Malay speech recognition using Hidden Markov Models. In: 2008 International Conference on Computer and Communication Engineering, pp. 721–725. IEEE (2008)
Ali, M.A., Hossain, M., Bhuiyan, M.N.: Automatic speech recognition technique for Bangla words. Int. J. Adv. Sci. Technol. 50 (2013)
Najkar, N., Razzazi, F., Sameti, H.: A novel approach to HMM-based speech recognition systems using particle swarm optimization. Math Comput Model 52(11–12), 1910–1920 (2010)
Paul, B., Bera, S., Paul, R., Phadikar, S.: Bengali spoken numerals recognition by MFCC and GMM technique. In: International Conference on Emerging Trends and Advances in Electrical Engineering and Renewable Energy (ETAEERE-2020) (2020)
https://kevinsprojects.wordpress.com/2014/12/13/short-time-fourier-transform-using-python-and-numpy/
Bansal, P., Kant, A., Kumar, S., Sharda, A., Gupta, S.: Improved hybrid model of HMM/GMM for speech recognition. (2008)
Chavan, R.S., Sable, G.S.: An overview of speech recognition using HMM. Int J Comput Sci Mob Comput 2(6), 233–238 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Paul, B., Adhikary, D., Dey, T., Guchhait, S., Bera, S. (2022). Bangla Spoken Numerals Recognition by Using HMM. In: Das, A.K., Nayak, J., Naik, B., Dutta, S., Pelusi, D. (eds) Computational Intelligence in Pattern Recognition . Advances in Intelligent Systems and Computing, vol 1349. Springer, Singapore. https://doi.org/10.1007/978-981-16-2543-5_8
Download citation
DOI: https://doi.org/10.1007/978-981-16-2543-5_8
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-2542-8
Online ISBN: 978-981-16-2543-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)