Skip to main content

A Novel Approach for Spoken Language Identification and Performance Comparison Using Machine Learning-Based Classifiers and Neural Network

  • Conference paper
  • First Online:
Proceedings of the International e-Conference on Intelligent Systems and Signal Processing

Abstract

Spoken Language Identification (SLI) is the process of capturing a type of language of a speaker. In this research paper, the used database is created in three different languages, Gujarati, Hindi, and English. Language classification is performed using features like MFCC (Mel Frequency Cepstral Coefficients), Pitch, and average energy. Accuracy values of the created database are evaluated and compared using various pattern classifiers, namely, Fine Tree, Linear Discriminant, Gaussian Naïve Bayes, Linear SVM, Fine KNN, and feed-forward neural network in MATLAB 2019. Performance using individual speech features and hybrid features are compared. Training time of all the classifiers is also evaluated to decide the best among all classifiers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. D. Deshwal, P. Sangwan, D. Kumar, Feature extraction methods in language identification: a survey. Wirel. Pers. Commun. 107 (2019)

    Google Scholar 

  2. D. Nandi, Language Identification—A Brief Review (2015)

    Google Scholar 

  3. A.B. Nassif, I. Shahin, I. Attilim, Speech recognition using deep neural networks: a systematic review. IEEE Access 7(99), 19143–19165 (2019)

    Article  Google Scholar 

  4. V. Tank, S.K. Hadia, Creation of speech corpus for emotion analysis in Gujarati language and its evaluation by various speech parameters. IJECE 10(5), 4752–4758 (2020)

    Article  Google Scholar 

  5. J. Anjana, S.S. Poorna, Language identification from speech features using SVM and LDA (2018), pp. 1–4

    Google Scholar 

  6. M. Zampieri, A. Ciobanu, L. Dinu, Native language identification on text and speech (2017)

    Google Scholar 

  7. P. Athira, S.S. Poorna, Deep learning based language identification system from speech (2019), pp. 1094–1097

    Google Scholar 

  8. S. Ishimitsu, Speech recognition method and speech recognition apparatus. J. Acoust. Soc. Am. 94(109), 3538 (2018)

    Google Scholar 

  9. Z. Song, English speech recognition based on deep learning with multiple features. Computing 102, 663–682 (2020)

    Article  MathSciNet  Google Scholar 

  10. H. Mukherjee, A. Dhar, S.K. Obaidullah, K.C. Santosh, S. Phadikar, K. Roy, A recurrent neural network-based approach to automatic language identification from speech (2020)

    Google Scholar 

  11. I. Lopez-Moreno, J. Gonzalez-Dominguez, D. Martinez, O. Plchot, J. Gonzalez-Rodriguez, P. Moreno, On the use of deep feedforward neural networks for automatic language identification. Comput. Speech Lang. 40 (2016)

    Google Scholar 

  12. C. Chandak, Z. Raeesy, A.L. Rastrow, H. Yuzong, W. Xiangyang, J. Dong, M. Roland, Streaming language identification using combination of acoustic representations and ASR hypotheses (2020)

    Google Scholar 

  13. R. Latypov, R. Nigmatullin, E. Stolov, Automatic spoken language identification by digital signal processing methods. Tatar Russ. Lang. (2017)

    Google Scholar 

  14. J. Chakraborty, S. Nath, S. Nirmala, K. Samudravijaya, Language identification of Assamese, Bengali and English Speech (2018) pp. 177–181

    Google Scholar 

  15. S. Jothilakshmi, V. Ramalingam, Sengottayan, A hierarchical language identification system for Indian languages. Digit. Signal Process. 22, 544–553 (2012)

    Google Scholar 

  16. C. Madhu, A. George, L. Mary, Automatic language identification for seven Indian languages using higher level features, in IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES) (2017), pp. 1–6

    Google Scholar 

  17. S. Koolagudi, D. Rastogi, Spoken language identification using spectral features. Commun. Comput. Inf. Sci. 306, 496–497 (2012)

    Google Scholar 

  18. A. Sangwan, M. Mehrabani, J.H.L. Ansen, Automatic language analysis and identification based on speech production knowledge, in IEEE International Conference on Acoustics, Speech and Signal Processing (2010), pp. 5006–5009

    Google Scholar 

  19. S. Maity, A.K. Vuppala, K.S. Rao, D. Nandi, IITKGP-MLILSC speech database for language identification, in IEEE National Conference on Communications (NCC) (2012)

    Google Scholar 

  20. H. Ali, S.N. Tran, E. Benetos, Speaker recognition with hybrid features from a deep belief network. Neural Comput. Appl. 29(6), 13–19 (2018)

    Article  Google Scholar 

  21. K. Hwamin, P. Jeong-Sik, Automatic language identification using speech rhythm features for multi-lingual speech recognition. Appl. Sci. 10, 2225 (2020)

    Google Scholar 

  22. M. Leen, B. Yegnanarayana, Prosodic features for language identification (2008), pp. 57–62

    Google Scholar 

  23. J. Fan, N. Zhou, J. Peng, L. Gao, Hierarchical learning of tree classifiers for large-scale plant species identification. IEEE Trans. Image Process. 24(11), 4172–4184 (2015). https://doi.org/10.1109/TIP.2015.2457337

    Article  MathSciNet  MATH  Google Scholar 

  24. A.N. Younis, F.M. Remo, Distinguish musical symbol printed using the linear discriminant analysis LDA and similarity scale. Int. J. Comput. Appl. 179(47), 20–24 (2018)

    Google Scholar 

  25. I.H. Sarker, A.S.M. Kayes, P. Watters, Effectiveness analysis of machine learning classification models for predicting personalized context aware smart phone usage. J. Big Data 6(57), 01–28 (2019)

    Google Scholar 

  26. Y. Zhang, Support vector machine classification algorithm and its application, in Information Computing and Applications. ICICA 2012., ed. by C. Liu, L. Wang, A. Yang. Communications in Computer and Information Science, vol. 308 (Springer, Berlin, Heidelberg, 2012). https://doi.org/10.1007/978-3-642-34041-3_27

  27. S. Taneja, C. Gupta, K. Goyal, D. Gureja, An enhanced K-nearest neighbor algorithm using information gain and clustering, in 2014 Fourth International Conference on Advanced Computing & Communication Technologies, Rohtak (2014), pp. 325–329. https://doi.org/10.1109/ACCT.2014.22.

  28. A. Jamatia, A. Das, B. Gambäck, Deep learning-based language identification in English-Hindi-Bengali code-mixed social media corpora. J. Intell. Syst. 28(3), 399–408 (2019). https://doi.org/10.1515/jisys-2017-0440

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vishal Tank .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tank, V., Manavadaria, M., Dudhat, K. (2022). A Novel Approach for Spoken Language Identification and Performance Comparison Using Machine Learning-Based Classifiers and Neural Network. In: Thakkar, F., Saha, G., Shahnaz, C., Hu, YC. (eds) Proceedings of the International e-Conference on Intelligent Systems and Signal Processing. Advances in Intelligent Systems and Computing, vol 1370. Springer, Singapore. https://doi.org/10.1007/978-981-16-2123-9_42

Download citation

Publish with us

Policies and ethics