A Deep Convolutional Neural Networks Approach for Word-Level Handwritten Script Identification Using a Large Dataset

El Bahy, Siham; Aboutabit, Noureddine; Ait Mait, Hind

doi:10.1007/978-3-031-29313-9_15

Siham El Bahy¹²,
Noureddine Aboutabit¹² &
Hind Ait Mait¹²

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 656))

Included in the following conference series:

International Conference of Machine Learning and Computer Science Applications

229 Accesses

Abstract

In this work, we propose a convolutional neural network (CNN) architecture to identify six word-level handwritten scripts involving Arabic, Latin, Chinese, Bangla, Devanagari and Telugu. A large dataset of 14k word images per script was constructed based on several public handwritten datasets. Then, three architectures are proposed and compared based on standard metrics performance and time execution. Experiments conducted on both test and validation classification show high performances that outperform the state-of-art techniques. Indeed, the best result was provided by CNN model with three-convolutional-polling pairs layers that achieved an average script identification accuracy of 97.67% and ran in a sufficiently fast time of 2 ms per frame during the test phase.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Understanding NFC-Net: a deep learning approach to word-level handwritten Indic script recognition

Article 18 May 2019

Deep Learning for Word-Level Handwritten Indic Script Identification

Improved word-level handwritten Indic script identification by integrating small convolutional neural networks

Article 06 March 2019

References

Ukil, S., Ghosh, S., Obaidullah, S.M., Santosh, K.C., Roy, K., Das, N.: Deep learning for word-level handwritten indic script identification. In: Santosh, K.C., Gawali, B. (eds.) RTIP2R 2020. CCIS, vol. 1380, pp. 499–510. Springer, Singapore (2021). https://doi.org/10.1007/978-981-16-0507-9_42
Chapter Google Scholar
Kanoun, S., et al.: Script and nature differentiation for Arabic and Latin text images. In: Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition. IEEE (2002)
Google Scholar
Hochberg, J., et al.: Script and language identification for handwritten document images. Int. J. Doc. Anal. Recogn. 2(2) (1999)
Google Scholar
Moussa, S.B., et al.: Fractal-based system for Arabic/Latin, printed/handwritten script identification. In: 2008 19th International Conference on Pattern Recognition. IEEE (2008)
Google Scholar
Benjelil, M., et al.: Arabic and Latin script identification in printed and handwritten types based on steerable pyramid features. In: 2009 10th International Conference on Document Analysis and Recognition. IEEE (2009)
Google Scholar
Cheikh Rouhou, A., Abdelhedi, Z., Kessentini, Y.: A HMM-based Arabic/Latin handwritten/printed identification system. In: Abraham, A., Haqiq, A., Alimi, A.M., Mezzour, G., Rokbani, N., Muda, A.K. (eds.) HIS 2016. AISC, vol. 552, pp. 298–307. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-52941-7_30
Chapter Google Scholar
Mahmoud, S.A., et al.: Online-khatt: an open-vocabulary database for Arabic online-text processing. Open Cybern. Syst. J. 12(1) (2018)
Google Scholar
Pechwitz, M., et al.: IFN/ENIT-database of handwritten Arabic words. In: Proceedings of CIFED, vol. 2. Citeseer (2002)
Google Scholar
Sarkar, R., et al.: CMATERdb1: a database of unconstrained handwritten Bangla and Bangla-English mixed script document image. Int. J. Doc. Anal. Recogn. (IJDAR) 15(1) (2012)
Google Scholar
Liu, C.-L., et al.: Online and offline handwritten Chinese character recognition: benchmarking on new databases. Pattern Recogn. 46(1) (2013)
Google Scholar
Su, T., Zhang, T., Guan, D.: HIT-MW dataset for offline Chinese handwritten text recognition. In: Tenth International Workshop on Frontiers in Handwriting Recognition, Suvisoft (2006)
Google Scholar
Dutta, K., et al.: Offline handwriting recognition on Devanagari using a new benchmark dataset. In: 2018 13th IAPR International Workshop on Document Analysis Systems (DAS). IEEE (2018)
Google Scholar
Liwicki, M., Bunke, H.: IAM-OnDB-an on-line English sentence database acquired from handwritten text on a whiteboard. In: Eighth International Conference on Document Analysis and Recognition (ICDAR 2005). IEEE (2005)
Google Scholar
Dutta, K., et al.: Towards spotting and recognition of handwritten words in Indic scripts. In: 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR). IEEE (2018)
Google Scholar
LeCun, Y., et al.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11) (1998)
Google Scholar
Culurciello, E.: Neural Network Architectures. Synthesis Lectures on Artificial Intelligence and Machine Learning, San Francisco (2017)
Google Scholar
Cireşan, D.C., et al.: Deep, big, simple neural nets for handwritten digit recognition. Neural Comput. 22(12) (2010)
Google Scholar
Ciregan, D., Meier, U., Schmidhuber, J.: Multi-column deep neural networks for image classification. IN: 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE (2012)
Google Scholar
Mantas, J.: An overview of character recognition methodologies. Pattern Recogn. 19(6) (1986)
Google Scholar
Obaidullah, S.Md., et al.: Handwritten Indic script identification in multi-script document images: a survey. Int. J. Pattern Recogn. Artif. Intell. 32(10) (2018)
Google Scholar
Jaderberg, M., et al.: Deep structured output learning for unconstrained text recognition. arXiv preprint arXiv:1412.5903 (2014)
Manmatha, R., Srimal, N.: Scale space technique for word segmentation in handwritten documents. In: Nielsen, M., Johansen, P., Olsen, O.F., Weickert, J. (eds.) Scale-Space 1999. LNCS, vol. 1682, pp. 22–33. Springer, Heidelberg (1999). https://doi.org/10.1007/3-540-48236-9_3
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

IPIM Laboratory, ENSA Khouribga, Sultan Moulay Slimane University, PO Box 523, 23000, Beni Mellal, Morocco
Siham El Bahy, Noureddine Aboutabit & Hind Ait Mait

Authors

Siham El Bahy
View author publications
You can also search for this author in PubMed Google Scholar
Noureddine Aboutabit
View author publications
You can also search for this author in PubMed Google Scholar
Hind Ait Mait
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Siham El Bahy .

Editor information

Editors and Affiliations

National School of Applied Sciences of Khouribga, Sultan Moulay Slimane University, Khouribga, Morocco
Noureddine Aboutabit
ENSIAS, Mohammed V University, Rabat, Morocco
Mohamed Lazaar
National School of Applied Sciences of Khouribga, Sultan Moulay Slimane University, Khouribga, Morocco
Imad Hafidi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

El Bahy, S., Aboutabit, N., Ait Mait, H. (2023). A Deep Convolutional Neural Networks Approach for Word-Level Handwritten Script Identification Using a Large Dataset. In: Aboutabit, N., Lazaar, M., Hafidi, I. (eds) Advances in Machine Intelligence and Computer Science Applications. ICMICSA 2022. Lecture Notes in Networks and Systems, vol 656. Springer, Cham. https://doi.org/10.1007/978-3-031-29313-9_15

Download citation

DOI: https://doi.org/10.1007/978-3-031-29313-9_15
Published: 07 April 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-28845-6
Online ISBN: 978-3-031-29313-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

A Deep Convolutional Neural Networks Approach for Word-Level Handwritten Script Identification Using a Large Dataset

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Understanding NFC-Net: a deep learning approach to word-level handwritten Indic script recognition

Deep Learning for Word-Level Handwritten Indic Script Identification

Improved word-level handwritten Indic script identification by integrating small convolutional neural networks

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Deep Convolutional Neural Networks Approach for Word-Level Handwritten Script Identification Using a Large Dataset

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Understanding NFC-Net: a deep learning approach to word-level handwritten Indic script recognition

Deep Learning for Word-Level Handwritten Indic Script Identification

Improved word-level handwritten Indic script identification by integrating small convolutional neural networks

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation