Handwritten Urdu character recognition using one-dimensional BLSTM classifier

Ahmed, Saad Bin; Naz, Saeeda; Swati, Salahuddin; Razzak, Muhammad Imran

doi:10.1007/s00521-017-3146-x

Handwritten Urdu character recognition using one-dimensional BLSTM classifier

Original Article
Published: 19 August 2017

Volume 31, pages 1143–1151, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Neural Computing and Applications Aims and scope Submit manuscript

Handwritten Urdu character recognition using one-dimensional BLSTM classifier

Download PDF

Saad Bin Ahmed¹,
Saeeda Naz^2,3,
Salahuddin Swati⁴ &
…
Muhammad Imran Razzak¹

1268 Accesses
58 Citations
Explore all metrics

Abstract

The recognition of cursive script is regarded as a subtle task in optical character recognition due to its varied representation. Every cursive script has different nature and associated challenges. As Urdu is one of cursive language that is derived from Arabic script, that is why it nearly shares the similar challenges and complexities but with more intensity. We can categorize Urdu and Arabic language on basis of its script they use. Urdu is mostly written in Nasta’liq style, whereas Arabic follows Naskh style of writing. This paper presents new and comprehensive Urdu handwritten offline database name Urdu-Nasta’liq handwritten dataset (UNHD). Currently, there is no standard and comprehensive Urdu handwritten dataset available publicly for researchers. The acquired dataset covers commonly used ligatures that were written by 500 writers with their natural handwriting on A4 size paper. UNHD is publically available and can be download form https://sites.google.com/site/researchonurdulanguage1/databases. We performed experiments using recurrent neural networks and reported a significant accuracy for handwritten Urdu character recognition.

UrduDeepNet: offline handwritten Urdu character recognition using deep neural network

Article 07 June 2021

Recognition of offline handwritten Urdu characters using RNN and LSTM models

Article 17 June 2022

Handwritten Arabic Character Recognition: Comparison of Conventional Machine Learning and Deep Learning Approaches

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The term document image analysis is a process that interprets digital documents. The digital documents may comprise textual data processing (OCR, layout analysis) and graphical processing (line, region and shape processing). Furthermore, it can be in form of synthetic data or in scanned form. The specialized technology named optical character recognition (OCR) is used to read characters of rendered data either taken from scanned text or in printed form. There exist two sorts of style in most of the language, i.e., cursive and non-cursive. In non-cursive style, the recognition of character is not be a problem, but when there is cursive text, it is an evident challenge to separate characters and make them recognized by classifier. Moreover, the dataset for most of cursive scripts cannot be available publicly for evaluation purpose. The OCR is a significant area in document image analysis. It reads document images and translates them into searchable text. The OCR systems are capable of recognizing characters and thereby words and sentences from document image. The researchers have shown great interest since past decade to address the potential problems that exist in this particular field [1,2,3,4,5,6,7,8].

Dataset is considered as collection of document images particularly in image analysis field. To propose dataset is a primary step for development of reliable OCR in the process of document image analysis. The role of dataset is crucial to evaluate the performance of state-of-the-art techniques. There exists various Latin datasets like IAMoffline [9], MNIST [3], UNLV-ISRI [10], for Latin word and character recognition. The Arabic-script-based languages like Arabic, Urdu, Persian, Sindhi, Pashtu, Balochi, Uigher, and Jawi are used by the considerable world population. Arabic-script-based languages character recognition is still considered as challenging task in field of OCR due to the complexity of this script, i.e., cursive writing, graphospasm multiplication, joiner and non-joiner property of letters, overlap in ligature, and with other ligature. Moreover, there are various writing styles for Arabic-script-based languages, but the most common writing styles are Naskh (for Arabic, Sindhi, Pashto) and Nasta’liq (for Urdu and Persian and Pakistani Punjabi).

Nasta’liq writing style is more difficult than Naskh due to its complexity (like diagonality, filled loops, false loop, no fixed baseline, and large variation of words/subwords) over Naskh writing style [11,12,13]. The work done for Arabic cannot be applied directly on Nasta’liq writing style. There is no significant OCR system specialized for Urdu language has been reported to date. However, there has been an increase in interest of the research community especially from the Indian subcontinent to address Urdu Nasta’liq OCR problem. This paper is organized in various sections. Section 2 summarizes details about dataset for Arabic-script-based languages followed by indication of different challenges that exist in Urdu language is depicted in Sect. 3. These challenges envision research ideas especially in Urdu language. Section 4 presents details about acquisition of proposed Urdu Nasta’liq dataset and the preproccesing that has applied on acquired data. Section 5 elaborates about the scenario where we can evaluate proposed dataset. The dataset evaluation followed by discussion on performed experiments has been detailed in Sect. 6. The result is depicted in Sect. 7, while Sect. 8 concludes our effort, and recommendation for future work has been proposed.

2 Related work

The dataset for Arabic-script-based languages [2, 14, 15] are limited and not been addressed the problems of cursive script in detail. Some efforts have been reported for Arabic character recognition [14, 16,17,18]. There are different datasets available for Arabic scripts, summarized in Table 1. but unfortunately, there is no publicly available handwritten dataset for Nasta’liq to research community. Character set is almost same for both scripts (i.e., Naskh and Nasta’liq), but we cannot use Nasta’liq as a replacement for Naskh due to complexity involved in prior script. Efforts are being made to standardize the dataset of Urdu language for the purpose of comparing different available state-of-the-art techniques. One such effort is made by Centre for Pattern Recognition and Machine Intelligence (CEPARMI) [19] to develop a handwritten database from different sources that consists of isolated digits and 44 isolated characters, numeral strings and 57 words (related to finance field), five special symbols, and dates in Urdu Nasta’liq. Other efforts are being reported by Image understanding and Pattern Recognition Group at the Technical University of Kaiserslautern, Germany, to generate synthetic data of Urdu language, whose contents were taken from leading Urdu newspaper of Pakistan named Jang [20]. The Jang newspaper prints the text in Alvi Nasta’liq script which covers the political, social, and religious issues.

Table 1 Summary of Arabic, Persian, and Urdu databases

Full size table

Another very useful dataset is prepared by [21]. They reported cursive Urdu character recognition results by bidirectional long short-term memory (BLSTM) networks. They performed experiments on synthetic data gathered from one of leading newspaper from Pakistan. They applied window based approach for taking feature value and provide them to classifier. They reported better accuracies with respect to position and without position information of Urdu character. Essoukri et al. [4] developed an Arabic relational database for Arabic OCR systems named ARABASE and evaluated their two systems by using their proposed database. One system was evaluated printed Arabic writing using the generalized Hough transform, while the other system was evaluated with handwritten Arabic script using planar hidden Markov model. The candidates were supposed to provide their name as author. This information was also used in preparation of database for offline Arabic handwriting (AHDB) [22] which is open-source database. They collected samples from 100 writers. This database contains words used in writing legal amounts in banks. It is considers as the most popular Arabic words. Another database named CENPARMI is freely available dataset that consist of 3000 handwritten cheque images [23]. It consists of labeled 29,498 subword images, 15,175 digit images, and 2499 legal amount images. The database is designed to facilitate automatic cheque reading research for the banking and finance sectors.

3 Challenges in Urdu language

The Urdu language is written from right to left like Arabic and Persian as represented in Fig. 1. Every character can occur in isolation, at initial, middle or at final position in a word. The joining of letters represents cursive nature of such languages. The characters may join or not join with its preceding or/and subsequent letter, due to joiner and non-joiner property of cursive text. The joiner character may appear at initial, middle, isolated, or final position as represented in Fig. 2. These characters joined with preceding and subsequent letters in a word when it occurs in the middle position. As an initial letter in a word, it joined its subsequent letter. When appears as a final letter of word, it joins with its preceding letter. When joiner letters occur at initial position or at middle position it may completely change its shape as depicted in Fig. 3. The non-joiner character appears in its full shape as isolated or at final position in a single word. Urdu word comprised of ligatures that make a word meaningful. Associate ligatures of same word pose huge challenge for researchers to address. For such complicated scripts, context is important to learn. The numeric characters in Urdu is written from left to right but Urdu writing starts from right to left as represented in Fig. 4. In the Naskh font, there are only four shapes but in case of Nasta’liq, we are facing more shape context than Naskh. The common issues related to Urdu language are mostly the same as its ancestor languages have, e.g., context sensitivity, characters overlap, word segmentation, and text line segmentation. But these challenges require more considerable efforts to address issues in Nasta’liq script as compared to Naskh. The diagonally, multiple baseline, placement of dots, number of dots association, kerning, stretching, false and filled loop are the unique challenges [12] for detail of challenges in Nasta’liq) in Urdu language and need to be tackled down. Figure 4 depicts normal format of Urdu characters and words. There is scarcity of database in Nasta’liq script, and the UNHD offline handwritten database provides strength to the problem of Urdu OCR particularly in Nasta’liq script.

4 UNHD database

For the purpose to perform Urdu handwritten OCR development and evaluation, we present comprehensive handwritten Urdu dataset named UNHD for Urdu Nasta’liq Handwritten Dataset. UNHD database covers all Urdu characters and ligatures with different variations. In addition, the Urdu numeric data are also taken from authors. This dataset consist of ligatures which comprised up to five characters. The taken dataset can apply as handwritten character recognition as well as writer identification. The initial data was taken in red color to ensure integrity of taken samples. The integrity of data refers to maintain its pixel value during noise removal. The noise is usually in black color and it is easy to determine it if text has other than black color. The vocabulary of UNHD database spans digits, numbers, and part of speech noun, verb, pronoun, etc., but in Urdu Nasta’liq font. Later, in the enhanced version we acquired samples from school and college students. Moreover, we also collected data from office going individuals to ensure that we must have all variability of handwritten samples from all individuals associated to any field and from every age. The data collected from 500 writers (both male and female).

In this way we have broaden our collected text from 48 unique text lines to 700 unique text lines including Urdu numerals and Urdu constraint handwritten samples. We have more than 6000 Urdu handwritten text lines. The text lines have been written with three variations like, on the baseline, without baseline and slanted on the page. Each individual was trained before taking sample and were asked to write provided text in a natural way. Thus, each individual wrote 48 text lines. Table 2 shows complete depiction of gathered data for UNHD. We had given six blank pages to each individual with their identification and page numbers were mentioned. Moreover, to mitigate the processing and to correct slant and skew, we also provide some pages with marked baselines to writer as shown in Fig. 5. Each individual is asked to write the provided printed text. The UNHD dataset text is obtained with identification of each individual that will helps in writer identification. The dataset consist of 312,000 words written by 500 candidates with total of 10,000 lines. There are approximately 624 words written by a single author. We have tried to cover Urdu text as maximum with character variability according to its position and writing style. There are approximately 187,200 number of characters exist in UNHD dataset to date (Fig. 6).

Table 2 UNHD dataset details

Full size table

The UNHD dataset can be accessed at https://sites.google.com/site/researchonurdulanguage.

4.1 Image acquisition and pre-processing

The following pre-processing steps applied on UNHD database that includes removal of baseline (if there is baseline in the document) and noise, grayscale conversion, skew detection and correction, text lines segmentation (shown in Fig. 5), and database labeling/annotation. The image acquisition is the first step in the workflow of character recognition to acquire image using some hardware-based sources like scanners, cameras, and tablets. The pages scanned with a HP Scanjet 200 flatbed scanners using 300 dpi (ideal for subsequent steps of OCR processing). Each line is extracted from given page automatically and given a label with 9 letters including digits and dash “-” symbol, e.g., ddd-dd-dd (d = 0, 1, 2…9). According to this coding, the first three digits represents writer’s id (e.g., 001… 300), next two digits show page id (e.g., 01… 06) and last two digits tell about the serial number (e.g., 01… 08) for different text lines in a given page. This coding scheme can be coped with the future growth of database and also helpful in sorting and searching the contents of database with efficiency.

4.1.1 Noise and skew removal

The scanned pages are used for removal of baseline using the color information. The median filter is applied for suppression of noise while maintaining the sharp edges. As in Urdu, it is cumbersome to learn each shape of every character in presence of variations with respect to one character. Therefore, we dealt with pixel values to accommodate different shapes information. The original image was converted into grayscale. The detection and correction of skew is a crucial part for segmentation step in OCR. In the literature, different methods reported [5, 13, 24, 25]. In our dataset, we got data on drawn baseline on the page from 100 writers for lessen skew in text line and 200 writers wrote without baseline. For skew correction, we used horizontal projection method [24] for all data from 500 writers. In this method the image is project at different angles and calculates the variance of horizontal projection. Horizontal projection is the sum of each row of the image. The horizontal projection of un-skewed image will likely have the maximum value. The stepwise detail of whole process is depicted in Algorithm 1.

4.1.2 Text line segmentation

The text lines are segmented by projection profile in horizontal direction. At first, we applied projection profile method to get the positions of each pixel. After taking text pixel position we segment lines from grayscale image. The resulted text lines with their ground truth used as an input to the classifier for learning the character patterns. The Algorithm 2 shows the pseudo code of text line segmentation.

4.1.3 Dataset labeling

In pattern recognition and machine learning, the ground truth information uses with actual image in supervised learning that provides standard to learn patterns in the image. It is considered as backbone for supervised learning tasks. It is an essential part of database for OCR’s experiment in segmentation step. To evaluate the performance of recognition system, ground truth needs to be labeled correctly. Each text line is described with the help of ground truth text file. The utf8 encoding of every non-Latin character has been declared. Like other non-Latin languages, Urdu also has its utf8 encoding characters regardless to its position. In reading Urdu, character positions are vital in determination of word for reader. As specified in Fig. 3, suppose the character meem occurred at different positions. There is no need to declare different utf8 codes with respect to each position of meem. So, we have only one code of meem for its every position, graphically it may have different shapes but its utf8 code would be same. We labeled every character with its four possibilities of occurrences in Urdu word. With reference to Fig. 3, the character appeared in isolation is labeled as meem_iso, at initial position as meem_i, at middle position meem_m, and at final position as meem_f, respectively.

5 Dataset research scenarios

Based on UNHD offline dataset, there are some typical scenarios where document analysis tasks can be performed. Our recommendations are mentioned as follows.

1.
The dataset is stored and tagged with identification of user, thus can be used to perform writer identification on data sample. For, user identification can be performed dataset form 500 individual.
2.
It is a cumbersome task to gather data from writers. The UNHD offline database will be used and apply different techniques for segmentation of text lines and words into subword/ligatures.
3.
To evaluate the potential of state-of-the-art techniques on cursive scripts like Urdu, the proposed dataset can be used.
4.
Another variation of UNHD dataset usage is to take geometrical information of every character and maintain the information in separate table against every character. The captured information in this way can be trained and used for character recognition.
5.
To determine ligatures from Urdu words is another research area which can be performed using UNHD database.

6 Dataset evaluation

The acquired dataset is completely applicable on any type of classifier. The classifier we used is precisely defined below.

6.1 Recurrent neural network

The classifier we proposed for our recognition system is recurrent neural networks (RNNs). As learned from literature [6, 25] the basic constraint of multilayer perceptrons (MLPs) is to map input into output vector without considering the previous computations at output unit while RNN has flexibility of tracing back previous computations. In this way, history also takes a part in computations at hidden layer. The internal state of the network is retained by looped connection that makes an influence at output level. The RNNs are meant to retain the previous sequence information [6]. Figure 7 shows complete depiction of the proposed system. RNNs are meant to use their feedback connections that exist in hidden layer for the purpose to retain most recent calculation that contributes in weight calculation of current node in a sequence. When given input is complex and large, then the time lag for retaining computation would be difficult to maintain and we can lose information as a result. In this particular situation, that information which requires for longer period of time would be vanished from cell’s memory. To address this problem, the concept of long short-term memory (LSTM) networks introduced [26].

In LSTM, the hidden layer memory cells were replaced by memory blocks with additive multiplicative units. These multiplicative units are responsible for maintaining the information for longer period of time and vanish the gradient information when it is no longer required by the sequence. In some situation, we need to predict the future point in a sequence. To consider the future point in time, we applied LSTM in forward and backward direction for the purpose to have the context of a text at given point in a sequence. We applied bidirectional long short-term evaluation of proposed dataset. Section 6 shows the strength of a given technique.

6.2 Text page segmentation

The given text page was segmented into text lines by projection profile method. The features of an image play a crucial role in text line recognition. The image is segmented into candidate regions and features are extracted from each candidate region as marked in Fig. 8. After applying pre-processing on UNHD handwritten dataset, we performed segmentation of text lines. The window size of 30 * 1 (x-height) traverses over the given text line image to get corresponding pixel values as feature values which is given to classifier for learning as shown in Fig. 7. To test the applicability of UNHD handwritten dataset, we took 7200 text line images as train set and 4800 text lines were used to validate the training 2400 text lines were in test set. The 6.04 to 7.93 percent error was reported on UNHD offline dataset. The produced results are highly motivated. We performed experiments on the text lines written by 500 authors to assess the potential of RNNs on handwritten cursive data as mentioned in Table 3.

Table 3 Dataset contribution w.r.t. writers

Full size table

7 Result and discussion

This section provides detail of BLSTM evaluation performed on UNHD offline handwritten dataset name Urdu-Nasta’liq handwritten dataset. The one-dimensional bidirectional long short-term memory (BLSTM) networks are used, which is a connectionist classification approach based on RNN [25], LSTM architecture [27] and bidirectional recurrent neural network (BRNN) [28]. As mentioned before that BLSTM is a RNN approach which is considered ideal for sequence learning. The sequence is crucial to get exact output as desired. There is a need to use such classifier that can address the past sequences for the intention to predict the output symbol at current point in time. Due to constraint of managing context, BLSTM is considered as an attractive choice for learning the sequential data that requires feedback from current or previous temporal sequences.

The dataset is divided into train set, validation set, and test set with a ratio as 50% for training, 30% used for validation and remaining 20% was used for test set. The number of authors that took part to make train set, validation set and test set are mentioned in Table 3. The learning curve formed during training of transcription is represented in Fig. 8. As we learned from the curve that there is approximately 5% difference between the training and validation. The difference is gradually increases after 52th epoch. In order to avoid over training, we terminate the learning on 60th epoch. The learning is depicted in Fig. 9.

We trained network on different BLSTM hidden layer size as shown in Fig. 10. As learned from [5] that size of hidden layer make an impact on learning. We get best performance of RNN by increasing the size of hidden layer neurons, i.e., 100. The recognition error tends to increase after 100th hidden layer size. This means that we get optimal result on 100th hidden neurons. However, the time to train network on different hidden memory units will increase by increasing number of units. This indicates that the number of hidden memory units is directly proportional to time.

8 Conclusion and future work

In this paper, we proposed new dataset for handwritten Urdu language named Urdu-Nasta’liq handwritten dataset. Being a cursive nature, Urdu Nasta’liq font has no standard dataset available publicly. The basic motive of preparing UNHD offline database is to compile Urdu text and make it available to research community free of cost. The data has been gathered from 500 individuals, but it will be extended up to 1000 individuals. Currently, UNHD database covers commonly used ligatures with different variations in addition to Urdu numeric data. Although we achieved very good character accuracy, i.e., approximately 6.04–7.93 percent error rate. The performance of classifier can be evaluated on two-dimensional BLSTM. Other future tasks may include writer identification, apply different feature extraction approaches, and apply different classifiers to recognize the text and word recognition with the help of dictionary and language modeling.

References

Biadsy F, El-Sana J, Habash NY (2006) Online Arabic handwriting recognition using hidden Markov models. In: Proceedings of the 10th international workshop on frontiers of handwriting and recognition
Breuel TM (2008) The OCRopus open source OCR system. In: Yanikoglu BA, Berkner K (eds) Document Recognition and Retrieval XV, vol 6815. SPIE, San Jose, CA, p 68150. doi:10.1117/12.783598
Deng L (2012) The MNIST database of handwritten digit images for machine learning. IEEE Signal Process Mag 29(6):141–147
Article Google Scholar
Essoukri N, Amara B, Mazhoud O, Bouzrara N, Ellouze N (2005) ARABASE: a relational database for Arabic OCR systems. Int Arab J Inf Technol 2(4):259–266
Google Scholar
Graves A (2012) Supervised sequence labeling with recurrent neural networks, vol 385. Springer Studies in Computational Intelligence
Gosselin B (1996) Multilayer perceptrons combination applied to handwritten character recognition. Neural Process Lett 3(1):3
Article Google Scholar
Razzak MI, Hussain SA (2010) Locally baseline detection for online Arabic script based languages character recognition. Int J Phys Sci 5:955
Google Scholar
Sabbour N, Shafait F (2013) A segmentation free approach to Arabic and Urdu OCR. In: DRR, ser. SPIE Proceedings 8658
Marti U-V, Horst Bunke H (2004) The IAM-database: an English sentence database for offline handwriting recognition. IJDAR 5(1):39
Article MATH Google Scholar
Taghva K, Nartker T, Borsack J, Condit A (1999) UNLV-ISRI document collection for research in OCR and information retrieval. In: International society for optics and photonics in electronic imaging
Javed ST, Hussain S (2013) Segmentation based Urdu Nastalique OCR. Springer 8259:41
Google Scholar
Naz S, Hayat K, Razzak MI, Anwar MW, Madani SA, Khan SU (2014) The optical character recognition of Urdu-like cursive scripts. Pattern Recognit 47(3):12291248
Article Google Scholar
Naz S, Hayat K, Razzak MI, Anwar MW, Khan SK (2014) Challenges in baseline detection of Arabic script based languages. Springer International Publishing in Intelligent Systems for Science and Information, p 181
Parvez MT, Mahmoud SA (2013) Offline Arabic handwritten text recognition: a survey. ACM Comput Surveys (CSUR) 45(2):23
Article MATH Google Scholar
Smith R (2007) An overview of the tesseract OCR engine. In: ICDAR 629
Lorigo LM, Govindaraju V (2006) Offline Arabic handwriting recognition: a survey. IEEE Trans Pattern Anal Mach Intell 28(5):712
Article Google Scholar
Marti U-V, Bunke H (2002) Using a statistical language model to improve the performance of an HMM-based cursive handwriting recognition system. World Scientific Publishing Co., River Edge, p 65
Google Scholar
Seiler R, Schenkel M (1996) Off-line cursive handwriting recognition compared with on-line recognition. In: ICPR, p 505
Sagheer MW, He CL, Nobile N, Suen CY (2009) A new large Urdu database for off line handwriting recognition. In: Image analysis and processing ICIAP. Springer, Berlin, p 538
Ul-Hasan A, Bukhari SS, Rashid SF, Shafait F, Breuel TM (2012) Semi-automated OCR database generation for Nabataean scripts. In: ICPR 1667
Ahmed SB, Naz S, Razzak MI, Rashid SF, Afzal MZ, Breuel TM (2015) Evaluation of cursive and non-cursive scripts using recurrent neural networks. Neural Comput Appl 27(3):603–613
Al-Maadeed S, Elliman D, Higgins C (2002) A data base for Arabic handwritten text recognition research. In: Proceedings of the 8th international workshop on frontiers in handwriting recognition, p 485
Al-Ohali Y, Cheriet M, Suen C (2003) Databases for recognition of handwritten Arabic cheques. Pattern Recognit 36(1):111
Article MATH Google Scholar
Wang Y, Ding X, Liu C (2011) MQDF discriminative learning based offline handwritten Chinese character recognition. In: ICDAR. IEEE 1100
Graves A, Bunke H, Fernandez S, Liwicki M, Schmidhuber J (2008) Unconstrained online handwriting recognition with recurrent neural networks. In: Advances in neural information processing systems, p 577
Gers FA, Schmidhuber E (2001) LSTM recurrent networks learn simple context-free and context-sensitive languages. IEEE Trans Neural Netw 12(6):1333
Article Google Scholar
Hochreiter S, Schmidhuber J (1997) Long short term memory. Neural Comput 9(8):1735
Article Google Scholar
Graves A (2008) Supervised sequence labeling with recurrent neural networks. PhD thesis, 1-117, Technical University Munich
Mrgner V, El-Abed H (2008) Databases and competitions: strategies to improve Arabic recognition systems. In: Proceedings of the conference on Arabic and Chinese handwriting recognition, Springer, Berlin, p 82
Srihari S, Srinivasan, H, Babu, P, Bhole C (2005) Handwritten Arabic word spotting using the cedarabic document analysis system. In: Proceedings of the symposium on document image UNHDerstanding technology (SDIUT-05), p 123
Al-Ohali Y, Cheriet M, Suen C (2003) Databases for recognition of handwritten Arabic cheques. Pattern Recognit 36(1):111–121
Article MATH Google Scholar
Schlosser S (1995) Erim Arabic Database. Document Processing Research Program, Information and Materials Applications Laboratory, Environmental Research Institute of Michigan
Slimane F, Ingold R, Kanoun S, Alimi A, Hennebert J (2009) Database and evaluation protocols for Arabic printed text recognition. Technical Report 296-09-01. Department of Informatics, University of Fribourg
Mozaffari S, El-Abed H, Maergner V, Faez K, Amirshahi A (2008) A database of Farsi handwritten city names. IfN/Farsi-Database, p 24
Ziaratban M, Faez K, Bagheri F (2009) FHT: an unconstraint Farsi handwritten text database. In: Proceedings of the 10th international conference on document analysis and recognition, Catalonia, Spain, p 281
www.cle.org.pk/clestore/imagecorpora.htm. Accessed 23 June 2014
Ul-Hasan A, Bukhari SS, Rashid SF, Shafait F, Breuel TM (2012) Semi-automated OCR database generation for Nabataean scripts. In: ICPR, p 1667

Download references

Author information

Authors and Affiliations

King Saud Bin Abdul Aziz University for Health Sciences, Riyadh, Saudi Arabia
Saad Bin Ahmed & Muhammad Imran Razzak
Department of Information Technology, Hazara University, Mansehra, Pakistan
Saeeda Naz
Higher Education Department, Govt. Girls Postgraduate College No. 1, Mansehra, KPK, Pakistan
Saeeda Naz
COMSATS Institute of Information Technology, Abbottabad, Pakistan
Salahuddin Swati

Authors

Saad Bin Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Saeeda Naz
View author publications
You can also search for this author in PubMed Google Scholar
Salahuddin Swati
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Imran Razzak
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Muhammad Imran Razzak.

Ethics declarations

Conflict of interest

Authors have no conflict of interest to declare.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ahmed, S.B., Naz, S., Swati, S. et al. Handwritten Urdu character recognition using one-dimensional BLSTM classifier. Neural Comput & Applic 31, 1143–1151 (2019). https://doi.org/10.1007/s00521-017-3146-x

Download citation

Received: 04 July 2014
Accepted: 13 June 2017
Published: 19 August 2017
Issue Date: 01 April 2019
DOI: https://doi.org/10.1007/s00521-017-3146-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Handwritten Urdu character recognition using one-dimensional BLSTM classifier

Abstract

Similar content being viewed by others

UrduDeepNet: offline handwritten Urdu character recognition using deep neural network

Recognition of offline handwritten Urdu characters using RNN and LSTM models

Handwritten Arabic Character Recognition: Comparison of Conventional Machine Learning and Deep Learning Approaches

1 Introduction

2 Related work

3 Challenges in Urdu language