Comparison of HMM- and SVM-based stroke classifiers for Gurmukhi script

Verma, Karun; Sharma, Rajendra Kumar

doi:10.1007/s00521-016-2309-5

Comparison of HMM- and SVM-based stroke classifiers for Gurmukhi script

Original Article
Published: 20 April 2016

Volume 28, pages 51–63, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Neural Computing and Applications Aims and scope Submit manuscript

Comparison of HMM- and SVM-based stroke classifiers for Gurmukhi script

Download PDF

Karun Verma¹ &
Rajendra Kumar Sharma¹

506 Accesses
17 Citations
Explore all metrics

Abstract

With the evolution of touch-based devices, development of handwriting recognition systems has received attention from many researchers. An online handwriting recognition system for Gurmukhi script is proposed in this paper. In this work, 74 stroke classes have been identified and implemented for character recognition of Gurmukhi script. Seventy-two different combinations of SVM- and HMM-based stroke classifiers with five different features have been experimented. The results of recognition of 35 basic characters of Gurmukhi script on a data set of 1750 Gurmukhi characters written by 10 writers have been reported using three best classifiers and a voting-based classifier built with the help of these classifiers. A character recognition rate of 96.7 % has been achieved using the voting-based classifier, whereas a recognition rate of 96.4 % has been achieved with an HMM-based classifier.

Recognition of Multi-Stroke Based Online Handwritten Gurmukhi Aksharas

Article 31 December 2014

Multi-layer Classification Approach for Online Handwritten Gujarati Character Recognition

Recognition of online unconstrained handwritten Gurmukhi characters based on Finite State Automata

Article 23 October 2018

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Handwriting is the art of drawing letters, words, or symbols on a surface with pen or pencil. A digital handwriting system captures a handwritten character by sequence of strokes. Online handwriting recognition is the process of analyzing these strokes and determining a particular character from a selected character set with a requisite degree of confidence. With the invention of affordable pen/stylus-based devices, research in the area of online handwriting recognition has received attention from many researchers [1, 2]. A limited amount of work in online character recognition of Gurmukhi has been carried out in recent past [3, 4]. However, a lot of works in handwriting recognition have been done for simplified Chinese, traditional Chinese, English, Japanese, and Korean languages [5–7]. In the literature, isolated character recognition for many Indic languages like Bangla, Hindi, Tamil, and Telugu has been reported by many researchers [8–11]. In this paper, recognition of Gurmukhi script, a popular script in northern part of India to write Punjabi language, has been attempted. This script consists of 9 vowels (laga matras), 41 consonants, $bind\bar{\imath }$, ṭ$ipp\bar{\imath }$ as two symbols for nasal sounds and addak as a symbol to duplicate the sound of a consonant. It is written from left to right. Table 1 illustrates various characters of Gurmukhi script. A sequence of points captured or drawn during a pen-down and pen-up event, while writing, is called a stroke. Gurmukhi characters are written in various styles that magnifies the difficulty in recognizing a particular character. The strokes for writing characters in Gurmukhi script can be drawn in one of the three horizontal zones, namely, upper, middle and lower zones as depicted in Fig. 1. Most of the character symbols of Gurmukhi are written in middle zone. The region above the headline denotes the upper zone, where some of the vowels () and sub-parts of some other vowels () reside, while the middle zone represents the area below the headline where the consonants and some sub-parts of vowels () are present. The lower zone represents the area below middle zone where some vowels and certain half characters lie in the foot of consonants. A Gurmukhi character can be written with a single or a combination of strokes, with stroke considered as the smallest unit. A major problem in character recognition of Gurmukhi is detection of headline and baseline. Appearance of similar looking strokes in these three zones results in formation of different characters which escalates the issue. This paper focuses on creation of a single classifier for estimation of strokes in the three zones. The solution has further been augmented with a robust postprocessing mechanism to transform the set of strokes into Gurmukhi characters.

Table 1 Chart of Gurmukhi characters

Full size table

This paper is organized into seven sections. This section introduces basic concepts behind the work. Section 2 describes work done in the area of handwriting recognition for various languages. Section 3 describes features of Gurmukhi script and how handwritten data for Gurmukhi script are collected. This section also discusses various stages of a Gurmukhi character recognition system. Section 4 explains how various features have been calculated and then organized for recognition of strokes after applying various preprocessing steps on the collected data. Section 5 describes two classifiers, namely support vector machine (SVM) and hidden Markov model (HMM) that have been used for classification of strokes in this work. This section also explains how the models are constructed from the training sets of various features as obtained in Sect. 4. Section 6 describes the postprocessing steps performed to generate Gurmukhi characters from the identified strokes. Section 7 concludes the work done with salient findings.

2 Related work

In the computing environment, keyboards had been an effective interface for interacting with computers, but with advancements in computational technology, and advent of touch-based systems, a more natural way of interacting with the computers is being explored. Handwriting is one such natural way that was pioneered and patented by Yaeger et al. [12]. This section briefly reviews the literature on handwriting recognition systems. Plamondon and Srihari [13] in their survey on online and offline handwriting recognition systems explained various processes involved, starting from input to final understanding/recognition of characters for a language. They have discussed categories of recognition methods, namely structural/rule-based methods and statistical methods in their work in order to recognize handwriting with pen based devices. Structural/rule-based methods involve with defining robust and reliable rules for recognition purposes. In Statistical methods, shape of a graphical mark known as stroke is described by fixed number of features, and the classes of these graphical marks are described by multidimensional probability distribution. Various recognition methods have been experimented by researchers across the world for handwriting recognition of scripts. These methods include artificial neural network (ANN), hidden Markov model (HMM), support vector machine (SVM), dynamic time warping (DTW), elastic matching and rule-based methods. Table 2 illustrates the recognition methods used and accuracy achieved by various researchers for different languages.

Table 2 Recognition techniques used and accuracy achieved for different languages

Full size table

3 Data collection and steps in recognition process

Verma and Sharma [4] identified and grouped the strokes as per the zone in which they appear. The number of stoke classes considered by them was 102. They took 13 stroke classes in upper zone, 82 stroke classes in middle zone and 7 stroke classes in lower zone, and proposed zone-wise classifiers in their work. In this study, a different approach for classification has been tried where a single classifier is built with all the stroke classes, irrespective of their zone of appearance. In this process, similar looking strokes have been merged and a common strokeID has been assigned to them irrespective of their zones. For example, 123 () and 131 () have been merged and assigned a common strokeID 123; 122 () and 132 () have been merged and assigned a common strokeID 122; 133 () and 134 () have been merged and assigned a common strokeID 133. This resulted in a total of 74 stroke classes, which have been used for training of classifiers in this work.

3.1 Data collection

In this work, we have addressed the problem of identification of characters of Gurmukhi script. A character in Gurmukhi script can be written with one or more stroke combinations. One of the major steps in recognition of Gurmukhi characters is recognition of these strokes. Based on recognized strokes, and their combination with nearby strokes, the final Gurmukhi character is formed. To start with, a stroke classifier needs to be trained for recognition of strokes. For this purpose, a total of 44301 samples of Gurmukhi words have been collected from 124 writers using Tablet PC device.

For each word, all strokes along with their writing sequence have been recorded in an XML format. For each stroke, x–y traces of digital PEN on the touch screen between successive pen-down and pen-up events have been recorded. The writers selected here were well-versed in writing Gurmukhi script. In order to have variations in handwriting samples, the writers were selected from various regions of Punjab (INDIA). For the purpose of training of classifiers, the strokes that were written by writers as per their correct script formation in an unconstrained environment were selected and annotated. A total of 74 stroke classes, as mentioned above, have been identified. Table 3 contains these strokes, the strokeIDs associated with them and the number of samples of these strokes used for training.

The developed online Gurmukhi character recognition system has been tested on the data collected from 10 new users. These users wrote each Gurmukhi consonant five times, giving a set of 1750 Gurmukhi characters. This data set is referred as testdata in Sect. 6.

Table 3 Gurmukhi strokes used for classification

Full size table

3.2 Steps involved in recognition of characters

This section includes a brief description of the steps involved in recognition of Gurmukhi characters.

(a)
Stroke capturing: As the first step, we capture the x–y traces of each stroke written by the writers.
(b)
Preprocessing: After capturing a stroke, the x–y traces are preprocessed for extraction of features in the next step. During preprocessing, a noise or distortion which arises due to software limitations is removed from original stroke. Normalization and centering of the stroke, where the input stroke is fitted into a fixed size window and is moved to the center location, is then performed. During writing of a stroke, some points may be missed by the touch sensor. These missing points are interpolated using Bézier interpolation technique. Jitters are removed from the stroke by averaging each point with its neighbors. After performing the preprocessing steps, we obtain 64 preprocessed x–y traces of the stroke in three different-sized windows, i.e., $200 \times 200$, $300 \times 300$, and $400 \times 400$, to investigate the effect of windows size on the performance of stroke recognition.
(c)
Feature extraction: Features required for classification of a stroke as per model chosen are extracted from 64 preprocessed x–y traces, which are used for recognition of stroke. The features used in this work are elucidated in Sect. 4.
(d)
Classification: SVM- and HMM-based classifiers have been used for classification of strokes. The results for recognition of strokes using radial basis function kernel in SVM and HMM for different feature sets are presented in Sect. 5.
(e)
Postprocessing: Zone identification is other important step in character formation, as certain strokes based on their appearance in different zones produce different characters. Zone of a stroke is identified based on its relative position compared to other strokes. Certain other postprocessing steps like rearrangement of strokes, stroke grouping/merging are applied on these recognized strokes to form the characters.

4 Feature extraction

Five different features, namely normalized x–y traces ($N_{xy}$), region-based features ($R_{xy}$), curvature features ($C_{xy}$), curvature feature-based classes ($C^N_{xy}$), and directional features ($D_{xy}$), have been considered for classification models. This section now explains briefly how these features have been extracted from the preprocessed x-y traces.

Normalized x–y traces ( $N_{xy}$ ): The 64 preprocessed traces are considered as first feature set. This feature set contains 128 elements for a given stroke sample and is referred as $N_{xy}$ feature set.
Region-based features ( $R_{xy}$ ): As explained in Sect. 3.2, each stroke is normalized to windows of size $200 \times 200$, $300 \times 300$ and $400 \times 400$. The windows were further divided into 100 small equi-sized sub-windows and labeled as $w_1$, $w_2, \ldots, $ $w_{100}$. Each point in 64 preprocessed traces will lie in one of these 100 sub-windows. The window number where the point lies is taken as a feature value. As such, this feature set will have 64 elements and is referred as $R_{xy}$ feature set.
Curvature features ($C_{xy}$ ): Curvature is defined as the degree to which something is curved. The 64 normalized x–y traces have been used to calculate curvature at each point. The curvature at a point (x, y) for the curve $y = f(x)$ is given by Eq. (1) [34].
$$\begin{aligned} R_{\mathrm{curve}}=\frac{\left( 1+ \left( \frac{\mathrm {d} y}{\mathrm {d} x}\right) ^{2}\right) ^{\frac{3}{2}}}{\frac{\mathrm {d}^2 y}{\mathrm {d} x^2}} \end{aligned}$$
(1)
The curvature at a point is calculated using three adjacent points. Since we need at least three points to calculate second-order derivative $\frac{\mathrm {d}^2 y}{\mathrm {d} x^2}$, we shall get a feature vector of size 62 from preprocessed x–y traces. The features thus obtained are referred as $C_{xy}$.
Curvature feature-based classes ( $C^N_{xy}$ ): The values of curvature, $C_{xy}$, lie in the range (0, $\infty $). These values have further been divided into 20 discrete classes based on uniform frequency of curvature values, and these classes have been assigned a code. Table 4 illustrates the coding pattern of these discrete classes along with range of curvature values. This feature set will contain 62 such values representing the class in which the curvature value appears and is referred as $C^N_{xy}$.
Directional features ( $D_{xy}$ ): To calculate directional features, 64 preprocessed x–y traces ($P_1$, $P_2, \ldots, $ $P_{64}$) on the trajectory of a stroke as shown in Fig. 2 have been used. The angle between two points $P_i$ and $P_{i+2}$, $i=1, 2, \ldots, 62$ is calculated and coded using a quantized vector consisting of 12 different values as illustrated in Table 5. This feature is referred as $D_{xy}$.

Table 4 Coding of curvature features in $C^N_{xy}$

Full size table

Table 5 Coding of directional features

Full size table

5 Classification models

The features extracted from a stroke have now been used to build a classifier for recognition of the stroke. In this section, we explain two different classifiers experimented for classification of strokes. There are 9129 samples that have been used to train the recognition engine. The two classification approaches are given in following subsections.

5.1 Support vector machine (SVM)

For the classification process, SVM with radial basis function (RBF) kernel has been used. Other parameters of SVM that have empirically been optimized in this work are learning rate ($\gamma $) and penalty parameter (C). Three preprocessed data sets having window size $200 \times 200$, $300 \times 300$, and $400 \times 400$ have been used to obtain features sets. Five different features, explained in Sect. 4, have been obtained from these preprocessed data sets. These 15 data sets were experimented for classification using k-fold cross-validation for three different values of k as mentioned in Table 6. Table 6 also contains the values of other parameters used for training SVM-based classifiers. As such, 45 different combinations have been tested with these parameters.

Table 6 SVM parameters

Full size table

It is well established that scaling of inputs affects the performance of SVM. Experiments have also been conducted to find an efficient range for scaling the input parameters that yields better accuracy. These experiments suggest to fix the scale of interval as [1, 9] since highest accuracy was achieved for this scaling. The results obtained using five features with selected SVM parameters are illustrated in Table 7. Table 7 also gives the accuracy of the model on data consisting of 920 stroke samples. These 920 samples have not been used for training of SVM classifiers.

5.2 Hidden Markov model (HMM)

The features, namely $R_{xy}$, $C^N_{xy}$, and $D_{xy}$ as explained in Sect. 4, have been used to build HMM model $\lambda = (\pi , A, B)$ for each stroke as proposed by Rabiner and Juang [35]. The other two features, $N_{xy}$, and $C_{xy},$ have not been used for building HMM models due to their large observation space. In $N_{xy}$ feature set, the number of observations equals the dimension of window size used (i.e., 200 in case of $200 \times 200$), whereas in the case of $C_{xy}$ feature set, number of observations cannot be fixed as the feature is a floating point value. The parameters $\pi $, A, and B for each strokeID have been calculated for the feature sets. For $R_{xy}$ feature set, the observation set is O = $\{1, 2, \ldots, 100\}$ and the set of states is $S= \{1, 2, \ldots, 64\}$. For training HMM model for $C^N_{xy}$ features, the observation set is $O = \{1, 2, \ldots, 20\}$, and the set of states is $S = \{1, 2, \ldots, 62\}$. For building HMM model with $D_{xy}$, the observation set is $O = \{1, 2, \ldots, 12\}$, and the set of states is $S = \{1, 2, \ldots, 62\}$. Algorithm 1 has been used to create HMM models from the three features sets. Algorithm 2 has now been used to test a sample for its matching strokeID.

As such 27 different combinations have been tested. The results obtained using the HMM models with three features are presented in Table 7. Table 7 also shows the accuracy of these models on unseen data of 920 stroke samples, which were not used for training the models.

5.3 Summary of stroke classification results

In this work, two classification models, namely SVM and HMM, have been implemented. Five different features have been experimented with these classification models, resulting into eight feature–classifier combinations. Each of these combinations has been tested for three different preprocessed window sizes, i.e., $200 \times 200$, $300 \times 300$, and $400 \times 400$. As shown in Table 7, feature–classifier combinations in $300 \times 300$ window size are performing better than feature–classifier combinations in other two window sizes. In six of the eight feature–classifier combinations, the models resulted in a performance of more than 82.3 %. The features $C_{xy}$ and $C^N_{xy}$ have a low performance (${<}39.7$ %) for SVM-based classifier. The $N_{xy}$ feature yields the highest stroke recognition accuracy of 96.5 % among SVM-based classifiers. In HMM-based classifiers, features $R_{xy}$ and $C^N_{xy}$ achieved stroke recognition rates of 95.5 and 95.0 %, respectively. Direction feature $D_{xy}$ yields the stroke recognition accuracy of 85.3 and 90.4 %, respectively, when used with SVM and HMM. It has also been observed that a very large training set is required to train a HMM model having features with large observation set, to minimize sparsity in transition and observational probability matrices of HMM. The two features $N_{xy}$ and $C_{xy}$ could not be used in training owing to data set used in this work. The three best feature–classifier combinations obtained from these experiments are $C^N_{xy}$-HMM, $R_{xy}$-HMM, and $N_{xy}$-SVM with stroke recognition accuracies of 95.0, 95.5, and 96.5 %, respectively, for window size of $300 \times 300$.

Table 7 Feature-wise accuracy using SVM- and HMM-based classifiers

Full size table

6 Postprocessing and character generation

The three best feature–classifier combinations resulted from the experiments illustrated in Sect. 5 were further used to implement the character recognition system. A voting-based classifier using these three classifiers have also been tested. After recognition of strokes using a classifier, the strokeIDs are processed further to generate characters of Gurmukhi script in postprocessing phase. As defined in Sect. 1, middle zone of Gurmukhi script is the busiest zone and hence most of the stroke classes selected for recognition in Table 3 are middle zone strokes. Some stokes with similar shape appear in different zones as illustrated in Table 8. Based on appearance of these strokes in different zones, different characters are formed. A zone detection algorithm in order to detect the position of strokes relative to middle zone stroke has been implemented. In this algorithm, original x–y traces of the strokes are used for detecting their zone. The zonal information along with strokeID is passed to the character recognition process. The scheme proposed by Kumar et al. [3] has been used to recognize the Gurmukhi script characters.

Table 8 Some issues in character formation caused by zoning

Full size table

Four character recognition systems developed using three best feature–classifier combinations and a voting-based classifier have been tested on testdata. The results obtained for this data set are depicted in Table 9. The proposed systems could achieve an average accuracy of 83.5 % for Gurmukhi character and an average accuracy of 88.0 % for character . The reason behind such a low accuracy achieved in recognition of these characters is due to confusion of strokes having strokeID 164 (), 165 () with strokeID 155 () and 156 (), respectively. Due to misclassification of these strokes, the resultant character is different from the ground truth. The system achieved 88.0 % accuracy for characters . The reason behind this low recognition rate is due to the stroke combinations, as mentioned in Table 8. Misidentification of zone of stroke is another common reason for low recognition rate of certain characters, namely , and . The system achieved an average recognition accuracy of 100.0 % for the characters , and . The average character recognition time in milliseconds (ms) are also elucidated in Table 9. The recognition time is calculated as the time taken by character recognition system from time of submission of strokes to the final character generated on screen. It is evident that HMM-based classifiers are faster than SVM-based classifiers. It is also evident from Table 9 that voting-based classifier, as expected, performs better than other classifiers with an overall character recognition accuracy of 96.7 %.

Table 9 Character-wise recognition accuracies of different models

Full size table

7 Conclusion

We have implemented three models for recognition of online handwritten Gurmukhi characters. These three models are based on SVM and HMM classifiers. A voting-based classification model based on these three models has also been implemented. We have also experimented with five different features in this work. The writing styles of Gurmukhi script users give rise to different shapes of strokes using which the characters are formed. We have carried out a brief study on the issues that are prominent in dealing with formation of confusing characters.

In an earlier study on online Gurmukhi script recognition, an accuracy of 87.4 % has been achieved using elastic matching technique by Sharma et al. [27]. They experimented with HMM models using 130 samples of each Gurmukhi character and achieved a recognition rate of 91.9 %. The HMM model was built on direction code-based feature as explained in Sect. 4. The feature used by them has eight different values of direction, whereas in the current work, we have used 12 different directions. Verma and Sharma [4] in their work on zone-based features for online Gurmukhi script achieved a recognition rate of 92.1 % for normalized diagonal zone-based features. In this work, we have achieved a higher recognition accuracy of 96.7 % for Gurmukhi script character recognition using a voting-based classifier. As expected, the average recognition time taken by voting-based classifier is more than other three classifiers. These studies are summarized in Table 10.

As a further work in this direction, the low recognition accuracies of some of the confusing characters in Gurmukhi script can be improved by adding a more robust zone detection algorithm. Different classification techniques based on multilayer perceptron and linear discriminant analysis (LDA) for different feature combinations can also be tested. One can also explore the effect of training data on recognition rates of classification models.

Table 10 Comparison of results with earlier approaches

Full size table

References

Almuallim H, Yamaguchi S (1987) A method of recognition of Arabic cursive handwriting. IEEE Trans Pattern Anal Mach Intell 9(5):715–722
Article Google Scholar
Bellegarda EJ, Bellegarda JR, Nahamoo D, Nathan KS (1993) A continuous parameter hidden markov model approach to automatic handwriting recognition, 1993. EP Patent 0,550,865
Kumar R, Sharma RK (2013) An efficient post processing algorithm for online handwriting Gurmukhi character recognition using set theory. Int J Pattern Recognit Artif Intell 27(04):1–17
Article Google Scholar
Verma K, Sharma R (2015) Performance analysis of zone based features for online handwritten Gurmukhi script recognition using support vector machine. In: Selvaraj H, Zydek D, Chmaj G (eds) Progress in systems engineering. Volume 330 of advances in intelligent systems and computing. Springer International Publishing, Berlin, pp 747–753
Google Scholar
Jäger S, Liu C-L, Nakagawa M (2003) The state of the art in Japanese online handwriting recognition compared to techniques in western handwriting recognition. Doc Anal Recognit 6(2):75–88
Article Google Scholar
Liu C-L, Jaeger S, Nakagawa M (2004) Online recognition of Chinese characters: the state-of-the-art. IEEE Trans Pattern Anal Mach Intell 26(2):198–213
Article Google Scholar
Tappert CC, Suen CY, Wakahara T (1990) The state of the art in online handwriting recognition. IEEE Trans Pattern Anal Mach Intell 12(8):787–808
Article Google Scholar
Bharath A, Madhvanath S (2012) Hmm-based lexicon-driven and lexicon-free word recognition for online handwritten Indic scripts. IEEE Trans Pattern Anal Mach Intell 34(4):670–682
Article Google Scholar
Bhattacharya N, Pal U (2012) Stroke segmentation and recognition from Bangla online handwritten text. In: IEEE 2012 international conference on frontiers in handwriting recognition (ICFHR). pp 740–745
Biswas C, Bhattacharya U, Parui SK (2012) Hmm based online handwritten Bangla character recognition using dirichlet distributions. In: IEEE 2012 international conference on frontiers in handwriting recognition (ICFHR). pp 600–605
Sundaram S, Ramakrishnan A (2013) Attention-feedback based robust segmentation of online handwritten isolated Tamil words. ACM Trans Asian Lang Inf Process (TALIP) 12(1):4
Google Scholar
Yaeger LS, Richard WFI, Pagallo GM (2009) Method and apparatus for acquiring and organizing ink information in pen-aware computer systems, 21 July 2009. US Patent 7,564,995
Plamondon R, Srihari SN (2000) Online and off-line handwriting recognition: a comprehensive survey. IEEE Trans Pattern Anal Mach Intell 22(1):63–84
Article Google Scholar
Bhattacharya U, Gupta BK, Parui SK (2007) Direction code based features for recognition of online handwritten characters of Bangla. In: IEEE ninth international conference on document analysis and recognition, 2007. ICDAR 2007, vol 1. pp 58–62
Bhowmik TK, Ghanty P, Roy A, Parui SK (2009) Svm-based hierarchical architectures for handwritten Bangla character recognition. Int J Doc Anal Recognit 12(2):97–108
Article Google Scholar
Dutta A, Chaudhury S (1993) Bengali alpha-numeric character recognition using curvature features. Pattern Recognit 26(12):1757–1770
Article Google Scholar
Parui SK, Guin K, Bhattacharya U, Chaudhuri BB (2008) Online handwritten Bangla character recognition using hmm. In: IEEE 19th international conference on pattern recognition, 2008, ICPR 2008. pp 1–4
Garcia-Salicetti S, Doizzi B, Gallinari P, Mellouk A, Fanchon D (1995) A hidden markov model extension of a neural predictive system for online character recognition. In: IEEE proceedings of the third international conference on document analysis and recognition, 1995, vol 1. pp 50–53
Hu J, Brown MK, Turin W (1996) Hmm based online handwriting recognition. IEEE Trans Pattern Anal Mach Intell 18(10):1039–1045
Article Google Scholar
Connell SD, Sinha R, Jain AK (2000) Recognition of unconstrained online Devanagari characters. In: IEEE proceedings of 15th international conference on pattern recognition, 2000, vol 2. pp 368–371
Joshi N, Sita G, Ramakrishnan A, Deepu V, Madhvanath S (2005) Machine recognition of online handwritten Devanagari characters. In: IEEE proceedings of eighth international conference on document analysis and recognition, 2005. pp 1156–1160
Bhattacharya U, Parui S, Shaw B, Bhattacharya K et al (2006) Neural combination of ann and hmm for handwritten Devanagari numeral recognition. In: Tenth international workshop on frontiers in handwriting recognition
Takahashi K, Yasuda H, Matsumoto T (1997) A fast hmm algorithm for on-line handwritten character recognition. In: IEEE proceedings of the fourth international conference on document analysis and recognition, 1997, vol 1. pp 369–375
Prasad MM, Sukumar M, Ramakrishnan A (2009) Divide and conquer technique in online handwritten Kannada character recognition. In: Proceedings of the international workshop on multilingual OCR. pp 1–7
Lehal G, Singh C (2000) A Gurmukhi script recognition system. In: IEEE proceedings of the15th international conference on pattern recognition, 2000, vol 2. pp 557–560
Sharma A, Kumar R, Sharma R (2008) Online handwritten Gurmukhi character recognition using elastic matching. In: IEEE congress on image and signal processing, 2008. CISP’08, vol 2. pp 391–396
Sharma A (2009) Online handwritten Gurmukhi character recognition. Ph.D. thesis, Thapar University
Sharma A, Kumar R, Sharma R (2010) Hmm-based online handwritten Gurmukhi character recognition. Mach Gr Vis Int J 19(4):439–449
Google Scholar
Kumar R, Sharma RK, Sharma A (2015) Recognition of multi-stroke based online handwritten Gurmukhi aksharas. Proc Natl Acad Sci India Sect A Phys Sci 85(1):159–168
Article Google Scholar
Bharath A, Madhvanath S (2007) Hidden markov models for online handwritten Tamil word recognition. In: IEEE ninth international conference on document analysis and recognition, 2007. ICDAR 2007, vol 1. pp 506–510
Joshi N, Sita G, Ramakrishnan A, Madhvanath S (2004) Comparison of elastic matching algorithms for online Tamil handwritten character recognition. In: IEEE ninth international workshop on frontiers in handwriting recognition, 2004. IWFHR-9 2004. pp 444–449
Aparna K, Subramanian V, Kasirajan M, Prakash GV, Chakravarthy V, Madhvanath S (2004) Online handwriting recognition for Tamil. In: IEEE ninth international workshop on frontiers in handwriting recognition, 2004. IWFHR-9 2004. pp 438–443
Jayaraman A, Chandra SC, Srinivasa CV (2007) Modular approach to recognition of strokes in Telugu script. In: IEEE ninth international conference on document analysis and recognition, 2007. ICDAR 2007, vol 1. pp 501–505
Kline M (2013) Calculus: an intuitive and physical approach. Courier Corporation, Chelmsford
Google Scholar
Rabiner LR, Juang B-H (1986) An introduction to hidden markov models. IEEE ASSP Mag 3(1):4–16
Article Google Scholar

Download references

Acknowledgments

We take this opportunity to extend our special thanks to Technology Development for Indian Languages (TDIL), DeitY, MoCIT, Government of India, for sponsoring this research work.

Author information

Authors and Affiliations

Computer Science and Engineering Department, Thapar University, Patiala, 147004, Punjab, India
Karun Verma & Rajendra Kumar Sharma

Authors

Karun Verma
View author publications
You can also search for this author in PubMed Google Scholar
Rajendra Kumar Sharma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Karun Verma.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Verma, K., Sharma, R.K. Comparison of HMM- and SVM-based stroke classifiers for Gurmukhi script. Neural Comput & Applic 28 (Suppl 1), 51–63 (2017). https://doi.org/10.1007/s00521-016-2309-5

Download citation

Received: 06 October 2015
Accepted: 30 March 2016
Published: 20 April 2016
Issue Date: December 2017
DOI: https://doi.org/10.1007/s00521-016-2309-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Comparison of HMM- and SVM-based stroke classifiers for Gurmukhi script

Abstract

Similar content being viewed by others

Recognition of Multi-Stroke Based Online Handwritten Gurmukhi Aksharas

Multi-layer Classification Approach for Online Handwritten Gujarati Character Recognition

Recognition of online unconstrained handwritten Gurmukhi characters based on Finite State Automata

1 Introduction

2 Related work