An efficient segmentation technique for Devanagari offline handwritten scripts using the Feedforward Neural Network

Dhaka, Vijay Pal; Sharma, Manoj Kumar

doi:10.1007/s00521-015-1844-9

An efficient segmentation technique for Devanagari offline handwritten scripts using the Feedforward Neural Network

Original Article
Published: 26 February 2015

Volume 26, pages 1881–1893, (2015)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Neural Computing and Applications Aims and scope Submit manuscript

An efficient segmentation technique for Devanagari offline handwritten scripts using the Feedforward Neural Network

Download PDF

Vijay Pal Dhaka¹ &
Manoj Kumar Sharma¹

330 Accesses
21 Citations
Explore all metrics

Abstract

In this research work, we have proposed a segmentation technique for words and characters of Devanagari offline handwritten scripts. Due to complex structures and high unevenness in writing styles, recognition of words and characters from the unconstrained scripts has become a burning vicinity of interest for researchers. The proposed Pixel Plot and Trace and Re-plot and Re-trace (PPTRPRT) technique extracts text region from Devanagari offline handwritten scripts and lead iterative processes for segmentation of text lines along with skew and de-skew operations. The outcomes of iterations are used in pixel-space-based word segmentation, and the segmented words are used in segmentation of characters. Moreover, PPTRPRT perform various normalization steps to allow deviation in pen breadth and slant in inscription. Investigational outcome shows that the proposed technique is competent to segment characters from Devanagari offline handwritten scripts, and accuracy of outcomes is up to 99.578 %.

Segmentation of english Offline handwritten cursive scripts using a feedforward neural network

Article 16 July 2015

Pixel plot and trace based segmentation method for bilingual handwritten scripts using feedforward neural network

Article 26 June 2015

A Database for Printed Takri Class of North-West Indian Regional Scripts

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The research on auto-recognition of constrained and unconstrained Devanagari scripts started in early 1970s, and it has complex composition of its constituent symbols (13 vowels and 34 consonants along with 14 modifiers of vowels, see Fig. 1) Jayadevan et al. [1].

Devanagari script has its own specified composition rules for combining vowels, consonants and modifiers. Some of them can be combined with their type (see Fig. 2) Jayadevan et al. [1].

Another distinctive feature of the Devanagari script is the presence of a horizontal header line on the top of all characters (see Fig. 3).

The occurrence frequency of different Devanagari characters is provided in the occurrence statistics of 20 frequent characters in Devanagari script Jayadevan et al. [1] (see Table 1).

Table 1 Occurrence statistics of 20 frequent characters in Devanagari script

Full size table

In the present scenario, due to the complex structures and high unevenness in writing styles of the scripting languages, recognition of words and characters from the Devanagari offline handwritten script has become a burning vicinity of interest for researchers. Many preprocessing steps are performed in automatic Devanagari offline handwritten recognition systems, which give a moderate deviation in the handwritten script and conserved information relevant to acknowledge.

The present research article is arranged, in the following sections: Sect. 2 gives a brief introduction to related works, Sect. 3 defines mathematical notations to be used in description algorithms, Sect. 4 discusses the methodologies of Pixel Plot and Trace and Re-plot and Re-trace (PPTRPRT) technique, Sects. 5, 6 and 7 deal with the segmentation of words and characters.

2 Related work

Numerous approaches have been proposed so far for improving segmentation of Devanagari offline handwritten scripts. Jayadevan et al. [1] have explored various feature extraction techniques as well as training, classification and matching techniques. Sahu et al. [2] used artificial neural network technique for preprocessing operations and designed a system for segmentation as well as recognition of Devanagari characters. Yadav et al. [3] proposed a hybrid system by combining both writer recognition and handwriting recognition for the Devanagari script. Wshah et al. [4] have proposed a script independent line-based word spotting framework based on hidden Markov models for offline handwritten documents and deal with large vocabulary without the need for word or character segmentation. Wshah et al. [5] proposed a multilingual word spotting framework based on hidden Markov models that work on corpus of multilingual handwritten documents and proposed two systems: a script identifier-based (IDB) and a script identifier-free (IDF) system. Deshpande et al. [6] proposed that the segmentation evolved regular expressions without doing preprocessing as well as training and transformed notation of regular expression directly into directed graphs and provide all the symbol strings generated by the corresponding expressions to finite-state automata. Rahul et al. [7] have proposed a Devanagari handwritten script recognition by applying a feature vector to an artificial neural network. Pal et al. [8] proposed a lexicon-driven method for multilingual (English, Hindi and Bangla) city name recognition for Indian postal automation. Doiphode et al. [9] proposed a technique which finds joint points in the word, identifies vertical and horizontal lines and finally dissects touching characters by taking into account its dimensions, namely height and width of the bounding box. Sankaran et al. [10] have proposed a formulation to minimize expectations on Unicode generation, error correction, and the harder recognition task is modeled as learning of an appropriate sequence to sequence translation scheme. Kumar et al. [11] have presented a survey on OCR of most popular Indian scripts. Wshah et al. [12] proposed a statistical script independent line-based word spotting framework for offline handwritten documents based on hidden Markov models by comparing an exhaustive study of filler models and background models for better representation of background or non-keyword text. Basu et al. [13] proposed a work to recognize the postal codes written in any of the four popular scripts, viz. Latin, Devanagari, Bangla and Urdu. Hassan et al. [14] proposed a framework for the application of multiple features for handwritten data-based identity recognition. They have designed a scheme for multiple feature-based identity establishment using multi-kernel learning using genetic algorithm.

3 Mathematical terms

The use of the standard symbols is highly recommended (see Table 2).

Table 2 Mathematical symbols

Full size table

4 Methodology

This section briefly discusses descriptive algorithms of PPTRPRT technique for Devanagari offline handwritten script segmentation.

4.1 Overview of system design

Descriptive architecture of the PPTRPRT technique is given in Fig. 4.

The pattern sets used in the current study are Devanagari offline handwritten script. For desired outcomes, we are using Feedforward Neural Network.

For the evaluation of PPTRPRT technique, a gigantic database of 49,000 samples (i.e., Center for Microprocessor Application for Training Education and Research (CMATER), ICDAR-2005, Off-line Handwritten Devanagari Numeral Database, WCACOM ET0405A-U, HP Labs India Indic Handwriting Datasets) is composed for training pattern.

The PPTRPRT technique extracts text region from Devanagari offline handwritten script images and passes it to iterative processes for segmentation of text lines. The segmented text lines are used in skew and de-skew operations. These skewed and de-skewed images are provided for white pixel-based word segmentation, and these segmented words are used in an iterative process for segmentation of characters. The PPTRPRT technique embraces various dispensations to segment characters from Devanagari offline handwritten script. The normalization steps permit deviations in pen breadth and slant in inscription.

The PPTRPRT technique initiates the segmentation algorithm.

Algorithm 1: Segmentation ()
Step 1: Load Devanagari offline handwritten script file ɛ _i
Step 2: Call Preprocessing (ɛ _i)

The segmentation algorithm upload Devanagari offline handwritten script ɛ _i and outcome of an algorithm is shown in Fig. 5.

4.2 Preprocessing for characters segmentation

The outcomes of the segmentation algorithm are availed by preprocessing algorithm for forthcoming operations.

Algorithm 2: Preprocessing (ɛ _i)
Step 1: Extract text segment
$\beta_{i} = \varepsilon_{i} - \rho$
Step 2: Segment text lines from β _i
(a) Define PEAK_FACTOR (Γ _i) = 5 and THRESHOLD (Ξ _i) = 1
(b) Obtain black and white image (Δ _i)
(c) Generate Histogram (Π _i)
(d) Calculate upper peak (MaxΓ _i) and lower peak (MinΓ _i)
(e) Calculate text line segmentation points (P _j)
(f) Segment text line (S _i) and drop white parts (∃_k)
Step 3: Call SkewCorrection (S _i)

The above-defined algorithm extracts the text segment β _i from ɛ _i by removing noises ρ. This text segment β _i is applied for text line segmentation by calculating MaxΓ _i and MinΓ _i with PEAK_FACTOR Γ _i along with various adequate operations (see Fig. 6).

Furthermore, using MaxΓ _i, MinΓ _i and the size of an image L, we calculate text line segmentation points (P _j) and segment text line (S _i) from the text segment β _i by eliminating white parts (∃_k) along with THRESHOLD value (see Fig. 7).

4.3 Skew correction

Outcomes of preprocessing algorithm are availed by skew correction algorithm for skew correction operations.

Algorithm 3: SkewCorrection (S _i)
Step 1: Improve intensity of the black pixels (Bk)
Step 2: Find (x, y) coordinates of the lower pixels (low_bk)
Step 3: Filter irrelevant pixels F _i(low_bk)
Step 4: Calculate slope angle (θ _i(S _i)) of the text line using linear regression
Step 5: Skew or De-skew text line(S _i) with slope angle (θ _i(S _i))
$\left( {S_{i}^{2} = Screw\,\theta_{i} \,\,or\,\,De - Screw\,\theta_{i} } \right)$
Step 6: Remove black corners from skewed text line
$S_{i}^{2} = S_{i}^{1} - \varLambda_{k}$
Step 7: Call SlantDetection (S ²_i , θ _i)

An above-defined algorithm calculates (x, y) coordinate of the lower pixels and filters irrelevant pixels F _i (low_bk) from segmented text line (S _i). The slope angle (θ _i(S _i)) of the segmented text line (S _i) is calculated using linear regression [θ _i(S _i)] and removed black corners with skewed operation S ²_i = S ¹_i − Λ _k. The outcomes of an algorithm are shown in Figs. 8 and 9.

4.4 Slant detection and correction

Outcomes of skew correction algorithm (skewed text line S ²_i along with angle θ _i) are availed by slant detection algorithm for slant detection operation.

Algorithm 4: SlantDetection (S ²_i , θ _i)
Step 1: Trace regional boundaries of an image ω _i(S ²_i )
Step 2: Define min_strock_length and step_size
Step 3: Do Histogram count operations (Π _i)
Step 4: Find indices and value of nonzero elements
Step 5: Calculate slant angle ϕ _i
Step 6: Call SlantCorrection (S ²_i , ϕ _i)

The slant detection algorithm trace regional boundaries of an image ω _i(S ²_i ) by defining min_strock_length and step_size. Moreover, it traces count operations as well as the indices of the text line S ²_i using histogram Π _i and finds nonzero elements. A slant angle ϕ _i is to be calculated using indices and value of nonzero elements. The outcomes of an algorithm are shown in Fig. 10.

Besides, slant angle ϕ _i and the text line S ²_i are used for slant correction operations.

Algorithm 5: SlantCorrection (S ²_i , ϕ _i)
Step 1: Calculate Height (ξ _i(S ²_i )), Width (η _i(S ²_i )) and Depth (ζ _i(S ²_i )) of the text line (S ²_i )
Step 2: Define new text line matrix A _i(S ²_i )
Step 3: Create spatial transformation structure of the text line
$R_{i} \left( {A_{i} \left( {S_{i}^{2} } \right)} \right)$
$T_{i} \left( {A_{i} \left( {S_{i}^{2} } \right)} \right)$
$U_{i} \left( {A_{i} \left( {S_{i}^{2} } \right)} \right)$
Step 4: Segment final skewed text line
$S_{i}^{3} = \left( {R_{i} - T_{i} + U_{i} } \right)$
Step 5: Call WordSegmentation (S ³_i )

The above-defined algorithm calculates height [ξ _i(S ²_i )], width [η _i(S ²_i )] and depth [ζ _i(S ²_i )] of the text line(S _i ²). These parameters formulate a new text line matrix A _i(S ²_i ) and spatial transformation structures R _i(A _i(S ²_i )), T _i(A _i(S ²_i )), U _i(A _i(S ²_i )). These spatial transformation structures are used to segment skewed text line S ³_i = (R _i − T _i + U _i) which is used in word segmentation. The outcomes of an algorithm are shown in Fig. 11.

5 Word segmentation

Moreover, the outcomes of slant correction algorithm are availed by word segmentation algorithm.

Algorithm 6: WordSegmentation (S ³_i )
Step 1: Generate Histogram Π _i
Step 2: Find Segmentation Points of Word Q _ij
(a) Find White Spaces λ _ij = ∑ ∃_k
(b) Cluster Text Line to distinguish between white spaces
$\varOmega_{ij} = \aleph_{i} \left( {S_{i}^{3} ,\lambda_{ij} } \right)$
Step 3: Call CharactersSegmentation (Ω _ij)

Above-defined algorithm calculates histogram Π _iand white space λ _ij = Σ∃_k which are used to trace words segmentation pointsQ _ij. The outcomes of an algorithm are shown in Figs. 12 and 13.

The Character Segmentation Algorithm uses segmented words (Ω _ij) for the segmentation as well as extraction of characters using a Feedforward Neural Network model.

6 Feedforward Neural Network

In a multilayer Feedforward Neural Network, output feeds forward from one layer of neurons to the next layer of neurons. A multilayer Feedforward Neural Network can represent nonlinear functions and consists of one input layer, one or more hidden layers besides one output layer. Each layer has associated weights, (w0) and (z0), which feeded into the hidden layer and into the output layer respectively. Therefore, neither any backward connection nor skipping connection exists between layers. These weights will adjust while training the network using back-propagation algorithm. Typically, all input units are connected to hidden layer units, and hidden layer units are connected to the output units.

A.:: Input units

Data is feeded through Input units into the system without any processing, the value of an input unit is xj, where j goes through 1 to d input units along with a special input unit x0 contains a constant value 1 and provides bias to the hidden nodes.

B.:: Hidden units

Each hidden node calculates the weighted sum of its inputs and determines the output of the hidden node with a threshold function. The weighted sum of the inputs for hidden node z _h is calculated as:

$$\sum\limits_{j = 0}^{d} {w_{hj} x_{j} }$$

(1)

The threshold function applied at the hidden node is typically either a step function or a sigmoid function.

$${\text{sigmoid}}\left( a \right) = \frac{1}{{1 + {\text{e}}^{ - a} }}$$

(2)

The sigmoid function is a squashing function, it squashes input between 0 and 1. It applies to the hidden node for the weighted sum of inputs and generates output z _h

$${\text{z}}_{h} {\text{ = sigmoid}}\left( {\sum\limits_{j = 0}^{d} {w_{hj} x_{j} } } \right) = \frac{1}{{1 + {\text{e}}^{{ - \sum\nolimits_{j = 0}^{d} {w_{hj} x_{j} } }} }}$$

(3)

for h going from 1 to H total number of hidden nodes.

C.:: Output units

Computation of the output node is either based on the type of problem (i.e., either a regression problem or a classification problem) or on number of output. The weights going into the output unit are v _ih and have the bias input from hidden unit z ₀, where the input from z ₀ is always 1. So, output unit i computes the weighted sum of its inputs as:

$$o_{i} = \sum\limits_{h = 0}^{H} {v_{ih} \,z_{h} }$$

(4)

In case of one output unit, weighted sum is

$$o = \sum\limits_{h = 0}^{H} {v_{h} \,z_{h} }$$

(5)

D.:: Functions

(1)
Regression for single and multiple outputs

A regression function for the single output calculates the output unit value ‘y’ by weighted sum of its inputs:

$$y = o = \sum\limits_{h = 0}^{H} {v_{h} \,z_{h} }$$

(6)

For multiple outputs, we calculate the output value of unit y _i

$$y_{i} = o_{i} = \sum\limits_{h = 0}^{H} {v_{ih} \,z_{h} }$$

(7)

(2)
Classification for 2 classes

One node can produce either 0 or 1, it can have one class correspond to 0 or 1, respectively, and generate output between 0 and 1.

$${\text{y = }}\;{\text{sigmoid}}\,\left( o \right) = {\text{sigmoid}}\left( {\sum\limits_{h = 0}^{H} {v_{h} \,z_{h} } } \right) = \frac{1}{{1 + {\text{e}}^{{ - \sum\nolimits_{h = 0}^{H} {v_{h} z_{h} } }} }}$$

(8)

E.:: Back-propagation Algorithm

The Back-Propagation Algorithm is used to train Feedforward Neural Network using Gradient Descent method to update the weights in order to minimize the squared error between the network output values and the target output values.

(1)
Online and offline learning

Online learning performs using Stochastic Gradient Descent and offline learning using gradient descent.

(2)
Online weight updates

Weight updates give a single instance (x ^t, r ^t), where x ^t is the input, r ^t is the target output, and y ^t is the actual output of the network along with a positive constant learning rate ϑ.

(a)
Regression in a single output: the weight updates are

$$\Delta v_{h} = \vartheta \left( {r^{\text{t}} - y^{\text{t}} } \right)z_{h}^{\text{t}}$$

(9)

$$\Delta w_{hj} = \vartheta \left( {r^{\text{t}} - y^{\text{t}} } \right)v_{h} z_{h}^{\text{t}} \left( {1 - z_{h}^{\text{t}} } \right)x_{j}^{\text{t}}$$

(10)

(b)
Regression in multiple (i.e., K) outputs: the weight updates are

$$\Delta v_{ih} = \vartheta \left( {r_{i}^{t} - y_{i}^{t} } \right)z_{h}^{t}$$

(11)

$$\Delta w_{hj} = \vartheta \left( {\sum\limits_{i = 1}^{K} {\left( {r_{i}^{\text{t}} - y_{i}^{\text{t}} } \right)v_{ih} } } \right)z_{h}^{\text{t}} \left( {1 - z_{h}^{\text{t}} } \right)x_{j}^{\text{t}}$$

(12)

(c)
Classification for 2 classes: the weight updates are

$$\Delta v_{h} = \vartheta \left( {r^{\text{t}} - y^{\text{t}} } \right)z_{h}^{\text{t}}$$

(13)

$$\Delta w_{hj} = \vartheta \left( {r^{\text{t}} - y^{\text{t}} } \right)v_{h} z_{h}^{\text{t}} \left( {1 - z_{h}^{\text{t}} } \right)x_{j}^{\text{t}}$$

(14)

(d)
Classification of K > 2 classes: the weight updates are

$$\Delta v_{ih} = \vartheta \left( {r_{i}^{\text{t}} - y_{i}^{\text{t}} } \right)z_{h}^{\text{t}}$$

(15)

$$\Delta w_{hj} = \vartheta \left( {\sum\limits_{i = 1}^{K} {\left( {r_{i}^{\text{t}} - y_{i}^{\text{t}} } \right)v_{ih} } } \right)z_{h}^{\text{t}} \left( {1 - z_{h}^{\text{t}} } \right)x_{j}^{\text{t}}$$

(16)

F.:: Error calculation

The sum of squared errors function is usually used to calculate errors, it can measure error with one output unit for one training example (x ^t, r ^t) as

$$E\left( {W,v|x^{\text{t}} ,r^{\text{t}} } \right) = \frac{1}{2}\left( {r^{\text{t}} - y^{\text{t}} } \right)^{2}$$

(17)

Error calculation with multiple output units is simply summation of errors

$$E\left( {W,v|x^{\text{t}} ,r^{\text{t}} } \right) = \frac{1}{2}\sum\limits_{i = 1}^{K} {\left( {r_{i}^{\text{t}} - y_{i}^{\text{t}} } \right)^{2} }$$

(18)

To calculate the error for 1 epoch with one output node (y _i)

$$E\left( {W,v|X} \right) = \frac{1}{2}\sum\limits_{{\left( {x^{\text{t}} ,r^{\text{t}} } \right) \in X}} {\left( {r_{i}^{\text{t}} - y_{i}^{\text{t}} } \right)^{2} }$$

(19)

To calculate the error in the entire network for 1 epoch

$$E\left( {W,v|X} \right) = \frac{1}{2}\sum\limits_{{\left( {x^{t} ,r^{t} } \right) \in X}} {\left( {\sum\limits_{i = 1}^{K} {\left( {r_{i}^{t} - y_{i}^{t} } \right)^{2} } } \right)}$$

(20)

G.:: Momentum

To speed up the learning process, the momentum is used instead of the gradient descent method.

$$v_{ih} = v_{ih} +\Delta v_{ih}$$

(21)

$$w_{hj} = w_{hj} +\Delta w_{hj}$$

(22)

So, new weight update equations are

$$v_{ih}^{\text{t}} = v_{ih}^{\text{t}} + \varDelta v_{ih}^{\text{t}} + \alpha \,\Delta v_{ih}^{{{\text{t}} - 1}}$$

(23)

$$w_{hj}^{\text{t}} = w_{hj}^{\text{t}} +\Delta w_{hj}^{\text{t}} + \alpha \,\Delta w_{hj}^{{{\text{t}} - 1}}$$

(24)

6.1 Design of training set

In the simulation and design of network learning, a multilayer Feedforward Neural Network is designed and network is trained using ‘gradient descent with momentum and adaptive learning rate back-propagation,’ ‘gradient descent with momentum weight and bias learning function,’ ‘mean squared normalized error performance function’ and ‘log-sigmoid transfer function’ and its structure is 100-5-5-5. (see Table 3).

Table 3 Neural network specifications

Full size table

Figure 14 shows a training state graph which contains training, validation, test and best case. The best validation performance is 9.0905e−024 at epoch 2.

Figure 15 shows the neural network regression state which contains training graph, validation graph, test graph and overall graph. It specifies the overall regression state near about linear which is 1. Table 4 shows simulation results of the training.

Table 4 Simulation results

Full size table

Since every input pattern consists of 100 distinct samples so far, Fig. 16 shows the performance graph of simulation result between neural network and input samples.

7 Characters segmentation

For the evaluation of PPTRPRT technique, a gigantic database of 49,000 samples (i.e., Center for Microprocessor Application for Training Education and Research (CMATER), ICDAR-2005, Off-line handwritten devanagari numeral database, WCACOM ET0405A-U, HP Labs India Indic Handwriting Datasets) is composed for training pattern.

Algorithm 7: CharactersSegmentation (Ω _ij)
Step 1: Calculate all black pixels
$\sum {B_{k} \left( {\varOmega_{ij} } \right)}$
Step 2: Plot N Pixels Image
$P_{jk} \left( {\sum {N_{k} } \,,\sum {B_{k} \left( {\varOmega_{ij} } \right)} } \right)$
Step 3: Input image to Feedforward Neural Network and Recall the Input image from Input samples
$\psi_{jk} = ANN\left( {P_{jk} \left( {\sum {N_{k} } \,,\,\sum {B_{k} \left( {\varOmega_{ij} } \right)} } \right)} \right)$
If (ψ _jk > 95 %)
go to step 2
else
$\psi_{jk}^{'} = \psi_{jk} + P_{jk} \left( {\sum {N_{k} } \,,\sum {B_{k} \left( {\varOmega_{ij} } \right)} } \right)$
$\psi_{jk} = \psi_{jk}^{'}$
Repeat step 3
Step 4: Return (ψ _jk)

In above algorithm, histogram Π _i is used to calculate black pixels ΣB _k(Ω _ij) from the segmented words and plot N black pixels P _jk(ΣN _k, ΣB _k(Ω _ij)) from the calculated black pixels ΣB _k(Ω _ij). Moreover, P _jk pixels are used in the Feedforward Neural Network model to recall the image from samples database. The PPTRPRT technique calculates ψ _jk using Feedforward Neural Network

$$\psi_{jk} = ANN\left( {P_{jk} \left( {\sum {N_{k} } \,,\,\sum {B_{k} \left( {\varOmega_{ij} } \right)} } \right)} \right)$$

(25)

If recalled input image ψ _jk is matched with sample pattern up to ≥99 %, then it proceeds for the next N pixels. Otherwise, calculate ψ′ _jk, by adding N more pixels in earlier calculated ψ _jk and proceed for further operations

$$\psi_{jk}^{'} = \psi_{jk} + P_{jk} \left( {\sum {N_{k} } \,,\sum {B_{k} \left( {\varOmega_{ij} } \right)} } \right)$$

(26)

The outcomes of an algorithm are shown in Fig. 17.

8 Experiments and results

The PPTRPRT technique utilizes a gigantic database of offline handwritten sample scripts (each sample has 10–20 lines) with distinct brightness and intensity. In this research article, descriptive algorithms are used with distinct parameter given in Table 5. The outcomes of algorithms are in the form of lines, words and characters (see Figs. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19).

Table 5 Parameter table

Full size table

An execution of PPTRPRT technique is started with preprocessing algorithm for segmentation of text regions as well as text lines from Devanagari offline handwritten script (see Figs. 5, 6, 7). These segmented text lines were used with skew correction algorithm. Moreover, these skewed lines are available for execution of slant detection algorithm (see Figs. 8, 9, 10), and slant detected lines are used by slant correction algorithm (see Fig. 11). The slant corrected lines are used by white pixel-based word segmentation algorithm (see Figs. 12, 13). These segmented word images are used by characters segmentation algorithm (see Fig. 17).

Outcomes of the PPTRPRT technique are encouraging enough, and analyses of the result are shown in Fig. 18. Figure 18 shows the analytical performance graph in form of minimum and maximum performance ratio for the operations performed during the experiments. Graph shows that noise correction ratio is minimum 90.078 %, maximum 99.23 % and the performance ratio of segmented/un-segmented lines is minimum 0 %, maximum 100 %. The performance ratio of skew/de-skew of segmented line is minimum 99.97 %, maximum 100 % and un-skewed/de-skewed lines ratio is 0 %. Besides this, the words segmentation performance ratio over segmented lines is minimum 99.79 %, minimum 100 % and segmented characters performance ratio in a word as well as in the line is minimum 99.037 % and maximum 100 %. Finally, a unit sample performance ratio is minimum 99.578 % and maximum 100 %.

9 Conclusion and further work

This research work presents a realistic characters segmentation technique for Devanagari offline handwritten scripts using a Feedforward Neural Network. The PPTRPRT technique gives a concrete basis to design an optical characters reader with finest accuracy and lowest cost. It is a new technique for reconstructing the Devanagari offline handwritten scripts.

An accuracy of segmented characters from unit sample is 99.578 % and is far enhanced from existing known techniques [15–26]. Table 6 gives comparative details of Devanagari Offline Handwritten Script Recognition systems with the performance of our proposed technique. Puri et al. [5] have proposed accuracy 80.20 % using header line as a feature vector with HMM as a classifier, the size of data set was 39,700. Pal et al. [16] and Malik et al. [17] have proposed accuracy 80.36 and 82 %, respectively, using chain code as feature vector and quadratic was well RE and MFD as classifier, the size of data set was 11,270 and 5000, respectively. Sridhar et al. [18] have proposed accuracy 84.31 % using directional chain code as feature vector and HMM as classifier, the size of data set was 39,700. Murthy et al. [19] have proposed accuracy 90.65 % using distance vector as feature vector and Fuzzy set as classifier, the size of data set was 4750. Basu et al. [20] have proposed accuracy 90.74 % using shadow and CH as feature vector and MLP and MED as classifier, the size of data set was 7154. Kumar et al. [21] have proposed accuracy 94.1 % using gradient as feature vector and SVM as classifier, the size of data set was 25,000. Wakabayashi et al. [22] have proposed accuracy 94.24 % using Gaussian filter as feature vector and quadratic as classifier, the size of data set was 36,172. Rabha et al. [23] have proposed accuracy 94.91 % using eigen-deformation as feature vector and elastic matching as classifier, the size of data set was 3600. Kimura et al. [24] have proposed accuracy 95.13 % using gradient as feature vector and MQDF as classifier, the size of data set was 36,172. Bhatcharjee et al. [25] have proposed accuracy 98.12 % using structural vector as feature vector and FFNN as classifier, the size of data set was 50,000. Nasipuri et al. [26] have proposed accuracy 98.85 % using combined code as feature vector and MLP as classifier, the size of data set was 1500. Finally in proposed work, PPTRPRT technique gives accurate results up to 99.578 % using HFNN as classifier, the size of data set was 49,000. Furthermore, comparison of PPTRPRT technique with other techniques is given in tabular form as below (see Table 6).

Table 6 Comparative details of Devanagari offline handwritten script recognition systems

Full size table

Figure 19 shows analytical performance graph to compare performance of the proposed technique over the existing technique. Our future work is to extend the scope of the PPTRPRT technique to recognize English–Hindi mixed offline handwritten scripts.

In this research article, we are giving a realistic technique to reform the segmentation of words and characters from the Devanagari offline handwritten scripts over the existing techniques. It will provide a concrete basis to design an optical characters reader (OCR) with finest accuracy and lowest cost. In the knowledge of researchers, Pixel Plot and Trace and Re-plot and Re-trace (PPTRPRT) technique is new in reconstruction of the Devanagari offline handwritten scripts.

References

Jayadevan R, Kolhe SR, Patil PM, Pal U (2011) Offline recognition of Devanagari script: a survey. IEEE Trans Syst Man Cybern Part C Appl Rev. ISSN 1094-6977, pp. 782–976. Available: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=5699408&queryText%3DDevanagari+handwritten+script+segmentation
Sahu N, Raman NK (2013) An efficient handwritten Devanagari characters recognition system using neural network. In: International multi-conference on automation, computing communication, control and compressed sensing (iMac4s), IEEE. ISBN 978-1-4673-5089-1, pp. 173–177. Available: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6526403&queryText%3DDevanagari+handwritten+script+segmentation
Yadav N, Chaudhury S, Kalra P (2013) Most discriminative primitive selection for identity determination using handwritten Devanagari Script. In: 12th international conference on document analysis and recognition (ICDAR), IEEE. ISSN 1520-5363, pp. 1390–1394. Available:http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6628842&queryText%3DDevanagari+handwritten+script+segmentation
Wshah S, Kumar G, Govindaraju V (2012) Script independent word spotting in offline handwritten documents based on hidden Markov models. In: 2012 international conference on frontiers in handwriting recognition (ICFHR). ISBN 978-1-4673-2262-1, pp. 14–19. Available: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6424364&queryText%3DDevanagari+handwritten+script+segmentation
Wshah S, Kumar G, Govindaraju V (2012) Multilingual word spotting in offline handwritten documents. In: 21st international conference on pattern recognition (ICPR), IEEE. ISBN 978-1-4673-2216-4, pp. 310–313. Available: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6460134&queryText%3DDevanagari+handwritten+script+segmentation
Deshpande PS, Malik L, Arora S (2006) Characterizing hand written Devanagari characters using evolved regular expressions. In: IEEE region 10 conference TENCON 2006. ISBN 1-4244-0548-3, pp. 1–4. Available: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=4142173&queryText%3DDevanagari+handwritten+script+segmentation
Rahul PV, Gaikwad AN (2012) Multistage recognition approach for handwritten Devanagari script recognition. In: 2012 World congress on information and communication technologies (WICT). ISBN 978-14673-4806-5, pp. 651–656. Available: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6409156&queryText%3DDevanagari+handwritten+script+segmentation
Pal U, Roy RK, Kimura F (2012) Multi-lingual city name recognition for indian postal automation. In: 2012 International conference on frontiers in handwriting recognition (ICFHR). ISBN 978-1-4673-2262-1, pp. 169–173. Available: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6424387&queryText%3DDevanagari+handwritten+script+segmentation
Doiphode A, Ragha L (2011) Novel approach for segmentation of handwritten touching characters from Devanagari words. Das VV, Thankachand N (eds) CIIT 2011 Springer: Berlin. Vol. 250, pp. 621–623. Available: http://springerlink.bibliotecabuap.elogim.com/chapter/10.1007%2F978-3-642-25734-6_106#page-2
Sankaran N, Neelappa A, Jawahar CV (2013) Devanagari text recognition: a transcription based formulation. In: 12th International conference on document analysis and recognition (ICDAR). ISSN 1520-5363, pp. 678–682. Available: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6628704&queryText%3DDevanagari+script+segmentation
Kumar M, Jindal MK, Sharma RK (2011) Review on OCR for handwritten indian scripts character recognition, advances in digital image processing and information technology. In: First international conference on digital image processing and pattern recognition, DPPR 2011. ISBN 978-3-642-24054-6, pp. 268–276. Available: http://springerlink.bibliotecabuap.elogim.com/chapter/10.1007%2F978-3-642-24055-3_28
Wshah S, Kumar G, Govidaraju V (2014) Statistical script independent word spotting in offline handwritten documents. Pattern Recognit 47(3):1039–1050. Available : http://www.sciencedirect.com/science/article/pii/S0031320313003919
Basu S, Das N, Sarkar R, Kundu M, Nasipuri M, Basu DK (2010) A novel framework for automatic sorting of postal documents with multi-script address blocks. Pattern Recognit 43(10):3507–3521. Available: http://www.sciencedirect.com/science/article/pii/S0031320310002426
Hassan E, Chaudhury S, Yadav N, Kalra P, Gopal M (2014) Off-line hand written input based identity determination using multi kernel feature combination. Pattern Recognit Lett 35:113–119. Available: http://www.sciencedirect.com/science/article/pii/S0167865513001955
Shaw B, Parui SK, Shridhar M (2008) A segmentation based approach to offline handwritten Devanagari word recognition. In: International conference on information technology, (ICIT ‘08). ISBN 978-1-4244-37645-0, pp. 256–257. Available: http://ieeexplore.ieee.org/Xplore/defdeny.jsp?url=http%3A%2F%2Fieeexplore.ieee.org%2Fstamp%2Fstamp.jsp%3Ftp%3D%26arnumber%3D4731338%26userType%3Dinst&denyReason=-133&arnumber=4731338&productsMatched=null&userType=inst/
Sharma N, Pal U, Kimura F, Pal S (2006) Recognition of offline handwritten Devnagari characters using quadratic classifier. In: Proceedings indian conference on computer vision graphics and image processing. pp. 805–816. Available: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.333.2508
Deshpande PS, Malik L, Arora S (2008) Fine classification & recognition of hand written Devnagari characters with regular expressions & minimum edit distance method. J Comput 3(5):11–17. ISBN 1796-203X. Available: http://ojs.academypublisher.com/index.php/jcp/article/view/03051117
Shaw B, Parui SK, Shridhar M (2008) Off-line handwritten Devanagari word recognition: a holistic approach based on directional chain code feature and HMM. In: International conference on information technology, 2008 (ICIT’08). ISBN 978-1-4244-3745-0, pp. 203–208
Hanmandlu M, Murthy OVR, Madasu VK (2007) Fuzzy model based recognition of handwritten hindi characters. In: 9th Biennial conference of the Australian pattern recognition society on digital image computing techniques and applications. ISBN 0-7695-3067-2, pp. 454–461. Available: http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4426832&abstractAccess=no&userType=inst
Arora S, Bhattacharjee D, Nasipuri M, Basu DK, Kundu M (2010) Recognition of non-compound handwritten Devanagari characters using a combination of MLP and minimum edit distance. Int J Comput Sci Secur (IJCSS) 4(1):1–14 Available: http://arxiv.org/ftp/arxiv/papers/1006/1006.5908.pdf
Kumar S (2009) Performance comparison of features on Devanagari hand printed dataset. Int J Recent Trends Eng 1(2):33–37. Available: http://www.academypublisher.com/ijrte/vol01/no02/ijrte0102033037.pdf
Pal U, Sharma N, Wakabayashi T, Kimura F (2007) Off-line handwritten character recognition of Devnagari script. In: Ninth international conference on document analysis and recognition. ISBN 978-0-7695-2822-9, vol. 1, pp. 496–500. Available: http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4378759&abstractAccess=no&userType=inst
Mane V, Ragha L (2009) Handwritten character recognition using elastic matching and PCA. In: International conference on advances in computing, communication and control (ICAC3’09). pp. 410–415. Available: http://robotics.csie.ncku.edu.tw/HCI_Project_2010/P78991121_%E9%84%AD%E8%A9%A0%E4%B9%8B/ICACCC2009%20Handwritten%20character%20recognition%20using%20elastic%20matching%20and%20PCA.pdf
Pal U, Chanda S, Wakabayashi T, Kimura F (2008) Accuracy improvement of Devnagari character recognition combining SVM and MQDF. In: 11th International conference on frontiers handwriting recognition. pp. 367–372. Available: http://www.cenparmi.concordia.ca/ICFHR2008/Proceedings/papers/cr1080.pdf
Arora S, Bhatcharjee D, Nasipuri M, Malik L (2007) A two stage classification approach for handwritten Devanagari characters. In: IEEE International conference on computational intelligence and multimedia applications. ISBN 0-7695-3050-8, Vol. 2, pp. 399–403. Available: http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4426729&abstractAccess=no&userType=inst
Arora S, Bhattacharjee D, Nasipuri M, Basu DK, Kundu M, Malik L (2009) Study of different features on handwritten Devnagari character. In: 2nd International conference on emerging trends in engineering and technology (ICETET). ISBN 978-1-4244-5250-7, pp. 929–933. Available: http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=5395510&abstractAccess=no&userType=inst

Download references

Author information

Authors and Affiliations

Jaipur National University, Jagatpura, Jaipur, 302017, Rajasthan, India
Vijay Pal Dhaka & Manoj Kumar Sharma

Authors

Vijay Pal Dhaka
View author publications
You can also search for this author in PubMed Google Scholar
Manoj Kumar Sharma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Manoj Kumar Sharma.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dhaka, V.P., Sharma, M.K. An efficient segmentation technique for Devanagari offline handwritten scripts using the Feedforward Neural Network. Neural Comput & Applic 26, 1881–1893 (2015). https://doi.org/10.1007/s00521-015-1844-9

Download citation

Received: 02 June 2014
Accepted: 11 February 2015
Published: 26 February 2015
Issue Date: November 2015
DOI: https://doi.org/10.1007/s00521-015-1844-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

An efficient segmentation technique for Devanagari offline handwritten scripts using the Feedforward Neural Network

Abstract

Similar content being viewed by others

Segmentation of english Offline handwritten cursive scripts using a feedforward neural network

Pixel plot and trace based segmentation method for bilingual handwritten scripts using feedforward neural network

A Database for Printed Takri Class of North-West Indian Regional Scripts

1 Introduction

2 Related work

3 Mathematical terms

4 Methodology

4.1 Overview of system design

4.2 Preprocessing for characters segmentation

4.3 Skew correction

4.4 Slant detection and correction

5 Word segmentation

6 Feedforward Neural Network

6.1 Design of training set

7 Characters segmentation

8 Experiments and results

9 Conclusion and further work

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An efficient segmentation technique for Devanagari offline handwritten scripts using the Feedforward Neural Network

Abstract

Similar content being viewed by others

Segmentation of english Offline handwritten cursive scripts using a feedforward neural network

Pixel plot and trace based segmentation method for bilingual handwritten scripts using feedforward neural network

A Database for Printed Takri Class of North-West Indian Regional Scripts

Explore related subjects

1 Introduction

2 Related work

3 Mathematical terms

4 Methodology

4.1 Overview of system design

4.2 Preprocessing for characters segmentation

4.3 Skew correction

4.4 Slant detection and correction

5 Word segmentation

6 Feedforward Neural Network

6.1 Design of training set

7 Characters segmentation

8 Experiments and results

9 Conclusion and further work

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation