A Modified Approach for the Segmentation of Unconstrained Cursive Modi Touching Characters Cluster

Deshmukh, Manisha S.; Kolhe, Satish R.

doi:10.1007/978-981-16-0507-9_36

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1380))

Included in the following conference series:

International Conference on Recent Trends in Image Processing and Pattern Recognition

Abstract

In this paper, a robust character segmentation approach for cursive handwritten Modi script touching character cluster is presented. Prior to segmentation, the middle text region of the touching character cluster is separated by examining the location of Shirorekha and baseline. The middle text region is scrutinized for the estimation of ligature between two characters. Two different strategies are employed to find the location of the ligature. The selection of the strategy is based on the degree of connected component overlapratio. The foreground pixel intensity and vertical projection profile is scrutinized to segment the touching characters. The performance of the system is tested using the touching character clusters of the original archaic handwritten Modi documents. The proposed approach yields efficient touching characters cluster segmentation output and it is feasible to tackle most of the challenges in touching character cluster segmentation.

Access provided by Autonomous University of Puebla. Download conference paper PDF

A New Approach for Unified Characters Cluster Segmentation of Ancient Handwritten Modi Documents

A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents

Article 25 June 2024

Character Segmentation for Cursive Handwritten Text Using Ligature Classification and Transition Feature

Keywords

1 Introduction

The segmentation of the touching character is more challenging task if the script is handwritten, unconstrained, stylish and cursive. The Brahmi base ancient cursive Modi script is derived from Nagari family. This script is originated during 17^th century and used in Maharashtra as administrative script up to 1950. It was used to write Marathi language as well as other languages as Hindi, Guajarati, Persian and so on. Mainly, this script was used for the fast writing. The text is written using Boru (Wooden stick/bird feather) and link over the Shirorekha without lifting the pen. Shirorekha is drawn before to write a text. Modi script text is not separated in sentences and words. Similarly, no punctuation mark is used to indicate the end of words or sentences. Thus, number of challenges are exhibits all over the segmentation of the characters as touching characters; nonuniform Shirorekha and base line; skewed or curved text lines; nonuniform text size and so on. Figure 1 shows the sample images of archaic original Modi script documents [15,16,17,18,19].

The number of segmentation challenges is under perception of the touching handwritten Modi character clusters as depicted in Fig. 2.

The accuracy of character recognition having one most important requisite aspect as the accuracy in character segmentation. However, the research work for touching Modi character cluster segmentation is not reported. Thus, there is essential to achieve the improvement in segmentation of touching handwritten Modi character cluster. The main objective of this research is to improve the accuracy of the freestyle, unconstrained, cursive and handwritten Modi touching character cluster segmentation. To improve the segmentation rate of the touching character cluster, a modified technique is proposed. The global and local zoning is employed with three types of analysis as: i) overlap ratio, ii) foreground pixel intensity and iii) vertical projection profile.

After giving the brief introduction, rest of the paper is clarified the proposed work as follows. Section 2 reviewed the related work. Section 3 discussed the framework of the handwritten Modi touching character cluster system. Details of the dataset which is used to check the performance of the proposed system is illustrated in Sect. 4. The experimental results are reported and discussed in Sect. 5. The conclusion is stated in the Sect. 6.

2 Literature Review

Generally, the character segmentation techniques are classified in two type explicit and implicit approach [2, 5, 6]. In the explicit approaches, segmentation point of the text part is finding out to separate the isolated characters. These approaches having the drawbacks given as follows [5].

Over-segmentation or under-segmentation.
Failed to segment the touching characters.
Failed to segment overlapping characters.

In implicit approaches, to estimate the segmentation point intelligent system has been used. These techniques having two drawbacks given as follows [5].

Failed to identify overlapping and touched text part.
Requires huge training data and time which generates extensive overheads.

Handwritten character segmentation approaches found may vary significantly for the handwritten script like English, Chinese, Devanagari, Bengali, Guajarati etc. [1,2,3,4,5,6,7,8,9]. In the study of the handwritten character segmentation literature, it is found that the approaches are script specific and the number of issues are considered for character segmentation. It uses the prior knowledge about script’s character structuring, writing style of script etc. [14]. The research presented in [13, 16, 18, 19, 21] proposes different types of strategies for the handwritten Devanagari script character segmentation. Graph distance theory-based approach for isolated, overlapping and touching Devanagari character segmentation is discussed in citech36ref10. This approach has two inefficiencies as: i) In locating accurate boundaries between characters through post-processing, and ii) Vertical cuts are gained for overlapped characters. The cursive handwritten touching English character segmentation techniques are presented in [10, 12] and [23].

The research work presented in [13, 16,17,18,19,20,21] proposes different types of strategies for the handwritten Devanagari script character segmentation. The work presented in [13, 18] and [21] is the text-based approach used the structural properties to segments the touching Devanagari characters. This approach is script specific, parametric and having over-segmentation problem. It fails to segment the characters with broken left modifier and the vertical left modifier is too small. It also does not tackle with broken characters, touching characters. Graph distance theory-based approach for isolated, overlapping and touching Devanagari character segmentation is discussed in [16]. This approach has two inefficiencies as: i) In locating accurate boundaries between characters through post-processing, and ii) Vertical cuts are gained for overlapped characters. Morphological operation with minutiae detection algorithm is presented in [19] for the segmentation of Devanagari compound characters. A constrained based Devanagari touching character segmentation system is presented in [20] using analysis of bounding box with vertical bars technique. This system is failed to segment the broken and overlapping characters.

Kapoor S., & Verma V. [20] reported three opinions in the area of the character segmentation approaches as: i) The vertical projection profile-based character segmentation approaches are not suitable for free style, handwritten cursive, unconstrained overlapping characters; ii) The character segmentation approaches using Hidden Markov Model does not works well with overlapping characters; and iii) The water reservoir-based techniques are constrained based and are not able to segment the overlapping characters [22].

3 System Framework

A modified framework for Modi touching character segmentation system is illustrated in Fig. 3. The cluster of touching Modi characters is the input of this system. The proposed handwritten Modi touching character cluster segmentation procedure includes four stages, which are described as below.

3.1 Stage I

The initial stage of the character segmentation includes three processing sub steps. The input cluster images are in gray level. These images are converted into binary level by using Otsu’s thresholding method [23]. The binarization of the images is desirable for the estimation of the location of the Shirorekha and text region. For the formation of the text region (Treg) the location of Shirorekha (Hline) and base line (Bline) is detected by calculating maximum and minimum peak of the horizontal projection profile using Eq. 1 and Eq. 2 respectively [2]. These locations are used for the separation of the text region of the Modi touching character cluster image as demonstrated in Fig. 4.

$$\begin{aligned} Hline=max _ {(j=1:N)} \Bigg ( \sum _{i=1}^{M}C(i,j)\Bigg ) \end{aligned}$$

(1)

$$\begin{aligned} Bline=min _ {(j=1:N)} \Bigg ( \sum _{i=1}^{M}C(i,j)\Bigg ) \end{aligned}$$

(2)

Here, C is the Modi touching characters cluster image of size M X N.

3.2 Stage II

The next successive stage of the cluster segmentation includes local zoning of the text region (Treg). The text region is partitioned into two non-overlapping zones horizontally from the middle. The connected components (CC) from the lower region are intended as depicted in Fig. 5(b).

For each connected component the two elements are determined as its bounding box and area. The bounding box is rectangular region specified by a vector gives [X Y Width Height] of each component as shown in Fig. 5(b). Similarly, area (Areafig) is the actual number of pixels in the component computed which is calculated using Eq. 3.

$$\begin{aligned} Areafig(k)= \sum _{i=1}^{Height_k} \sum _{j=1}^{Width_k} CC_k (i,j) \qquad where k=1..Number of connected component \end{aligned}$$

(3)

Further, the operative connected components (CCfig) are discriminated using the Eq. 4.

$$\begin{aligned} CCfig= {\left\{ \begin{array}{ll} Areafig_k &{} Areafig_k \ge \sqrt{\mu (Areafig) + \sigma (Areafig) }/2 \\ none &{}\text {otherwise} \end{array}\right. } \end{aligned}$$

(4)

Where function $\mu $ and $\sigma $ are mean and standard deviation of the area of the all connected components respectively. The overlap ratio between the bounding boxes of each operative connected component pair is computed using union ratio type as given in Eq. 5.

$$\begin{aligned} Oratio =\frac{CCfig_A \cap CCfig_B}{ CCfig_A \cup CCfig_B} \end{aligned}$$

(5)

Let CCfig_A and CCfig_B be the two connected components with Box_A and Box_B bounding box respectively. The overlap ratio (Oratio) between these two bounding boxes is an M_oXN_o matrix. Each element (r₁, r₂) of this matrix is corresponds to overlap ratio between row r₁ in Box_A and row r₂ in Box_B. The overlapratio is computed in the between 0 and 1 where 1 implies that a perfect overlap. This overlap ratio is exploited for the selection procedure of the touching cluster type. Clusters are classified in two classes as i) Partially touching clusters (Fig. 2 (b, d, e and g)) and iii) Entirely touching clustered (Fig. 2 (a, c and h)). The full process of this stage I and II is illustrated in Algorithm 1 from step1 to step8.

3.3 Stage III

The overlapping ratio (CntOratio) is counted for the all the connected components. And this overlapping ratio is applied for the selection of the criteria of the touching character cluster segmentation process using Eq. 6. This is depicted in Algorithm 1 from step 9 to step10.

$$\begin{aligned} Criteria = {\left\{ \begin{array}{ll} Function2 ( )&{} CntOratio =null \\ Function1 ( ) &{}\text {otherwise} \end{array}\right. } \end{aligned}$$

(6)

A] Function1(): The partial touching character clusters are segmented using Function1 method. A window of size 3 X N_t is scanned through the text region. Related to each window first occurrence of the foreground pixels are searched from the bottom side. The ligature location (LocX) is finalized by scrutinized the location vector (Loc) with height of the text region M_t using Eq. 7.

$$\begin{aligned} LocX(s) = {\left\{ \begin{array}{ll} Loc(i) &{} Loc(i) \le M_t \wedge Loc \ne 0\\ none &{}\text {otherwise} \end{array}\right. } \end{aligned}$$

(7)

Where i = 1...length(Loc). The ligature location vector is used to finalize the segmentation column. This character segmentation method is described in Algorithm 2.

B] Function2(): The entirely touching characters clusters are segmented using vertical projection profile-based method. The vertical projection profile (Vpp) of the text region (Mpartfig) is calculated using Eq. 8.

$$\begin{aligned} Vpp= \sum _{i=1}^{M_t} Mpartfig(i,j) \qquad where\,1 \le j \ge N_t \end{aligned}$$

(8)

The location vector (Loc) is formed by using the Eq. 9.

$$\begin{aligned} Loc(s)= {\left\{ \begin{array}{ll} i &{} Vpp(i) \ge \mu (Vpp) + \sigma (Vpp) \\ none &{}\text {otherwise} \end{array}\right. } \end{aligned}$$

(9)

Where i = 1..Nt.Further, the estimated location vector (Loc) is analysed in the formation of the Rational segmentation column (SegLoc). The illustration of entirely touching characters cluster segmentation procedure is described in Algorithm 3.

3.4 Stage IV

The handwritten Modi script document are highly degraded due to number of reasons like aging, writing material and so on. Consequently, there may be under segmentation problem is occurred during the estimation of the Rational segmentation column. To reduce this problem, the Finalize Segmentation Columns (FSCs) is decided by analysing the Rational segmentation column (SegLoc) vector using the postprocessing method described in [2].

4 Dataset

The archaic original handwritten Modi script documents collected from the various archaeological places as Rajwade Sanshodhan Mandir, Dhule, Shri Samartha Vagdevta Mandir, Dhule etc., are used to test and evaluate the performance of the proposed system. The dataset contains 2540 document images. These document images already preprocessed means de-noised and globally de-skewed using the approaches presented in [25] and [1] respectively. The preprocessed document images are segmented in the text lines using the approach presented in [24]. The Modi text lines are segmented into the isolated Modi characters and touching characters clusters using the approach presented in [2]. These Modi touching characters clusters are the input of the proposed system.

5 Results and Discussions

The performance of the proposed handwritten Modi touching character cluster segmentation method is evaluated using a similar evaluation strategy that was illustrated in [2, 28]. The evaluation method uses five evaluation factors as: Correct segmentation rate (CSR), Successful Segmentation Rate (SSR), precision, recall and F-measure illustrated in Eqs. 10–14 respectively. These aspects are calculated by counting the number of matches between the resultant segmented characters by the algorithm and ground truth characters in text line segments [2, 28].

$$\begin{aligned} CSR =\frac{NC_R - ( NC_c +NC_B)}{ NC_G} \times 100 \end{aligned}$$

(10)

$$\begin{aligned} SSR =\frac{NC_I }{ NC_R} \times 100 \end{aligned}$$

(11)

$$\begin{aligned} Precision =\frac{NC_I + NC_C}{ NC_R} \end{aligned}$$

(12)

$$\begin{aligned} Recall =\frac{NC_I + NC_C}{ NC_G} \end{aligned}$$

(13)

$$\begin{aligned} Precision =2 \times \frac{Recall \times Precision}{ Recall + Precision} \end{aligned}$$

(14)

where,

NC_G: Number of ground truth characters.

NC_R: Number of resultant characters.

NC_I: Number of correctly segmented isolated characters.

NC_C: Number of overlapping or touching characters clusters.

NC_B: Number of incorrectly segmented characters.

Ground truth values are not available for the testing the performance of the handwritten Modi touching character cluster segmentation system. Thus, the numbers of 2249 Modi touching character clusters in 1072 text lines of ninety-two Modi documents are manually calculated and ground truth values are evaluated. Table 1 gives the result of touching Modi character cluster segmentation.

Table 1. The Modi touching character cluster segmentation result.

Full size table

The Successful Segmentation Rate (SSR) and Correct Segmentation Rate (CSR) are achieved using the proposed method for the Modi touching character cluster segmentation as 89.68% and 83.62% respectively. Partial touching Modi character clusters are segmented efficiently by proposed method. There are two issues are found about the bad segmentation. The first issue is with very tightly overlapping characters with touching characters exist in the same cluster. The second issue is the broken touching character as shown in Fig. 6(b). Still very tightly coupled character clusters and overlapping character clusters are not segmented properly. Figures 6, 7, 8 and 9 shows the examples of the segmented touching Modi character clusters.

6 Conclusions

The connected component overlapratio analysis effectively works on the different problems in touching character cluster segmentation. The proposed handwritten Modi touching character cluster segmentation approach is expertly dealing with the non-uniform Shirorekha and baseline. The results obtained proved that the presented character segmentation approach efficiently tackled with the character segmentation challenges like broken character, broken Shirorekha, variable size characters, variable distance between characters, skewed text lines and curved text lines. Highly degraded touching characters ligature is efficiently identified. The experimental results indicates that the proposed system is efficient to tackle the challenges in handwritten Modi touching character cluster segmentation. Hence, this system can be a part of the archaic handwritten freestyle and unconstrained Modi document segmentation system. Currently the work on the overlapping characters cluster segmentation is in progress.

References

Obaidullah, S.M., Halder, C., Santosh, K.C., Das, N., Roy, K.: PHDIndic\_11: page-level handwritten document image dataset of 11 official Indic scripts for script identification. Multimed. Tools Appl. 77(2), 1643–1678 (2017). https://doi.org/10.1007/s11042-017-4373-y
Obaidullah, S.M., Santosh, K.C., Halder, C., Das, N., Roy, K.: Automatic Indic script identification from handwritten documents: page, block, line and word-level approach. Int. J. Mach. Learn. Cybern. 10(1), 87–106 (2017). https://doi.org/10.1007/s13042-017-0702-8
Article Google Scholar
Obaidullah, S.M., Santosh, K.C., Das, N., Halder, C., Roy, K.: Handwritten Indic script identification in multi-script document images: a survey. Int. J. Pattern Recognit. Artif. Intell. 32(10), 1856012 (2018)
Google Scholar
Choudhary A., Rishi, R., Ahlawat, S: A new character segmentation approach for off-line cursive handwritten words. Proc. Comput. Sci. 17, 88–95 (2013)
Google Scholar
Kumar, M., Jindal, M.K., Sharma, R.K.: Segmentation of isolated and touching characters in offline handwritten Gurmukhi script recognition. Int. J. Inf. Technol. Comput. Sci. 6(2), 58–63 (2014)
Google Scholar
Kurniawan, F., Rahim, M.S.M., Daman, D., Rehman, A., Mohamad, D., Mariyam, S.: Region-based touched character segmentation in handwritten words. Int. J. Innov. Comput. Inf. Control 7(6), 3107–3120 (2011)
Google Scholar
Garg, N.K., Kaur, L., Jindal, M.K.: Segmentation of touching modifiers and consonants in middle region of handwritten Hindi text. Pattern Recognit. Image Anal. 25(3), 413–417 (2015). https://doi.org/10.1134/S1054661815030050
Article Google Scholar
Saba, T., Rehman, A., Elarbi-Boudihir, M.: Methods and strategies on off-line cursive touched characters segmentation: a directional review. Artif. Intell. Rev. 42(4), 1047–1066 (2011). https://doi.org/10.1007/s10462-011-9271-5
Article Google Scholar
Sharma, P., Sachan, M.K.: A review on character segmentation of touching and half character in handwritten Hindi text. Int. J. Adv. Res. Comput. Sci. 8(3), 1078–1083 (2017)
Google Scholar
Jindal, K., Kumar, R.: A novel shape-based character segmentation method for Devanagari script. Arabian J. Sci. Eng. 42(8), 3221–3228 (2017). https://doi.org/10.1007/s13369-017-2420-7
Article Google Scholar
Palakollu, S., Dhir, R., Rani, R.: Handwritten Hindi text segmentation techniques for lines and characters. In: Proceedings of the World Congress on Engineering and Computer Science, vol. 1, pp. 24–26 (2012)
Google Scholar
Bag, S., Krishna, A.: Character segmentation of Hindi unconstrained handwritten words. In: Barneva, R.P., Bhattacharya, B.B., Brimkov, V.E. (eds.) IWCIA 2015. LNCS, vol. 9448, pp. 247–260. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-26145-4_18
Chapter Google Scholar
Golait Snehal S., Malik L.: Handwritten Marathi compound character segmentation using minutiae detection algorithm. Proc. Comput. Sci. 87, 18–24 (2016)
Google Scholar
Kapoor S.,Verma V.: Fragmentation of handwritten touching characters in Devanagari script. Int. J. Inf. Technol. Model. Comput. (IJITMC) 2 11–21 (2014)
Google Scholar
Behera, S., Pradhan, A., Majhi, B.: A novel clustering based fuzzy approach for character segmentation in handwritten Odia scripts. In: 2017 Fourth International Conference on Image Information Processing (ICIIP), pp. 1–6. IEEE, December 2017
Google Scholar
Kavitha, A.S., Shivakumara, P., Kumar, G.H., Lu, T.: A new watershed model based system for character segmentation in degraded text lines. AEU-Int. J. Electron. Commun. 71, 45–52 (2017)
Article Google Scholar
Otsu, N.: A threshold selection method from gray level histogram. IEEE Trans. Syst. Man Cybern. 19(1), 62–66 (1979)
Google Scholar
Deshmukh, M.S., Patil, M.P., Kolhe, S.R.: A hybrid text line segmentation approach for the ancient handwritten unconstrained freestyle Modi script documents. Imaging Sci. J. 66(7), 433–442 (2018)
Article Google Scholar
Deshmukh M.S., Kolhe, S.R.: A hybrid character segmentation approach for cursive unconstrained handwritten historical Modi script documents. In: International Conference on Sustainable Computing in Science, Technology & Management (SUSCOM-2019). SSRN Elsevier Digital Library (2019)
Google Scholar
Deshmukh, M.S., Patil, M.P., Kolhe, S.R.: The divide-and-conquer based algorithm to detect and correct the skew angle in the old age historical handwritten Modi Lipi documents. Int. J. Comput. Sci. Appl. 14(2), 47–63 (2017)
Google Scholar
Deshmukh, M.S., Patil, M.P., Kolhe, S.R.: A dynamic statistical nonparametric cleaning and enhancement system for highly degraded ancient handwritten Modi Lipi documents. In: 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 1545–1551. IEEE, September 2017
Google Scholar
Deshmukh, M.S., Manoj, P.P., Satish, R.K.: Offline handwritten Modi numerals recognition using chain code. In: Proceedings of the Third International Symposium on Women in Computing and Informatics. ACM (2015)
Google Scholar
Kavallieratou E., Stamatatos E., Fakotakis N., Kokkinakis G.: Handwritten character segmentation using transformation-based learning. In: ICPR, p. 2634. IEEE, September 2000
Google Scholar
Peng, G., Yu, P., Li, H., Li, H., Zhu, X.: A character segmentation algorithm for the palm leaf manuscripts. In: 2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA), pp. 354–358. IEEE, September 2017
Google Scholar
Obaidullah, S.M., Santosh, K.C., Das, N., Halder, C., Roy, K.: Handwritten Indic script identification in multi-script document images: a survey. Int. J. Pattern Recognit. Artif. Intell. 32(10), 1856012:1–1856012:26 (2018)
Google Scholar
Halder, C., Obaidullah, S.M., Santosh, K.C., Roy, K.: Content independent writer identification on Bangla script: a document level approach. Int. J. Pattern Recognit. Artif. Intell. 32(9), 1856011:1–1856011:24 (2018)
Google Scholar
Mukherjee, H., Obaidullah, S.M., Santosh, K.C., Phadikar, S., Roy, K.: Line spectral frequency-based features and extreme learning machine for voice activity detection from audio signal. Int. J. Speech Technol. 21(4), 753–760 (2018). https://doi.org/10.1007/s10772-018-9525-6
Article Google Scholar
Santosh, K.C., Borra, S., Joshi, A., Dey, N.: Special section: advances in speech, music and audio signal processing (Articles 1–13). Int. J. Speech Technol. 22(2), 293–294 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Sciences, Kavayitri Bahinabai Chaudhari North Maharashtra University, Jalgaon, Maharashtra, India
Manisha S. Deshmukh & Satish R. Kolhe

Authors

Manisha S. Deshmukh
View author publications
You can also search for this author in PubMed Google Scholar
Satish R. Kolhe
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Manisha S. Deshmukh .

Editor information

Editors and Affiliations

University of South Dakota, Vermillion, SD, USA
K. C. Santosh
Dr. Babasaheb Ambedkar Marathwada University, Aurangabad, India
Bharti Gawali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Deshmukh, M.S., Kolhe, S.R. (2021). A Modified Approach for the Segmentation of Unconstrained Cursive Modi Touching Characters Cluster. In: Santosh, K.C., Gawali, B. (eds) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2020. Communications in Computer and Information Science, vol 1380. Springer, Singapore. https://doi.org/10.1007/978-981-16-0507-9_36

Download citation

DOI: https://doi.org/10.1007/978-981-16-0507-9_36
Published: 26 February 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-0506-2
Online ISBN: 978-981-16-0507-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics