Multi-verse Optimization Clustering Algorithm for Binarization of Handwritten Documents

Elfattah, Mohamed Abd; Hassanien, Aboul Ella; Abuelenin, Sherihan; Bhattacharyya, Siddhartha

doi:10.1007/978-981-10-8863-6_17

Mohamed Abd Elfattah^19,22,
Aboul Ella Hassanien^20,22,
Sherihan Abuelenin¹⁹ &
…
Siddhartha Bhattacharyya^21,22

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 727))

716 Accesses
8 Citations

Abstract

Binarization process of images of historical manuscripts is considered a challenge due to the different types of noise that are related to the degraded manuscripts. This paper presents an automatic clustering algorithm for binarization of handwritten documents (HD) based on multi-verse optimization. The multi-verse algorithm is used to find cluster centers in HD where the number of clusters is predefined. The proposed approach is tested on the benchmarking dataset used in the Handwritten Document Image Binarization Contest (H-DIBCO 2014). The proposed approach is assessed through several performance measures. The experimental results achieved competitive outcomes compared to the well-known binarization methods such as Otsu and Sauvola.

Access provided by Autonomous University of Puebla. Download conference paper PDF

A non-parametric binarization method based on ensemble of clustering algorithms

Article 30 October 2020

Handwritten Arabic Manuscript Image Binarization Using Sine Cosine Optimization Algorithm

Information Density Based Image Binarization for Text Document Containing Graphics

Keywords

1 Introduction

Binarization in document analysis field is considered as an open challenge, since the historical manuscript images suffer from different kinds of noise and any proposed systems, such as optical character recognition (OCR) and word spotting (WS), need the proper binarized image, while the accuracy of these systems affected directly with this process (binarization). Binarization is the process of extracting the text without any noise (black) and background (white) [1]. With more kinds of noise as; smudge, multi-colored, bleed-through, unclear background, shadow, broken character. The current systems depend on the binarization as the first step which is affected by noise. These systems are included in many applications such as handwritten recognition, watermarking, and data hiding [2].

Thresholding approaches can be either local or global. In the case of degraded images, global approaches do not perform well [3]. Otsu [4], Kapur et al. [5], and Kittler and Illingworth [6] are considered global methods, while Niblack [7], Sauvola and Pietikäinen [8], and Bernsen [9] are considered local methods. From the literature, many different approaches are presented to binarize the degraded images. However, the binarization process is still an open challenge [10].

Recently, meta-heuristic optimization algorithms have a wide range of applications such as feature selection, image processing, and others. Nature-inspired algorithms are well-known optimization algorithms. In these algorithms, the local optima problem can be solved by sharing the information between candidates [11]. Therefore, in this paper, a new cluster algorithm is proposed using one of the recent optimization algorithms named multi-verse optimizer (MVO) [11]. This algorithm is proposed to address the binarization process of historical documents.

The rest of this paper is organized as follows: Section 2 introduces the basics of MVO algorithm. Section 3 presents the proposed approach. In Sect. 4, the experimental result and discussion are clarified. Finally, conclusions and future works are presented in Sect. 5.

2 Preliminaries: Multi-verse Optimizer (MVO)

MVO is a recent nature-inspired algorithm proposed by Mirjalili et al. [11]. It is based on the three concepts of cosmology (white hole, black hole, and wormhole). The exploration phase is based on (white, black hole), while the wormhole is employed for improving the quality in the exploitation phase [11].

At each iteration, these universes are sorted depending on their inflation rate. The roulette wheel is employed for the selecting to have a white hole:

$$\begin{aligned} \mathbf {U} = \begin{bmatrix}y_{1}^{1}&\ldots&y_{1}^{v}\\ \ldots&\ldots&\ldots \\ y_{n}^{1}&\ldots&y_{n}^{v} \end{bmatrix} \end{aligned}$$

(1)

where the number of parameters (variables) is presented by v and the number of universes by n.

$$\begin{aligned} y_{i}^{j} = \left\{ {\begin{array}{*{20}c} {\begin{array}{*{20}c} {y_{k}^{j} } &{} {r1 < NI(Ui)} \\ \end{array} } \\ {\begin{array}{*{20}c} {y_{i}^{j} } &{} {r1 \ge NI(Ui)} \\ \end{array} } \\ \end{array} } \right. \end{aligned}$$

(2)

where $y_{i}^{j}$ denotes jth of ith universes. Ui presents the ith universe. The normalized inflation rate is presented by NI(Ui) of the ith universe, r1 is a random value in [0, 1], and $y{_{k}^{j}}$ presents the jth parameter of kth universe chosen by a roulette wheel selection mechanism [11].

To update the solutions, the two parameters Traveling Distance Rate (TDR) and Wormhole Existence Probability (WEP) are calculated based on Eqs. 3 and 4:

$$\begin{aligned} WEP=min +l\times \left( \frac{max-min }{L} \right) \end{aligned}$$

(3)

The minimum and maximum are presented by min (0.2) and max (1) as in Table 1, respectively, while the current iteration presented by l and L denotes the maximum number of iterations:

$$\begin{aligned} TDR=1-\left( \frac{{{l}^{1/p}}}{{{L}^{1/p}}} \right) \end{aligned}$$

(4)

The exploitation accuracy is presented by p. The large value of p indicates high perfect of local search/exploitation. The position of solutions is updated based on Eq. 5:

$$\begin{aligned} y_{i}^{j} = \left\{ {\begin{array}{*{20}l} {\left\{ {\begin{array}{*{20}l} {Y_{j} + {\text {TDR}} \times \left( {\left( {ub_{j} - lb_{j} } \right) \times r4 + lb_{j} } \right) } &{} {r3< 0.5} \\ {Y_{j} - {\text {TDR}} \times \left( {\left( {ub_{j} - lb_{j} } \right) \times r4 + lb_{j} } \right) } &{} {r3 \ge 0.5} \\ \end{array} } \right. } &{} { r2 < {\text {WEP}}} \\ {y_{i}^{j} } &{} {r2 \ge {\text {WEP}}} \\ \end{array} } \right. \end{aligned}$$

(5)

where Yj denotes the jth parameter of the best universe; $ lb_{j}$ and $ub_{j}$ denote the lower and upper bound of jth variable, while, r2, r3, and r4 are random numbers in [0, 1]. $y_{j}^{i}$ denotes the jth parameter of ith universe. TDR and WEP are coefficients [11].

3 The Proposed Binarization Approach

Starting with applying the MVO algorithm on the degraded manuscripts image to find the optimal cluster center based on objective function given in Eq. 6 as in the basic K-means clustering algorithm [12]. Depending on the obtained cluster centers to create BW (white, black) representing the foreground by white pixels where the darkest cluster denotes the text. In fact, at every iteration, each (universe) search agent updates its position according to (the best position). Finally, the cluster centers are updated, and the binary image is created. Figure 1 illustrates the general architecture of the proposed binarization approach and its phases.

3.1 Fitness Function and MVO Parameters

Equation 6 is the squared error function that is used as an objective function of the multi-verse optimization algorithm typically as in the k-means clustering [12]:

$$\begin{aligned} J = {{\sum \limits _{j=1}^{k}{\sum \limits _{i=1}^{x}{\left\| {{x}_{i}}^{(j)}-{{c}_{j}} \right\| }^{2}}}} \end{aligned}$$

(6)

The distance measure among the cluster center ${c}_{j}$ and data points ${{{x}_{i}}^{(j)}}$ is presented by ${\left\| {{x}_{i}}^{(j)}-{{c}_{j}} \right\| }^{2}$. It denotes the distance of the n data points from their cluster centers.

The foremost target of MVO is to minimize this function. Each cluster is presented within a single centroid. Each universe presents one solution, and its position is updated according to (best solution). For any optimization algorithm, we primarily require setting some parameters value that provides better performance of the proposed approach. Table 1 refers to the MVO parameters setting.

Table 1 MVO parameter’s setting

Full size table

4 Experimental Results and Discussion

H-DIBCO 2014 dataset [13] is used and employed to evaluate the proposed approach. This dataset contains ten handwritten images with different kinds of noise which are collected from tranScriptorium project [14]. This dataset is available with its ground truth. This dataset contains illustrative degradations such as bleed-through, faint characters, smudge, and low contrast.

To evaluate the proposed approach, different performance measures [15] are used and employed including F-measure [16, 17], Negative Rate Metric (NRM) [18], Peak Signal-to-Noise Ratio (PSNR) [17], Distance Reciprocal Distortion (DRD) [2], and Misclassification Penalty Metric (MPM) [18, 19]. The high values of F-measure, PSNR, and low value on DRD, NRM, and MPM indicate the best result. In addition, visual inspection is used.

Table 2 presents the results of MVO on H-DIBCO 2014; the high PSNR value appears in H08 with value 24.59, while the worst value is in H07 with value 15.32. The higher value of F-measure (98.09) is in H03, while the worst is in H07 (84.85). In addition, the better DRD value is in H03 (0.92). The best NRM and MPM appear in H03 (0.01, 0.03), respectively.

Table 2 Result of MVO on H-DIBCO 2014

Full size table

Table 3 summarizes the comparison between the approaches submitted to H-DIBCO 2014 competition [13] and the proposed MVO algorithm. According to Table 3, the numbers (1 to 4) indicate the rank of submitted methods with their values of F-measure, PSNR, and DRD. The result of MVO is better than the well-known methods (Otsu and Sauvola) and the method number (4) in all performance measures.

Table 3 Results of the MVO with the state-of-the-art methods on H-DIBCO 2014

Full size table

For visual inspection, two images are selected named H08 and H10 as shown in Figs. 2 and 3. Figures 2 and 3 show the comparison between the ground truth images, as shown in Figs. 2b and 3b, and the MVO output images (Figs. 2c and 3c). From these figures, the output images are very close to the ground truth images with complete character structure, but we found some simple noise.

The convergence rate is the last judgment measure to evaluate the proposed binarization approach. In each iteration, the solution with the best fitness is kept and it is used to create the convergence curves as in Fig. 4. This figure presents the convergence curve for two different images, and the lower fitness value with increasing the number of iterations demonstrates the convergence of the proposed approach. It is also remarkable that the fitness value decreased dramatically. The optimization problem here is a minimization problem. We can conclude from this figure that the MVO is a promising approach to address the binarization problem.

5 Conclusions and Future Works

This paper presents a binarization approach based on the MVO algorithm, which is employed for minimizing the distance between clusters. The convergence curve rate proves the high speed of MVO algorithm. This approach can deal with various kinds of noise.

As future work, it is planned to use preprocessing phase which can improve the accuracy of binarization. Furthermore, hybridization with other optimization algorithms will be used to improve the results in [20,21,22,23,24]. A comparative analysis between the basic MVO and a chaotic version of it based on different chaos maps and different objective functions will be presented to improve the OCR recognition rate in [25, 26] by using it in the binarization phase.

References

Mesquita RG, Silva RM, Mello CA, Miranda PB (2015) Parameter tuning for document image binarization using a racing algorithm. Expert Syst Appl 42(5):2593–2603
Google Scholar
Lu H, Kot AC, Shi YQ (2004) Distance-reciprocal distortion measure for binary document images. IEEE Signal Process Lett 11(2):228–231
Google Scholar
Singh BM, Sharma R, Ghosh D, Mittal A (2014) Adaptive binarization of severely degraded and non-uniformly illuminated documents. Int J Doc Anal Recognit (IJDAR) 17(4):393–412
Google Scholar
Otsu N (1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybern 9(1):62–66
Google Scholar
Kapur JN, Sahoo PK, Wong AK (1985) A new method for gray-level picture thresholding using the entropy of the histogram. Comput Vis Graph Image Process 29(3):273–285
Google Scholar
Kittler J, Illingworth J (1986) Minimum error thresholding. Pattern Recognit 19(1):41–47
Google Scholar
Niblack W (1985) An introduction to digital image processing. Strandberg Publishing Company
Google Scholar
Sauvola J, Pietikäinen M (2000) Adaptive document image binarization. Pattern Recognit 33(2):225–236
Google Scholar
Bernsen J (1986) Dynamic thresholding of grey-level images. Int Conf Pattern Recognit 2:1251–1255
Google Scholar
Hadjadj Z, Cheriet M, Meziane A, Cherfa Y (2017) A new efficient binarization method: application to degraded historical document images. Signal Image Video Process 1–8
Google Scholar
Mirjalili S, Mirjalili S, Hatamlou A (2016) Multi-verse optimizer: a nature-inspired algorithm for global optimization. Neural Comput Appl 27(2)
Google Scholar
MacQueen J et al (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, Oakland, CA, USA, vol 1, pp 281–297
Google Scholar
Ntirogiannis K, Gatos B, Pratikakis I (2014) ICFHR2014 competition on handwritten document image binarization (h-dibco 2014). In: 2014 14th international conference on frontiers in handwriting recognition (ICFHR). IEEE, pp 809–813
Google Scholar
http://transcriptorium.eu
Gatos B, Ntirogiannis K, Pratikakis I (2009) ICDAR 2009 document image binarization contest (DIBCO 2009). In: 10th international conference on document analysis and recognition, 2009 (ICDAR’09). IEEE, pp 1375–1382
Google Scholar
Sokolova M, Lapalme G (2009) A systematic analysis of performance measures for classification tasks. Inf Process Manag 45(4):427–437
Google Scholar
Ntirogiannis K, Gatos B, Pratikakis I (2013) Performance evaluation methodology for historical document image binarization. IEEE Trans Image Process 22(2):595–609
Google Scholar
Pratikakis I, Gatos B, Ntirogiannis K (2010) H-dibco 2010-handwritten document image binarization competition. In: 2010 international conference on frontiers in handwriting recognition (ICFHR). IEEE, pp 727–732
Google Scholar
Young DP, Ferryman JM (2005) Pets metrics: on-line performance evaluation service. In: Joint IEEE international workshop on visual surveillance and performance evaluation of tracking and surveillance (VS-PETS), pp 317–324
Google Scholar
Elfattah MA, Abuelenin S, Hassanien AE, Pan JS (2016) Handwritten arabic manuscript image binarization using sine cosine optimization algorithm. In: International conference on genetic and evolutionary computing. Springer, pp 273–280
Google Scholar
Mostafa A, Fouad A, Elfattah MA, Hassanien AE, Hefny H, Zhu SY, Schaefer G (2015) Ct liver segmentation using artificial bee colony optimisation. Procedia Comput Sci 60:1622–1630
Google Scholar
Mostafa A, Elfattah MA, Fouad A, Hassanien AE, Hefny H (2016) Wolf local thresholding approach for liver image segmentation in ct images. In: Proceedings of the second international Afro-European conference for industrial advancement (AECIA 2015). Springer, pp 641–651
Google Scholar
Ali AF, Mostafa A, Sayed GI, Elfattah MA, Hassanien AE (2016) Nature inspired optimization algorithms for ct liver segmentation. In: Medical imaging in clinical applications. Springer, pp 431–460
Google Scholar
Hassanien AE, Elfattah MA, Aboulenin S, Schaefer G, Zhu SY, Korovin I (2016) Historic handwritten manuscript binarisation using whale optimisation. In: 2016 IEEE international conference on systems, man, and cybernetics (SMC). IEEE, pp 003842–003846
Google Scholar
Sahlol AT, Suen CY, Zawbaa HM, Hassanien AE, Elfattah MA (2016) Bio-inspired bat optimization algorithm for handwritten arabic characters recognition. In: 2016 IEEE congress on evolutionary computation (CEC). IEEE, pp 1749–1756
Google Scholar
Sahlol A, Elfattah MA, Suen CY, Hassanien AE (2016) Particle swarm optimization with random forests for handwritten arabic recognition system. In: International conference on advanced intelligent systems and informatics. Springer, pp 437–446
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Computers and Information, Computer Science Department, Mansoura University, Dakahlia Governorate, Egypt
Mohamed Abd Elfattah & Sherihan Abuelenin
Faculty of Computers and Information, Cairo University, Giza, Egypt
Aboul Ella Hassanien
Department of Computer Application, RCC Institute of Information Technology, Kolkata, 700015, India
Siddhartha Bhattacharyya
Scientific Research Group in Egypt (SRGE), Cairo, Egypt
Mohamed Abd Elfattah, Aboul Ella Hassanien & Siddhartha Bhattacharyya

Authors

Mohamed Abd Elfattah
View author publications
You can also search for this author in PubMed Google Scholar
Aboul Ella Hassanien
View author publications
You can also search for this author in PubMed Google Scholar
Sherihan Abuelenin
View author publications
You can also search for this author in PubMed Google Scholar
Siddhartha Bhattacharyya
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Siddhartha Bhattacharyya .

Editor information

Editors and Affiliations

Department of Computer Application, RCC Institute of Information Technology, Kolkata, West Bengal, India
Siddhartha Bhattacharyya
Department of Engineering Science and Management, RCC Institute of Information Technology, Kolkata, West Bengal, India
Anirban Mukherjee
Department of Information Technology, RCC Institute of Information Technology, Kolkata, West Bengal, India
Hrishikesh Bhaumik
Electronics and Communication Sciences Unit, Indian Statistical Institute, Kolkata, West Bengal, India
Swagatam Das
Department of Human Intelligence Systems, Kyushu Institute of Technology, Wakamatsu-ku, Kitakyushu, Fukuoka, Japan
Kaori Yoshida

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Elfattah, M.A., Hassanien, A.E., Abuelenin, S., Bhattacharyya, S. (2019). Multi-verse Optimization Clustering Algorithm for Binarization of Handwritten Documents. In: Bhattacharyya, S., Mukherjee, A., Bhaumik, H., Das, S., Yoshida, K. (eds) Recent Trends in Signal and Image Processing. Advances in Intelligent Systems and Computing, vol 727. Springer, Singapore. https://doi.org/10.1007/978-981-10-8863-6_17

Download citation

DOI: https://doi.org/10.1007/978-981-10-8863-6_17
Published: 10 May 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8862-9
Online ISBN: 978-981-10-8863-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Multi-verse Optimization Clustering Algorithm for Binarization of Handwritten Documents

Abstract

Similar content being viewed by others

A non-parametric binarization method based on ensemble of clustering algorithms

Handwritten Arabic Manuscript Image Binarization Using Sine Cosine Optimization Algorithm

Information Density Based Image Binarization for Text Document Containing Graphics

Keywords

1 Introduction

2 Preliminaries: Multi-verse Optimizer (MVO)