Textural Measure for Medical Words Characterization Applied to Script Identification in Bilingual Context

Alzahrani, Nouf M.; Alharthi, Adil F.

doi:10.1007/s41133-019-0028-z

Textural Measure for Medical Words Characterization Applied to Script Identification in Bilingual Context

Original Paper
Published: 24 December 2019

Volume 5, article number 9, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Augmented Human Research Aims and scope Submit manuscript

Textural Measure for Medical Words Characterization Applied to Script Identification in Bilingual Context

Download PDF

84 Accesses
Explore all metrics

Abstract

The objective of this work is to contribute to the analysis and understanding of medical documents taken from health institutions in Saudi Arabia. The project aimed to use intelligent technologies and image processing tools to the automation of processing the medical documents. This consists particularly to assist medical staff to the treatment of the different medical forms in order to facilitate the storage of the important information and their centralization. As we worked on bilingual context, we proposed a system for identifying Arabic and Latin texts whether taped or manuscripted. In this way, we can identify the extracted blocks from different regions of interest and distribute them to different OCR systems to recognize them. We used SGLD as a texture measure of the image writing shapes. Then, we calculated Haralick descriptors that characterize them. The resulting recognition ratios were very efficient and promising.

A texture-based approach for word script and nature identification

Article 19 May 2016

South Indian Handwritten Script Identification at Block Level from Trilingual Script Document Based on Gabor Features

Language discrimination by texture analysis of the image corresponding to the text

Article 19 August 2016

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The Ministry of Health (MOH) and all related organisms are institutions, which are submitted to the trusteeship of the Minister of Health in KSA. Among its missions, doctors, analysts, pharmacists, etc., should daily treat thousands of documents from all health organizations such as (healthcare reimbursements, hospitalization, and physiotherapy sessions). In addition to its huge amount, these documents are sometimes handwritten and require hundreds of working hours per week to be treated or stored in databases. The objective of our work is the identification of the script of each block of text, from the processed documents. Script recognition consists of finding the alphabet used to write a document. The observation of some medical forms makes it possible to note the existence of different writings in the blocks representing the regions of interest. These are the regions with significant importance for a patient’s medical record and should therefore be recognized to facilitate their storage in a knowledge database or to their use for an indexation system. A block of text can be written in Arabic or Latin, manuscript or printed shape (see Fig. 1). Manual identification of the language with which a block is written is no longer appropriate. The automatic identification of the script as well as the nature of its writing will guide each block of homogeneous text to an appropriate optical character recognition (OCR) system. That’s why we focus on the best way to characterize the four kinds of writing: printed Arabic, manuscript Arabic, printed Latin and manuscript Latin. We turned to robust image processing tools to characterize homogeneous blocks extracted from medical forms. In particular, we plan to use texture-based methods.

Background

Each script has specific visual characteristics. This specification is due to the variation in spatial characteristics such as character density and feature orientation. For example, Western languages are characterized by a small alphabet and generally isolated signs. Conversely, the Arabic script is characterized by many ligatures and a strong cursive while Asian languages use several thousand pictograms forming relatively complex internal structures of straight lines.

This is a recent area compared to that of optical character recognition (OCR). It is very useful in any multilingual recognition system to know the script according to which a text is written in order to be able to direct it to the appropriate optical character recognition system (Latin, Arabic, Chinese…). For printed documents, the variety of fonts, style, size, etc., makes the recognition of scripts even more difficult.

The state of the art in recognition of scripts shows that this research is scarce and that it focuses mainly on printed documents. But in this area, there is also work on discrimination between handwritten texts and printed texts. The script identification works are divided into three groups according to the analysis: the word level (or pseudo-word), at the line level and a block level. Block-based approaches assume that texts are normalized (equal height and width), uniformly formatted (interspaces and constant inter-words), and therefore have only one script.

Zhou et al. [1] and Elgammal and Ismail [2] analyze the profiles of horizontal and vertical projections to discriminate, respectively, different scripts. The efficiency of methods based on the analysis of projection profiles decreases when the number of different alphabets is important.

Chaudhuri [3] presented an approach to discrimination of printed and handwritten scripts for the classification of Bangla and Devanagari Indian scripts. Discrimination is based on structural and statistical characteristics at the line level.

Contrary to the vast majority of the work already done in this field concerning the discrimination of whole blocks of text, the author [4] proposed a discrimination between Arabic and Latin scripts at word level on mixed blocks. The approach uses similarity of forms for discrimination of Arabic/non-Arabic scripts at the word level. To reduce computation time, we preceded the processing of a pre-selection of Arabic scripts from information on the spatial relationships between related components without reducing the overall performance of the identification. This work was taken up by [5] with a discrimination method based on a recognition approach.

In [6] authors demonstrated an automatic identification method for Arabic and Latin script in both handwritten and printed script. Furthermore, another author suggested a method based on morphological and geometrical analysis for Arabic and Latin text block discrimination for both printed and handwritten shapes [7]. An exact framework based on a steerable pyramid transforms at word level in order to identify Arabic and Latin script was proposed by [8]. Structural features at word level was proposed by [9], to recognize the Arabic or Latin scripts of machine printed or handwritten documents effectively. Additionally, Li and Tan [10] used a statistical features technique in character level to identify Arabic, English, and Chinese scripts from camera-based images. The researchers in [11] demonstrated a recent survey in script identification in more details.

There are many reasons that considered a difficult task for the development of an automated assistant system for scripts recognition extracted from handwritten medical forms, such as the complexity of the form of writings that seem illegible and incomprehensible, the variability of the writings of a same doctor or intervening, the existence of writings resulting from a mixture of several consultation of the same document, for example lines and words which overlap, presence of text written in the margin and/or between lines, the quality of several deteriorates colored images because of scanning or using carbonized copy (Fig. 1).

Therefore, this research analyzes the image directly by grayscale without filtering, geometric correction or restoration. This choice prevents us from using many of the reusable methods, especially those based on segmentation.

The Proposed Approach

The main idea of our approach is to consider the entire text block to be labeled as a particular texture pattern. Each type of texture will therefore have a homogeneous appearance characterizing the same class of script, and we can therefore use techniques already used successfully to distinguish textures.

Through several years, many approaches were proposed for texture detection and quantification. However, the texture analysis still constitutes a very important research topic. In particular, the continuous attention to the texture analysis proves the importance of texture for image understanding. Several surveys on texture analysis can be found in [12,13,14,15]. Statistical approaches are based on the intensity values of each pixel in order to calculate numerical descriptors.

SGLD Calculation

The co-occurrence of the gray levels or the spatial dependence of the intensities is the most used second-order statistic in image processing. It is better known as SGLD for Spatial Gray Level Dependence. The SGLD counts the number of occurrences that two different points of the image take values of given intensities.

We consider I the image on which we calculate the co-occurrence. I(x, y) is the intensity value observed in (x, y) and I(x + u, y + v) takes the intensity value observed after a translation (u, v) of the coordinates. The SGLD (u, v, i, j) counts the number of times I(x, y) takes the intensity value i and I(x + u, y + v) the intensity value j (Eq. 1). In terms of statistics, co-occurrence makes it possible to calculate the joint law of simultaneously observing events (I(x, y) = i) and (I(x + u, y + v) = j) for all pixels (x, y) of the image.

$$SGLD(u,v,i,j) = {\text{Mes}}[(I(x,y) = i) \cap (I(x + u,y + v) = j)].$$

(1)

The SGLD is expressed according to the 4 variables (u, v, i, j) if we represent the spatial relation (u, v) in Cartesian coordinates. We can also represent it in polar coordinates (Eq. 2) by describing the displacement $(u,v)$ by its equivalent $(\rho \times \cos \theta ,\rho \times \sin \theta )$ (see Fig. 2).

$$SGLD(\rho ,\theta ,i,j) = {\text{Mes}}[(I(x,y) = i) \cap (I(x + \rho \times \cos \theta ,y + \rho \times \sin \theta ) = j)].$$

(2)

We can use either of these two formulas, but it is more convenient to choose the polar version to specify the displacement ρ and the orientation θ between the two points.

To characterize the different writings, the displacements ρ must be very small so as to measure the local form of the letters. With larger displacements ρ, we measure information on the layout, the interline distance and the height of the lines. The maximum displacement value ρ_max defines the size of the analysis window and therefore the measurement scale.

The co-occurrence of the gray levels is like comparing the image and its translated for all the possible displacements. The basis of co-occurrence is thus similar to that of self-similarity because it allows the forms to be measured by themselves according to all the displacements of length ρ and possible direction θ.

For each direction θ and each displacement ρ, we have a co-occurrence matrix of size N_g × N_g where N_g is the number of gray levels of the image. It is possible to quantify the gray levels in N_max levels only in order to reduce the sizes of the matrices of obtained co-occurrence. If we have a displacement ρ limited to ρ_max and a number θ_max of possible directions, the SGLD is thus a matrix of ρ_max × θ_max matrices of size N_max × N_max each.

To measure only the shape of the characters, we choose a reduced number of displacement ρ equal to ρ_max = 8. The choice ρ_max = 8 is compatible with the scale of the vast majority of images available regardless of the resolution. This analysis window is suitable for images with an average height of text lines greater than 24 pixels. Without any normalization of the size of the writings in all the images, the SGLD may find dissimilar two identical scripts of extremely different size. On the other hand, we know that SGLD is robust to variations in image resolution as long as the scale remains the same. Indeed, a very low resolution image will lose information in the co-occurrence matrices for low displacement values ρ = 1.

The matrices of co-occurrences relating to ρ = 0 cannot be significant since they do not correspond to any displacement. The discrete nature of the images does not allow having more than 4 directions for the displacement of 1 pixel, 8 directions for a displacement of 2 pixels, etc. So for low values of displacement ρ, there are not all the directions θ.

All SGLD co-occurrence matrices or ${\text{Cooccurrence}}(\rho ,\theta ,i,j)$ are four-dimensional data following ρ, θ, i and j.

With the current values ρ_max = 8, θ_max = 8, N_max = 16 and t_max = 16, we obtain p = 16,384 descriptors for matrices. The matrices of co-occurrences therefore have a considerable number of descriptors that cannot be used precisely because of the “curse of dimensionality” [17]. This is the reason behind the usage of co-occurrences in image analysis. Effective matrix reduction is therefore necessary for better exploitation of this multidimensional data.

The method proposed by Haralick et al. [18] is still active to exploit SGLD for biomedical applications. We chose to use them at first, and then we proposed one new method based on CNN which we are not going to finish to test in this work lack of time. For that purpose, we calculated the Haralick features to reduce the set of characteristics, which maximizes the discrimination rate (Fig. 3).

Haralick Features Calculation

Haralick [18] proposed 14 descriptors f₁ to f₁₄ to be computed on each of the grayscale co-occurrence matrices to describe the texture of an image. Some of these characteristics describe the presence of an organized structure while others reflect the complexity of an image or the nature of the transitions between the gray levels of the points of the image.

We present in the following a full description of these descriptors.

Let M be one of the co-occurrence matrices resulting from the SGLD for fixed values of displacements ρ = ρ₀ and direction θ = θ₀. If the image has N_g grayscales, then the matrix M will be of size N_g × N_g (Eq. 3).

$$M(i,j) = SGLD(\rho_{0} ,\theta_{0} ,i,j) = {\text{Mes}}[(I(x,y) = i) \cap (I(x + \rho_{0} \cos \theta_{0} ,y + \rho_{0} \sin \theta_{0} ) = j)] .$$

(3)

For each of the co-occurrence matrices, the joint probabilities $P(i,j)$ are calculated by normalizing each value by the sum of the elements of the respective co-occurrence matrix (Eq. 4).

$$P(i,j) = \frac{M(i,j)}{{\sum\nolimits_{i = 0}^{{i < N_{\text{g}} }} {\sum\nolimits_{j = 0}^{{j < N_{\text{g}} }} {M(i,j)} } }}$$

(4)

$\{ P(i,j)\}$ then becomes a distribution of joint probabilities to observe the two events $\left( {I(x,y) = i} \right)$ and $\left( {I(x + \rho_{0} \cos \theta_{0} ,y + \rho_{0} \sin \theta_{0} ) = j} \right)$ in an image I.

The characteristics of Haralick require the definition of:

Four Projections P_x, P_y, P_x + y, P_x−y

$$P_{x} (i) = \sum\limits_{j = 0}^{{j < N_{\text{g}} }} {P(i,j)\;{\text{is}}\;{\text{the}}\;{\text{sum}}\;{\text{of}}\;{\text{the}}\;P(i,j)\;{\text{pixels}}\;{\text{along}}\;{\text{the}}\;i{\text{th}}\;{\text{row}}.}$$

$$P_{y} (j) = \sum\limits_{i = 0}^{{i < N_{g} }} {P(i,j)\;{\text{is}}\;{\text{defined}}\;{\text{similarly}}\;{\text{in}}\;{\text{the}}\;{\text{vertical}}\;{\text{direction}} .}$$

In terms of probabilities, the sum $P_{x} (i)$ represents the law marginal observation $(I(x,y) = i)$ for all points (x, y) for each gray level i. Similarly, the sum $P_{y} (j)$ represents the second marginal law of observation $(I(x + \rho_{0} \cos \theta_{0} ,y + \rho_{0} \sin \theta_{0} ) = j)$ according to the level of gray j. The projections $P_{x + y} (k)$ and $P_{x - y} (k)$ allow to define other marginal laws of probabilities summing in the two diagonals. The first law measures the probability distribution of the events $\left( {I(x,y) + I(x + \rho_{0} \cos \theta_{0} ,y + \rho_{0} \sin \theta_{0} ) = k} \right)$ while the second gives the distribution of the probabilities of the events $\left( {\left| {I(x,y) - I(x + \rho_{0} \cos \theta_{0} ,y + \rho_{0} \sin \theta_{0} )} \right| = k} \right)$.

$$P_{x + y} (k) = \sum\limits_{{i < N_{\text{g}} }} {\sum\limits_{\begin{subarray}{l} j < N_{\text{g}} \\ k < 2 \times N_{\text{g}} - 1 \end{subarray} }^{i + j = k} {P(i,j)\;{\text{is}}\;{\text{the}}\;{\text{projection}}\;{\text{sum}}\;{\text{of}}\;P(i,j)\;{\text{in}}\;{\text{the}}\;{\text{ascending}}\;{\text{diagonal}}\;{\text{direction}}.} }$$

$$P_{x - y} (k) = \sum\limits_{{i < N_{\text{g}} }} {\sum\limits_{\begin{subarray}{l} j < N_{\text{g}} \\ k < N_{\text{g}} \end{subarray} }^{{\left| {i - j} \right| = k}} {P(i,j)\;{\text{is}}\;{\text{the}}\;{\text{projection}}\;{\text{sum}}\;{\text{of}}\;P(i,j)\;{\text{in}}\;{\text{the}}\;{\text{descending}}\;{\text{diagonal}}\;{\text{direction}}.} }$$

Five Entropies HX, HY, HXY, HXY1, HXY2

Entropy is a function that measures the degree of disorder of a system. By extension, it can be used to characterize the regularity of a discrete distribution. There are therefore 5 entropies with the multiple combinations of computation on the two marginal laws $P_{x} (i)$, $P_{y} (j)$ and the distribution of joint probabilities $P(i,j)$:

$$\begin{aligned} & {\text{Entropy}}\;{\text{of}}\;P_{x} \quad HX = - \sum\limits_{i = 0}^{{i < N_{\text{g}} }} {P_{x} (i) \times { \log }(P_{x} (i))} \\ & {\text{Entropy}}\;{\text{of}}\;P_{y} \quad HY = - \sum\limits_{j = 0}^{{j < N_{\text{g}} }} {P_{y} (j) \times { \log }\left( {P_{y} (j)} \right)} \\ & {\text{Entropy}}\;{\text{of}}\;P\quad HXY = - \sum\limits_{i = 0}^{{i < N_{\text{g}} }} {\sum\limits_{j = 0}^{{j < N_{\text{g}} }} {P(i,j) \times {\text{log(}}P(i,j) )} } \\ & {\text{Entropy}}\;{\text{of}}\;P\;{\text{on}}\;P_{x} \times P_{y} \quad HXY1 = - \sum\limits_{i = 0}^{{i < N_{\text{g}} }} {\sum\limits_{j = 0}^{{j < N_{\text{g}} }} {P(i,j) \times { \log }\left( {P_{x} (i) \times P_{y} (i) } \right)} } \\ & {\text{Entropy}}\;{\text{of}}\;P_{x} \times P_{y} \quad HXY2 = - \sum\limits_{i = 0}^{{i < N_{\text{g}} }} {\sum\limits_{j = 0}^{{j < N_{\text{g}} }} {P_{x} (i) \times P_{y} (i) \times { \log }\left( {P_{x} (i) \times P_{y} (i) } \right)} } \\ \end{aligned}$$

Six Statistics

The Haralick descriptors use the averages $\mu_{x}$ and $\mu_{y}$ as well as the variances $V_{x}$ and $V_{y}$ of gray levels according to the distributions of P_x and P_y.

$$\begin{aligned} {\text{Averages}}\quad \mu_{x} & = \sum\limits_{i = 0}^{{i < N_{\text{g}} }} {\sum\limits_{j = 0}^{{j < N_{\text{g}} }} {i \times P(i,j)} } \\ \mu_{y} & = \sum\limits_{i = 0}^{{i < N_{\text{g}} }} {\sum\limits_{j = 0}^{{j < N_{\text{g}} }} {j \times P(i,j)} } \\ {\text{Variances}}\quad V_{x} & = \sum\limits_{i = 0}^{{i < N_{\text{g}} }} {\sum\limits_{j = 0}^{{j < N_{\text{g}} }} {(i - \mu_{x} )^{2} \times P(i,j)} } \\ V_{y} & = \sum\limits_{i = 0}^{{i < N_{\text{g}} }} {\sum\limits_{j = 0}^{{j < N_{\text{g}} }} {(j - \mu_{y} )^{2} \times P(i,j)} } \\ \end{aligned}.$$

This results in the standard $\sigma_{x} = \sqrt {V_{x} }$ and $\sigma y = \sqrt {V_{y} }$ of marginal laws P_x and P_y.

Finally, we define the average of the levels according to the marginal law P_x−y: $m_{x - y} = \sum\nolimits_{k = 0}^{{k < N_{{\rm g}} }} {k \times P_{x - y} (k)}.$

All of these definitions allow us to present the fourteen textural attributes defined by Haralick [19,20,21] (Table 1):

Table 1 List of 14 Haralick descriptors

Full size table

Results and Discussion

Datasets

We extracted 389 word blocks from medical forms by hand to validate our approach. Each block of text can be formed by one or more words of the same type of writing: Arabic manuscript, Latin manuscript, Arabic printed or Latin printed and we have four classes to discriminate. We call this part of our HEL-ADU-W database. The distribution of the words on the 4 classes is presented in the table below (see Table 2).

Table 2 HEL-ADU-W database

Full size table

Experiments

After calculating the SGLDs of the 389 images of blocks of homogeneous text and divided into 4 classes, we calculated the 14 Haralick descriptors corresponding to each co-occurrence matrix.

Unlike factor analysis that maximizes the variance of n observations as a function of p variables, discriminate analysis maximizes the distribution of n observations in their respective classes. There are many discrimination techniques. We will retain Fisher’s discrimination which assumes that classes are approximately Gaussian and that they can be linearly separated. Because of the large number of descriptors, we use an LDA on 4 descriptors of Haralick at the same time (Table 3).

Table 3 Confusion matrix of the four classes MA, ML, PA and PL obtained by the SGLD with f₂, f₆, f₈ and f₁₁ of Haralick features

Full size table

The LDA analysis results show an unambiguous separation between classes like Manuscript Arabic (MA), Manuscript Latin (ML), Printed Arabic (PA) and Printed Latin (PL) by using the SGLD with f₂, f₆, f₈ and f₁₁ features. The confusion matrix shows that the classes PA and PL for printed writings are perfectly recognized. We reached 98.95% of discrimination for ML and 97.95% for MA. The MA and the ML present few number of confusions, which is predictable because manuscript writings shares common particularities of shapes independently of the Arabic or Latin script because of the cursive state of forms.

In addition, the handwriting Arabic or Latin words, on medical forms, are very illegible and the characters are not even distinguishable and are therefore much more difficult to characterize comparing to printed scripts.

Conclusion

This study presented a successful contribution in the field of handwritten/printed writings characterization of the text blocs understanding for Medical forms. We chose to characterize a whole homogeneous bloc by statistical methods based on the SGLD. For our identification problem Arabic/Latin scripts, printed/manuscript, the approach using a global analysis of any zone of text, presents an advantage to treat writing without any segmentation step into characters by using textural method.

Indeed the objective of this study is not the recognition of characters—as the classification of each character separately is not required—but the characterization of a word or block of words. Such block is considered as a kind of texture. We admit that a block of text containing enough high number of letters of variable frequencies constitutes a basis of statistically reliable observations to determine the type of writing independently from the content of texts. This is the goal to use such a robust texture measurement as the SGLD.

The results showed that descriptors based on the co-occurrences such as the SGLD allow finding approximately the writing classes of our medical documents.

References

Zhou L, Lu Y, Tan CL (2006) Bangla/English script identification based on analysis of connected component profiles. In: International association for pattern recognition workshop on document analysis systems (DAS), vol 3872, pp 243–254
Elgammal A, Ismail MA (2001) Techniques for language identification for hybrid Arabic–English document images. In: International conference on document analysis and recognition (ICDAR), Seattle, Washington, USA, pp 10–13
Pal U, Chaudhuri BB (2001) Machine-printed and hand-written text lines identification. Pattern Recognit Lett 22:431–441
Article Google Scholar
Moalla I, Alimi AM, Ben Hamadou A (2002) Extraction of Arabic text from multilingual documents. In: IEEE International conference on systems, man and cybernetics (SMC), Hammamet, Tunisie
Elbaati A, Charfi M, Alimi AM (2004) Discrimination de Documents Imprimés Arabes et Latins Utilisant une Approche de Reconnaissance. Journées Génie Electrique et Informatique (GEI), Monastir, Tunisie
Ben Moussa S, Zahour A, Benabdelhafid A, Alimi AM (2008) Fractalbased system for Arabic/Latin, printed/handwritten script identification. In: Proceedings of ICPR, Tampa, FL, USA, Dec, pp 1–4
Kanoun S, Ennaji A, Lecourtier Y, Alimi AM (2002) Script and nature differentiation for Arabic and Latin text images. In: Proceedings of IWFHR, Aug, pp 309–313
Benjelil M, Mullot R, Alimi A (2012) Language and script identification based on steerable pyramid features. In: Proceedings of ICFHR, Bari, Italy, Sept, pp 716–721
Saïdani A, Echi AK, Belaïd A (2013) Identification of machine-printed and handwritten words in Arabic and Latin Scripts. In: Proceedings of ICDAR, Washington, DC, USA, Aug, pp 798–802
Li L, Tan CL (2008) Script identification of camera-based images. In: Proceedings of ICPR, Tampa, FL, USA, Dec, pp 1–4
Ubul K et al (2017) Script identification of multi-script documents: a survey. IEEE Access 5:6546–6559
Google Scholar
Rosenfeld A, Kak A (1982) Digital image processing. Academic Press, New York
MATH Google Scholar
Van Gool L, Dewaele P, Oosterlinck A (1985) Texture analysis. Comput Vis Graph Image Process 29:336–357
Article Google Scholar
Kilpela E, Heikkila J (1990) Comparison of some texture classifiers. In: Symposium on global and environmental monitoring: techniques and impacts, vol 28, pp 333–339
Schowengerdt RA (1997) Remote sensing, models and methods for image processing, 2nd edn. Academic Press, San Diego
Google Scholar
Moalla I, Le Bourgeois F, Alimi AM (2013) Generalized Eigen cooccurrence: application to palaeography. In: International conference on document analysis and recognition (ICDAR), IEEE edition. Washington, DC, Aug 25–28
Bellman R (1961) Adaptive control processes: a guided tour. Princeton University Press, Princeton
Book Google Scholar
Haralick RM, Shanmugam K, Dinstein I (1973) Textural features for image classification. IEEE Trans Syst Man Cybern (SMC) 3:610–621
Article Google Scholar
Haralick RM (1979) Statistical and structural approaches to texture. Proc IEEE 67(5):786–804
Article Google Scholar
Rome J (2002) Introduction to image recognition tutorials and surveys. April 26. https://www.researchgate.net/publication/249882106_Introduction_to_Image_Recognition
Singh PK, Sarkar R, Nasipuri M, Doermann D (2015) Word-level script identification for handwritten indic scripts. In: Proceedings of 13th ICDAR, Aug 2015, pp 1106–1110

Download references

Acknowledgements

This project was funded by the Deanship of Scientific Research, Albaha University, KSA (Grant No. 33/1439). The assistance of the deanship is gratefully acknowledged.

Author information

Authors and Affiliations

Information Technology Department, Faculty of Computer Science and Information Technology, Albaha University, Albaha, Saudi Arabia
Nouf M. Alzahrani
Computer Science Department, Faculty of Computer Science and Information Technology, Albaha University, Albaha, Saudi Arabia
Adil F. Alharthi

Authors

Nouf M. Alzahrani
View author publications
You can also search for this author in PubMed Google Scholar
Adil F. Alharthi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nouf M. Alzahrani.

Ethics declarations

Conflict of interest

On behalf of all authors, there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Alzahrani, N.M., Alharthi, A.F. Textural Measure for Medical Words Characterization Applied to Script Identification in Bilingual Context. Augment Hum Res 5, 9 (2020). https://doi.org/10.1007/s41133-019-0028-z

Download citation

Received: 18 October 2019
Accepted: 01 November 2019
Published: 24 December 2019
DOI: https://doi.org/10.1007/s41133-019-0028-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Textural Measure for Medical Words Characterization Applied to Script Identification in Bilingual Context

Abstract

Similar content being viewed by others

A texture-based approach for word script and nature identification

South Indian Handwritten Script Identification at Block Level from Trilingual Script Document Based on Gabor Features

Language discrimination by texture analysis of the image corresponding to the text

Introduction

Background