An Automated System to Preprocess and Classify Medical Digital X-Rays

Sumera; Vaidehi, K.; Manivannan, R.

doi:10.1007/978-981-19-2840-6_59

Sumera¹²,
K. Vaidehi¹² &
R. Manivannan¹³

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 475))

493 Accesses
1 Citations

Abstract

Considering, the reality (fact) that innumerable medical (digital) images are collected at hospitals and numerous medical tests centres on a daily basis, a clear manifestation is that these images must be composed, stacked and accessed accurately for future references (CBIR). Sole motivation (reasoning) for proposition of this attempted work is to give a provision for categorizing or classifying medical X-rays automatically at macro-level (global level) to aid lab technicians (analysts) in their day job. GLCM is employed to draw out features or characteristics of (images) X-rays and resultant is utilized in building classification model by taking an advantage of SVM. Six different classes (groups) of X-ray images are taken, namely chest, foot, skull, neck, palm and spine. The proposed attempt to put forward a medical X-ray image classification process involves pre-processing of X-rays with an aim of making them fit for further processing. Digital X-rays in this current research are subjected to Pre-processing using a filtering operation (median filter), histogram filtering (or equalization) and CLAHE. The upshots of each are recorded. Subsequently, segmentation (connected component labelling), feature extraction (GLCM), classification (SVM). Lately post implementation, the outcomes vividly depict around 91% accuracy is acquired utilizing median filter accompanying GLCM and SVM whereas pre-processing images using, histogram equalization (HE) yielded 89% accuracy. Third blend (combination) of CLAHE, GLCM and SVM outshined with 96% accuracy. Consequently, CLAHE in estimation (comparison) to other Pre-processing methodologies outperformed classification results.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Computer Analysis of Chest X-Ray Images to Highlight Pathological Objects

Advancement in Classification of X-Ray Images Using Radial Basis Function with Support of Canny Edge Detection Model

Angular relational signature-based chest radiograph image view classification

Article 22 January 2018

Keywords

1 Introduction

One can offer a tremendous contribution to facilitate doctors and radiologists in (their) work of clinical diagnosis by designing an approach (structure) that could automate their tasks instead of alternately letting them perform manual observations. It gets quite tedious and also incurs surplus time for medical practitioners to manually perform image interpretation. Hence, new vogues for processing of images automatically through computers and medical systems and classifying those images are welcomed now-a-days. Benefitting this needy trend of therapeutic systems, this chore concentrates on developing a methodology (or system) to automate categorization (classification) of X-ray images into six groups. These six groups of X-ray images considered in this study are chest, foot, spine, neck, skull, palm. Images considered in this study are taken from IRMA image CLEF database. This thesis comprises of-four fragments. Fragment 2 provides some information on existing trends, information on work proposed is summarized in Fragment 3, Fragment 4 shows performance assessment with respect to current developed system. Finally, experimental outcomes are handed over to Fragment 5. Fragment 6 gives conclusion.

2 Existing Trends

Enormous work has been done in past and till date by experts across globe in developing medical systems to aid and automate medical diagnosis wherein [1] introduces an effort to interpret digital images, improve diagnosis quality using techniques like median filter, histogram Equalization, for quality enhancement, PCA, K-nearest-neighbour techniques, were applied for selecting features and classification [2]. Introduces a case-study wherein analysis of malicious software is done based on machine learning approaches. Reference [3] introduces a work wherein an effort has been made to develop a CBIR system to retrieve mammographic images of breast tissue and classify them as dense, fatty, glandular using statistical features and SVM. Reference [4] introduces research on a similar area (with splitting/merging scheme) wherein contrast equalizing histogram, FD, ZM and classifiers like KNN, MLP are used. Reference [5] presents work on TRUS images which uses techniques like M3 filter, DBSCAN clustering, SVM to demonstrate a close curative system. Reference [6] presents a work of similar kind wherein classification of images (breast mammograms) based on breast mass is proposed in which segmentation is performed using fuzzy C-means, GLCM features are taken out and adaboost, back propagation, neural networks, sparse representation classifiers are used. Reference [7] presents work wherein pectoral breast muscle cropping is done where Haralicks’ and Zernike descriptors facilitate extraction of features. Classifiers like BPN with SVM operate which resulted accuracy of 95.83%. Reference [8] presents a machine learning based approach to detect/identify images with glaucoma. A series of preprocessing and morphological operations are applied followed by segmenting optic-cup part and extracting rim-disc/cup-disk ratios as features and finally classifying using SVM. Reference [9] In this abstract, collection of (Lung) images are Pre-processed and manipulated with respect to pixel values to either remove distortion, filter the image, enhance contrast for making images fit for better visualization and interpretation [10]. Automated categorizing (of microorganisms) is suggested using SVM classifier. Before performing classification, system feeds image (to module) for Pre-processing then a phase to extract features. Reference [11] introduces thesis presenting a methodology for, classification of lung images in which lung data (images) are collected via lung database with a motto to group them as cancerous/non-cancerous. Prior to the final conclusion, images undergo a set of manipulations constituting Pre-processing (median filter), segmenting (fuzzy c means), GLCM and lastly SVM. Reference [12] shows a machine learning approach to recognize symbols (mathematical symbols) which are handwritten. Reference [13] shows an implementation of a system to detect expressions from a real time dataset of faces by making use of SVM for classification of facial emotions into six categories [14]. Proposes a detection system to ascertain type of breast tissue using GLCM features and FLDA classifier that yielded an accuracy of 72.93% using features based on texture and 82.48% using cascade features [15]. This paper gives an overview on different aspects/studies in educational data mining [16]. Introduces a survey/study on Detection of outliers based on machine learning approaches by considering IOT data for analysis. Reference [17] presents a case study on spam detection based on machine-learning approaches.

Reference [18] Presents classification implementation wherein an approach to classify X-rays by applying M3-filter (better image quality), CCL for segmenting, attributes related to texture, shape are withdrawn by operating with GLCM then classification via SVM [19]. Presents a work on image fusion using guided filters [20]. Introduces a review/study on multi-focus-mage fusion using diversified mechanisms.

3 Proposed Trend

Medical (diagnostic) images accumulated from IRMA (medical image dataset) are fed to Pre-processing module wherein CLAHE (Contrast Limiting AHE (adaptive histogram equalization)) algorithm enhances image calibre (Besides this, other filtering methods like median filter, histogram Equalization; are used to examine, compare the overall system performance) accompanied by segmentation utilizing Connected component Labelling. Texture features for interested region (ROI) are pulled out using GLCM statistics. SVM model is built to which images are fed for classification. System architecture is pictorially introduced in Fig. 1.

A flowchart of system architecture. I R M A; preprocessing; segmentation; feature extraction; classification. Query; preprocessing; segmentation; feature extraction; classification of feature vector into the chest, foot palm, neck, spine, and skull. Each step is followed by an arrow. — **Fig. 1**

3.1 A Module to Improve Quality of Images-Pre-processing

For quality intensification (also termed as image enhancement), intensities (or intensity values, pixel values) of digital X-rays are modified which gives finer visualizing, interpretation and displaying experience. It lowers noise within image. One among the pitfalls of median filter is to pull out any outliers (along) with fine or minute details since its’ hard for it to differentiate (distinguish) between the two. Median rate (value) will very minutely be affected by anything diminutive in size. Histogram Equalization emphasizes (stresses) on finding image’s global contrast that gives too dark, too light images. Since it is not ideal for image enhancement, a variation called CLAHE is implemented here. CLAHE concentrates on finding the local contrast of image areas thereby uniformly distributing image intensities. It also clips off or limits contrast to escape noise amplification if any. Sample images demonstrating Pre-processing strategies (techniques) are shown below in Figs. 2, 3, 4, and 5.

An X-ray and a screenshot of an application side by side. The X-ray on the left displays an original image of a skull. The screenshot on the right displays an equivalent histogram that plots pixel probability versus pixel intensity. Values are estimated. The pixel intensity is maximum at (10, 4500), (125, 1500), and (175, 1700). — **Fig. 2**

An X-ray and a screenshot of an application side by side. The X-ray on the left displays a pre-processed image of a skull. The screenshot on the right displays an equivalent histogram. — **Fig. 3**

3.2 A Module to Extract the Region of Interest-Segmentation

ROI (segmented part) whose features need to be extracted is found using CCL (Connected Component Labelling). This segmentation mode scans an X-ray, group pixels of it considering pixel connectivity. Pixels inside same group share indistinguishable characteristics. Each group of pixels are assigned a label, largest labelled component is the desired region-of-interest (ROI). Since CCL provides a very efficient, user-friendly way to segment the images thereby providing further ease to calculate and display portions of images based on 4-connectivity or 8-connectivity, this has been used in this work.

3.3 A Module to Calculate Statistics-Feature Extraction

When load (input) to algorithm is bulky to be processed (when image sizes are large enough), this input (images) can be represented as compact (feature vector). Here, a statistical methodology called GLCM (Gray Level co-occurrence matrix) is utilized for withdrawing of textural feature-based particulars of the images. GLCM matrices takes (original image’s) pixel intensity co-occurrences into consideration and forms a matrix. Resultant matrix is operated to discover texture statistics (features). Proposed work presented here dealt with extraction of 22 features altogether, couple of them are: correlation, dissimilarity, contrast, variance, entropy, energy homogeneity, etc.

3.4 A Module that does the Purpose-Classification

Feature sets of X-rays (used here) computed in previous segment are forwarded to establish a classifier (SVM here). Since all six categories/classes of images are finely pre-processed and features are withdrawn using ROI, SVM here analyzes these feature sets (of images) and classifies them into six categories where images belonging to same class share similar feature values. SVM principal is depicted pictorially. See Fig. 6.

A diagram displays a line overlapping on 3 parallel lines. The line at the center and the overlapping line are labeled optimal hyperplane l and separator hyperplane, respectively. The distance between the first and third line is labeled maximum margin. Square box and the circles on the right are labeled classe 1 and 2 support vectors, respectively. — **Fig. 6**

4 Performance-Evaluation

Thankfully, Machine Learning (ML) approaches/algorithms are furnished with innumerable alternatives to calculate correctness (accuracy) of any model being developed. This study aims at calculating accuracy (how well did the classifier performed) of classifier by utilizing a confusion matrix (which is a-combination of True positives (TP), True negatives (TN), False positives (FP),False negatives (FN) of predicted versus actual task results. Here the classifiers’ quality is ascertained using specificity, accuracy, sensitivity and specificity measures.

$${\text{Sensitivity}} = {\text{ TP}}/\left( {{\text{TP}}\, + \,{\text{FN}}} \right)\,{\text{Specificity }} = {\text{TN}}/\left( {{\text{TN}}\, + \,{\text{FP}}} \right)\,{\text{Precision}}\, = \,{\text{ TP}}/\left( {{\text{TP}}\, + \,{\text{FP}}} \right)$$

$${\text{Accuracy}} = \left( {{\text{TP}}\, + \,{\text{TN}}} \right)/\left( {{\text{TP}}\, + \,{\text{TN}}\, + \,{\text{FP}}\, + \,{\text{FN}}} \right)$$

5 Investigational Outcomes

Eventually, CLAHE Pre-processing worked best with GLCM, SVM offering 96% accuracy unlike median filtering, histogram filtering wherein classification result or accuracy was found as 91% ,89% respectively. Overall performance results of classifying X-ray images preprocessed using CLAHE are tabulated in Table 1 wherein; for the class Skull, precision, sensitivity, specificity values are 100%, 80%, 100% respectively with an accuracy of about 96.55% in contrast to the class Foot wherein the accuracy is much lesser i.e.; about 89.65%.Overall performance results of classifying X-ray images preprocessed using Histogram Equalization are tabulated in Table 2 wherein; for the class Neck, precision, sensitivity, specificity values are 100%, 70%, 100% respectively with an accuracy of about 94.82% in contrast to the class Spine wherein the accuracy is much lesser i.e.; about 84.48%.Overall performance results of classifying X-ray images preprocessed using Median Filter are tabulated in Table 3 wherein; for the class Neck, precision, sensitivity, specificity values are 100%, 80%, 100% respectively with an accuracy of about 96.55% in contrast to the class Spine wherein the accuracy is much lesser i.e.; about 84.48%. Figures 7, 8 and 9 presents output X-rays (images) post pre-processing, segmentation when those images were preprocessed using CLAHE, Histogram Equalization and Median Filter respectively. The performance measures discloses (shows) that CLAHE preprocessed (X-ray) along with GLCM (texture features) and SVM (classifier) gives a better/finer accuracy over other filters and serves the purpose. Hence, SVM-classification is quite acceptable for classifying X-rays (images). A bar graph (as) in Fig. 10 is opted to present a pictorial/vivid look on classification results.

Table 1 Performance results (measures) for GLCM, SVM- with CLAHE

Full size table

Table 2 Performance results (measures) for GLCM, SVM -with histogram equalization.

Full size table

Table 3 Performance results (measures) for GLCM, SVM with- median filter

Full size table

A chart of X-rays in six rows and four columns. The column headers are input, pre-processed images, C C L, and segmented images of the chest, foot, hand, face, neck, and spine from rows 1 to 6, respectively. — **Fig. 7**

A chart of x-rays in four columns and six rows. The column headers are input, pre-processed images, C C L, and segmented images. The row entries from columns 1 to 4 are as follows. Row 1: chest; row 2: foot; row 3: hand; row 4: face; row 5: neck; and row 6: spine. — **Fig. 8**

A grouped bar graph. Values are estimated. Median filter, G L C M, and S V M (precision, 84), (sensitivity, 63), (specificity, 97), (accuracy, 91). Histogram E Q, G L C M, S V M (precision, 73), (sensitivity, 62), (specificity, 96), (accuracy, 89). Clahe, G L C M, and S V M (precision, 92), (sensitivity, 82), (specificity, 98), and (accuracy, 95). — **Fig. 10**

6 Conclusion

In Proposal described here, medical (X-rays) images of six categories namely chest, spine, palm, skull, foot, neck are acquired from IRMA medical database. These images (X-rays) are subjected to Pre-processing module wherein three different image manipulation techniques/strategies namely Median-filter, Histogram Equalization and CLAHE are implemented (or worked with) to compare their performance/results and find which Pre-processing technique works best with GLCM, SVM. Images when Pre-processed with CLAHE outshined among all three with accuracy of 96% altogether, sensitivity of 82.91%, specificity of 97.96% and precision of 92.91%. This gave a conclusion that images when Pre-processed with CLAHE gave better accuracy post classification. Future scope of this proposed work is the extension (of current work) for content based image retrieval systems (CBIR). Unlike the six groups/classes of X-rays images specified above, other parts like knee, elbow, limbs, ankle etc. can also be taken into consideration for future upgrading of medical systems. A Heterogeneous (image) types like C.T, M.R.I, U.S.G etc. can also be worked with in future.

References

Fang Yang, MuratHamit ‘Feature-Extraction-and-classification-of-Esophageal-X-ray-images-of-Xinjiang-Kazak Nationality’Volume 2017, Article:ID 4620732.
Google Scholar
Dr. M. Upender kumar, Dr. D Shravani “Novel Design of Machine Learning for Malicious Software Analysis – Malicious URL Case Study” Vol 6 Issue 4 October 2018 – December 2018 pp 292–298 International Journal of Interdisciplinary Research and Innovations (IJIRI).
Google Scholar
Vaidehi, K., and T. S. Subashini. "Content- Based –Mammogram-Retrieval-based-on-Breast-Tissue-Characterization-using Statistical-Features." Research Journal of Applied Sciences, Engineering and Technology 8.7 (2014): 871–878.
Google Scholar
Nooshin,jafari and Hosssein “Medical-X –ray-image-hierarchical-classification-using-a-merging-and-splitting-scheme-in feature- space” March 2013 Volume 2, Issue11.
Google Scholar
R. Manavalan K. Thangaval “Evaluation-of-textural-feature-extraction-from-GLCM-for-prostate-cancer-TRUS-medical- images” ‘International Journal of- Computer Applications’ (No.0975 _ 8887), Volume :36– No:12, December2011
Google Scholar
Vaidehi, K., and T. S. Subashini. "Automatic-characterization-of-benign-and-malignant-masses-in-mammography." Procedia Computer Science 46 (2015): 1762–1769.
Google Scholar
V. Kaliyaperumal, S.Selvarajan “Automated-characterization- of mammographic-density-for-early-detection-of-breast-cancer risk”International-Journal-of-Simulation :Systems.Science & Technology. Feb 2014.
Google Scholar
Sumera, K. Vaidehi, K., and J. Shahistha. "Automated glaucoma detection using machine learning approaches" Turkish online journal of qualitative enquiry”-2021.
Google Scholar
S.Perumal1 and T.Velmurugan “Preprocessing-by-Contrast-Enhancement-Techniques-for-Medical-Images.” International- Journal -of Pure and Applied- Mathematics” Volume 118 No. 18 2018, 3681–3688.
Google Scholar
Vanitha.L. and Venmathi.A.R “Classification of- Medical-Images-Using-Support-Vector-Machine” 2011 International Conference-on-Information -and-Network-Technology.
Google Scholar
Usha Kumari “Lung Cancer Image-Feature Extraction- and-Classification using ‘GLCM’ and ‘SVM’ classifier” September 2019.
Google Scholar
Firdaus, Syeda Aliya, and K. Vaidehi. "Handwritten-mathematical-symbol-recognition-using-machine-learning techniques." Advances-in-Decision-Sciences,-Image Processing, Security-and-Computer-Vision. Springer, Cham, 2020. 658–671.
Google Scholar
Sathya, R., R. Manivannan, and K. Vaidehi. "Vision-Based Personal Face Emotional Recognition Approach Using Machine Learning and Tree-Based Classifier." Inventive Computation and Information Technologies. Springer, Singapore, 2022. 561–573.
Google Scholar
Vaidehi, K., and T. S. Subashini. "Automatic classification and retrieval of mammographic tissue density using texture features." 2015 IEEE 9th International Conference on Intelligent Systems and Control (ISCO). IEEE, 2015.
Google Scholar
Anjum, Nadia, and Srinivasu Badugu. "A study of different techniques in educational data mining." Advances in Decision Sciences, Image Processing, Security and Computer Vision (2020): 562–571.
Google Scholar
Nenavath chander, Dr. M Upender, Machine-learning-based-Outlier-Detection-Techniques-for-IOT: A comprehensive Survey Volume 12 Issue 1 January 2021 pp 144–158 International Journal of Advanced Research in Engineering and Technology (IJARET) IAEME publication.
Google Scholar
Amogh Deshmukh, Dr. M. Upender “Cyber-Security-Engineering-for-Malware-Analysis – Machine Learning for Spam detection case study” Vol 10 Issue 10 October 2019 pp 301–306 Journal of Engineering Sciences JES ISSN 0377-9254.
Google Scholar
Sumathi Ganesan ,T.S. Subashini “Classification-of-medical- X-ray images-for-Automated-annotation” Journal-of-Theoretical and-Applied-Information-Technology 31, May 2014. Vol. 63, No.3.
Google Scholar
Dulhare, Uma N., and Areej Mohammed Khaleed. "Taj-Shanvi Framework for Image Fusion Using Guided Filters." Data Management, Analytics and Innovation. Springer, Singapore, 2020. 419–427.
Google Scholar
Dulhare, Uma, Areej Mohammed Khaled, and Mohd Hussam Ali. "A review on diversified mechanisms for multi focus image fusion." Proceedings of International Conference on Communication and Information Processing (ICCIP). 2019.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Artificial Intelligence, Data Science and Computer Engineering, Stanley College of Engineering and Technology for Women, Abids, Hyderabad, India
Sumera & K. Vaidehi
Department of Computer Science Engineering, Stanley College of Engineering and Technology for Women, Abids, Hyderabad, India
R. Manivannan

Authors

Sumera
View author publications
You can also search for this author in PubMed Google Scholar
K. Vaidehi
View author publications
You can also search for this author in PubMed Google Scholar
R. Manivannan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sumera .

Editor information

Editors and Affiliations

Electronics and Communication Engineering, Gnanamani College of Technology, Namakkal, India
G. Ranganathan
Czech Technical University in Prague, Prague, Czech Republic
Robert Bestak
Ryerson Communications Lab, Toronto, ON, Canada
Xavier Fernando

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sumera, Vaidehi, K., Manivannan, R. (2023). An Automated System to Preprocess and Classify Medical Digital X-Rays. In: Ranganathan, G., Bestak, R., Fernando, X. (eds) Pervasive Computing and Social Networking. Lecture Notes in Networks and Systems, vol 475. Springer, Singapore. https://doi.org/10.1007/978-981-19-2840-6_59

Download citation

DOI: https://doi.org/10.1007/978-981-19-2840-6_59
Published: 02 September 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-2839-0
Online ISBN: 978-981-19-2840-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

An Automated System to Preprocess and Classify Medical Digital X-Rays

Abstract

Similar content being viewed by others

Computer Analysis of Chest X-Ray Images to Highlight Pathological Objects

Advancement in Classification of X-Ray Images Using Radial Basis Function with Support of Canny Edge Detection Model

Angular relational signature-based chest radiograph image view classification

Keywords

1 Introduction

2 Existing Trends

3 Proposed Trend

3.1 A Module to Improve Quality of Images-Pre-processing

3.2 A Module to Extract the Region of Interest-Segmentation

3.3 A Module to Calculate Statistics-Feature Extraction

3.4 A Module that does the Purpose-Classification

4 Performance-Evaluation

5 Investigational Outcomes

6 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

An Automated System to Preprocess and Classify Medical Digital X-Rays

Abstract

Similar content being viewed by others

Computer Analysis of Chest X-Ray Images to Highlight Pathological Objects

Advancement in Classification of X-Ray Images Using Radial Basis Function with Support of Canny Edge Detection Model

Angular relational signature-based chest radiograph image view classification

Keywords

1 Introduction

2 Existing Trends

3 Proposed Trend

3.1 A Module to Improve Quality of Images-Pre-processing

3.2 A Module to Extract the Region of Interest-Segmentation

3.3 A Module to Calculate Statistics-Feature Extraction

3.4 A Module that does the Purpose-Classification

4 Performance-Evaluation

5 Investigational Outcomes

6 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation