Introduction

Celiac disease is an autoinflammatory ailment, triggered in genetically susceptible individuals by the consumption of food containing dietary gluten. Gluten is a protein found in some grains such as wheat, barley, and rye [1]. In celiac disease, gluten reactivity causes destruction of the small intestinal villi, which are primarily responsible for absorption of nutrients into the blood. Damage to the villi can result in diarrhea, abdominal discomfort and malabsorption [2].

The disease is present in countries globally, with a prevalence rate of about 1%. Although more celiac cases are surfacing with the advent of better diagnostic tools, a large number of cases remain undiagnosed [3]. Detecting celiac disease is challenging due to its limited awareness amongst some clinicians and patients, and limited diagnostic capabilities in under-resourced countries. The prevalence of celiac disease, being associated with other diseases like diabetes mellitus, thyroid and liver diseases, is high [4]. Hence, accurate diagnosis and treatment is imperative. These reasons warrant the need for an accurate, automated diagnostic tool.

Celiac disease can be pathologically subdivided, depending upon the severity of mucosal atrophy, into stages 0, 1, 2, and 3, using the Marsh scoring system [4].

Stage 0: This is known as the pre-infiltrative stage, the mucosa appears normal.

Stage 1: This is the infiltrative stage, where cells on the surface of the intestinal lining (the epithelium) have more lymphocytes than normal.

Stage 2: This is the infiltrative-hyperplastic stage, also known as crypt hyperplasia. Besides having more intra-epithelial lymphocytes than normal, depressions known as “crypts” that exist between the intestinal villi become longer than that of the normal intestinal lining, giving rise to crypt hyperplasia.

Stage 3: At this stage, increased intraepithelial lymphocytes are accompanied by crypt hyperplasia and shrinkage of the villi (villus atrophy). Stage 3 can be subdivided into stages 3a, 3b and 3c, in which there is progressive flattening of the villi. [5]. Figure 1 illustrates a healthy intestinal mucosa and an affected (atrophic) mucosa. Intestinal biopsy during standard endoscopy with analysis of the villi in the biopsy specimen is the yardstick for celiac disease diagnosis [7]. Normally the duodenum, which is a component of the small intestine, is most affected by celiac disease. Hence, healthcare experts conduct standard endoscopic biopsy by examining the proximal duodenum. Besides being invasive however, this method can be costly [7]. On the contrary, video capsule endoscopy (VCE) is noninvasive and relatively inexpensive [8]. The capsule, which is swallowed, has a video camera embedded within. This transits along the gastrointestinal tract after it is swallowed by the subject together with some water. The images obtained by the capsule during its journey along the gastrointestinal tract are recorded by a device attached to a retaining belt. The recording device is removed from the belt after about eight hours, while the capsule is excreted with the feces. Once data is obtained, by reading the video capsule data, one can only comment on whether villous atrophy is present and if it looks severe or mild. A quantitative classification based on the VCE image is currently not possible. Visual examination of the images from VCE is subject to interobserver variability, and has limited spatiotemporal resolution. Hence, diagnosing celiac disease via VCE is less precise and arduous. This warrants the need for a more pragmatic, computer-aided detection(CAD) system to diagnose the disease using VCE. Herein, this study underscores the use of a computerized technique to assist in diagnosis of celiac disease, and summarizes the CAD systems employed in other studies. Table 1 discusses the varied CAD systems exploited by other researchers for the automated categorization of celiac disease. Vecsei et al. [9] employed Fourier domain features coupled with SVM, kNN and Bayes classifiers, achieving an accuracy of 94% while Koh et al. [6] examined nonlinear features yielding an accuracy of 86.47%. Ciaccio et al. [10, 11] employed statistical methods such as using mean and standard deviation to compare image brightness, achieving specificities of 96% and 93.5%, respectively. The same researchers delved into dominant frequency analysis [12] and also explored histogram level techniques [14], achieving a sensitivity of 84.6% for combined and single basis images respectively. Gadermayr et al. [13] and Zhou et al. [15] developed unique features extracted from deep learning techniques. In our study herein, daisy descriptors are used as a cutting edge and novel method for feature extraction, prior to assessment for the presence of celiac disease After concealing video image borders, feature extraction was performed via DAISY description. The features were then reduced using Shannon entropy, after which particle swarm optimisation (PSO) was executed for its proficiency in selection of features, for classification.

Fig. 1:
figure 1

Illustration of healthy mucosa(left) and atrophic mucosa(right).(Figure taken with permission from Ref [6]).

Table 1: A summary of CAD systems used for celiac disease.

Data Used

The data used in this study are VCE images obtained from 16 healthy individuals and 21 celiac patients from the Columbia University Medical Center in New York. A PillCam™ SB [Medtronic, Minneapolis, MN] was swallowed by each subject, one that is adept in attaining images of the small intestine. Figure 2 presents images of damaged (celiac) and normal villi present in the duodenum. Two different PillCam devices were incorporated in the study, specifically the PillCams SB2 [16] and SB3 models [17]. PillCam SB2 was used for acquisition from 9 celiac patients and 8 control individuals, while PillCam SB3 obtained images from 12 celiac patients and 8 control individuals. The rate at which images are attained is two frames per second for the SB2, and an adaptive frame rate of 2-6 frames per second is used for the SB3 capsule. The image resolution is 576 × 576 pixels, and the videoclip function enables between 100 to 1000 images to be downloaded as a movie. The SB2 images were captured from portions of the small intestine consisting of the duodenal bulb, distal duodenum and jejunum, as well as the jejunum and ileum in close proximity. The SB3 images were captured from three levels, i.e., the duodenum, jejunum, and ileum.

Fig. 2:
figure 2

Representation of celiac (a) and normal (b) images of villi attained via SB3 PillCam.

Methodology

A combination of the SB2 and SB3 videos were utilized for this study. Table 2 lists the characteristics of the images obtained from the small intestine. The images acquired from the PillCam devices were masked before extraction of DAISY descriptor features [18]. The DAISY descriptor is a computation that enables the transformation of a local image domain into low dimensional continuous descriptors that are apposite for pairing and classification. The DAISY descriptor [19] can be applied in a concentrated manner or to areas extracted around points of interest. Employing this method reduces error rates, cost of computation, as well as the storage requirements of the descriptor. The DAISY descriptor typically comprises five facets: feature detector, summation, robust normalization, PCA dimension reduction, and quantize and compress. The first three aspects are particularly pertinent for all types of application. The feature detector calculates a vector of k feature detector responses for every pixel in the input data. In summation, the feature detector responses are integrated spatially by pooling them via the two-dimensional Gaussian weighting profiles. These pooling centers have an arrangement akin to flower petals, with the size expanding radially away from the data centroid. In normalization, the vector from the summation undergoes unit-normalization.

Table 2: Table of image characteristics

The features were vectorized, and Shannon entropy [20] was used to reduce the feature size to 300. PSO [21] was employed to select a subset of features for categorization. Figure 3 represents the methodology. The illustration of the method is presented in Fig. 4.

Fig. 3:
figure 3

Block-diagram of the proposed CAD system.

Fig. 4:
figure 4

Illustration of proposed method.

Pre-processing

The border of the acquired video images were first masked to eradicate any artefact surrounding the image. This is so that the results of the automated system will not be affected.

DAISY

Scale invariant features (SIFT) are commonly employed for identifying and obtaining characteristic local features from images [22]. In contrast, DAISY has been portrayed to be apt for intense computation with favourable performance. Hence DAISY descriptors are employed in this study. Figure 5 presents the shape of the DAISY descriptor. Existing in histograms of positioned gradients, measurements are made around a significant point with varying orientations and radii. Orientation gradients that are dissimilar are gauged, earmarking the positive directions. The descriptor has a design similar to that of a flower, with areas that intersect. In this study, SIFT was applied on the DAISY descriptors, causing a change in gradient. The descriptors of 2-dimensional images ( 331776 × 200) were projected to a 1-dimensional (1 × 66355200)vector. The optimal features were extracted from the 1-dimensional vector thereafter.

Fig. 5:
figure 5

Diagram of DAISY Descriptors.

Data reduction

The acquired data was quantitatively analyzed using Entropy calculation. Entropy is a measure of disorder or variability in the pixel intensities or variation in the samples [23]. Hence, entropy is useful to detect pixel intensity variations and the value of entropy obtained is high if pixel variation is high. Shannon entropy, the most basic form of entropy, is governed by the equation [20]:

H =  −  ∑ pk log pk

where pk denotes the likelihood of a data point being located in bin k. Shannon entropy is applied to each non-overlapping window(size of 221184). This helps to reduce the feature set to 300, which is needed to mitigate overfitting, so that the classifier is able to yield accurate results [24,25,26].

Selection of features

The PSO technique was employed to automatically select a group of optimum features for categorization, following feature extraction and reduction. PSO is a feature selection technique first mooted by Kennedy and Eberhart, and is invigorated by swarm intelligence [27]. PSO emulates the behavior of a group of birds that scout around for food in a particular domain. The birds are oblivious to the location of food initially, but with each round, they become more familiar with the location, by trailing the bird which is closest to the food source. Following this phenomenon, each feature in PSO acts like a bird in the parameter space. Every feature has its own velocity, which determines its direction. The features shadow the best feature at present. This process is replicated in a circle, till the reiteration is completed. The two existing algorithms are known as the 'global best' and 'local best' methods. The former identifies the position of features based on the most befitting particle from the absolute swarm, while the latter does so based on its neighbors [28]. 30 discriminant features are selected using PSO for classification in this study. PSO and normalization are executed per fold for the 10-fold, to obtain the average accuracy level.

Classification

The central part of each videoclip, comprising a segment length 20% of the total length, was selected to train and test the classifiers. The summation of frames was considered and divided by three to obtain the midmost frames. Some classifiers together with the support vector machine (SVM) [29] were utilized in the study. The types of SVM classifiers used were those with radial basis (RBF) function and polynomial 1,2,3, kernel functions. The 10-fold cross-validation [30] technique was implemented to evaluate the performance of the developed CAD system. The statistical metrics: accuracy(Acc), sensitivity(Sen), specificity(Spe) and positive predictive value(Ppv) were used to gauge system performance.

Results

Thirty features were identified using the PSO method. Figure 6 presents the boxplot for the unique features used to differentiate the normal and abnormal classes. After conducting 10 iterations, the average performances (Acc, Sen, Spe and Ppv ) were obtained. Figure 7 highlights these values attained in each fold, with the SVM RBF classifier. The performance per fold varies marginally, with 30 features.

Fig. 6:
figure 6

A summary of unique DAISY descriptor features for normal and celiac classes.

Fig. 7:
figure 7

Average performances of SVM RBF using the 10-fold.

Table 3 depicts the performance results of the best classifier exploited in this study. The average of the 10 repetitions conducted is the hallmark of the performance of the proposed technique. Hence, it is apparent from Table 3 that using 30 features, SVM with a parameter(sigma) of 1.4 displays the highest accuracy of 89.82% as compared to the other classifiers, with the application of the 10-fold cross-validation method.

Table 3: Performance of best classifier exposed to the 10-fold.

Discussion

The DAISY descriptor is an advanced technique, explored in our study for feature extraction and classification of celiac disease. It is shown to be expedient in the extraction of optimal features from VCE imageries of the small intestine. This technique has been previously used to assess compact depth maps from broad-baseline image pairs, and purports to show competency in gauging the depth and for occlusion identification [31]. It is explicit from Fig. 2 that the VCE image of the normal villi comprises more pixel variations, contributing to high entropy. On the contrary, the VCE image of celiac villi contains more pixels harboring the same intensity, thus reducing the variability. This in turn lowers the entropy values, vindicated by the high mean values of the Shannon entropy for the control images as compared to that of celiac, as seen in Fig. 6. Thus the features used are unique. The greater variation in local detail that is evident in control images may result from densely packed normal villi, which are minute in size, while in celiac patients with villous atrophy, clumping of villi occurs [16, 17], which would tend to smooth the local pixel variation. Analyzing Table 1, the method proposed herein appears to be best suited for automated assessment of celiac disease, as it has the most number of images used, compared to the other techniques, fetching results of relatively high accuracy.

Thus, extracting DAISY descriptors from the masked images and using the PSO technique and SVM (RBF) classifier is helpful in classifying celiac disease, and hence for the automated diagnosis of celiac disease.

Nonetheless, Zhou et al. [15] investigated the use of a CAD system involving deep learning. It comprises a deep convolution neural network (CNN) with twenty-two layers, dexterous in distinguishing normal versus celiac images with an accuracy of 100%. CNN is a subset of the deep learning mechanism, which circumvents the conventional feature extraction and selection processes [32, 33]. The model is trained to spontaneously sieve out the outstanding features from the input data. However, the implementation of this technique requires a sizeable data pool. Hence, in the near future, we are optimistic about gathering more images, and adopting deep learning techniques, to automatically classify celiac disease, thereby improving automation.

The proposed system has an advantage and disadvantage. The advantage is :

  1. I.

    Compared to some of the other studies, this study utilized a greater subject pool (37) with more images being used, and better accuracy yielded. Hence the developed system may possibly be more robust to differing data characteristics.

The main disadvantage is:

  1. I.

    The performance of the developed system should be further improved by using more images. Usage of larger data will enable advanced deep learning methods to be applied.

The deep learning model has provided a new impetus for the accurate diagnosis of diseases at present. Deep learning models include the Artificial Neural Network (ANN), Convolutional Neural Network (CNN), Long Short Term Memory (LSTM) and autoencoders. These models that are spearheading artificial intelligence, constitute multiple processing layers, to allow substantial data to be learnt [34]. Deep learning is an innovative tool that is being used to process video, speech and audio, as well as images. The model is lauded for its ability to automatically extract the best features, and classify signals or images thereafter, evading traditional feature extraction and selection processes, paramount in machine learning techniques. Hence, deep learning paradigms coupled with machine learning techniques are bound to be important to the future of celiac disease diagnosis.

Conclusion

Celiac disease is a systemic disorder, characterized by hypersensitivity to gluten, that affects the small intestine as well as acting systemically. Celiac disease remains underdiagnosed in many countries, and this may lead to serious health conditions if left undetected. As discussed in Section 1, conventional methods of celiac disease diagnosis are often slow and manually determined, with prohibitive cost. Thus, it could be useful to employ a CAD system. Herein, a CAD system was described to diagnose celiac disease automatically using DAISY descriptors. An accuracy of 89.82% was attained with the 10-fold cross-validation technique. The proposed system is therefore likely to be useful to healthcare professionals in diagnosing early stages of celiac disease.