Image Retrieval Systems: From Underlying Feature Extraction to High Level Intelligent Systems

Dhingra, Shefali; Bansal, Poonam

doi:10.1007/978-981-15-7533-4_79

Shefali Dhingra⁸ &
Poonam Bansal⁹

Part of the book series: Algorithms for Intelligent Systems ((AIS))

633 Accesses

Abstract

In this digital era, the profound amounts of complex images are being produced due to the up gradation of image capturing devices. So there is a huge demand of an efficient retrieval system for indexing and retrieving these images. Content based image retrieval (CBIR) system has been an active and promising research field in the area of image retrieval and processing. This system aims at retrieving the most appropriate and visually similar images from the large databases with the extraction of low level features of the images like color, edge and texture by various extraction techniques. This paper analyzes the basic CBIR system and the various achievements obtained in these systems mainly in the areas of feature extraction, indexing and intelligent CBIR systems. The most of the research in this area is now being focussed in developing of an advanced and intelligent CBIR system by using various deep learning algorithms which includes Convolutional neural network, auto encoders; long short term neural networks etc. so that the accuracy of the system can be improved. Finally, in the paper our insights and challenges are also provided for future research.

Access provided by Autonomous University of Puebla. Download conference paper PDF

A Comprehensive Survey on Content-Based Image Retrieval Using Machine Learning

Content-Based Image Retrieval Using Deep Learning

Content-Based Image Retrieval Using Convolutional Neural Networks

Keywords

1 Introduction and Motivation

With the up gradation of internet technology and digital image repositories, large amount of different media databases are being created in every area. In image repositories, recovery of the images can be performed in two ways i.e. text based and content based. The text based retrieval approach is the conventional technique to retrieve the images and it suffers from many drawbacks such as this process is very cumbersome for humans to manually annotate the images. Moreover, spelling mistakes may also be there. So it is not feasible for large databases [1]. To blown away these problems, content based image retrieval was designed in 1992 by T. Kato which retrieves the images based on the visual attributes of the images [2].

Since then, the research on this emerging area has grown rapidly and the researchers now have done much progress and have made this system from low level basic system to high level semantic and intelligent system. Basically the CBIR works in two steps: First step is extraction of features and the next is similarity calculation as shown in Fig. 1. This system works by extracting the features of both query as well as database images. It then compares the query image feature vector with the stored feature vectors of the database images and retrieves the relevant images which are visually same on the basis of low level features [3]. The basic block diagram showing how the image retrieval takes place is demonstrated in Fig. 1.

CBIR systems have several applications: It is used in the medical domain for the diagnosis of diseases as large digital images are now being created by X-rays MRI’s etc. So this system assists the physician in the diagnosis of the problem by retrieving the similar images from the medical database [4]. Another important application is in criminal investigations by searching sketch of the criminal from the criminal database available. These systems are now being utilized in many areas such as remote sensing, GIS, graphic and fashion design etc.

2 Feature Extraction Techniques

The images are retrieved with the extraction of low level features such as color, shape and texture. Based on these features the images are retrieved from the visual attributes of the image but for retrieving from high level semantic features special techniques are used. The commonly used methods of color feature extraction are color histogram, dominant color descriptor (DCD), color moment (CM) and many more which are displayed Fig. 2.

Color moment technique is most suitable for the extraction of color feature due to its lowest complexity and quicker response than other methods such as histogram based or dominant color descriptor. This technique computes the statistical measures which have the capability to express all the important details present in the image [5]. Mean and Standard deviation are evaluated from the RGB color space which is described in Eqs. 1 and 2.

$${\text{Mean}}\left( {\text{Ir}} \right) = \frac{1}{X \times Y}\mathop \sum \limits_{i = 1}^{X} \mathop \sum \limits_{j = 1}^{Y} Pc_{ij} ,r = [{\text{R,G,B}}]$$

(1)

$Pc_{ij}$, image pixel value in ith row and jth column

$${\text{Std}}(I_{r} ) = \left( {\frac{1}{X \times Y}\mathop \sum \limits_{i = 1}^{X} \mathop \sum \limits_{j = 1}^{Y} (P_{cij} - {\text{Mean(}}I_{r} ))^{2} } \right)^{{\frac{1}{2}}}$$

(2)

X and Y are the row and column of the image

Color histogram is also an extensively used feature extraction technique in retrieval systems. It embodies the frequency distribution of the color pixels in the images. It figure out number of similar pixels of the image and stores that pixels. The major drawback of this technique is that the spatial information is not evaluated during its computation. And secondly the two divergent images will produce the similar histograms with same color distribution. In CCV technique, every bit of a histogram is divided into two types: incoherent and coherent type. If the value of the image pixel is attached with the uniform colored region then it is coherent otherwise incoherent. The coherent pixel belongs to the contiguous region of the image. The CCV presents the allocation of every color present in an image.

Another important feature is texture of the image. These features are classified as statistical features, model-based features, signal processing and lastly structural features. Mostly used texture feature techniques are based on signal processing due to their better performance; few of them are Discrete Cosine Transform (DCT), Gabor filter, Wavelet transform etc. Some important and influential texture extraction techniques are given in Fig. 3.

LBP is used on a large scale for ample number of applications in image processing because of its simplicity, performance and implementation. The LBP texture descriptor has illumination and rotational invariant properties. In this technique, the image is subdivided into smaller number of sub-matrices and from that the features are extracted. All the extracted features procured from these smaller sub matrices are merged to form one feature histogram that represents the whole image [6]. GLCM is also an accurate technique for the extraction of texture feature from the images. It computes the various second order statistical properties present in the images. Some of the important properties are shown in below Eqs. 3 to 5.

$${\text{Energy}}\;E = \mathop \sum \limits_{a} \mathop \sum \limits_{b} \left( {k\left( {a,b} \right)} \right)$$

(3)

$${\text{Contrast}}\;c = \mathop \sum \limits_{a} \mathop \sum \limits_{b} \left( {a - b} \right)^{2} k\left( {a,b} \right)$$

(4)

$${\text{Entropy}}\;T = \mathop \sum \limits_{a} \mathop \sum \limits_{b} \left( {k\left( {a,b} \right)} \right)\log \left( {k\left( {a,b} \right)} \right)$$

(5)

where, a and b are the co-occurrence matrix coefficients.

The Discrete wavelet transform have the frequency and spatial characteristics which are capable to obtain the multi-scale resolution of the images. For extracting the texture features via this technique, the coefficient distribution of the mother wavelet is computed. This wavelet is translated by b and scaled by a is given in Eq. 6.

$$\Delta_{a,b} (t) = \frac{1}{\sqrt a }\varPsi \left( {\frac{t - b}{a}} \right)$$

(6)

Shape feature is also most important feature for image retrieval as it describes the contour and position information. But to extract these features image segmentation is required. It is very difficult to retrieve the images based on simply shape feature. The various models which describe the target contour or shapes include spline fitting curves, line segments, gaussian curves and Fourier descriptors etc. [7].

3 Performance Evaluation Metrics

The second step of this system is similarity measurement or calculation. For this measurement, the difference between feature vectors of the query image and feature vectors of database images is calculated with the help of various distance metrics. The various distance matrices are Euclidean distance, manhattan distance, mahalanobis distance and Murkowski distance and many more [8]. Some of the most significant distance measures are shown in Eqs. 7, 8 and 9.

$$D_{\text{Euclidean}} = \sqrt {\mathop \sum \limits_{i = 1}^{n} (\left| {I_{i} - D_{i} } \right|} )^{2}$$

(7)

$$D_{\text{Manhattan}} = \sum\limits_{i = 1}^{n} {\left| { I_{i} - D_{i} } \right|}$$

(8)

$$D_{\text{Minkowski}} = \left[ {\mathop \sum \limits_{i = 1}^{n} \left( {\left| { I_{i} - D_{i} } \right|} \right)} \right]^{{\frac{1}{p}}}$$

(9)

In these above equations, I denote the feature vector of the query image and D denotes the feature vector extracted for every image of the database.

After the step of similarity evaluation the competence of the particular CBIR system can be measured in terms of two most important and well known metrics known as precision and recall [9,10,11]. These are shown in Eqs. 10 and 11.

$${\text{Precision}} = \frac{\text{Retrieved number of relevant images }}{\text{Number of images Retrieved }}$$

(10)

$${\text{Recall}} = \frac{\text{Number of retrieved relevant images}}{\text{Total Number of relevant images in database}}$$

(11)

4 Hybrid CBIR Systems and Their Performance

The capability of the CBIR systems depends upon the proper and appropriate selection of the feature extraction techniques. The low level features described in the above section when individually used in CBIR systems for feature extraction do not provide highlighted results. When the image is a complex one, only the use of primary feature will not be sufficient. Because it will not be able to capture the variable details present in the images. To overcome this limitation, integration of features is employed. Different types of hybrid CBIR systems have been designed and proposed in the literature on different datasets, which shows the better performance in terms of average precision and recall rate. In [12] CBIR model was proposed in which the retrieval was done in two stages. In first stage the images were analyzed by splitting them in small patches and in the next stage the same information was used for retrieving the similar images. The performance of the system in terms of precision and recall is increased by 55% and 25% respectively by the described technique.

Two texture techniques i.e. LBP and Gabor filters were used for extracting the image features. This designed approach proved to be less sensitive to histogram equalization as well as rotational invariant. Another new method was designed in [13] which combine the DWT and Handamard matrix results in increased accuracy along with speed of the CBIR system.

Singh and Kaur [14] Schemed a system on the combined approach of texture and color features which was very efficient and fast. New techniques based on block difference and block variations were used for textual features and color histogram for extracting color. It was observed that out of several distance metrics square-chord gives the better results. Non training based classifiers were used due to their efficiency and simplicity. Sadegh Fadaei et al. [9] designed the CBIR scheme on the optimized combination of two different features to improve the precision value of image retrieval. Dominant Color Descriptor (DCD) features were extracted from HSV color space and to extract texture features wavelet and curvelet were applied and finally these two features are combined optimally by optimization algorithm which is particle swarm optimization algorithm and provides much better accuracy as compared to other systems. The performance of some hybrid systems on Wang database with their average precision rate is tabulated in Table 1. Wang database is taken as it is one of the standard databases used to check the competency of CBIR systems.

Table 1 Hybrid systems with their average precision values (%)

Full size table

5 Hybrid and Intelligent CBIR Systems with Their Performance

The application of artificial intelligence including machine learning and deep learning algorithms in CBIR systems has been heightened successfully in recent years. Due to the employment of all these algorithms basic CBIR system has now become an intelligent CBIR system with the improvement in efficiency and retrieval time. These deep learning algorithms are based on neural networks and they have adaptive learning power. So this is primarily the main reason behind their achievement in every field of multimedia processing [17]. Various types of machine and deep learning algorithms are being used successfully in CBIR systems for the purpose of classification of images or for the feature extraction of the images. Mostly used are Convolutional neural networks (CNN), SVM, auto-encoders, Extreme learning machine (ELM), clustering and simple forward and back propagated neural networks. An intelligent hybrid CBIR based on SVM was designed on the combined approach of color, edge and texture. It has increased the speed and accuracy of the system to a great extent [18].

The average precision of some CBIR systems using some intelligent techniques on Wang dataset is presented in Table 2.

Table 2 The average value of precision in intelligent CBIR systems (%)

Full size table

It can be clearly observed from the Tables 1 and 2, that the execution results of the CBIR systems with some intelligent algorithms or techniques are more prominent as compared to the simple hybrid CBIR systems. Moreover, their accuracy and retrieval time is also more preferable than others which is also the matter of concern in CBIR systems in case of large datasets.

6 Semantic Gap Reduction

The basic CBIR system suffers from the problem of semantic gap which is the difference in the low level features of the image captured by the system and human perception. This issue can be effectively overcome by the mechanism of Relevant Feedback [2]. It acts as an interface which connects the user with the search engine. Depending upon the feedback from the user, this technique helps in refining the images. When the query image is entered in CBIR system, number of images is retrieved. Then these images are examined by the user and by Relevant Feedback technique selects the best matching images out of the total retrieved images. This operation repeats until the appropriate results are obtained. The basic CBIR system with Relevant Feedback is shown in Fig. 4.

Grigorova et al. [22] designed a RF based CBIR system in order to increase the accuracy of the system and to reduce the problem of semantic gap. This system dynamically selects the features and assigns them appropriate weights depending upon the user feedback. Various machine and deep learning algorithms are also being used as relevant feedback techniques in these systems. Another study was done on SFS algorithm along with relevance feedback for determining the matching images for the particular query image [23]. The system was tested on eighteen different distance measures and evaluation metrics were calculated.

These techniques can also enhance the efficiency of unlabeled image databases used for the image retrieval. In [24] at every iteration of feedback, query image and user feedback images are trained and after this the unlabeled images are labeled in the database. After that with the retraining of the images, the unlabeled images are again labeled and merged into the previously labeled images.

7 Conclusion, Issues and Future Scope

This paper presents the précised portraiture of CBIR systems along with its feature extraction techniques. The most important and critical issue in this area is the feature selection and feature combination as the hybrid CBIR systems yields better performance as compared with individual techniques. The other problem which arises here with large image datasets is ‘Curse of dimensionality’, so proper indexing technique should be used in order to reduce the feature dimensions of the images so that faster results can be obtained. Some intelligent and hybrid CBIR systems are also discussed here and their performance parameter is compared with the simple hybrid CBIR systems. These intelligent technique based CBIR systems have very high precision rate and higher accuracy values. When some deep learning, machine learning algorithms, indexing techniques, relevance feedback and other promising techniques are successfully employed in these systems then they will be able to make a revolution in various image related applications.

References

Wang H, Feng L, Zhang J, Liu Y (2016) Semantic discriminative metric learning for image similarity measurement. IEEE Trans Multimed 18(8):1579–1589
Article Google Scholar
Alsmadi MK (2017) An efficient similarity measure for content based image retrieval using memetic algorithm. Egypt J Basic Appl Sci 4(2):112–122
Google Scholar
Grigorova A, De Natale FGB, Dagli C, Huang TS (2007) Content-based image retrieval by feature adaptation and relevance feedback. IEEE Trans Multimed 9(6):1183–1192
Google Scholar
Wang J (2011) Bag-of-features based medical image retrieval via multiple assignment and visual words weighting. IEEE Trans Med Imaging 30(11):1996–2011
Article Google Scholar
Lin C-H, Chen R-T, Chan Y-K (2009) A smart content-based image retrieval system based on color and texture feature. Image Vis Comput 27(6):658–665
Article Google Scholar
Reddy PVN, Prasad KS (2011) Color and texture features for content based image retrieval. Int J Comput Technol Appl 2:1016–1020
Google Scholar
Mistry Y, Ingole DT, Ingole MD (2017) Content based image retrieval using hybrid features and various distance metric. J Electr Syst Inf Technol
Google Scholar
Jenni K, Mandala S, Sunar MS (2015) Content based image retrieval using colour strings comparison. Procedia Comput Sci 50:374–379
Article Google Scholar
Fadaei S, Amirfattahi R, Ahmadzadeh MR (2017) New content-based image retrieval system based on optimised integration of DCD, wavelet and curvelet features. IET Image Process 11(2):89–98
Article Google Scholar
Shriwas MK, Raut VR (2015) Content based image retrieval: a past, present and new feature descriptor. In: 2015 International conferences circuits, power computer technology [ICCPCT-2015], pp 1–7
Google Scholar
Vogel J, Schiele B (2006) Performance evaluation and optimization for content-based image retrieval. Pattern Recognit 39(5):897–909
Article Google Scholar
Liao S, Law MWK, Chung ACS (2009) Dominant local binary patterns for texture classification. IEEE Trans Image Process 18(5):1107–1118
Article MathSciNet Google Scholar
Mohamadzadeh S, Farsi H (2013) Colour and texture feature-based image retrieval by using hadamard matrix in discrete wavelet transform. IET Image Process 7(3):212–218
Article MathSciNet Google Scholar
Singh C, Kaur KP (2016) A fast and efficient image retrieval system based on color and texture features. J Vis Commun Image Recogn 41:225–238
Article Google Scholar
Pavithra LK, Sharmila TS. (2017) An efficient framework for image retrieval using color, textureand edge features. J Comput Electr Eng 1–14
Google Scholar
Mangijao S, Hemachandran K (2012) Content based image retrieval using color moment and gabor texture feature. Int J Comput Sci Issues 9:299–309
Google Scholar
Yasmin M, Mohsin S, Shariff M (2014) Intelligent image retrieval techniques: a survey. J Appl Res Technol 12(1):87–103
Article Google Scholar
Nikkam P, Eswara Reddy B (2017) An efficient approach for content based image retrieval using hierarchical part-template and tree modeling. J Image Video Process 8(2):1607–1613
Article Google Scholar
Dhingra S, Bansal P (2019) A competent and novel approach of designing an intelligent image retrieval system. EAI Trans Scalable Inf Syst 7(24):1–15
Google Scholar
Liu F, Wang Y, Wang F, Chang Y, Lin J (2019) Intelligent and secure content-based image retrieval for mobile users. IEEE Trans 4:1–14
Article Google Scholar
Mohamad D, Saba T (2014) Content-based image retrieval using PSO and k-means clustering algorithm algorithm. Arabian J Geosci 1–14. https://doi.org/10.1007/s12517-014-1584-7
Grigorova A, Natale FG, Dagli C, Huang T (2007) Content-based image retrieval by feature adaptation and relevance feedback. IEEE Trans Multimed. https://doi.org/10.1109/TMM.2007.902828:1-10
Article Google Scholar
Mosbah M, Boucheham B (2017) Distance selection based on relevance feedback in the context of CBIR using the SFS meta-heuristic with one round. Egyptian Inform J 18(1):1–9
Article Google Scholar
Zhou Z, Chen K, Dai H (2006) Enhancing relevance feedback in image retrieval using unlabeled data. ACM Trans Inf Syst 24(2):219–244
Google Scholar

Download references

Author information

Authors and Affiliations

Guru Gobind Singh Indraprastha University, New Delhi, India
Shefali Dhingra
Maharaja Surajmal Institute of Technology, New Delhi, India
Poonam Bansal

Authors

Shefali Dhingra
View author publications
You can also search for this author in PubMed Google Scholar
Poonam Bansal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shefali Dhingra .

Editor information

Editors and Affiliations

Department of Computer Engineering, National Institute of Technology, Kurukshetra, Kurukshetra, India
Mayank Dave
Department of Computer Engineering, National Institute of Technology, Kurukshetra, Kurukshetra, India
Ritu Garg
Department of Computer Engineering, National Institute of Technology, Kurukshetra, Kurukshetra, India
Mohit Dua
School of Information Technology, Deakin University, Geelong, VIC, Australia
Jemal Hussien

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dhingra, S., Bansal, P. (2021). Image Retrieval Systems: From Underlying Feature Extraction to High Level Intelligent Systems. In: Dave, M., Garg, R., Dua, M., Hussien, J. (eds) Proceedings of the International Conference on Paradigms of Computing, Communication and Data Sciences. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-15-7533-4_79

Download citation

DOI: https://doi.org/10.1007/978-981-15-7533-4_79
Published: 20 February 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-7532-7
Online ISBN: 978-981-15-7533-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Image Retrieval Systems: From Underlying Feature Extraction to High Level Intelligent Systems

Abstract

Similar content being viewed by others

A Comprehensive Survey on Content-Based Image Retrieval Using Machine Learning

Content-Based Image Retrieval Using Deep Learning

Content-Based Image Retrieval Using Convolutional Neural Networks

Keywords

1 Introduction and Motivation

2 Feature Extraction Techniques

3 Performance Evaluation Metrics

4 Hybrid CBIR Systems and Their Performance

5 Hybrid and Intelligent CBIR Systems with Their Performance

6 Semantic Gap Reduction

7 Conclusion, Issues and Future Scope

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Image Retrieval Systems: From Underlying Feature Extraction to High Level Intelligent Systems

Abstract

Similar content being viewed by others

A Comprehensive Survey on Content-Based Image Retrieval Using Machine Learning

Content-Based Image Retrieval Using Deep Learning

Content-Based Image Retrieval Using Convolutional Neural Networks

Keywords

1 Introduction and Motivation

2 Feature Extraction Techniques

3 Performance Evaluation Metrics

4 Hybrid CBIR Systems and Their Performance

5 Hybrid and Intelligent CBIR Systems with Their Performance

6 Semantic Gap Reduction

7 Conclusion, Issues and Future Scope

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation