Clustered redundant keypoint elimination method for image mosaicing using a new Gaussian-weighted blending algorithm

Hossein-Nejad, Zahra; Nasri, Mehdi

doi:10.1007/s00371-021-02261-9

Clustered redundant keypoint elimination method for image mosaicing using a new Gaussian-weighted blending algorithm

Original article
Published: 19 July 2021

Volume 38, pages 1991–2007, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

The Visual Computer Aims and scope Submit manuscript

Clustered redundant keypoint elimination method for image mosaicing using a new Gaussian-weighted blending algorithm

Download PDF

457 Accesses
11 Citations
1 Altmetric
Explore all metrics

Abstract

In this paper, a new method for image mosaicing (image stitching) is introduced based on Scale Invariant Feature transform (SIFT). One of the main drawbacks of SIFT is the redundancy of the extracted keypoints, which leads to lower image mosaicing quality. Recently, a new method called Redundant Keypoint Elimination (RKEM) was presented to remove these redundant features, and enhance image registration performance. Despite the applicability of RKEM, its threshold value is considered the same in all parts of the image. This characteristic leads to inappropriate removal of keypoints due to the fact that distribution of keypoints in the high-detailed region is denser than the low-detailed ones. This paper proposes a new method to improve RKEM called Clustered RKEM (CRKEM) which is based on keypoints distribution. Moreover, in this paper a new blending algorithm is proposed based on a Gaussian-weighted function. In the proposed blending method, the Gaussian function is proposed based on the mean and variance of the pixels in the overlapped region of images to be mosiaced. In comparison with the classical methods, the experimental results confirm the superiority of the proposed method in image mosaicing as well as to image registration and matching.

Image mosaicking using improved auto-sorting algorithm and local difference-based harris features

Article 11 June 2020

Image Stitching using AKAZE Features

Article 09 September 2020

Image stitching by feature positioning and seam elimination

Article 11 March 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Image mosaicing (image stitching) is the process of stitching two or more images together in order to create a larger, consistent and seamless composite image [1,2,3,4]. Image mosaicing is one of the most important and commonly used in the fields of image processing, machine vision, and computer graphics, which has recently been the focus of many researchers [5,6,7,8,9]. Mosaicing process has many applications in image compression [10], image stabilization [11, 12], medical image analysis [13], video indexing [14], panoramic photography [15, 16], and mapping [17]. Although there are various methods for mosaicing images, the majority of them are composed of three steps including feature extraction, image registration, image warping and blending [3, 18, 19]. In general, image mosaicing methods are divided into two categories: pixel-based (direct) methods [20,21,22] and feature-based methods [23,24,25,26]. In the direct methods, the information of the entire pixels of the image is used for mosaic image. Direct methods are good for mosaicing images with large overlap, translations, and small rotations [23, 27]. In the feature-based methods, image features are identified and then matching and homography between feature points are done. Feature-based methods are suitable for mosaicing images with little overlapping areas [27, 28]. In general, feature-based methods have more precision compared to direct methods and are used more often in practical applications [29].

In feature-based methods, features are extracted from images in a variety of ways. In some methods, these features are detected by deep neural networks [30, 31], and in some other methods, features are extracted by common feature detectors [32, 33]. Deep neural networks are composed of three main layers: convolution, pooling and fully connected [34]. The features are identified by the convolution layer and these features are converted to the features vector by the pooling layer. Deep neural networks are widely used in image processing [35,36,37,38], but these networks have weaknesses in feature extraction, including the incomprehensibility of image features physically, time consumin, and computational complexity [39]. These disadvantages have led to the continued attention to classical detectors in feature extraction. Feature-based detectors for image mosaicing are divided into two categories [40,41,42,43]: mosaicing images based on high-level features and low-level features. In the high-level feature algorithms, one part of the image (contours) is identified as feature. In the low-level feature algorithms, keypoints or corners are identified as features [44, 45]. Typically, low-level feature algorithms are used more often for the image mosaicing process than high-level feature algorithms [2, 23, 29]. Low-level feature algorithms consist of two stages: identifying features and crating descriptors [46, 47]. At the feature identification stage, distinct positions of the image stable against different deviations (rotation, scale, affine, illumination) are extracted, and at the stage of creating descriptors, a descriptor is created for each identified feature using the specific area around each feature such as the gradient size, image information, texture, and edges. The most important and widely-used algorithm used in low-level feature extraction is the Scale Invariant Feature Transform (SIFT) algorithm [48, 49].

SIFT is a computer vision algorithm used to extract specific features of the images, and was originally proposed by David Lowe in 2004 at Columbia University, patented in the United States [32]. The features obtained from this algorithm are invariable against image scale and rotation as well as the change in the viewpoint, and the changes in intensity are partly fixed [50]. Despite the advantages mentioned, the algorithm has limitations that can reduce the image mosaicing process quality [51, 52]. Many studies have been conducted to solve the problems of this algorithm [53,54,55,56,57,58,59,60,61,62,63,64]. The PCA-SIFT algorithm has been introduced to reduce the size of the vector in the SIFT descriptor [53]. In this approach, the Principal Component Analysis is introduced to reduce the size of the descriptors vector. In this algorithm, 128 dimensional SIFT descriptors are turned into a 40-dimensional vector, increasing the speed of the algorithm performance and enhancing its performance in cases when images do not have much affine transformation. The Speed Up Robust Features (SURF) algorithm is a resilient local feature detector that has been proposed to enhance the speed of SIFT, which is also constant against rotation and scale [54]. SURF uses fast measurement of the Hessian matrix for the detector and distribution-based descriptor. Although SURF performance is similar to SIFT, at a same time, SURF can operate faster. The N-Dimensional Scale Invariant Feature Transform (NSIFT) method has been presented to improve N-dimensional images matching in SIFT [55]. This procedure is effective in the medical image matching that is three and four dimensional. Scale Restriction Scale Invariant Feature Transform) SRSIFT (has been introduced to reduce mismatches in SIFT [56]. This method eliminates some of the mismatches by modifying the keypoints gradient direction and scale limit standard leading to SIFT improvement in multi-modal remote sensing images. Auto-Adaptive SIFT (A²SIFT) has been proposed to improve SIFT over spatial distribution [57]. Affine SIFT (ASIFT) is the improved version of SIFT that is constant to affine deviations and much stronger against angle changes [58]. ASIFT extracts an impressive number of matching points in images, but often a large number of mismatches occur among them. The Iterative SIFT algorithm has been presented to reduce the number of features of SIFT [59]. Although this method can reduce the number of features and control the feature extraction step of SIFT but the distribution of keypoints in this method is not uniform. The Uniform Robust SIFT (UR-SIFT) algorithm has been introduced for more control over SIFT feature extraction [60]. RKEM-SIFT has been introduced to delete unnecessary keypoints in SIFT, the effectiveness of which in image registration [61], retinal image matching [63], and copy-move forgery detection [65] has been confirmed. In this algorithm, after identifying the keypoints according to the classic SIFT, in each image, the keypoint’s distance from each other is calculated first. Afterward, when the distance between two different keypoints is less than a certain threshold value, the redundant keypoint is removed and the other one is maintained for the matching process. One of the major disadvantages of RKEM-SIFT is that it considers a single threshold value for the removal of redundant keypoints in all parts of the image regardless of the details of the image. In an image, one part may have many details and another part may have few details; a fixed threshold value in the whole image cannot lead to good results in the removal of redundant keypoints.

In this paper, to solve the mentioned problem of RKEM-SIFT, a new method is proposed to find the appropriate threshold values based on the clustering approach. In this proposed method, after detecting the features by SIFT, based on the Fuzzy C-Means clustering algorithm which is one the most applicable methods in data clustering, keypoints are placed in certain order clusters. Afterward, to remove redundant keypoints, according to the details of each region of the image, the appropriate threshold value is considered. The proposed Clustered Redundant Keypoint Elimination Method-SIFT (CRKEM-SIFT) can increase the matching precision and ultimately improves the performance of image mosaicing. Another innovation of the paper is the proposal of a new method in image blending. One of the major problems with classical methods is the combination of image artifacts and common lines. The Gaussian weight function is used in the proposed method of combining the mean and its variance, based on the mean and variance of the overlapped region, respectively.

The organization of the rest of paper is as follows. In Sect. 2, classical algorithms and their problems in image mosaicing are described, and in Sect. 3, the proposed approach is explained and in Sect. 4, we deal with the results of testing by different applications such as image matching, image registration, image mosaicing and change detection. Finally, the paper is concluded in Sect. 5.

2 Classical algorithms and their problems in image mosaicing

The image mosaicing process consists of three different steps according to Fig. 1. At first, the features of both input images are detected, and then the process of registration is done. Image registration refers to the geometric alignment of the image sets. Finally, image blending is carried out which modifies the image gray levels in the vicinity of a boundary to obtain a smooth transition between images by removing these seams and creating a blended image. RKEM-SIFT, and the common methods of blending such as the weighted average method [66], and ORB-improved blending [67] is described briefly, and finally the disadvantages of each of these algorithms are investigated in image mosaicing.

2.1 RKEM-SIFT algorithm

RKEM-SIFT algorithm was introduced by Hossein-Nejad et al. in 2017 [61]. This algorithm is the improved form of SIFT algorithm in the feature extraction step. In this method, the features distance identified in the SIFT algorithm are calculated and proposed as the redundancy index criterion according to the distance between features. Then the features distance which is less than the predetermined threshold value, the feature of the redundancy index of which is more than other features, is omitted.

2.2 RKEM-SIFT problems

In an image, there are areas with many details, such as trees, and areas with least details such as the sky, the sea, where one of the problems of RKEM-SIFT method exits: in deletion of redundant keypoints, it does not pay attention to image details, so acting the same in all regions of the image. As shown in Fig. 2, image details at the bottom are more than those at the top of the image, so the number of the keypoints identified at the top of the image is less than those at the bottom. Finally, the distance between the keypoints is closer to the bottom of the image and there is even overlap in some areas. To remove features that overlap, attention should be paid to the image details because some of the features that are close to each other are not redundant in areas where there are many details. This problem mentioned in the RKEM-SIFT algorithm may interfere with the next processes, leading to problems such as the reduction in precision in image mosaicing. In the next section, we will examine the blending methods.

2.3 Blending algorithms

The process of image mosaicing requires to create an integrated image without border-stitching after the registration stage. Image stitching is to create an image from blending of two or several other images so that their most important information is reserved and the boundaries of the images are not clear. The three common methods in image blending are the Weighted Average [21, 66], Wavelet Transform [68, 69], and Spline Multi-resolution Method [70]. The Weight Average method is used most [67, 71]. Blending two images is presented according to Eq. (1), which provides various methods for calculating the amount $\beta \left( x \right)$.

$$ I\left( {x,y} \right) = \beta \left( x \right)I_{{{\text{Ref}}}} \left( {x,y} \right) + \left( {1 - \beta \left( x \right)} \right)I_{{{\text{Tgt}}}} \left( {x,y} \right) $$

(1)

In this equation, I_Ref (x, y) and I_Tgt (x, y) are pixels in the reference image and the target image at the same location of the overlapping region.

In [66], the Weighted Average method is used for image blending, where $\beta \left( x \right)$ is the cross-fade weighting function; its value is calculated according to Eq. (2).

$$ \beta \left( x \right) = 1 - \frac{{x - x_{\min } }}{{x_{\max } - x_{\min } }} $$

(2)

In this equation, X_min and x_max representing the left and right edge of a certain row in the overlapping region, respectively. The disadvantages of the weighted average method is the use of cross-fade function as this function creates artificial edges and blurring [67]. In [67] to solve the problem mentioned in the weighted average method, the Gaussian model is used instead of cross-fade function according to Eq. (3). The advantage of this method, is fewer artificial edges in the blended images. One of the disadvantages of this method [67] is the existence of boundary lines in the image.

$$ \beta \left( x \right) = 2e^{{ - \frac{{\left( {x - u} \right)^{2} }}{{2\sigma^{2} }}}} - \frac{1}{2} $$

(3)

In this equation, u and $2\sigma^{2}$ are obtained according to Eq. (4–5).

$$ u = x_{\max } $$

(4)

$$ 2\sigma^{2} = \left( {x_{\max } - x_{\min } } \right)^{2} $$

(5)

In this equation, X_min and x_max represent the left and right edge of a certain row in the overlapping region, respectively.

2.4 Problems of blending methods

Each of image blending methods has some problems, each of which will be discussed below.

Most of the blended methods described shortly in the previous section are used to blend two images. However, normally more than two images are used to create panoramic images. Geometric adjustments, radiometric adjustments, altered imaging position, and changes in image texture and color cause problems in image blending. The most common problems are blurring of the image in common areas (Fig. 3a), artifacts (Fig. 3.b), boundary line in the blending area (Fig. 3c) and artificial edges (Fig. 3d).

3 Method

In this section, CRKEM-SIFT is proposed at the feature extraction stage and Gaussian-weighted blending function is proposed at the blending stage to improve image mosaic processes. The details of each method are as follows.

3.1 Proposed CRKEM-SIFT method in feature extraction

The details of the proposed method are as follows.

Stage 0 A set of different images to be mosaicked is considered as the input to the algorithm.

Stage 1 The keypoints are identified in each image using SIFT.

Stage 2 For each keypoint of the image, one orientation is considered according to the classic SIFT [12].

Stage 3 The keypoints in each image are placed in m clusters using the Fuzzy C-Means clustering algorithm [46].

Stage 4 In each cluster, using Eq. (6), the number of each cluster’s keypoints (Nci) to the total detected keypoints in the image (N_t) is calculated. This value is considered as the threshold for each cluster. This threshold value is appropriate because the number of keypoints required for each cluster is identified by considering the image content in each cluster compared to the overall content of the image.

$$ {\text{threshold}} = \frac{{N_{{{\text{ci}}}} }}{{N_{t} }}\quad i = 1, \ldots , m $$

(6)

Stage 5 In each cluster, the Manhattan distance between each keypoint and other keypoints is calculated, respectively, according to Eq. (7).

$$ d\left( {p_{a} , p_{b} } \right) = \mathop \sum \limits_{i = 1}^{k} \left| {p_{a} \left( i \right) - p_{b} \left( i \right)} \right| $$

(7)

In these Equation, p_a (i) is the ith coordinate of the keypoint pa, p_b (i) is the ith coordinate of the p_b and k is the dimension of the keypoints. Then the total distance between each keypoint and other keypoints is calculated in each cluster according to Eq. (8).

$$ {\text{SD}}\left( {p_{a} } \right) = \mathop \sum \limits_{j = 1}^{N} d\left( {p_{a} , p_{j} } \right) $$

(8)

In this equation, N is the number of keypoints in each cluster and d(p_a, p_j) is the distance between the keypoint p_a and p_j calculated according to Eq. (7).

Stage 6 If the Manhattan distance of two distinct keypoints is less than the threshold value obtained according to Eq. (6) in each cluster, we should eliminate one of the keypoints. Among these points, the keypoint, according to Eq. (9), whose redundancy is high is removed because Eq. (9) shows the amount of redundancy of the keypoints. The greater this amount is, the closer the keypoints are to one another.

$$ {\text{RI}}\left( {p_{a} } \right) = \frac{1}{{{\text{SD}}\left( {P_{a} } \right)}} $$

(9)

Stage 7 at this stage, for each remaining keypoint of the image, a descriptor vector is computed according to the classic SIFT [12].

3.2 Proposed Gaussian-weighted method in blending

Image blending is the mosaic final step to blend the pixels intensity in the overlapped region to avoid the seams. Image blending is used according to Eq. (10).

$$ I\left( {x,y} \right) = \beta \left( {x,y} \right)I_{{{\text{Ref}} }} \left( {x,y} \right) + \left( {1 - \beta \left( {x,y} \right)} \right)I_{Tgt} \left( {x,y} \right) $$

(10)

In this equation, $I_{{{\text{Ref}} }} \left( {x,y} \right)$ and $I_{Tgt} \left( {x,y} \right)$ are the pixels of the reference and target image in overlapping areas. $\beta \left( {x,y} \right)$ is the Gaussian weighted function that gives a value to the pixel as weight according to the distance of the pixel to the boundary line. The Gaussian weighted function $\beta \left( {x,y} \right)$ is according to Eq. (11).

$$ \begin{gathered} \beta \left( {x,y} \right) = 2e^{{ - \frac{{\left( {x - u} \right)^{2} }}{{2\sigma^{2} }}}} - \frac{1}{2} \hfill \\ u = {\text{mean}}\left( {\left\{ {x_{i} \left| {i = 1,...,n} \right.} \right\}} \right) \hfill \\ \sigma^{2} = {\text{var}} \left( {\left\{ {x_{i} \left| {i = 1,...,n} \right.} \right\}} \right) \hfill \\ \end{gathered} $$

(11)

In these equations, x_i is the data in the overlapping areas and n is the number of pixels in the overlapping areas.

4 Simulation results

In this section, a set of experiments in different applications is considered to assess the performance of the proposed CRKEM-SIFT method compared with SIFT [32], SURF [54], MSER-SIFT [72], RKEM-SIFT [61], ARKEM-SIFT [73], ORB-Weighted Mean method [67], AKAZE-MAC [3], SIFT-Voronoi Diagram [51], SURF-Multiband Blending [74], SIFT-weighted Average [75], SURF-LM [76], and SIFT-Improved Weighted Average [77]. In the first experiment series, effects of different threshold values on preservation of image details in RKEM and CRKEM-SIFT are investigated. In the second, the performance of the proposed CRKEM-SIFT approach in the matching process is investigated. In the third and fourth sets, the performance of the proposed method in the applications of registration process and image mosaicing process are used, respectively. In these experiments, in order to prevent the selection of the parameters values from affecting the results, the algorithm parameters are selected constant. In the proposed algorithm of CRKEM-SIFT, the value of m which is the number of clusters is considered 3 and this number is considered experimental. The amount of the parameter T_c is considered 0.01 in order to extract the suitable number of features according to Lowe’s suggestion [32].

4.1 Image database

To assess the proposed method, it is necessary that the images be chosen in a way that include all distortions (different scales, different viewpoints, different times and changes in rotation) between the images. To do so, the collection of images includes tree dataset, wall images downloaded from (www.robots.ox.ac.uk), bough images downloaded from (www.robots.ox.ac.uk), graffiti images downloaded from (http://lear.inrialpes.fr/people/mikolajczyk/Database), building images downloaded from (https://www.mathworks.com), and Waterfall images.

4.2 Effects of different threshold values on preservation of image details in RKEM and CRKEM-SIFT

In this experiment, an image was used to examine the removal of redundant keypoints and the performance of removing the redundant keypoints by RKEM-SIFT with different thresholds and the proposed CRKEM-SIFT is shown in Fig. 4.

As shown in Fig. 4b–f, by increasing the threshold value in the RKEM-SIFT method, more keypoints are considered as redundant points and eliminated. RKEM-SIFT does not pay attention to the details of the image to remove the redundant keypoints, and in the part of the image that has a lot of details, the keypoints are removed in the same proportion as that of the part of the image with few details. It is shown in Fig. 4g with the proposed CRKEM-SIFT, much attention has been paid to the details of the image to remove the redundant keypoints, so that in the parts of the image where the details are many, more keypoints have been preserved. In general, RKEM-SIFT does not pay attention to the content and texture of the image removing redundant points, and uniformly removes keypoints that are closer than the predetermined threshold throughout the image. The proposed CRKEM-SIFT, on the contrary, pays attention to the content and texture of the image while removing the redundant points and adaptively removes fewer keypoints in parts of the image that have more content.

4.3 Experiment in the matching process

Matching is the process of determining correspondence between two or more images of the same scene received at different times, with different angles, or by different sensors. To investigate the effectiveness of the proposed CRKEM-SIFT, a series of experiments with varied scales and viewpoints is performed, and the resulting matching process is evaluated by the matching precision according to Eq. (12), feature repeatability rates according to Eq. (13), and the total number of matches.

$$ {\text{Precision}} = \frac{{{\text{TP}}}}{m} $$

(12)

$$ F_{r} = \frac{{{\text{TP}}}}{{\min \left( {N_{{{\text{ref}}}} , N_{{{\text{sens}}}} } \right)}} $$

(13)

In these equations, TP is the number of true matches and m is the total number of matches. N_ref and N_senc denote the total number of the detected features of the reference image and the target image, respectively. If the matching precision and feature repeatability rates are high, it has a better performance in the matching process.

4.3.1 Matching process on images with varied scales

In this test, the "tree" dataset including images with different scales is used, and the image matching performance is evaluated, which is shown in Fig. 5.

In Fig. 5, false matches are grouped by red squares based on the unnecessary SIFT keypoints, what indicates removal of redundant points using RKEM-SIFT, and ARKEM-SIFT in addition to the improvement of matching precision by the proposed CRKEM-SIFT. The total number of matches is one of the important criteria. As seen in Fig. 4, the total number of matches in the ARKEM-SIFT method is much less than the proposed CRKEM-SIFT method. In ARKEM-SIFT, in order to remove the redundant keypoints, the details of the image have not received attention, what has reduced the total number of matches, so it is important that the details of the image have been considered in the proposed CRKEM-SIFT method. Quantitative results obtained by different methods are given in Table 1, which shows the proposed CRKEM-SIFT performs the best versus SIFT, SURF, MSER-SIFT, RKEM-SIFT and ARKEM-SIFT in term of feature repeatability rate. In ARKEM-SIFT, the matching precision is more than the proposed CRKEM-SIFT, which does not indicate the proper performance of ARKEM-SIFT versus the proposed CRKEM-SIFT. As the total number of matches in ARKEM-SIFT is significantly different from the proposed CRKEM-SIFT, the proposed CRKEM-SIFT is more robust to scale changes than the other methods.

Table 1 Matching process results of different methods on tree images with different scales by classic criteria

Full size table

4.3.2 Matching process on images with viewpoint changes

In this test, the "wall" dataset with different angles is used, and the matching performance in the proposed CRKEM-SIFT is evaluated, shown in Table 2.

Table 2 Matching process results of different methods on wall images with different viewpoints by classic criteria

Full size table

In terms of the precision and repeatability metrics, RKEM-SIFT [61], ARKEM-SIFT [73] and specially the proposed CRKEM-SIFT, also have performed better than SIFT. The SURF method performs the worst among the other methods in two sets of experiments. It has obtained fewer repeatability and lower matching precision (in Table 2).

4.4 Experiment on the image registration process

Image registration is the process of aligning two images from the same scene taken under various imaging situations [78]. To investigate the effectiveness of the proposed CRKEM-SIFT, a series of tests on images with various rotations, scales, and viewpoints is performed, and the functionality of the proposed CRKEM-SIFT in the registration process is evaluated visually and by the RMSE criterion according to Eq. (14).

$$ {\text{RMSE}} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {x_{i} - x_{i}^{\prime } } \right)^{2} + \left( {y_{i} - y_{i}^{\prime } } \right)^{2} }}{m} $$

(14)

In this equation, (x_i,y_i) and (x′_i, y′_i) are the coordinates of the ith matching keypoint pair. If RMSE is relatively low, it has a better performance in image registration.

4.4.1 Registration process on images with different rotations and scales

In this experiment, seven pairs of the “bough” dataset which consist of images with changes rotation and scale are used, the performance results of which in the registration process are presented in Fig. 6.

As shown in Fig. 6, the proposed CRKEM-SIFT method has better performance than others. Performance of the registration process on a pair of images is shown in Fig. 7. As shown in this figure, CRKEM-SIFT has improved the registration process.

4.4.2 Registration process on images with viewpoint changes

In this experiment, the "graffiti" dataset which consists of images with different viewpoints is used, and the functionality of the proposed CRKEM-SIFT in the registration process is evaluated, shown in Fig. 8.

Figure 8 and Table 3 show that alignment accuracy of the proposed CRKEM-SIFT compared to the others is more suitable.

Table 3 Registration process results of different methods on graffiti images with different viewpoints by classic criterion

Full size table

4.5 Experiment on the image mosaicing process

In general, the image mosaicing process includes two important steps: feature extraction and image blending. In this section, to assess these two steps, first the proposed CRKEM-SIFT method and then the proposed method (proposed CRKEM-SIFT and proposed blending method) for two and more image mosaicings are evaluated. Image mosaicing evaluation methods are divided into two categories: visual methods and objective methods. To evaluate the performance of image mosaicing algorithms objectively, several articles have used the relationship between mosaicing and reference images, including the Median error (MEE), Maximum error (MAE) [79], and RMSE. On the other hand, overlapping areas are very important in mosaicing the image process. The Structural SIMilarity (SSIM) index [80], Peak Signal-to-Noise Ratio (PSNR) [3], Feature SIMilarity Index (FSIM) index [81] and Visual Saliency-Induced Index Visual saliency (VSI) [82] are overlapping criteria. The mechanism for calculating these criteria is shown in Fig. 9.

4.5.1 Studying proposed CRKEM-SIFT method in mosaicing process

To demonstrate the performance of the proposed CRKEM-SIFT in the mosaicing process, a “Margoon waterfall^{Footnote 1}” dataset which consists of images with different viewpoints is employed in this experiment (Fig. 10).

As shown in Table 4, the proposed method has obtained the highest alignment precision versus SIFT, RKEM-SIFT, ARKEM-SIFT, which denotes the proper performance of the proposed CRKEM-SIFT in image mosaicing. The mosaicing process has not shown a good performance by SURF, MSER-SIFT and ARKEM-SIFT. An example of improper performance of image mosaicing has been shown in Fig. 9 with red squares, showing that it has not been able to mosaic the specified part properly.

Table 4 Mosaicing process results of different methods on waterfall images with different viewpoints by classic criterion

Full size table

4.5.2 Studying the proposed method for two-image mosaicing

To demonstrate the performance of mosaicing process by the proposed method (proposed CRKEM-SIFT and proposed blending method), a dataset of the “underground city of Kariz^{Footnote 2}” which consists of images with different viewpoints is employed in this experiment (Fig. 11).

As shown in Fig. 11, some boundary lines can be seen by the results of ORB-Weighted Mean [67], AKAZE-MAC [3], SIFT-Voronoi Diagram [51] and SIFT-Improved Weighted Average [77]. In addition, parts of the image are not well stitched, indicating poor performance. By the proposed blending method, the blending process is well observed without any boundary lines.

As seen in Table 5, the proposed method has the highest value by the PSNR, SSIM, FSIM and VSI evaluation criteria and the lowest value by the MEE, RMSE and MAE evaluation criteria, which indicates its successful performance in image mosaicing. Following the proposed method, based on PSNR, SSIM, FSIM and VSI evaluation criteria are AKAZE-MAC [3], SIFT-improved weighted average [77], SIFT-Voronoi diagram [51] and finally ORB-Weighted Mean [67] for success in image mosaicing, respectively.

Table 5 Mosaicing process results of blending methods

Full size table

4.5.3 Studying the proposed method for several images mosaicing process

In this experiment, a “building” dataset which consists of images with angle changes is used, the results of the mosaicing process on which are shown in Fig. 12. As it can be seen in this figure, the image mosaicing process by methods [74,75,76,77] have a large number of boundary lines (red squares in Fig. 12), but only one boundary line is shown in the proposed method, indicating the effective performance of the proposed method. In the SIFT-Weighted Average method [75], in addition to the boundary line, a false edge (blue square in Fig. 12) is also observed.

5 Conclusion and future works

In this paper, a new approach for natural image mosaicing process using a combination of two proposed methods CRKEM-SIFT and a Gaussian-Weighted blending method was introduced. At first, the features were extracted from the images using the proposed CRKEM-SIFT algorithm, which individually showed that the proposed CRKEM-SIFT increased the matching precision and ultimately improved the performance of image mosaicing. Then, the mean and variance of the pixels of common areas were considered as the mean and variance in the Gaussian Weight function, showing the enhancement of the image blending. Therefore, the proposed method seems to be improving the accuracy of the natural image mosaicing. Application of the proposed CRKEM-SIFT method in other research fields such as image tracking might reasonably be the topic of future works.

Notes

Margoon Waterfall in Fars Province, Iran.
Uunderground city of Kariz, Kish, Iran.

References

Kekec, T., Yildirim, A., Unel, M.: A new approach to real-time mosaicing of aerial images. Robot. Auton. Syst. 62, 1755–1767 (2014)
Article Google Scholar
Vaghela D, Naina P. A review of image mosaicing techniques. arXiv preprint arXiv:1405.2539, (2014)
Sharma, S.K., Jain, K.: Image stitching using AKAZE features. J. Indian Soc. Remote Sens. 48, 1389–1401 (2020)
Article Google Scholar
Wang, Z., Yang, Z.: Review on image-stitching techniques. Multimed. Syst. 26, 1–18 (2020)
Article Google Scholar
Saha, M., Chakraborty, M., Biswas, T.: An improved approach for document image mosaicing. Int. J. 6, 51–55 (2016)
Google Scholar
Kaur, J.: A robust technique for image mosaicing using modified. Indian J. Sci. Technol. (2016). https://doi.org/10.17485/ijst/2016/v9i47/101722
Article Google Scholar
Jinwei, C., Bin, G., Gangxiang, G.: Image registration and mosaicking based on the criterion of four collinear points. DEStech Trans. Eng. Technol. Res. (2016). https://doi.org/10.12783/dtetr/ICMITE20162016/4576
Article Google Scholar
Yan, W., Yue, G., Fang, Y., Chen, H., Tang, C., Jiang, G.: Perceptual objective quality assessment of stereoscopic stitched images. Signal Process. 172, 107541 (2020)
Article Google Scholar
Zhang, Y., Lai, Y.-K., Zhang, F.-L.: Stereoscopic image stitching with rectangular boundaries. Vis. Comput. 35, 823–835 (2019)
Article Google Scholar
Irani, M., Hsu, S., Anandan, P.: Video compression using mosaic representations. Signal Process.: Image Commun. 7, 529–552 (1995)
Google Scholar
Hu R, Shi R, Shen I.-F, Chen W. Video stabilization using scale-invariant features. In Information Visualization, 2007. IV'07. 11th International Conference, (2007), pp. 871–877
Okade, M., Biswas, P.K.: Improving video stabilization using multi-resolution MSER features. IETE J. Res. 60, 373–380 (2014)
Article Google Scholar
Niu, C., Zhong, F., Xu, S., Yang, C., Qin, X.: Cylindrical panoramic mosaicing from a pipeline video through MRF based optimization. Vis. Comput. 29, 253–263 (2013)
Article Google Scholar
Choi Y.-H, Seong Y. K, Choi T.-S. Image mosaicing with automatic scene segmentation for video indexing. In: Consumer Electronics, 2002. ICCE. 2002 Digest of Technical Papers. International Conference on, (2002), pp. 74-75
Szeliski R, Shum H-Y. Creating full view panoramic image mosaics and environment maps. In: Proceedings of the 24th annual conference on Computer graphics and interactive techniques, pp. 251–258. (1997)
Zhang, T., Zhao, R., Chen, Z.: Application of migration image registration algorithm based on improved SURF in remote sensing image mosaic. IEEE Access 8, 163637–163645 (2020)
Article Google Scholar
Gracias N, Costeira J. P, Victor J. Linear global mosaics for underwater surveying. In 5th IFAC Symposium on Intelligent Autonomous Vehicles, pp. 78–83. (2004)
Zhang X, Zhu X. An accurate and efficient image registration algorithm in the aerial infrared images. In: Eleventh International Conference on Graphics and Image Processing (ICGIP 2019), p. 113730W. (2020)
Deshmukh P, Paikrao P. A review of various image mosaicing techniques. In: 2019 Innovations in Power and Advanced Computing Technologies (i-PACT), pp. 1–4. (2019)
Szeliski, R.: Image alignment and stitching: a tutorial. Found. Trends® Comput. Gr. Vis. 2, 1–104 (2006)
MathSciNet MATH Google Scholar
Shum, H.-Y., Szeliski, R.: Construction of panoramic image mosaics with global and local alignment. In: Benosman, R., Kang, S.B. (eds.) Panoramic Vision, pp. 227–268. Springer, New York (2001)
Chapter Google Scholar
Wei, L., Zhong, Z., Lang, C., Yi, Z.: A survey on image and video stitching. Virtual Real. Intell. Hardw. 1, 55–83 (2019)
Article Google Scholar
Jain, P.M., Shandliya, V.K.: A review paper on various approaches for image mosaicing. Int. J. Comput. Eng. Res. 3, 106–109 (2013)
Google Scholar
Monali R, Moonka S, Priya A, Tripathy S. S. Effects of noise and relative overlap on image mosaicing using SURF features. In: Recent Trends in Electronics, Information & Communication Technology (RTEICT), IEEE International Conference on, pp. 773–777. (2016)
Adel E, Elmogy M, Elbakry H. Image stitching based on feature extraction techniques: a survey. In: International Journal of Computer Applications (0975–8887) Volume, pp. 1–8, (2014)
Krishnakumar, K., Gandhi, S.I.: Video stitching based on multi-view spatiotemporal feature points and grid-based matching. Vis. Comput. 36, 1837–1846 (2020)
Article Google Scholar
Pandey, A., Pati, U.C.: Panorama generation using feature-based mosaicing and modified graph-cut blending. In: Pant, M., Ray, K., Sharma, T.K., Rawat, S., Bandyopadhyay, A. (eds.) Soft Computing: Theories and Applications. Springer, Singapore (2018)
Google Scholar
Mistry, S., Patel, A.: Image stitching using Harris feature detection. Int. Res. J. Eng. Technol. (IRJET) 3, 2220–2226 (2016)
Google Scholar
Bheda, D., Joshi, M., Agrawal, V.: A study on features extraction techniques for image mosaicing. Int. J. Innov. Res. Comput. Commun. Eng. 2, 3432–3437 (2014)
Google Scholar
Khan, H.A., Haider, M.A., Ansari, H.A., Ishaq, H., Kiyani, A., Sohail, K., et al.: Automated feature detection in dental periapical radiographs by using deep learning. Oral Surg., Oral Med., Oral Pathol. Oral Radiol. 131, 711–720 (2020)
Article Google Scholar
Bhowmik A, Gumhold S, Rother C, Brachmann E. Reinforced feature points: optimizing feature detection and description for a high-level task. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 4948–4957. (2020)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60, 91–110 (2004)
Article Google Scholar
Derpanis K. G. The harris corner detector," York University, vol. 2, (2004)
Yang, A., Yang, X., Wu, W., Liu, H., Zhuansun, Y.: Research on feature extraction of tumor image based on convolutional neural network. IEEE Access 7, 24204–24213 (2019)
Article Google Scholar
Kamboj, A., Rani, R., Nigam, A.: A comprehensive survey and deep learning-based approach for human recognition using ear biometric. Vis. Comput. (2021). https://doi.org/10.1007/s00371-021-02119-0
Article Google Scholar
Scheidegger, F., Istrate, R., Mariani, G., Benini, L., Bekas, C., Malossi, C.: Efficient image dataset classification difficulty estimation for predicting deep-learning accuracy. Vis. Comput. 37, 1–18 (2020)
Google Scholar
Hemanth, D.J., Estrela, V.V.: Deep Learning for Image Processing Applications. IOS Press, Amsterdam (2017)
Google Scholar
Jiao, L., Zhao, J.: A survey on the new generation of deep learning in image processing. IEEE Access 7, 172231–172263 (2019)
Article Google Scholar
Joshi, K., Patel, M.I.: Recent advances in local feature detector and descriptor: a literature survey. Int. J. Multimed. Inf. Retr. 9, 1–17 (2020)
Google Scholar
Ghosh, D., Kaabouch, N.: A survey on image mosaicing techniques. J. Vis. Commun. Image Represent. 34, 1–11 (2016)
Article Google Scholar
Prathap K. S. V, Jilani S, Reddy P. R A critical review on image mosaicing. In: Computer Communication and Informatics (ICCCI), 2016 International Conference on, pp. 1–8 (2016)
Bhosle, U., Chaudhuri, S., Roy, S.D.: A fast method for image mosaicing using geometric hashing. IETE J. Res. 48, 317–324 (2002)
Article Google Scholar
Vishwakarma, A., Bhuy, M.: Image mosaicking using improved auto-sorting algorithm and local difference-based harris features. Multimed. Tools Appl. 79, 1–18 (2020)
Article Google Scholar
Zagrouba, E., Barhoumi, W., Amri, S.: An efficient image-mosaicing method based on multifeature matching. Mach. Vis. Appl. 20, 139–162 (2009)
Article Google Scholar
Kang P, Ma H. An automatic airborne image mosaicing method based on the SIFT feature matching. In: Multimedia Technology (ICMT), 2011 International Conference on, pp. 155–159. (2011)
Murali, Y., Madanapalle, M.: Image mosaic using speeded up robust feature detection. Image 1, 40–45 (2012)
Google Scholar
Prathap, K.S.V., Jilani, S., Reddy, P.R.: A real-time image mosaicing using scale invariant feature transform. Indian J. Sci. Technol. 9, 1–6 (2016)
Article Google Scholar
Hossein-nejad Z, Nasri M. Image registration based on SIFT features and adaptive RANSAC transform. In: Communication and Signal Processing (ICCSP), 2016 International Conference on. pp. 1087–1091. (2016)
Hossein-Nejad, Z., Nasri, M.: An adaptive image registration method based on SIFT features and RANSAC transform. Comput. Electr. Eng. 62, 524–537 (2017)
Article Google Scholar
Hossein-Nejad, Z., Agahi, H., Mahmoodzadeh, A.: Detailed review of the scale invariant feature transform (sift) algorithm; concepts, indices and applications. J. Mach. Vis. Image Process. 7, 165–190 (2020)
Google Scholar
Laraqui, A., Baataoui, A., Saaidi, A., Jarrar, A., Masrar, M., Satori, K.: Image mosaicing using voronoi diagram. Multimed. Tools Appl. 76, 8803–8829 (2017)
Article Google Scholar
Laraqui, A., Saaidi, A., Satori, K.: MSIP: multi-scale image pre-processing method applied in image mosaic. Multimed. Tools Appl. 77, 7517–7537 (2018)
Article Google Scholar
Ke Y, Sukthankar R. PCA-SIFT: A more distinctive representation for local image descriptors. In: Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on, pp. II-506-II-513 Vol. 2. (2004)
Bay, H., Tuytelaars, T., Van Gool, L.: Surf: Speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) Computer Vision–ECCV 2006, pp. 404–417. Springer, Berlin (2006)
Chapter Google Scholar
Cheung W, Hamarneh G. N-sift: N-dimensional scale invariant feature transform for matching medical images. In: Biomedical Imaging: From Nano to Macro, 2007. ISBI 2007. 4th IEEE International Symposium on, pp. 720–723. (2007)
Yi, Z., Zhiguo, C., Yang, X.: Multi-spectral remote image registration based on SIFT. Electron. Lett. 44, 107–108 (2008)
Article Google Scholar
Lingua, A., Marenchino, D., Nex, F.: Performance analysis of the SIFT operator for automatic feature extraction and matching in photogrammetric applications. Sensors 9, 3745–3766 (2009)
Article Google Scholar
Morel, J.-M., Yu, G.: ASIFT: A new framework for fully affine invariant image comparison. SIAM J. Imag. Sci. 2, 438–469 (2009)
Article MathSciNet MATH Google Scholar
Tamimi, H., Andreasson, H., Treptow, A., Duckett, T., Zell, A.: Localization of mobile robots with omnidirectional vision using particle filter and iterative sift. Robot. Auton. Syst. 54, 758–765 (2006)
Article Google Scholar
Sedaghat, A., Mokhtarzade, M., Ebadi, H.: Uniform robust scale-invariant feature matching for optical remote sensing images. IEEE Trans. Geosci. Remote Sens. 49, 4516–4527 (2011)
Article Google Scholar
Hossein-Nejad, Z., Nasri, M.: RKEM: redundant keypoint elimination method in image registration. IET Image Proc. 11, 273–284 (2017)
Article Google Scholar
Hossein-Nejad, Z., Nasri, M.: A-RANSAC: adaptive random sample consensus method in multimodal retinal image registration. Biomed. Signal Process. Control 45, 325–338 (2018)
Article Google Scholar
Hossein-Nejad Z, Nasri M. Retinal image registration based on auto-adaptive SIFT and redundant keypoint elimination method. In: 2019 27th Iranian Conference on Electrical Engineering (ICEE), pp. 1294–1297 (2019)
Liu, Y., Yu, D., Chen, X., Li, Z., Fan, J.: TOP-SIFT: the selected SIFT descriptor based on dictionary learning. Vis. Comput. 35, 667–677 (2019)
Article Google Scholar
Hossein-Nejad, Z., Nasri, M.: Copy-move image forgery detection using redundant keypoint elimination method. In: Ramakrishnan, S. (ed.) Cryptographic and Information Security Approaches for Images and Videos, pp. 773–797. CRC Press, Boca Raton (2019)
Google Scholar
Yonghong, J.: Fusion of landsat TM and SAR image based on principal component analysis. Remote Sens. Technol. Appl. 13, 46–49 (1998)
Google Scholar
Tian F, Shi P. Image mosaic using orb descriptor and improved blending algorithm. In: Image and Signal Processing (CISP), 2014 7th International Congress on, pp. 693–698 (2014)
Chipman L. J, Orr T. M, Graham L. N. Wavelets and image fusion. In: Image Processing, 1995. Proceedings., International Conference on, pp. 248-251 (1995)
Li, H., Manjunath, B., Mitra, S.K.: Multisensor image fusion using the wavelet transform. Gr Models Image Process. 57, 235–245 (1995)
Article Google Scholar
Burt, P.J., Adelson, E.H.: A multiresolution spline with application to image mosaics. ACM Trans. Gr. (TOG) 2, 217–236 (1983)
Article Google Scholar
Li A, Zhou S, Wang R. An improved method for eliminating ghosting in image stitching. In: 2017 9th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), pp. 415–418 (2017)
Zhang, Q., Wang, Y., Wang, L.: Registration of images with affine geometric distortion based on maximally stable extremal regions and phase congruency. Image Vis. Comput. 36, 23–39 (2015)
Article Google Scholar
Hossein-Nejad, Z., Agahi, H., Mahmoodzadeh, A.: Image matching based on the adaptive redundant keypoint elimination method in the SIFT algorithm. Pattern Anal. Appl. 24, 669–683 (2020)
Article Google Scholar
Hong J, Lin W, Zhang H, Li L. Image mosaic based on surf feature matching. In: 2009 First International Conference on Information Science and Engineering, pp. 1287–1290 (2009)
Zhen Y, Sun Z, Li J, Peng Y. An airborne remote sensing image mosaic algorithm based on feature points. In: 2016 Sixth International Conference on Instrumentation & Measurement, Computer, Communication and Control (IMCCC), pp. 202–205 (2016)
Zhang, W., Li, X., Yu, J., Kumar, M., Mao, Y.: Remote sensing image mosaic technology based on SURF algorithm in agriculture. J. Image Video Proc. 2018(85), 2018 (2018)
Google Scholar
Ai, Y., Kan, J.: Image mosaicing based on improved optimal seam-cutting (January 2020). IEEE Access 8, 181526–181533 (2020)
Article Google Scholar
Ma, W., Wen, Z., Wu, Y., Jiao, L., Gong, M., Zheng, Y., et al.: Remote sensing image registration with modified SIFT and enhanced feature matching. IEEE Geosci. Remote Sens. Lett. 14, 3–7 (2017)
Article Google Scholar
Tang, H., Pan, A., Yang, Y., Yang, K., Luo, Y., Zhang, S., et al.: Retinal image registration based on robust non-rigid point matching method. J. Med. Imag. Health Inform. 8, 240–249 (2018)
Article Google Scholar
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13, 600–612 (2004)
Article Google Scholar
Zhang, L., Zhang, L., Mou, X., Zhang, D.: FSIM: a feature similarity index for image quality assessment. IEEE Trans. Image Process. 20, 2378–2386 (2011)
Article MathSciNet MATH Google Scholar
Zhang, L., Shen, Y., Li, H.: VSI: A visual saliency-induced index for perceptual image quality assessment. IEEE Trans. Image Process. 23, 4270–4281 (2014)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank the anonymous reviewers for their valuable comments which improve the quality of the paper.

Author information

Authors and Affiliations

Department of Electrical Engineering, Sirjan Branch, Islamic Azad University, Sirjan, Iran
Zahra Hossein-Nejad
Department of Electrical Engineering, Khomeinishahr Branch, Islamic Azad University, Khomeinishahr, Iran
Mehdi Nasri

Authors

Zahra Hossein-Nejad
View author publications
You can also search for this author in PubMed Google Scholar
Mehdi Nasri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mehdi Nasri.

Ethics declarations

Conflict of interest

No conflict of interest exists in the submission of this manuscript, and all the authors have agreed to have seen and approved the manuscript for submission.

Human participants or animals

The paper does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hossein-Nejad, Z., Nasri, M. Clustered redundant keypoint elimination method for image mosaicing using a new Gaussian-weighted blending algorithm. Vis Comput 38, 1991–2007 (2022). https://doi.org/10.1007/s00371-021-02261-9

Download citation

Accepted: 12 July 2021
Published: 19 July 2021
Issue Date: June 2022
DOI: https://doi.org/10.1007/s00371-021-02261-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Clustered redundant keypoint elimination method for image mosaicing using a new Gaussian-weighted blending algorithm

Abstract

Similar content being viewed by others

Image mosaicking using improved auto-sorting algorithm and local difference-based harris features

Image Stitching using AKAZE Features

Image stitching by feature positioning and seam elimination

Explore related subjects

1 Introduction

2 Classical algorithms and their problems in image mosaicing

2.1 RKEM-SIFT algorithm

2.2 RKEM-SIFT problems

2.3 Blending algorithms

2.4 Problems of blending methods

3 Method

3.1 Proposed CRKEM-SIFT method in feature extraction

3.2 Proposed Gaussian-weighted method in blending

4 Simulation results

4.1 Image database

4.2 Effects of different threshold values on preservation of image details in RKEM and CRKEM-SIFT

4.3 Experiment in the matching process

4.3.1 Matching process on images with varied scales

4.3.2 Matching process on images with viewpoint changes

4.4 Experiment on the image registration process

4.4.1 Registration process on images with different rotations and scales

4.4.2 Registration process on images with viewpoint changes

4.5 Experiment on the image mosaicing process

4.5.1 Studying proposed CRKEM-SIFT method in mosaicing process

4.5.2 Studying the proposed method for two-image mosaicing

4.5.3 Studying the proposed method for several images mosaicing process

5 Conclusion and future works

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Human participants or animals

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation