Abstract
In real applications, the same object may have been presented by different shapes due to the moment and the angles of image acquisition, which does not guarantee a complete contour extraction without being disturbed by the noise or the distortions. In this paper, we propose a new method to match partially occluded shape based on affine curvature scale space. Firstly, an affine curve re-parameterization is defined, inspired by the properties of affine curvature scale space (ACSS) shape descriptor. Then, the different parts will be matched in order to minimize the \( L_{2} \) distance by the calculation of the pseudo-inverse matrix to estimate the translation and the linear transformation based on the affine curve matching (ACM) algorithm. Finally, a matching curve algorithm is obtained according to any planar affine transformation and in any partial occluded case. Experiments are conducted on multi-view curve dataset.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
5.1 Introduction
The shape matching of planar curves that are subjected to certain occlusion and viewing transformations is motivated by a board of problems arising from different applications in many fields such as robot navigation [1, 2], medical image matching [3, 4], face recognition [5, 6], and object tracking [7, 8].
The research on the shape matching is recognized as a classical field and well-studied in the literature. However, there are other challenges that still remain open until now such as noise, distortion, and deformation. They are caused by two main reasons: First, images taken at different moments and from different angles of the same object suffer from perspective distortions [9]. This transformation can be approximated by a two-dimensional affine transformation [10] if the object is planar and far enough from the image plane. The second reason, the curve sometimes can present a little part of the shape [11].
However, it is not obvious to define an approach that satisfies these different criteria, so an appropriate trade-off is necessary. In the last decades, various methods for shape matching have been developed. The best-known researches include affine-invariant Fourier descriptors (AIFD) method [12, 13], shape context (SC) methods [14], the inner distance shape context (IDSC) [15], the height function (HF) descriptor [16], and so on. However, they suppose that the complete shape can be extracted from the images [17]. While in reality, the extracted shape may be occluded by many distortions [18] like noise, articulations and missed contour portions, as well as affine transformation. In [19], Chen et al. introduce a partial shape matching method based on the Smith–Waterman algorithm. Also, Latecki et al. [20] define an elastic partial shape matching to model distortion and occlusion of shapes. In [21], the author proposes a matching algorithm based on dynamic time warping. Therefore, only some methods treat both affine and open curve matching [9, 22]. In [17], Mai and al. define an affine-invariant partial shape matching approach where each contour is segmented into affine-invariant segments by the application the local maxima of curvature scale space (CSS). Then, the different parts are matched using the Smith–Waterman algorithm. However, the Smith–Waterman (SW) algorithm is sensitive to position jitter of a point sequence as indicated in [9]. Moreover, Huijing Fu and al [9] present an affine planar shape matching and exploit it for partial object recognition where an affine-invariant curve descriptor (AICD) using the affine-invariant signature is defined. Then, a partial curve matching algorithm is developed by combining AICD with a curve segmentation strategy based on inflexion points. Yet, their method has limited accuracy in the noise condition case caused by the high number of derivation [23]. In this paper, we are interested in part-to-part affine shape matching. We can present the partial shape matching problem by giving as an input two partial shape, often called source and target, the goal is to recover the affine transformation that optimized the pairing between the source and the target. So to achieve this pairing, we should define the same parameterization for each curve. The underlying idea here is to divide each shape into ordered affine-invariant segments inspired by the expression of affine curvature scale space [24]. Then to handle the matching problem, we estimate the affine transformation parameters using affine curve matching (ACM) [25] algorithm. This algorithm minimizes the approximation of L2 distance between pairs by the computation of the pseudo-inverse matrix. In experimentation, we will evaluate the performance of proposed approach in the context of shape retrieval and we will compare it with different affine curves matching methods presented in the state of the art.
The remainder of the paper is organized as follows: In Sect. 5.2, the detailed descriptions of suggested curve matching algorithm will be presented. Section 5.3 will investigate the effectiveness of the proposed approach through experiments and analyses. Finally, the last section submits the conclusion.
5.2 Affine Curvature Scale Space Curve Matching Algorithm
Here, we define our new partial contour matching based on curvature scale space matching algorithm. Therefore, this section will be devoted firstly to recall the affine re-parametrization based on the affine curvature scale space (ACSS) [24]. After that, we recall the newly developed affine curve matching (ACM) algorithm [25] and apply the pseudo-inverse matrix to estimate the translation vector B and the special linear transformation A existing between the two curves up to a special affine transformation.
5.2.1 Affine Curvature Scale Space Descriptor (ACSS)
As given Ω1 and Ω2 two planar shapes represented by closed or open continuous curves. We extract two contours of each shape which are represented by two parameterizations, respectively, as: \( \Omega _{1} = [f(t) = (f^{x} (t) f^{y} (t))] \, \left( {t = 1;2; \ldots ;N} \right) \) and \( \Omega _{2} = [h(t^{\prime}) = (h^{x} (t^{\prime})h^{y} (t^{\prime}))] \, (t^{\prime} = 1;2; \ldots ;N) \), where their relation is defined by:
With B is a translation vector and A is a special linear transformation. It is obvious that each curve can be represented by different parameterizations. Therefore, we cannot consider that the two contours have the same parameterization and we compare the different viewpoints of planar shape. To handle this problem, we must assure that the parameterization is independent from transformations and distortions. As a result, we need to re-parameterize the points of the contour. In [24], it is proved that the locations of the contour local maxima in the affine curvature scale space (ACSS) image is invariant under an affine transformation. Moreover, they are robust to noise as indicated in [17]. The underlying idea is to do an affine curves re-parameterization by applying an ACSS descriptor. Then, we describe the following main steps of the ACSS method as indicated in [24]. Firstly, we re-parameterize the contour using the affine length function l(t) defined by
where the total affine length L of the considered curve is presented by:
With \( \dot{f} \) and \( \ddot{f} \) denote, respectively, first and second derivatives of f and T are a positive real. As a result of re-parameterization by this affine length, the relation between the two curves becomes:
With \( h^{*} \) and \( f^{*} \) denotes a re-parameterization by this affine length. In the rest of paper, we will replace \( f^{*} \) by f and \( h^{*} \) by h to simplify the notation. Then, we compute the curvature function \( k(l) \) expressed by:
where \( \dot{f}^{x} (l) \), \( \dot{f}^{y} (l) \) and \( \ddot{f}^{y} (l) \), \( \ddot{f}^{x} (l) \) are the first and second derivatives. If g(l, σ), a 1-D Gaussian kernel of width σ, is convolved with each component of the curve, then \( f_{\sigma }^{x} \left( {l,\sigma } \right) \) and \( f_{\sigma }^{y} \left( {l,\sigma } \right) \) represent the components of the resulting curve, \( f_{\sigma } \):
where ⊛ is a convolutional operator. The curvature of \( f_{\sigma } \) is given by:
5.2.2 Affine Curve Matching (ACM) Algorithm
To solve the matching problem, we must find A and B to estimate the relation and motion between the different contours as indicated in [25]. The re-parametrization of two curves by the ACSS gives the following rectangular system formed by 2*N equations and 6 unknown variables:
with f(l) and h(l) as the re-parametrization, respectively, of two contours f(t) and \( h\left( {t^{\prime}} \right) \), B = (\( B^{x} \); \( B^{y} \)) and \( A = \left( {a_{ij} } \right)_{{1 = < \left( {i,j} \right) = < 2}} \). Our goal is to minimize the error e between the two contours by the estimation of A and B which will be defined by:
This system can be written in matrix notation:
With \( U = \left[ {a_{11} \;a_{12} \;a_{21} \;a_{22} \;B^{x} \;B^{y} } \right]^{t} \); \( H = \left[ {h_{\sigma }^{x} \left( {l_{1} } \right),h_{\sigma }^{y} \left( {l_{1} } \right) \ldots h_{\sigma }^{x} \left( {l_{N} } \right),h_{\sigma }^{y} \left( {l_{N} } \right)} \right] \) and
In [25], the author applies the least squares method to solve the over determined system of linear equations when the numbers of equations are more than unknown variables. Thus, the resolution of this rectangular system can be done by minimizing the error via inverting the system by using pseudo-inverse of the matrix D as indicated in [25].
Then, we calculate the normal matrix \( \left( {D^{t} D} \right) \) which has the following expression:
\( \overline{X} = \frac{1}{N}\mathop \sum \nolimits_{k = 1}^{N} (f_{\sigma }^{x} \left( {l_{k} } \right)) \) and \( \overline{Y} = \frac{1}{N}\mathop \sum \nolimits_{k = 1}^{N} (f_{\sigma }^{y} \left( {l_{k} } \right)) \)
5.3 Experiments
In this section, we provide the recognition rates of the suggested approach and compare it with the exiting shape matching methods. The experiments are carried on multi-view curve dataset (MCD). First, we evaluate the performance of our approach in the shape retrieval. Then, we prove the performance of the proposed algorithm on shape registration. Finally, we will analyze the algorithm complexity of our approach.
5.3.1 Retrieval Accuracy
To perform our algorithm in shape retrieval task, we calculate the bulls-eye score as defined in [26, 27] on multi-view curve dataset (MCD) [28], since it contains forms that undergo an affine transformation. So to calculate the bulls-eye score, we compare each curve to the whole MCD dataset curve (including itself). Then, the contour number of the same class that are midst the 2 * Nc most similar is recovered, with Nc is the sample number per class. The bulls-eye score is the ratio of the number of correct results and the highest possible number of correct results [9, 27].
The MCD contains 40 shape classes taken from MPEG-7 datasets. Each class presents 14 curve samples that correspond to different perspective distortions of the same shape. Samples of contours from MCD datasets are shown in Fig. 5.1.
Table 5.1 compares the retrieval bulls-eye score to the first 10 contours of the MCD database using our approach with some existing methods. In terms of the average rate performance, the proposed approach performs reasonably well as compared to many other techniques such as affine curve matching (ACM) [25] algorithm with 94% of rate and especially the curvature scale space Smith–Waterman (CSS-SW) [17] approach which prove that our algorithm is more efficient in terms of registration since we are based on ACSS in the re-parameterization step.
5.3.2 Shape Registration
Shape registration is a crucial applications of the proposed algorithm [29]. Most shapes of MCD datasets are represented by closed curves. So to evaluate the proposed algorithm in partial occlusion and deformable registration, we take off some parts of the contour to make it open as indicated in [17]. Figure 5.2 shows our method in full-to-full and part-to-part registration case.
5.3.3 Algorithm Complexity
Here, we compare our algorithm complexity with the CSS-SW. The calculation of the shape descriptors and shape matching are two independent steps for the proposed approach. Therefore, the complexity of the proposed method will be evaluated separately here. We consider that N is the number of sample points of the contour. For each curve, we should start by the re-parametrization step based on ACSS descriptor which is a common step between the two methods. In the matching step, the proposed algorithm can do the matching with O(N) complexity which underlines the speed of pseudo-inverse. However, Smith–Waterman complexity is O(\( N^{3} \)) [9], since it requires SVD calculation.
5.4 Conclusion
In this paper, we propose a new matching method which can deal with both occlusion and affine transformations. Firstly, the contour is divided into affine-invariant segment by applying the ACSS descriptor. Then, we estimate the affine transformation using the ACM algorithm. As a result, an affine curve matching is achieved. Experiment results show that the proposed algorithm is simple and can cope partially occlusion and affine transformation. In our future work, we will apply our algorithm in different application domains as remote sensing and robotic recognition.
References
Diedrich, W., Latecki, L.J.: Shape matching for robot mapping. In: Pacific Rim International Conference on Artificial Intelligence. Springer, Berlin, Heidelberg (2004)
Marius, M., Lowe., D.: Fast matching of binary features. In: Ninth Conference on Computer and Robot Vision (CRV). IEEE (2012)
Hava, L., Arridge, S.: A survey of hierarchical non-linear medical image registration. Pattern Recogn. 32(1), 129–149 (1999)
Hemamalini, G., Prakash, J.: Medical image analysis of image segmentation and registration techniques. Int. J. Eng. Technol. (IJET) 8(5):2234-2241 (2016)
Ehsan Fazl, E., Zelek, J.: Local feature matching for face recognition. In: The 3rd Canadian Conference on Computer and Robot Vision. IEEE (2006)
Jilin, T, Huan, T., Tao, H.: Face as mouse through visual face tracking. In: Proceedings of the 2nd Canadian Conference on Computer and Robot Vision. IEEE (2005)
Yixin, C., Das, M., Bajpai, D.: Vehicle tracking and distance estimation based on multiple image features. In: Fourth Canadian Conference on Computer and Robot Vision. CRV’07. IEEE (2007)
Marc, L.: Real-time eye blink detection with GPU-based SIFT tracking. In: Fourth Canadian Conference on Computer and Robot Vision, CRV’07. IEEE (2007)
Huijing, F.: Novel affine-invariant curve descriptor for curve matching and occluded object recognition. IET Comput. Vis. 7(4), 279–292 (2013)
Forsyth, D.: Invariant descriptors for 3D object recognition and pose. IEEE Trans. Pattern Anal. Mach. Intell. 10, 971–991 (1991)
Turney, J.L., Trevor Mudge N., Richard A.V.: Recognizing partially occluded parts. IEEE Trans. Pattern Anal. Mach. Intell. 4, 410–421 (1985)
Ghorbel, F.: Towards a unitary formulation for invariant image description: application to image coding. Annales des Telecommun. 53, 242–260. Springer (1992)
Arbter, K.: Application of affine-invariant Fourier descriptors to recognition of 3-D objects. IEEE Trans. Pattern Anal. Mach. Intell. 12(7), 640–647 (1990)
Mori, G., Serge, B., Jitendra, M.: Efficient shape matching using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 27(11), 1832–1837 (2005)
Ling, H., David, W.: Shape classification using the innerdistance. IEEE Trans. Pattern Anal. Mach. Intell. 29(2), 286–299 (2007)
Wang, J.: Shape matching and classification using height functions. Pattern Recogn. Lett. 33(2), 134–143 (2012)
Mai, F., Chang, C.Q., Hung. Y.S.: Affine-invariant shape matching and recognition under partial occlusion. In: 17th IEEE International Conference on Image Processing (ICIP). IEEE (2010)
Yang, C., Hui, W., Qian, Y.: A novel method for 2D nonrigid partial shape matching. Neurocomputing 275, 1160–1176 (2018)
Chen, L., Rogerio, F., Turk, M.: Efficient partial shape matching using smith-waterman algorithm. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. IEEE (2008)
Latecki, L., et al.: An elastic partial shape matching technique. Pattern Recogn. 40(11), 3069–3080 (2007)
Bouagar, S., Slimane, L.: Efficient descriptor for full and partial shape matching. Multimedia Tools Appl. 75(6), 2989–3011 (2016)
Zhang, G., JiYuan Xu X., JianXin. L.: A new method for recognition partially occluded curved objects under affine transformation. In: 10th International Conference on Intelligent Systems and Knowledge Engineering (ISKE). IEEE (2015)
Arulmozhi, P., Abirami, S.: Shape based image retrieval: a review. Int. J. Comput. Sci. Eng. 6(4), 147 (2014)
Mokhtarian, F., Sadegh, A.: Affine curvature scale space with affine length parametrisation. Pattern Anal. Appl. 4(1), 1–8 (2001)
Elghoul, S., Ghorbel, F.: An efficient 2D curve matching algorithm under affine transformations. In: VISIGRAPP (4: VISAPP) (2018)
Yang, C., Wei, H., Yu, Q.: Multiscale triangular centroid distance for shape-based plant leaf recognition. In: ECAI (2016)
Yang, C., Wei, H., Yu, Q.: A novel method for 2D nonrigid partial shape matching. Neurocomputing 275 (2018)
Zuliani, M.: Affine-invariant curve matching. In: International Conference on Image Processing, ICIP’04, vol. 5. IEEE (2004)
Mai, F., Chang, C.Q., Hung, Y.S.: A subspace approach for matching 2D shapes under affine distortions. Pattern Recogn. 44(2), 210–221 (2011)
Hanbyul, J.: Graph-based robust shape matching for robotic application. In: IEEE International Conference on Robotics and Automation, ICRA’09. IEEE (2009)
Mark, G., Bekaert, P.: Local stereo matching with segmentation-based outlier rejection. In: The 3rd Canadian Conference on Computer and Robot Vision. IEEE (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Elghoul, S., Ghorbel, F. (2021). Partial Contour Matching Based on Affine Curvature Scale Space Descriptors. In: Kountchev, R., Mironov, R., Li, S. (eds) New Approaches for Multidimensional Signal Processing. Smart Innovation, Systems and Technologies, vol 216. Springer, Singapore. https://doi.org/10.1007/978-981-33-4676-5_5
Download citation
DOI: https://doi.org/10.1007/978-981-33-4676-5_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-4675-8
Online ISBN: 978-981-33-4676-5
eBook Packages: EngineeringEngineering (R0)