Symmetry-Aware Human Shape Correspondence Using Skeleton

Xu, Zongyi; Zhang, Qianni

doi:10.1007/978-3-319-27671-7_53

Zongyi Xu¹⁹ &
Qianni Zhang¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9516))

Included in the following conference series:

International Conference on Multimedia Modeling

2979 Accesses

Abstract

In this paper, we propose a symmetry-aware human shape correspondence extraction method. We address the symmetric flip problem which exists in establishing correspondences for intrinsically symmetric models and improve the accuracy of the final corresponding pairs. To achieve this goal, we extended the state-of-the-art approach by using skeleton information to further remove symmetric flipped shape correspondences. Traditional approaches that only rely on surface geometry information can hardly discriminate surface points which are symmetric. With the appearance of inexpensive RGB-D camera, such as Kinect, skeleton information can be easily obtained along with mesh. Therefore, after the initial correspondences are achieved, we extend the candidate sets for each point on the template, followed by making use of skeleton to remove the symmetric flipped false candidates. In the remaining candidates, final correspondences are achieved by choosing those with minimum geodesic distortion from base vertex set, which is formed by sampling on the mesh. Experiments demonstrate that the proposed method can effectively remove all the symmetric flipped candidates. Moreover, the final correspondence pair is more accurate than those of the state of the arts.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Novel correspondence-based approach for consistent human skeleton extraction

Article 30 April 2015

Fast and Accurate Intrinsic Symmetry Detection

Extended Investigations on Skeleton Graph Matching for Object Recognition

Keywords

1 Introduction

Shape correspondence is a fundamental problem in many research topics such as 3D mesh retrieval, shape registration and mesh deformation. 3D shape correspondence is a mapping from one point set on the source mesh to another on the target mesh. There exist three kinds of mapping: one-to-one, one-to-many and many-to-one. In this paper, we aim to address the problem of establishing the accurate one-to-one correspondence between intrinsic-symmetrically isometric human models.

The target of shape correspondence is to find the point pairs that are similar or semantically equivalent. Isometric shapes appear in various contexts such as different poses of an articulated human model or two shapes presenting different but semantically similar objects [16]. It is highly demanded to find isometric shape correspondence since most real world deformations are isometric. Moreover, shape correspondences between isometric shapes have practical values. For instance, the deformation based on isometric template will be much more efficient benefiting from their similar shapes. If two shapes are totally isometric, the geodesic distance between two points on one shape is the same as the geodesic distance between their correspondences on the other shape [16].

Embedding-based methods are popular techniques for 3D shape correspondences problem. In these methods, original mesh is embedded into a new domain where isometric deviation can be measured and optimized. Euclidean embedding can be achieved by using various techniques such as classic MDS(Multi-Dimensional Scaling) [14], least-square MDS [6], heat kernel embedding [10] and spectral embedding [7]. Besides embedding methods, other approaches [15, 16] minimize the isometric distortion directly in the 3D Euclidean space. However, most existing algorithms tend to be confused by the intrinsically symmetric features and suffer from symmetric flip problems. They can hardly discriminate symmetric points on the surface even if the mesh to be matched is not perfectly symmetric. Therefore, it is common that the correspondence of the point on the right hand of the source mesh is established on the left hand of the target mesh, as shown in Fig. 1.

We proposed a method to find correspondences for human isometric shape model which is able to solve the symmetric flipping problem. Our idea is to combine skeleton information to distinguish intrinsic symmetry. Given two meshes with their skeletons, we first utilize local features to find one-to-many correspondences between two meshes. The candidate set for each feature point presents symmetric property on the mesh. A skeleton segment associated with surface points is capable of discriminating symmetry. The final correspondence is located and refined by minimize the isometric distortion with respect to based vertex set.

In summary, our contributions are: (a) we integrate skeleton information to robustly address symmetric flip problem which still exist in state-of-the-art techniques; (b) we take advantage of the base vertex set to refine the final one-to-one correspondence and achieve better accuracy.

2 Related Work

Shape correspondence is a long- and well-studied problem. In areas such as shape matching, 3D shape retrieval, and mesh registration, 3D reconstruction, many recent efforts are made on finding shape corresponding points on two meshes.

SCAPE model [1] use markers to manually locate correspondences between two meshes. Besides manual assignment, plenty of works develop local surface descriptors to automatically establish correspondences. Some works extend local descriptors in 2D images for triangulated meshes, such as MeshHOG [18] or 3D shape context [9]. The embedding-based method is a more reliable approach when it comes to isometric deformation. Multidimensional Scaling(MDS) [14] approximate geodesic distance with Euclidean distance in embedding space. Dey et al. [5] uses the Global Point Signature(GPS) [13] for spectral embedding of meshes and thereby find the mesh extremities. Sahillioglu et al. [15] also transfers vertices into spectral domain and optimize the result using expectation-maximization algorithm. However, these above methods sometimes provide false correspondence due to the presence of model symmetries. Ovsjanikov et al. [11] firstly identify the intrinsic symmetry of object in a quotient space, and then factor it out. Zhang et al. [19] differentiate the intrinsic symmetric points by calculating a signed angle field from the gradient fields of the harmonic field which is derived from four points on the hands and feet. For scan data produced by RGB-D cameras, e.g. Kinect, many imperfections make it harder to find the correspondences. Holes and non-smooth mesh result in difficulties for calculating geodesic distance. Noises and missing data have negative influence on the matrix structure which is the basis of embedding-based methods. Jiang et al. [8] and Zheng [20] detect the intrinsic symmetry of point clouds using skeleton but the skeleton they use is produced according to the surface or point clouds.

Sahillioglu et al. [16] propose a coarse-to-fine scheme to track symmetric flips. Although this method is more accurate than the previous ones based on embedding approach, it still has the symmetric flipping false pairs even in the final level. We will compare our method with them in both accuracy and addressing problem of symmetric flips. The strength of our approach is that it takes into account the skeleton information.

3 Skeleton-Based Symmetry-Aware Approach

The intrinsic symmetry leads to the symmetric flipped correspondence between two meshes. Neither embedding-based methods like MDS, GMDS nor local descriptors can differentiate them effectively. Previous works which are solely replying on surface-related information, i.e. geodesic distance, face normal, are unable to solve symmetric problems completely. However, with the help of a set of skeleton information where different skeleton segments have different labels and surface point and skeleton segment are associated, it is possible to address the symmetric flipping problem with skeleton. Moreover, along with the appearance of Kinect camera, we are able to obtain skeleton of mesh with ease. To perform the skeleton attachment process, we use the algorithm based on the work by [2], in which the input is the joints positions tracked from Kinect. The output is the skeleton attached the human model. In the following, we firstly discuss how to obtain candidate set for source point, followed by our method to address symmetric flip problem as well as to refine the final correspondence which is more accurate in terms of both visual effect and semantics.

3.1 Correspondence Candidate Set

As mentioned before, in order to make sure that the candidate set includes the correct correspondence as much as possible, we first compute the one-to-many correspondences using Heat Kernel Signature(HKS) [17], we select the top N similar points to construct the candidate set which is shown in Fig. 2(a). To compute the heat of point i at time $t_i$, we firstly perform the Laplace-Beltrami operator L on the mesh. Let $\Lambda $ be the diagonal matrix of the eigenvalues of L, and $\varPhi $ be the matrix with the corresponding eigenvectors, the heat kernel of the mesh is computed as Eq. 1:

$$\begin{aligned} K_t=\varPhi exp(-t\Lambda )\varPhi ^T \end{aligned}$$

(1)

Each entry in $k_t(i,j)$ represents the heat diffusion between point i and j. The diagonal elements of this matrix is composed of HKS. Thus, HKS feature is a vector whose entry $k_{t_j}(p_i,p_i)$ is the heat at point i at time of $t_j$:

$$\begin{aligned} \left\{ k_{t1}(p_i,p_i),k_{t2}(p_i,p_i),\ldots ,k_{tn}(p_i,p_i) \right\} \end{aligned}$$

(2)

When the dissimilarity of HKS between the template point and target point in Eq. 3 is less than a threshold t, the target point is selected as candidate for the template point.

$$\begin{aligned} \varDelta s = ||HKS(p_t) - HKS(p_s)||, \end{aligned}$$

(3)

where HKS(p) is the heat kernel signature at point p, $p_t$ and $p_s$ are the points on the template and target respectively. Here, we apply the scale-invariant HKS(si-HKS) [3] to get feature for meshes.

After the initial correspondence is achieved by si-HKS, an expanded set of candidate points are obtained as shown in Fig. 2(a). As it can be observed, the expanded candidates for the point on the right foot of the source model distribute on both feet of the target model, presenting symmetric property.

3.2 Skeleton-Based Symmetry-Aware Shape Correspondence

To locate the single correspondence for template point, the next step is to remove those symmetric flipping points. Skeleton is an important clue for filtering flipped correspondences. As shown in Fig 3, skeleton divides mesh into 17 parts and each mesh part attached a segment has a unique label and the right extremity and its left counterpart have different labels. Therefore, our method is able to discriminate the right points and their counterparts on the left, addressing symmetric flip problems. When the template point and candidate points are on the same skeleton segment, they are kept; otherwise, the candidates are removed. The filtered candidate set for template point is shown in Fig. 2(b).

3.3 One-to-One Correspondence

After the symmetric flip problem is solved, the remaining candidates need to be further filtered to find the one-to-one correspondence pair. Therefore, the next step in our method uses the sum of relative distances from candidates to the base vertex set to filter invalid candidates.

The base vertex set [15] is selected based on Gaussian curvatures. This process is illustrated in Fig. 4. Initially, at each vertex of the original mesh, we compute the Gaussian curvatures using a simple way proposed in [12] with Eq. 4.

$$\begin{aligned} gc(p)=3(2 \pi -\sum \alpha _i)/\sum A(f_i), \end{aligned}$$

(4)

where $A(f_i)$ is the area of the face $f_i$ that adjacent to the vertex and the angle $\alpha _i$ is the angle of $f_i$ at the vertex. Then we sort the vertices into a list in descending order with respect to their curvature values like in Fig. 4(a) and choose the top vertex as the first base vertex, e.g. marked point $(x_1,y_1,z_1)$ in Fig. 4(b). Then, as shown in Fig. 4(c), we compute the geodesic distance from this vertex and mark all its neighboring points lying within a radius r. In our experiment, we adopt the Dijkstra’s shortest path algorithm to compute the geodesic distance between two vertices as Eq. 5. The weight of each edge of Dijkstra’s path is the Euclidean distance between neighboring vertices by Eq. 6.

$$\begin{aligned} g(i,j)=\sum _{i\in P}\omega _i \end{aligned}$$

(5)

$$\begin{aligned} \omega _i=\min _{v_k\in N_i}||v_i-v_k||, \end{aligned}$$

(6)

where $N_i$ is the neighbors of point i. The next base vertex is the first unmarked vertex in the list like $(x_3,y_3,z_3)$ in Fig. 4(d). This process is repeated until all points are marked and based vertex set is built. The final base vertex set is illustrated in Fig. 4(f). Given base vertex set $\phi $, we compute the relative surface distance from each candidate to $\phi $ with Eq. 7. The candidate C with the minimum relative distance to $\phi $ is regarded as the final correspondence as shown in Fig. 2(c).

$$\begin{aligned} D_{iso}\left( c_i,\phi \right) =\sum _{\left( v_j\in \phi , c_i\in \varTheta \right) }g(c_i, v_j) \end{aligned}$$

(7)

$$\begin{aligned} C=arg\min _{c_i\in \varTheta } \left( D_{iso}\left( c_i,\phi \right) -D_{iso}\left( p,\phi \right) \right) \end{aligned}$$

(8)

Here, g(., .) is the geodesic distance between two vertices. p is the point on the template. After the distances from the candidates to base vertex set are acquired, we select the candidate with minimum distance to base vertex set as the final correspondence as illustrated in Eq. 8.

4 Experiments

4.1 Dataset

SCAPE human dataset [1] is built by Dragomir et al. in 2005. It is composed of pose dataset and shape dataset. In pose dataset, it contains scans of 70 different poses of a particular person. The shape model consists of 45 different people in a similar but un-identical pose. In the pose dataset, one mesh is chosen as the template mesh and others are denoted as instance meshes. Each mesh has 25000 triangle faces and 12500 vertices. Although the original work made use of both shape and pose data, only the pose data is distributed together with its skeleton information. Meshes in SCAPE model are hole-filled using the algorithm by Davis et al. [4]. SCAPE model also constructs a skeleton for the template mesh based on the fact that vertices on the same skeleton joint are spatially contiguous and exhibit similar motion across different scans. Thus, after scanning the pose instance for a particular person, authors decompose the mesh into several approximately rigid parts and get the location of the parts in different pose instances as well as the articulated object skeleton linking the parts. Based on the pose dataset, a tree-structured articulated skeleton is automatically constructed with 16 parts. Since SCAPE model contain both symmetric and various deformed shapes, we evaluate the performance of our proposed method with respect to symmetric flipped correspondence as well as the accuracy of final correspondences with SCAPE dataset.

4.2 Performance Evaluation

We compare our method with latest shape correspondence algorithm [16]. Firstly, we intuitively compare our result with those in coarse-to-fine combinatorial matching (C2FCM) algorithm [16] in Fig. 5. Our method outperforms C2FCM [16] with respect to semantical equivalence. For the same point on the third toe of template, C2FCM method finds its correspondence on the foot bottom. However, our method locates its correspondence almost on the third toe of foot. Secondly, the average geodesic error is compared with C2FCM algorithm in Table 1. We can see that for different proportions of correspondences, the geodesic errors of our method are less than those of C2FCM. The average of geodesic error of all correspondences, shown in the column of 100 % correspondences, our method outperforms C2FCM, which means our method is able to find the correspondences more accurately. More results are shown in Fig. 6, we can see that in coarse-to-fine algorithm correspondences which are shown in the top line present symmetry (both on the left and right foot) for template point. Our method is able to find the unique correspondences which is more accurate than [16] in terms of semantics. Moreover, it successfully removes that symmetric flipped invalid correspondences and achieved accurate one to one mapping from template to target meshes.

Table 1. Comparison of our method with C2FCM method

Full size table

5 Conclusion

In summary, we present a robust method to address the symmetric flip problem in shape correspondence research area. This approach can effectively remove the flipped correspondences by introducing skeleton information and through minimising distortion error. It can locate the one-to-one semantically similar correspondence more accurately. Experimental results indicate that the proposed approach outperformed traditional approaches that rely only on surface information.

In the future, we hope to investigate other mesh data which is obtained from cheap scanners such as Kinect to find correspondence between template mesh and scanned mesh. The focus will be on tracking the intrinsic challenges posed by the incomplete and noisy data that is used to build such scanned mesh.

References

Anguelov, D., Srinivasan, P., Koller, D., Thrun, S., Rodgers, J., Davis, J.: Scape: shape completion and animation of people. ACM Trans. Graph. (TOG) 24, 408–416 (2005). ACM
Article Google Scholar
Baran, I., Popović, J.: Automatic rigging and animation of 3d characters. ACM Trans. Graph. (TOG) 26, 72 (2007). ACM
Article Google Scholar
Bronstein, M.M., Kokkinos, I.: Scale-invariant heat kernel signatures for non-rigid shape recognition. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1704–1711. IEEE (2010)
Google Scholar
Davis, J., Marschner, S.R., Garr, M., Levoy, M.: Filling holes in complex surfaces using volumetric diffusion. In: Proceedings of the First International Symposium on 3D Data Processing Visualization and Transmission, pp. 428–441. IEEE (2002)
Google Scholar
Dey, T.K., Fu, B., Wang, H., Wang, L.: Automatic posing of a meshed human model using point clouds. Comput. Graph. 46, 14–24 (2015)
Article Google Scholar
Elad, A., Kimmel, R.: On bending invariant signatures for surfaces. IEEE Trans. Pattern Anal. Mach. Intell. 25(10), 1285–1295 (2003)
Article Google Scholar
Jain, V., Zhang, H.: Robust 3d shape correspondence in the spectral domain. In: IEEE International Conference on Shape Modeling and Applications, SMI 2006, pp. 19–19. IEEE (2006)
Google Scholar
Jiang, W., Xu, K., Cheng, Z.Q., Zhang, H.: Skeleton-based intrinsic symmetry detection on point clouds. Graph. Models 75(4), 177–188 (2013)
Article Google Scholar
Körtgen, M., Park, G.J., Novotni, M., Klein, R.: 3d shape matching with 3d shape contexts. In: The 7th Central European Seminar on Computer Graphics, vol. 3, pp. 5–17 (2003)
Google Scholar
Ovsjanikov, M., Mérigot, Q., Mémoli, F., Guibas, L.: One point isometric matching with the heat kernel. Comput. Graph. Forum 29, 1555–1564 (2010). Wiley Online Library
Article Google Scholar
Ovsjanikov, M., Mérigot, Q., Pătrăucean, V., Guibas, L.: Shape matching via quotient spaces. Comput. Graph. Forum 32, 1–11 (2013). Wiley Online Library
Article Google Scholar
Rugis, J., Klette, R.: Surface curvature extraction for 3d image analysis or surface rendering
Google Scholar
Rustamov, R.M.: Laplace-beltrami eigenfunctions for deformation invariant shape representation. In: Proceedings of the Fifth Eurographics Symposium on Geometry Processing, pp. 225–233. Eurographics Association (2007)
Google Scholar
Sahillioğlu, Y., Yemez, Y.: 3d shape correspondence by isometry-driven greedy optimization. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 453–458. IEEE (2010)
Google Scholar
Sahillioglu, Y., Yemez, Y.: Minimum-distortion isometric shape correspondence using em algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 34(11), 2203–2215 (2012)
Article Google Scholar
Sahillioğlu, Y., Yemez, Y.: Coarse-to-fine isometric shape correspondence by tracking symmetric flips. Comput. Graph. Forum 32, 177–189 (2013). Wiley Online Library
Article Google Scholar
Sun, J., Ovsjanikov, M., Guibas, L.: A concise and provably informative multi-scale signature based on heat diffusion. Comput. Graph. Forum 28, 1383–1392 (2009). Wiley Online Library
Article Google Scholar
Zaharescu, A., Boyer, E., Varanasi, K., Horaud, R.: Surface feature detection and description with applications to mesh matching. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 373–380. IEEE (2009)
Google Scholar
Zhang, Z., Yin, K., Foong, K.W.: Symmetry robust descriptor for non-rigid surface matching. Comput. Graph. Forum 32, 355–362 (2013). Wiley Online Library
Article Google Scholar
Zheng, Q., Hao, Z., Huang, H., Xu, K., Zhang, H., Cohen-Or, D., Chen, B.: Skeleton-intrinsic symmetrization of shapes. Comput. Graph. Forum 34, 275–286 (2015). Wiley Online Library
Article Google Scholar

Download references

Author information

Authors and Affiliations

Queen Mary University of London, Mile End, London, E1 4NS, UK
Zongyi Xu & Qianni Zhang

Authors

Zongyi Xu
View author publications
You can also search for this author in PubMed Google Scholar
Qianni Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qianni Zhang .

Editor information

Editors and Affiliations

University of Texas at San Antonio, San Antonio, USA
Qi Tian
Dept. of Information Engineering, University of Trento, Povo, Trento, Italy
Nicu Sebe
EECS, University of Central Florida, Orlando, Florida, USA
Guo-Jun Qi
EURECOM, Sophia-Antipolis, France
Benoit Huet
Hefei University of Technology, Hefei, Anhui, China
Richang Hong
School of Computing and Information, Hefei University of Technology, Hefei, Anhui, China
Xueliang Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, Z., Zhang, Q. (2016). Symmetry-Aware Human Shape Correspondence Using Skeleton. In: Tian, Q., Sebe, N., Qi, GJ., Huet, B., Hong, R., Liu, X. (eds) MultiMedia Modeling. MMM 2016. Lecture Notes in Computer Science(), vol 9516. Springer, Cham. https://doi.org/10.1007/978-3-319-27671-7_53

Download citation

DOI: https://doi.org/10.1007/978-3-319-27671-7_53
Published: 03 January 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27670-0
Online ISBN: 978-3-319-27671-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics