Robust and Fast Registration for Lidar Odometry and Mapping

Liu, Wenbo; Sun, Wei

doi:10.1007/978-981-16-4039-1_10

Wenbo Liu⁸ &
Wei Sun⁸

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 250))

614 Accesses

Abstract

Outliers, such as sensor noise, abnormal measurements, or dynamic objects, can damage the overall accuracy of a Simultaneous Localization and Mapping (SLAM) system. Aiming at to improve the performance of Lidar SLAM systems in urban scenes containing a large number of outliers, we propose a real-time, feature-based, and outliers-rejection Lidar SLAM system. By embedding an outlier elimination method based on 4-points congruent sets into a state-of-the-art SLAM framework and further optimizing the traditional single-step registration to coarse-to-fine registration, we can solve the problem of time-consuming, high motion drift, and wrong mapping caused by the current Lidar SLAM systems which cannot effectively detect and eliminate the outliers in surrounding environment.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Lmapping: tightly-coupled LiDAR-inertial odometry and mapping for degraded environments

Article 11 September 2023

Robust Real-Time Optimized LiDAR Odometry with Loop Closure Detection

An Hybrid Registration Method for SLAM with the M8 Quanergy LiDAR

1 Introduction

Simultaneous Localization and Mapping (SLAM) is the most basic prerequisite of intelligent robots and the necessary ability of driverless vehicles. Although there are many accurate and effective solutions to SLAM problems, such as the methods based on vision [1] or Lidar [2, 3], or integrating their advantages [4], most of the existing methods [5, 6], including them, are based on the assumption of a static world, which greatly limits the scope of application of these excellent methods. The typical method to estimate Lidar ego-motion is to apply the iterative closest point (ICP) method to the point clouds of adjacent frames [7]. However, Lidar-based methods can work even in the dark, and many 3D Lidars can capture the details of the environment over a long distance [6]. Therefore, this work focuses on the use of 3D Lidar to deal with the problem of SLAM in urban environment containing complex outliers.

The main contributions of this paper are as follows:

We integrate a robust outliers elimination method based on 4-point congruence set into a state-of-the-art SLAM framework to further improve the accuracy and robustness of odometry and mapping.
We use a reasonable coarse-to-fine registration method to replace the traditional single-step registration method to further reduce computation of the registration part, so as to reduce total runtime consumption of the system.

The rest of the paper is organized as follows: we introduce the related work in Sect. 10.2, describe the proposed method in Sect. 10.3, compare our method with the most advanced method in Sect. 10.4, and summarize our work and put forward the prospect in Sect. 10.5.

2 Related Work

2.1 Point Cloud Registration

The overlap rate of two point clouds to be registered is a key parameter, which is usually inversely proportional to the number of outliers. If two point clouds have a large overlap rate or obvious point correspondence, then their registration can be achieved by the iterative closest point [8] and various improved algorithms without initialization. ICP will fall into the local optimal solution. Go-ICP [9] and Gogma [10] use branch and bound in six-dimensional space to achieve global registration of point clouds without corresponding points. Because branch and bound method has exponential complexity, these methods are computationally expensive, and they often diverge due to small overlap [15].

The common rough registration methods are divided into hypothesis, test methods are represented by Random Sample Consensus (RANSAC), and geometric feature-based methods are represented by 4PCS [11]. RANSAC and its improved algorithm have cubic complexity and are not suitable for large-scale point clouds. Mellado et al. [13] reduced the complexity to quadratic and linear successively by matching the congruent four-point sets in two point clouds. Theiler et al. [12] use the set of key points for matching, which reduces the size of the set to be searched. Mohamad et al. [14] generalize the construction of coplanar four-point basis, which further improves the efficiency of point cloud registration. Raposo and Barreto [15] only use the topological relationship and normal vector between two points to construct matching rules, which greatly improves the registration efficiency and is suitable for point clouds with smaller overlap ratio. On the basis of the initial solution of coarse registration, precise registration is used to locally minimize the nonconvex error function to obtain the exact solution. Shan and Englot [6] suggest searching rotation and translation separately to reduce the registration complexity.

There are a lot of related works, but they either cannot fully meet the requirements of real-time, robustness, and high-precision in Lidar SLAM or have not carried out systematic experiments on Lidar point cloud dataset.

2.2 Lidar Odometry and Mapping

LOAM [2], rank second in the odometry evaluation project of KITTI vision benchmark [16], has a very low drift in the short-trajectory scenarios, and LOAM has greatly promoted the development of Lidar SLAM field. However, LOAM has no loop closure detection to eliminate the continuous accumulation of errors, so it may produce significant errors in the case of long trajectory. More importantly, LOAM does not consider dynamic objects, resulting in incorrect results in dynamic scenes. LeGo-LOAM [6] optimizes LOAM by adding loop closure detection to reduce motion drift, adding clustering and segmentation modules to filter out clusters with less than 30 points to realize filtering of noise. LOL [17] improves odometry accuracy by detecting the geometric similarity between online 3D point clouds and prior offline maps.

3 Proposed System

3.1 Task Description and Definitions

The problem addressed in this work is how to use the point cloud observed by 3D Lidar to estimate its ego-motion and build a global map for the traversed environment.

We define the point cloud measured as P, Lidar pose as X, global map as M, use the upper right corner to represent the time stamp and define the K-scan point cloud as P^k, where a point is $p_{i}^{k}$, that is, ${\varvec{P}}^{{\varvec{k}}} = \left( {p_{1}^{k} ,p_{2}^{k} , \ldots ,p_{n}^{k} } \right)$, $p_{i}^{k} = \left( {x_{i}^{k} ,y_{i}^{k} ,z_{i}^{k} } \right)$, where n is the size of the point cloud, $k \in {\mathbb{N}}^{ + }$.

Under the above definition, the problem can be modeled as: Given map $M^{t - 1}$, Lidar pose $X^{t - 1}$ at time t − 1, and point cloud ${\varvec{P}}^{{\varvec{t}}}$ at time t, to update $X^{t}$ and $M^{t}$, as can be shown in (10.1) and (10.2).

$$ X^{t} = {\mathbb{F}}\left( {X^{t - 1} ,{\varvec{P}}^{{\varvec{t}}} } \right) = {\mathbb{F}}\left( {X^{t - 1} ,{\mathbb{W}}\left( {{\varvec{P}}^{{*}{\varvec{t}}} ,W^{t} } \right)} \right) $$

(10.1)

$$ M^{t} = {\mathbb{G}}\left( {M^{t - 1} ,{\varvec{P}}^{{\varvec{t}}} } \right) = {\mathbb{G}}\left( {M^{t - 1} ,{\mathbb{W}}({\varvec{P}}^{{*}{\varvec{t}}} ,W^{t} )} \right) $$

(10.2)

where $P^{{{*}t}}$ is the description of the real scene in the field of vision at time t. $W^{t} = (w_{1}^{t} ,w_{2}^{t} , \ldots ,w_{i}^{t} )$ is the sum of many interference factors such as sensor noise, abnormal measurement values, or dynamic objects. ${\mathbb{W}}$ is the unknown noise function that transforms the real scene into the measurement point cloud. Next, we define the set of outliers in point cloud caused by $W^{t}$ that are harmful to registration as ${\varvec{P}}_{{{\varvec{out}}}}$, and the remaining points are ${{\varvec{P}}}_{{\varvec{i}}{\varvec{n}}}$, namely,

$$ {\varvec{P}} = {\varvec{P}}_{{{\varvec{out}}}} \cup {\varvec{P}}_{{{\varvec{in}}}} = {\mathbb{W}}\left( {P^{*} ,W} \right) $$

(10.3)

The task of this paper is to find the optimal function ${\mathbb{F}}$ and ${\mathbb{G}}$, and the core procedure is to find and eliminate ${{\varvec{P}}}_{{\varvec{o}}{\varvec{u}}{\varvec{t}}}$ to optimize them.

3.2 Segmentation

The input of the segmentation module is the original 3D LIDAR point cloud ${\varvec{P}} = \left\{ {p_{1} ,p_{2} , \ldots ,p_{n} } \right\}$, the output is m clusters $\{ P_{{C_{1} }} ,P_{{C_{2} }} , \ldots ,P_{{C_{m} }} \}$ and corresponding cluster label vector $L = \left( {l_{1} ,l_{2} , \ldots ,l_{n} } \right)$, where $l_{i} \in \left[ { - 1,m} \right] \cap {\mathbb{Z}}$, label −1 represents the point belongs to the ground, and the positive integer represents the serial number of non-ground clusters.

The point cloud ${\varvec{P}}^{{\varvec{k}}}$ is first projected onto the circular range image $P_{U,V} \triangleq \{ p_{{\left( {u,v} \right)}} |u \in \left[ {1,360^\circ /R_{{{\text{hori}}}} } \right] \cap {\mathbb{Z}}$, $v \in \left[ {1,N_{{{\text{scan}}}} } \right] \cap {\mathbb{Z}}\}$ of size $N_{{{\text{pixel}}}} = 360^\circ /R_{{{\text{hori}}}} *N_{{{\text{scan}}}}$. The row index ${\text{row}}_{i}$ of $p_{i}$ is shown in (10.4), and the column index ${\text{col}}_{i}$ is shown in (10.5).

$$ {\text{row}}_{i} = \left( {\tan^{ - 1} \frac{{z_{i} }}{{\sqrt {x_{i}^{2} + y_{i}^{2} } }} + {\text{Pit}}_{{{\text{bottom}}}} } \right)/R_{{{\text{vert}}}} $$

(10.4)

$$ {\text{col}}_{i} = \tan^{ - 1} \frac{{y_{i} }}{{x_{i} }}/R_{{{\text{hori}}}} $$

(10.5)

The value of the image is the distance from $p_{i}$ to the Lidar $r_{i} = \sqrt {x_{i}^{2} + y_{i}^{2} + z_{i}^{2} }$, where $R_{{{\text{hori}}}}$ is the horizontal resolution of the Lidar, $R_{{{\text{vert}}}}$ is the vertical resolution of the Lidar, ${\text{Pit}}_{{{\text{bottom}}}}$ is the minimum pitch of the Lidar Lidar beam, and $N_{{{\text{scan}}}}$ is the number of Lidar lines. After the projection, a point $p_{u,v + 1}$ and $p_{u,v}$, $v \in \left[ {1,\frac{1}{2}N_{{{\text{scan}}}} } \right] \cap {\mathbb{Z}}$ in $P$ can be uniquely identified by the row index and column index, denoted by:

$$ p_{i} \equiv p_{{\left( {u,v} \right)}} = \left( {x_{{\left( {u,v} \right)}} ,y_{{\left( {u,v} \right)}} ,z_{{\left( {u,v} \right)}} } \right) $$

(10.6)

Then the ground is extracted from the range image $P_{U,V}$. For points $p_{u,v + 1}$ and $p_{u,v}$, where $v \in \left[ {1,\frac{1}{2}N_{{{\text{scan}}}} } \right] \cap {\mathbb{Z}}$; if the pitch of adjacent rows $Pit_{v,v + 1}$ (as shown in (10.7)) are less than 10°, they are denoted as ground points.

$$ Pit_{v,v + 1} = \tan^{ - 1} \frac{{z_{u,v + 1} - z_{u,v + 1} }}{{\sqrt {(x_{u,v + 1} - x_{u,v} )^{2} + (y_{u,v + 1} - y_{u,v} )^{2} } }} $$

(10.7)

Finally, the segmentation clustering method based on range image [18] was applied to cluster the points into several clusters. Points from the same cluster are assigned a unique label. After this process, for $p_{i}^{k} \in {\varvec{P}}^{{\varvec{k}}}$, we get $l_{i}$, ${\text{row}}_{i}$, ${\text{col}}_{i}$, and $r_{i}$, where $i \in \left[ {1,N_{{{\text{pixel}}}} } \right] \cap {\mathbb{Z}}$. ${\varvec{P}}^{{\varvec{k}}}$ is divided into a ground point set and some non-ground point sets, that is, ${\varvec{P}}^{{\varvec{k}}} = {\varvec{P}}_{{\varvec{g}}} \cup {\varvec{P}}_{{{\varvec{ng}}}}$, and it is worth noting that both types of points may contain outliers. The extraction of features from non-ground points can reduce time-consuming and improve the accuracy of registration through label matching and noise filtering.

3.3 Feature Extraction

In order to reduce the computation amount of registration, we extract the representative feature points to reduce the number of the points to be registered. The feature extraction method is similar to that used in LeGo-LOAM. We first used (10.8) to evaluate the curvature c_i of point p_i, defining points whose curvature is greater than threshold $c_{{{\text{thre}}}}$ as edge features and points whose curvature is less than $c_{{{\text{thre}}}}$ as plane features.

$$ c_{i} = \frac{1}{{\left| {\varvec{S}} \right|||r_{i}|| }}\left\|\mathop \sum \limits_{j \in S,j \ne i} (r_{j} - r_{i} )\right\| $$

(10.8)

where S is a set of continuous points on the same row from the image, points in S are uniformly distributed on both sides of p_i, and |S| is the number of points in S.

3.4 Lidar Odometry

The input of Lidar odometry module is the feature point in the point cloud, which includes estimating the Lidar motion between adjacent scans and eliminating the motion distortion of the point cloud through feature matching. Looking for the Lidar of the adjacent scanning motion—by looking for rotation matrix $\tilde{\user2{R}} \in {\text{SO}}(3)$ and translation vector $\tilde{t} \in {\mathbb{R}}^{3}$ to minimize the point cloud registration error, such as (10.9).

$$ \tilde{\user2{T}} \triangleq \left( {\tilde{\user2{R}},\tilde{t}} \right) = \arg \min \left( {\mathop \sum \limits_{i}|| \tilde{\user2{R}}^{t} p_{i}^{t} + \tilde{t} - q_{i}^{t - 1} }|| \right) $$

(10.9)

where $q_{i}^{t - 1}$ is the corresponding point at t − 1 found by $p_{i}^{t}$.

The LOAM details ${\mathbb{F}}$, and LeGo-LOAM optimizes it. For brevity, this chapter focuses on further optimization based on these methods, which is achieved by matching only valid feature points in the adjacent frames. The optimization process can be defined as:

$$ {\text{min}}\;||\tilde{\user2{T}}^{t} - {\varvec{T}}^{{*}{\varvec{t}}}|| \;\;\;{\text{s}}.{\text{t}}.\;p_{i}^{t} \in {\varvec{F}}_{{{\varvec{in}}}}^{{\varvec{t}}} $$

(10.10)

There is the optimal transformation matrix from ${\varvec{P}}^{{\varvec{t}}}$ to ${\varvec{P}}^{{{\varvec{t}} - 1}}$, ${\varvec{R}}^{\user2{*}}$ and ${\text{tr}}^{*}$ are the corresponding rotation matrix and the translational vectors, and ${\varvec{F}}_{{{\varvec{in}}}}^{{\varvec{t}}}$ is the set of effective feature points at time t, namely ${\varvec{F}}^{{\varvec{t}}} = \left( {{\varvec{F}}_{{{\varvec{out}}}}^{{\varvec{t}}} \cup {\varvec{F}}_{{{\varvec{in}}}}^{{\varvec{t}}} } \right)$. The process of finding the corresponding relationship is shown in Fig. 10.1, that is, the transformation is found in the plane feature point set and the edge feature point set, respectively, and the computational effort is reduced by reducing the candidate range of corresponding points. We focus on two strategies, outlier elimination and weight compensation of feature point.

(1) Outlier elimination: To solve this problem, we use the two-step coarse-to-fine registration strategy, that is, rough registration is first used to detect outliers, and then the outliers are avoided to participate in the L-M algorithm. We can express the outlier feature point ${\varvec{F}}_{{{\varvec{out}}}}$ as:

$$ {\varvec{F}}_{{{\varvec{out}}}}^{{\varvec{t}}} = \left\{ {p_{i}^{t} | ||{{\varvec{R}}^{*} p_{i}^{t} + tr^{*} - q_{i}^{t - 1} }|| > \varepsilon \wedge p_{i}^{t} \in {\varvec{F}}^{{\varvec{t}}} } \right\} $$

(10.11)

where $q_{i}^{t - 1}$ is the feature point corresponding to $p_{i}^{t}$. We first define feature descriptor ${\varvec{d}}^{{\varvec{t}}}$ and then use ${\varvec{d}}^{{\varvec{t}}}$ and ${\varvec{d}}^{{{\varvec{t}} - 1}}$ to calculate pose transformation $\tilde{\user2{T}}_{{{\text{rough}}}}$ as follows:

(1)
Finding the first point in the four-point basis: We select the feature point $f_{{\left( {u_{1} ,v_{1} } \right)}}^{t} \in {\varvec{F}}^{{\varvec{t}}}$ as the first point $a_{1}^{t}$ in the ith range image, namely $a_{1}^{t} \triangleq f_{{\left( {u_{1} ,v_{1} } \right)}}^{t}$, where $u_{1}^{t} \in \left[ {1 + \frac{{360^\circ *\left( {i - 1} \right)}}{{N_{{{\text{sub}}}} *R_{{{\text{hori}}}} }},\frac{360^\circ *i}{{N_{{{\text{sub}}}} *R_{{{\text{hori}}}} }}} \right] \cap {\mathbb{Z}}$ and $v_{1}^{t} \in \left[ {\frac{{N_{{{\text{scan}}}} }}{4},\frac{{N_{{{\text{scan}}}} }}{2}} \right] \cap [v_{1}^{t - 1} - 1,v_{1}^{t - 1} + 1] \cap {\mathbb{Z}}$.
(2)
Finding the second point: We select $a_{2}^{t} \triangleq f_{{\left( {u_{2} ,v_{2} } \right)}}^{t}$ in the sub-image farthest from $a_{1}^{t}$, where $u_{2}^{t} \in \left[ {1 + \frac{{360^\circ *\left( {i - 1 + \frac{{N_{{{\text{div}}}} }}{2}} \right)}}{{N_{{{\text{sub}}}} *R_{{{\text{hori}}}} }},\frac{{360^\circ *\left( {i + \frac{{N_{{{\text{div}}}} }}{2}} \right)}}{{N_{{{\text{sub}}}} *R_{{{\text{hori}}}} }}} \right] \cap {\mathbb{Z}}$, $f_{{\left( {u_{2} ,v_{2} } \right)}}^{t} \in P_{{\left( {U,V} \right)_{{i + \frac{1}{2}N_{{{\text{sub}}}} }} }}^{t}$, the line segment defined with $a_{1}^{t}$ and $a_{2}^{t}$ as endpoints is $l_{1}^{t} \triangleq a_{1}^{t} a_{2}^{t}$.
(3)
Constructing feature descriptors: We select $a_{3}^{t}$ and $a_{4}^{t}$ from the sub-images excepting $a_{1}^{t}$ and $a_{2}^{t}$ to satisfy the conditions that $l_{2}^{t} \triangleq a_{3}^{t} a_{4}^{t}$ is coplanar with $l_{1}^{t}$, $\left| {l_{2}^{t} } \right| > \frac{{\left| {l_{1}^{t} } \right|}}{4}$, and $\angle \left( {l_{1}^{t} ,l_{2}^{t} } \right) > \frac{180^\circ }{{N_{{{\text{sub}}}} }}$. The intersection of $l_{1}^{t}$ and $l_{2}^{t}$ is $e^{t}$. Then we record feature descriptor ${\varvec{d}}^{{\varvec{t}}}$, ${\varvec{d}}^{{\varvec{t}}} \triangleq \left( {d_{1}^{t} ,d_{2}^{t} } \right) $ when the above three conditions are true, otherwise, ${\varvec{d}}^{{\varvec{t}}} \triangleq \left( {d_{1}^{t} ,d_{3}^{t} } \right)$, where $d_{1}^{t} \triangleq \left( {a_{1}^{t} ,a_{2}^{t} } \right)$, $d_{2}^{t} \triangleq \left( {a_{3}^{t} ,a_{4}^{t} ,e^{t} ,\angle \left( {l_{1}^{t} ,l_{2}^{t} } \right)} \right)$, and $d_{3}^{t} \triangleq \left( {c_{{a_{1} }}^{t} ,c_{{a_{2} }}^{t} } \right)$.
(4)
Matching feature descriptors: If the condition in (10.3) is true, and we use Super4PCS [13] to match ${\varvec{d}}^{{\varvec{t}}}$ and ${\varvec{d}}^{{{\varvec{t}} - 1}}$; otherwise, we use 2PNS [15].

The process of the second L-M step is similar to that in the first step, but the edge feature points is used instead, and the number of rows of feature points in (1) and (2) is no longer limited (Fig. 10.2).

(2) Weight compensation of feature point: LeGo-LOAM gives the feature points a weight $s_{i}$ which is inversely proportional to the distance between Lidar and points in the iteration process of L-M algorithm. The difference is that we also consider the influence of the point cloud holes in the background caused by the occlusion of dynamic objects on the stability of the system. Therefore, we reduce the weight of the feature points within a certain range of the dynamic cluster.

$$ \widetilde{{s_{i} }} = s_{i} {*}\left( {1 - \frac{{R_{{{\text{hori}}}} }}{{\left| {\theta_{{{\text{yaw}}}} } \right|}}} \right) $$

(10.12)

where $\widetilde{{s_{i} }}$ is the new feature weight and $\theta_{{{\text{yaw}}}}$ is the minimum yaw difference between a feature point and a dynamic object.

3.5 Lidar Mapping

Let us review the mapping model ${\mathbb{G}}$ (10.2), which is to use $M^{t - 1}$ and ${\varvec{P}}^{{\varvec{t}}}$ to build $M^{t}$ because the global map contains a lot of feature information, and the earlier information contains less error. The Lidar mapping module further optimizes the pose estimation by matching the ${\varvec{F}}^{{\varvec{t}}}$ with the submap $M_{{{\text{sub}}}}^{t - 1}$ around the Lidar. Please refer to the description in [20] for detailed matching and optimization procedures. We use the map storage method in LeGo-LOAM to store only the feature point set ${\varvec{F}}^{{\varvec{t}}}$.

$$ M^{t - 1} \triangleq \left\{ {\underbrace {{\left\{ {{\varvec{F}}^{1} } \right\},{ }\left\{ {{\varvec{F}}^{2} } \right\}, \ldots ,\left\{ {{\varvec{F}}^{{{\varvec{t}} - {\varvec{k}}2}} } \right\}}}_{{M_{{{\text{hist}}}}^{t - 1} }}, \ldots ,\underbrace {{\left\{ {{\varvec{F}}^{{{\varvec{t}} - {\varvec{k}}}} } \right\}, \ldots ,\left\{ {{\varvec{F}}^{{\varvec{t}}} } \right\}}}_{{M_{{{\text{sub}}}}^{t - 1} }}} \right\} $$

(10.13)

In addition, the optional function is to further eliminate the error of the mapping module by detecting the loop closure. In LeGo-LOAM, if ${\varvec{F}}^{{\varvec{t}}}$ can find corresponding feature in earlier map $M_{{{\text{hist}}}}^{t - 1}$ or recent features by ICP, a new constraint is added. The difference is that we use coarse-to-fine registration strategy in odometry module again to replace single-step ICP.

Then, the estimated pose of the sensor is updated by sending the pose map to the optimized system (e.g., [19]).

4 Experiments

All experiments were performed on a laptop equipped with a 2.5 GHz Intel i7-4700 processor, 16 GB of RAM.

4.1 Feature Extraction

Figure 10.3 shows a frame of point cloud and feature points (in green) in scenario 1, including five moving pedestrians (circled in white box), where Fig. 10.3a shows the wrong feature extraction results of LeGo-LOAM. It can be seen that many feature points are distributed on unstable outliers belong to pedestrians, while our result (Fig. 10.3b) is more reasonable because we detect outliers and avoid extracting feature points from them to avoid the incorrect odometry and mapping results caused by wrong matching of feature points. To be concise, we only show the feature edge points that are sufficient to illustrate the problem.

The results show that our method has the advantage of effective feature extraction in scenes containing a large proportion of outliers (Table 10.1).

Table 10.1 Comparison of effective features ratio between LeGo-LOAM and ours

Full size table

4.2 Odometry

Figure 10.4 shows the odometry comparison results of scenario 1, where the red curve is the result of LeGO-LOAM, while the blue one is ours. The results of the two methods have little difference from a global perspective, but there are quite differences in detail. We choose the most representative location (upper left corner of scenario 1, where pedestrians gathering here) to show the contrast, as can be seen from the green box, and the red line has a distinct jagged edge, which is exactly the wrong odometry estimation caused by the outliers, while the blue line has no such problem.

Figure 10.5 shows the comparison result between the two methods and the true value in scenario 2, and we can see from Fig. 10.5 that the accuracy can still be maintained in the long trajectory scene (with a length of about 3730 m). The relative pose estimation error is calculated by comparing the final pose and the initial pose. Rotation errors and translation errors of LeGo-LOAM are 5.65° and 3.19 m, while our results are 4.52° and 2.75 m. These two groups of experiments illustrate the advantage of our method in odometry estimation.

4.3 Mapping

The results of representative scenarios indicate that LeGo-LOAM will produce wrong mapping results because they do not consider outliers, as shown from Figs. 10.6, 10.7, 10.8, and 10.9.

Figure 10.6 shows the mapping result of scenario 1, where Fig. 10.6a is the LeGO-LOAM mapping result. Figure 10.7 shows the details of the mapping result of scene 1, where (a), (b), and (c) are the result of incorrect mapping caused by incorrect feature extraction. Dynamic pedestrians and vehicles participate in feature extraction and association and are thus added to the map, leaving obvious residual shadows. Among them, Fig. 10.7a is the enlarged result on the top of the map, and the residual shadow is the movement track left by four pedestrians moving forward. Figure 10.7b is a local enlarged image on the left of the map, and the residual shadow is the movement track left by an upward moving vehicle. Figure 10.7c is a local enlarged image of the lower right part of the map, and the residual shadow is left by a moving car. Different from LeGO-LOAM, we first detect and eliminate outliers and focus on extracting effective features, so we have obvious advantages when mapping in dynamic scenarios.

Figure 10.8 shows the mapping result of scenario 2, when we zoom in on the local map near the loop closure, as shown in Fig. 10.9, we can see the obvious differences as indicated by the white arrows, and the buildings in map of LeGO-LOAM have obviously offset, while ours remains globally consistent, which benefits from two-step of coarse-to-fine registration strategy.

4.4 Runtime

The running time of odometry module and mapping module between our method and LeGo-LOAM is shown in Table 10.2. The time consumption of odometry module is reduced by about 15%, while that of mapping module is about 31%. The results demonstrate the advantage of our approach in terms of time consumption, which is because we use two-step coarse-to-fine registration strategy to replace single-step registration.

Table 10.2 Runtime (ms) comparison of one frame between LeGo-LOAM and ours

Full size table

5 Conclusions and Future Works

Future works include further improving the accuracy of SLAM system in complex and highly dynamic scenes, because we find the proposed system will still produce unsatisfactory performance when there are many and dense moving objects around the Lidar. Some methods to solve the problem of robot kidnapping may be the solutions to the raised problems. We will also try to deploy SLAM system on multiple base stations to make up for the deficiency caused by the serious occlusion of a single base station by mobile objects through cooperative tasks. Therefore, we will further combine the semantic information and motion information of the scene, which will also provide a new idea for the construction of semantic map.

References

Zhang, J., Henein, M., Mahony, R.E., Ila, V.: VDO-SLAM: a visual dynamic object-aware SLAM system. Int. J. Robot. Res. (2020) [Online]. https://arxiv.org/abs/2005.11052
Zhang, J., Singh, S.: LOAM: Lidar odometry and mapping in real time. Proc. Robot.: Sci. Syst. (2014)
Google Scholar
Zhang, J., Singh, S.: Low-drift and real-time lidar odometry and mapping. Auton. Robot. 41(2), 401–416 (2017)
Article Google Scholar
Zhang, J., Singh, S.: Visual-lidar odometry and mapping: low-drift, robust, and fast. In: 2015 IEEE International Conference on Robotics and Automation (ICRA), pp. 2174–2181 (2015)
Google Scholar
Shan, T., Englot, B., Meyers, D., Wang, W., Ratti, C., Rus, D.: LIO-SAM: tightly-coupled Lidar inertial odometry via smoothing and mapping. In: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Oct 2020
Google Scholar
Shan, T., Englot, B.: LeGO-LOAM: lightweight and ground-optimized lidar odometry and mapping on variable terrain. In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 4758–4765 (2018)
Google Scholar
Cadena, C., Carlone, L., Carrillo, H., Latif, Y., Scaramuzza, D., Neira, J., Reid, I., Leonard, J.J.: Past, present, and future of simultaneous localization and mapping: towards the robust-perception age. IEEE Trans Robot. (TRO) 1309–1332 (2016)
Google Scholar
Besl, P.J., McKay, N.D.: A method for registration of 3-D shapes. IEEE Trans. Pattern Anal. Mach. Intell. 14(2), 239–256 (1992)
Article Google Scholar
Yang, J., Li, H., Campbell, D., Jia, Y.: Go-ICP: a globally optimal solution to 3D ICP point-set registration. IEEE Trans. Pattern Anal. Mach. Intell. 38(11), 2241–2254 (2016)
Article Google Scholar
Campbell, D., Petersson, L.: Gogma: Globally-optimal gaussian mixture alignment. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5685–5694 (2016)
Google Scholar
Aiger, D., Mitra, N.J.: Cohen-Or D.4-points congruent sets for robust pairwise surface registration. ACM Trans. Graphics 27(3), 85, 1–10 (2008)
Google Scholar
Theiler, P.W., Wegner, J.D., Schindler, K.: Keypoint-based 4-points congruent sets—automated marker-less registration of laser scans. ISPRS J. Photogramm. Remote Sens. 96, 149–163 (2014)
Article Google Scholar
Mellado, N., Aiger, D., Mitra, N.J.: Super 4PCS fast global pointcloud registration via smart indexing. Comput. Graphics Forum 33(5), 205–215 (2015)
Article Google Scholar
Mohamad, M., Ahmed, M.T., Rappaport, D., Greenspan, M.: Super generalized 4PCS for 3D registration. In: 2015 International Conference on 3D Vision, pp. 598–606 (2015)
Google Scholar
Raposo, C., Barreto, J.P.: Using 2 point+normal sets for fast registration of point clouds with small overlap. In: 2017 IEEE International Conference on Robotics & Automation (ICRA), pp. 5652–5658 (2017)
Google Scholar
Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? The KITTI vision benchmark suite. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3354–3361 (2012)
Google Scholar
Rozenberszki, D., Majdik, A.L.: LOL: Lidar-only odometry and localization in 3D point cloud maps. In: 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, pp. 4379–4385 (2020)
Google Scholar
Bogoslavskyi, I., Stachniss, C.: Fast range image-based segmentation of sparse 3D Lidar scans for online operation. In: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 163–169 (2016)
Google Scholar
Kaess, M., Johannsson, H., Roberts, R., Ila, V., Leonard, J.J., Dellaert, F.: iSAM2: incremental smoothing and mapping using the Bayes tree. Int. J. Robot. Res. 31(2), 216–235 (2012)
Google Scholar
Quigley, M., Conley, K., Gerkey, B., Faust, J., Foote, T., Leibs, J., Wheeler, R., Ng, A.Y.: ROS: an open-source robot operating system. In: IEEE ICRA Workshop on Open Source Software (2009)
Google Scholar

Download references

Acknowledgements

This work was supported by National Nature Science Foundation of China (NSFC) under Grants No. 61671356, Science and Technology Program of Shaanxi Province under Grants No. 2020GY-136 and 2019ZDLGY14-02-03.

Author information

Authors and Affiliations

Xidian University, Xi’an, China
Wenbo Liu & Wei Sun

Authors

Wenbo Liu
View author publications
You can also search for this author in PubMed Google Scholar
Wei Sun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Sun .

Editor information

Editors and Affiliations

College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao, China
Tsu-Yang Wu
School of Transportation and Logistics, Southwest Jiaotong University, Chengdu, Sichuan, China
Shaoquan Ni
College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao, China
Shu-Chuan Chu
College of Computer and Data Science, Fuzhou University, Fuzhou, China
Chi-Hua Chen
Institute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, Krasnoyarsk, Russia
Margarita Favorskaya

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, W., Sun, W. (2022). Robust and Fast Registration for Lidar Odometry and Mapping. In: Wu, TY., Ni, S., Chu, SC., Chen, CH., Favorskaya, M. (eds) Advances in Smart Vehicular Technology, Transportation, Communication and Applications. Smart Innovation, Systems and Technologies, vol 250. Springer, Singapore. https://doi.org/10.1007/978-981-16-4039-1_10

Download citation

DOI: https://doi.org/10.1007/978-981-16-4039-1_10
Published: 30 November 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-4038-4
Online ISBN: 978-981-16-4039-1
eBook Packages: EngineeringEngineering (R0)

Publish with us