Autonomous robotic capture of non-cooperative target using visual servoing and motion predictive control

Larouche, Benoit P.; Zhu, Zheng H.

doi:10.1007/s10514-014-9383-2

Autonomous robotic capture of non-cooperative target using visual servoing and motion predictive control

Published: 12 January 2014

Volume 37, pages 157–167, (2014)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Autonomous Robots Aims and scope Submit manuscript

Autonomous robotic capture of non-cooperative target using visual servoing and motion predictive control

Download PDF

Benoit P. Larouche¹ &
Zheng H. Zhu¹

1941 Accesses
52 Citations
3 Altmetric
Explore all metrics

Abstract

This paper presents a framework for autonomous capture operation of a non-cooperative mobile target in a 3-dimensional workspace using a robotic manipulator with visual servoing. The visual servoing with an eye-in-hand configuration is based on motion predictive control using Kalman filter for the on-line state and parameter estimation of the target. A transitional decision making process is developed to guide the robotic manipulator between the different phases of the capture operation by employing a custom metric that translates visual misalignments between the end-effector and the target into a guidance measurement. These phases include the target acquisition and approach stage and the alignment and capture phase. Experiments have been carried out on a custom designed and built robotic manipulator with 6 degrees of freedom. The objective is to evaluate the performance of the proposed motion predictive control scheme for the autonomous capturing task and to demonstrate the robustness of the proposed control scheme in the presence of noise and unexpected disturbances in vision system, sensory-motor coordination and constraints for the execution in real environments. Experimental results of the visual servoing control scheme integrated with the motion predictive Kalman filter indicate the feasibility and applicability of the proposed control scheme. It shows that when the target motion is properly predicted, the tracking and capture performance has been improved significantly.

4 × 2D Visual Servoing Approach for Advanced Robot Applications

Self-learning Visual Servoing for Robot Manipulation in Unstructured Environments

Visual servoing using an optimized trajectory planning technique for a 4 DOFs robotic manipulator

Article 22 May 2017

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The capture of non-cooperative targets using autonomous robotic manipulators as shown in Roe et al. (2004) and Ogilvie et al. (2008) presents a set of unique and interesting challenges as described in Edan (1995), Fumagall et al. (2012), Slaughter et al. (2008), not the least of which is the autonomy problem described in great detail in Liu and Chopra (2011), Liang and Ma (2011). Mobile robotics brings together the fusion of sensing, decision making, and acting in Chung et al. (2011). In autonomous robotic tracking and capturing of a non-cooperative target, computer vision is exclusively used as the primary feedback sensor in these tasks to control the pose of end-effector with respect to the target, seen in Cretual and Chaumetter (2001), Fang et al. (2012) and Wong et al. (1996). Among all vision-based control schemes, the position-based visual servoing with an eye-in-hand camera configuration has the advantages of precise view of a target and the natural relationship of the end-effector’s pose with respect to the target in the workspace. However, the pose estimation in this configuration is prone to camera calibration errors, target model inaccuracies, image noise, and unexpected disturbances. Therefore, the focus of this study will be on the impact of pose estimation in visual servoing where the relative pose between a camera and a target is used for real-time control of robot motion, seen with limited success in Fang et al. (2011), Zhang et al. (2012) and Ghadyok et al. (2012). Furthermore, autonomous grasping is a critical task, as shown in Wang and Xie (2009), Hsiao et al. (2011) and Ignakov et al. (2012), in robotic capture operation and presents a change in the system kinematic and dynamic behavior without precise knowledge. To achieve the objective, an improved control scheme has been developed to obtain reliable pose and velocity estimates of a moving target from noisy image data using photogrammetry and Kalman filtering. The latter not only enhances the pose and velocity estimation from the noisy image data but more importantly provides smooth pose and rate estimates for the robotic control system when the vision system loses its tracking of the target momentarily. Once the target’s pose and velocity are determined, the robotic trajectory to intercept the target at a rendezvous point will be planned and executed. The trajectory planning will be subjected to the constraints such as the maximum allowed velocity of the end-effector and the physical limits of the workspace and the joint ranges. A decision making metric has been developed and examined for a robotic manipulator attempting to approach and capture a non-cooperative target, where five critical alignment offsets are identified based on geometric character of the target and the existing work (two rotational offsets) as shown in Rekleitis et al. (2007). These offsets are then combined into a composite alignment index by a weighted summation for the total alignment measurement of the end-effector with respect to the the grasping point of a target. Individual weights are given depending on the stages of the capture operation in order to allow for a rapid approach, a high level of collision avoidance, and an increased rate of successful captures. The developed control scheme has been validated experimentally using a custom built robotic manipulator to lock on, track, approach and capture a moving target. The experimental results demonstrate a successful captures of a moving non-cooperative target with the proposed control scheme.

2 Kinematics of robotic manipulator

Consider an eye-in-hand (as defined in Fang et al. 2002), Pieper (see McKerrow 1991 for description) configured robotic manipulator as shown in Fig. 1. The actuators consist of three stepper motors attached to the first three links to control the torso ($\theta _{1}$), shoulder ($\theta _{2}$), and elbow ($\theta _{3}$) joints, similar to the work of Schutter (1987). The orientation of the end-effector are controlled by two wrist joints actuated by two servo motors, namely, wrist rotation ($\theta _{4}$) and wrist angle ($\theta _{5}$) joints. The last joint is a translational joint that controls the gripper. Since the gripper will be activated after the end-effector is aligned with the grasping point of the target, we can consider only the degrees of freedom (DOF) of the first five joints that affect the pose of the end-effector in the robotic controller development, as shown in Kurniawati et al. (2012) and Klein and Huang (1983). The visual sensor in this work is a digital camera mounted next to the end-effector to monitor the target motion in a 3-dimensional (3D) workspace similar to Lippiello et al. (2008). The camera is assumed calibrated and the intrinsic and extrinsic parameters, such as the focal length, the physical size and resolution of image sensor, the transformation matrix between the camera and the end-effector, are known. However, the target can move in the 3D workspace and its pose and velocity are unknown in advance. Therefore, the robotic control problem can be defined as the following two tasks:

Task 1: Given a desired pose of the target in the workspace statically or dynamically, design a proper set of joint trajectories for the manipulator such that the end-effector can move to the desired grasp position and orientation smoothly with the minimum time.

Task 2: Once the end-effector is in the vicinity of the target, design a proper set of joint input for the manipulator to grasp the stationary or dynamic target without collision.

Let $\left\{ X_g\right\} $ and $\left\{ X_e\right\} $ $ \in \mathfrak {R}^3$ be the end-effector position vectors defined in the global and local coordinate systems in the 3D workspace and $\left\{ \theta \right\} = \left\{ \theta _1, \theta _2, \dots , \theta _5\right\} ^T \in \mathfrak {R}^5$ be the vector of corresponding joint angles defined in the joint space. The kinematics of a robotic manipulator defines the relationship between the joint positions and the corresponding position of the end-effector in the workspace, such that,

$$\begin{aligned} \begin{Bmatrix} X_{g} \end{Bmatrix} =[T_{ge}(\theta _1, \theta _2, \theta _3, \theta _4, \theta _5)] \begin{Bmatrix} X_{e} \end{Bmatrix} \end{aligned}$$

(1)

where $[T_{ge}]$ is the transformation matrix from local to global coordinate systems.

Similarly, the transformation from the camera frame $\{X_c\}$ to the global stationary frame $\{X_g\}$ can be expressed as,

$$\begin{aligned} \begin{Bmatrix} X_{g} \end{Bmatrix} =[T_{gc}(\theta _1, \theta _2, \theta _3, X_{c}, Y_{c}, Z_{c} )]\begin{Bmatrix} X_{c} \end{Bmatrix} \end{aligned}$$

(2)

where $(X_{c}, Y_{c}, Z_{c})$ are the coordinates of origin of the camera frame in the frame $\{X_e\}$.

Then, the velocity of the end-effector can be found by taking the first order time derivation of (1),

$$\begin{aligned} \{\dot{X}_{g}\} = [J]\{\dot{\theta }\} \end{aligned}$$

(3)

where $[J]$ is the Jacobian matrix of the robotic manipulator.

For the manipulator path planning, the inverse kinematics gives the joint angle velocity $\{\dot{\theta }\}$ in terms of the end-effector velocity,

$$\begin{aligned} \{\dot{\theta }\} = [\tilde{J}^{-1}]\{\dot{X}_{g}\} \end{aligned}$$

(4)

where $[\tilde{J}^{-1}]$ is the pseudo-inverse of the Jacobian matrix in (3) if the matrix is non-square,

$$\begin{aligned}{}[\tilde{J}^{-1}] = ([J]^T[J])^{-1}[J]^T \end{aligned}$$

3 Pose estimation

The pose information of the target in a 3D workspace is extracted from a 2D image space by photogrammetry (Fang et al. 2002; Mittal and Paragios 2004) based on the collinearity equations and a least squares solution. As shown in Fig. 2, the image coordinates of markers on the target can be established by collinearity equations, such as,

$$\begin{aligned} x_{c}=-f\frac{X_{tc}}{Y_{tc}} ,\quad z_{c}=-f\frac{Z_{tc}}{Y_{tc}} \end{aligned}$$

(5)

where ($x_{c}$, $z_{c}$) are the coordinates of a projected point on the camera’s image plane containing only a 2D measurement, ($X_{tc}, Y_{tc}, Z_{tc}$) are the spatial coordinates of the marker with respect to the camera coordinate system, and $f$ is the focal length of the camera. Here, it is implied that the focal length is much less than the distance between the marker and the camera.

The target point $(X_{tc}, Y_{tc}, Z_{tc})$ can be further decomposed into two parts:

$$\begin{aligned} \begin{Bmatrix} X_{tc} \\ Y_{tc} \\ Z_{tc} \end{Bmatrix} =\begin{bmatrix} T_{tc}\end{bmatrix} \begin{Bmatrix} X_{t} \\ Y_{t} \\ Z_{t} \end{Bmatrix} + \begin{Bmatrix} X_{T0} \\ Y_{T0} \\ Z_{T0} \end{Bmatrix} \end{aligned}$$

(6)

where $[T_{tc}]$ is the rotational matrix from the coordinate system attached to the target to the camera coordinate system, $(X_{t}, Y_{t}, Z_{t})$ is the coordinates of the marker in the local target coordinate system and $(X_{T0}, Y_{T0}, Z_{T0})$ is the coordinates of origin of the local target coordinate system in the camera coordinate system.

Substituting (6) into (5), we obtain:

$$\begin{aligned} \begin{Bmatrix} x_{c} \\ z_{c} \end{Bmatrix} = \begin{Bmatrix} - f\dfrac{r_{11}X_{t}+r_{12}Y_{t}+r_{13}Z_{t}+X_{T0}}{r_{21}X_{t}+r_{22}Y_{t}+r_{23}Z_{t} +Y_{T0}} \\&\\ - f\dfrac{r_{31}X_{t}+r_{32}Y_{t}+r_{33}Z_{t}+Z_{T0}}{r_{21}X_{t}+r_{22}Y_{t}+r_{23}Z_{t} +Y_{T0}} \\ \end{Bmatrix} \end{aligned}$$

(7)

where $r_{ij}$ are the functions of the Euler (roll–pitch–yaw) angles ($\varOmega $, $\varPhi $, $\varTheta $) of the target with respect to the camera’s image plane. These equations are highly non-linear and involve six unknowns: the three rotational angles ($\varOmega $, $\varPhi $, $\varTheta $) and the three Cartesian coordinates ($X_{T0}, Y_{T0}, Z_{T0}$). They are the orientation and position of the target relative to the camera coordinate system. In order to solve (7) mathematically, a minimum of three markers are required. In this work, four markers are used to (i) avoid the solution ambiguity and (ii) provide system redundancy so that the loss of one target point can be tolerated during the tracking process. The resulting equation is solved by the least-square method iteratively until the residual errors of the measurement satisfy a pre-set convergence criterion.

The photogrammetry algorithm is implemented together with the Open-sourced Computer Vision (OpenCV) library. The program outputs the coordinates of the markers on the image plane into the photogrammetry algorithm to calculate the 6 DOF of the target with respect to the camera. In order to improve the stability of the vision system, a Kalman filter embedded in the OpenCV is running in parallel with the photogrammetry algorithm to predict subsequent positions of the markers in concurrent images for the photogrammetry algorithm in case the vision system loses tracking of the markers, see Bradski and Kaehler (2008). Testing of the photogrammetry algorithm proceeded with a web cam of one mega-pixel resolution and the testing results are shown in Table 1. The accuracy of the algorithm is very high with the lowest value reading 87.32 %, a value which translates at the capture distance to less than 0.9 cm in the Y axis of camera frame. Once the pose of the target in the camera coordinate system is determined, it must be transformed into the global coordinate system $(X_{g})$ by (2) as input to the robot controller.

Table 1 Test results of photogrammetry algorithm

Full size table

4 Kalman filter enhanced target tracking

In the eye-in-hand visual servoing, the pose estimated by the photogrammetry is noisy and prone to the residual vibrations of linkages of the manipulator due to the inherent mechanical flexibility and unexpected disturbances. To avoid the instability of robot control, the output of photogrammetry is treated by a Kalman filter to obtain the best stable and/or smooth pose estimation with the knowledge of the system that we have. In order to accomplish this, an unscented Kalman filter is employed with the state space shown in Table 2.

Table 2 Kalman filter state space

Full size table

Let the $\{x\}$ be the state vector of the target by including the pose and its velocity and acceleration with respect to the camera coordinate system:

$$\begin{aligned} \{x\}&= \{T_x, \dot{T}_x, \ddot{T}_x, T_y, \dot{T}_y, \ddot{T}_y, T_z, \dot{T}_z, \ddot{T}_z, \varTheta , \dot{\varTheta },\nonumber \\&\ddot{\varTheta }, \varPhi , \dot{\varPhi }, \ddot{\varPhi }, \varOmega , \dot{\varOmega }, \ddot{\varOmega }\} \end{aligned}$$

(8)

Then, the Kalman filter model of the target in a discrete time form can be defined as follows

$$\begin{aligned} \{x\}_{k+1} = [A]\{x\}_k + [B]\{w\}_k \end{aligned}$$

(9)

where the subscript ($k+1$) denotes the states at time step $k+1$, $[A]$ is the state transition matrix, and $[B]$ is the disturbance transition matrix associated with the process noise vector $\{w\}_k$.

The $18 \times 18$ transition matrix $[A]$ is constructed by six 3$\times $3 block diagonal sub-matrices, such as,

$$\begin{aligned}{}[a] = \begin{bmatrix} 1&dt&dt^{2}/2 \\ 0&1&dt \\ 0&0&1 \end{bmatrix} \end{aligned}$$

while the disturbance transition matrix $[B]$ is an 18$\times $6 sparse matrix with the following non-zero elements:

$$\begin{aligned} B_{3(i-1)+1, i} = \frac{dt^3}{6},\quad B_{3(i-1)+2, i} = \frac{dt^2}{2},\quad B_{3(i-1)+3, i} = dt \end{aligned}$$

Here, $dt$ is the sampling period and $i=1,2,\dots , 6$. The process noise vector $\{w\}_k$ contains the jerks of the target and is assumed to obey a zero-mean white Gaussian distribution with its covariance, $[Q]_k$,

$$\begin{aligned} \{w\}_k = \{\dddot{T}_x, \dddot{T}_y, \dddot{T}_z, \dddot{\Theta }, \dddot{\Phi }, \dddot{\Omega } \}^T \sim N(0,[Q]_k) \end{aligned}$$

(10)

Generally, the process noise covariance matrix $[Q]$ is difficult to determine in advance because of the non-cooperative nature of the target and unknown motion of the camera. In the current work, it is found that the following constant process noise covariance matrix works well after tuning the Kalman filter in experiments,

$$\begin{aligned}{}[Q] = \begin{bmatrix} 5&\quad 0&\quad 0&\quad 0&\quad 0&\quad 0 \\ 0&\quad 5&\quad 0&\quad 0&\quad 0&\quad 0 \\ 0&\quad 0&\quad 5&\quad 0&\quad 0&\quad 0 \\ 0&\quad 0&\quad 0&\quad 5&\quad 0&\quad 0 \\ 0&\quad 0&\quad 0&\quad 0&\quad 5&\quad 0 \\ 0&\quad 0&\quad 0&\quad 0&\quad 0&\quad 5 \\ \end{bmatrix} \times 10^{-6} \end{aligned}$$

The measurement model of the Kalman filter is defined as

$$\begin{aligned} \{z\}_{k+1} = [H]\{x\}_k+\{v\}_k \end{aligned}$$

(11)

where $\{z\}_{k+1} = \{T_z, T_y, T_z, \varTheta , \varPhi , \varOmega \}^T$ is the measurement vector defined in Table 3. It is the output from the photogrammetry module. The matrix $[H]$ is a 6$\times $18 sparse matrix with the following non-zero elements:

$$\begin{aligned} H_{1,1} = H_{2,4} = H_{3,7} = H_{4,10} = H_{5,13} = H_{6,16} = 1 \end{aligned}$$

(12)

while the 6$\times $1 vector $\{v\}_k$ is the measurement noise and obeys a zero-mean Gaussian distribution with its covariance $[R]_k$,

$$\begin{aligned} \{v\}_k \sim N(0, [R]_k) \end{aligned}$$

(13)

The measurement noise covariance matrices $[R]_k$ is determined by tuning with real data of residual errors of the measurement from the photogrammetry algorithm in experiments using a non-linear regression analysis. For the given markers’ pattern on the target and the vision configuration in the current study, it is found the following constant matrix is a good starting point for the Kalman filter,

$$\begin{aligned}{}[R] = \begin{bmatrix} 4&\quad 0&\quad 0&\quad 0&\quad 0&\quad 0 \\ 0&\quad 2&\quad 0&\quad 0&\quad 0&\quad 0 \\ 0&\quad 0&\quad 4&\quad 0&\quad 0&\quad 0 \\ 0&\quad 0&\quad 0&\quad 3&\quad 0&\quad 0 \\ 0&\quad 0&\quad 0&\quad 0&\quad 1&\quad 0 \\ 0&\quad 0&\quad 0&\quad 0&\quad 0&\quad 1 \\ \end{bmatrix} \times 10^{-6} \end{aligned}$$

After the vision system starts, the $[R]$ matrix can be updated based on the output of the photogrammetry algorithm if needed.

Table 3 Kalman filter measurement space

Full size table

The Kalman filter defined in this way is decoupled from the photogrammetry process to facilitate an easy implementation. Following a standard procedure, the Kalman filter recursively runs in two major steps as follow

(1)
Time update
$$\begin{aligned} \{\hat{x}\}_{k+1,1}&= [A]\{\hat{x}\}_{k,k}\end{aligned}$$
(14)

$$\begin{aligned} \left[ P\right] _{k+1,k}&= [A][P]_{k,k}[A]^T + [Q]_k \end{aligned}$$
(15)
(2)
Measurement update
$$\begin{aligned} \{\hat{x}\}_{k+1,k+1}&= \{\hat{x}\}_{k+1,k}+[K_g]_{k+1}(\{z\}_{k+1}- \left[ H\right] \{\hat{x}\}_{k+1,k})\nonumber \\ \end{aligned}$$
(16)

$$\begin{aligned} \left[ P\right] _{k+1,k+1}&= [P]_{k+1,k}-[K_g]_{k+1}[H][P]_{k+1,k} \end{aligned}$$
(17)
where the matrix $[K_g]$ is the Kalman gain at the time step $k+1$:
$$\begin{aligned}{}[K_g]_{k+1}=[P]_{k+1,k}[H]_{k+1}^T([H]_{k+1}[P]_{k+1,k}[H]_{k+1}^T+ [R]_{k+1})^{-1} \end{aligned}$$

5 Motion predictive control in autonomous robotic capture

The motion predictive control in autonomous robotic capture can be generally divided into two phases: (i) the tracking and approaching, and (ii) the capture.

5.1 Tracking and approaching

The target tracking and approaching involves identifying, locking on, and following a static or dynamic target in the 3D workspace by controlling the joint positions in the joint space. This includes the planning of the trajectory for the robot joint positions, the wrist position and the orientation of the end- effector to optimize capture potential. Assuming there is no obstacles between the end- effector and the target, a direct path is planned in the joint space, instead of workspace, in order to minimize the time taken for the end-effector to approach and align with the target. It should note that this may result in a curved trajectory of the wrist in the workspace.

Let us define the state errors in the position-based visual servoing by the relative position error of the wrist, $\{e\}$, and the orientation error, $\{\Phi \}$ of the gripper, such that,

$$\begin{aligned} \{e\}&= \{X_g\}-\{X_g^d\}=\left\{ \begin{array}{c} X_g \\ Y_g \\ Z_g \end{array} \right\} - \left\{ \begin{array}{c} X_g^d \\ Y_g^d \\ Z_g^d \end{array} \right\} \end{aligned}$$

(18)

$$\begin{aligned} \{\Phi \}&= \{\alpha _e\}-\{\alpha _e^d\}=\left\{ \begin{array}{c} \alpha _e \\ \alpha _e \\ \alpha _e \end{array} \right\} - \left\{ \begin{array}{c} \alpha _e^d \\ \alpha _e^d \\ \alpha _e^d \end{array} \right\} \end{aligned}$$

(19)

where $\{X_g\}$ and $\{ X_g^d\}$ are the current and desired wrist positions in the global frame, $\{\alpha _g\}$ and $\{\alpha _g^d\}$ are the current and desired gripper orientations in the local coordinate system $\{X_e\}$.

Correspondingly, the state velocity state errors can be written as

$$\begin{aligned} \{\dot{e}\} = \{\dot{X}_g\}-\{\dot{X}_g^d\}; \{\dot{\Phi }\} = \{\dot{\alpha }_e\}-\{\dot{\alpha }_e^d\} \end{aligned}$$

(20)

The desired wrist pose is obtained by the vision system via the photogrammetry algorithm and the Kalman filter.

For the dynamic tracking and approaching, the desired pose has to be modified so that the end-effector will approach the intercept position with proper orientation instead of the current position and orientation. This can be achieved by replacing the desired pose in (18) and (19) with the following,

$$\begin{aligned} \{\tilde{X}_g^d\}&= \{X_g^d\} + \{\dot{X}_g^d\}(\Delta T + t_0)\end{aligned}$$

(21)

$$\begin{aligned} \{\tilde{\alpha }_e^d\}&= \{\alpha _e^d\}+\{\dot{\alpha }_e^d\}(\Delta T + t_0) \end{aligned}$$

(22)

where $t_0$ is the time interval between two updates of the computer vision system and $\Delta T = ||\{e\}||/ ||\{ \dot{X}_{g}^{max} \} ||$ is the nominal shortest remaining time from the current position to the target position, $\{\dot{X}_{g}^{max}\}$ is the maximum velocity limit imposed on the end-effector, and the velocities of the desired position $\{\dot{X}_g^d\}$ and orientation $\{\dot{\alpha }_e^d\}$ are estimated by the Kalman filter. As the end-effector approaches the target, the $\Delta T$ will become smaller and the estimation of the intercept pose will be more accurate.

Thus, the error in the joint space consists of two parts, resulting from the position and orientation errors,

$$\begin{aligned} \Delta \{ \theta \} \approx [\tilde{J}^{-1}] \{e\} \!+\! [T_{\phi }] \{ \Phi \};\quad \Delta \{\dot{ \theta } \} \approx [\tilde{J}^{-1}] \{\dot{e}\} \!+\! [T_{\phi }] \{ \Phi \}\nonumber \\ \end{aligned}$$

(23)

where $[T_{\Phi }]$ is the transformation matrix taking the angular misalignments from the local coordinate system $\{X_e\}$ to the joint space.

The path tracking of the end-effector will be achieved by a proportional and derivative controller due to its simplicity and reliability, such that,

$$\begin{aligned} \Delta \{\theta _{c}\} = K_p \Delta \{\theta \} + K_d\Delta \{\dot{\theta }\} \end{aligned}$$

(24)

where ($K_p$, $K_d$) are the gains of proportional and derivative control, subject to the constraint of the maximum velocity of the end-effector,

$$\begin{aligned} \{\dot{\theta }\} \approx [\tilde{J}^{- 1}]\{\dot{X}_{g}^{d}\}+[T_\Phi ]\{\dot{\Phi }\} \le [\tilde{J}^{-1}]\{\dot{X}_{g}^{max}\} \end{aligned}$$

(25)

5.2 Capture

Once the end-effector is close to the target and prepares for a capture, a metric is defined to take into account misalignments and distances between the gripper and the grasping point in order to transition between phases as well as to decide when to effect a capture. These were modularized in order to scale their weightings throughout the individual phases of the operation. They are as follows:

$$\begin{aligned} M_1&= \sqrt{\Delta X_e^2 + \Delta Y_e^2 + \Delta Z_e^2} \end{aligned}$$

(26)

$$\begin{aligned} M_2&= \sqrt{\Delta Y_e^2} \end{aligned}$$

(27)

$$\begin{aligned} M_3&= \sqrt{(\alpha _x - \alpha _x^d)^2 + (\alpha _y - \alpha _y^d)^2 + (\alpha _z -\alpha _z^d)^2} \end{aligned}$$

(28)

$$\begin{aligned} M_4&= Y_e tan(\alpha _x - \alpha _x^d)+Z_e \end{aligned}$$

(29)

$$\begin{aligned} M_5&= Y_e tan(\alpha _z - \alpha _z^d) + X_e \end{aligned}$$

(30)

where $M_1$ represents the misalignment in the non-critical total distance, $M_2$ the critical horizontal distance, $M_3$ the total orientation misalignment, $M_4$ and $M_5$ the horizontal and vertical orientations of the end-effector, respectively.

These five offsets are then combined into a composite alignment index for the total alignment measurement of the end-effector by a weighted sum,

$$\begin{aligned} h = \sum _{i=0}^{5} w_i M_i \end{aligned}$$

(31)

where $w_i$ are the weights that are tuned in the experiment.

The index is further conditioned by a sliding average to define a threshold logic function (TL) to control the capture action, such that,

$$\begin{aligned} TL = {\left\{ \begin{array}{ll} j < 100 &{} \text {Initialization}\qquad \, \text {Approach}\\ j \ge 100 &{} {\left\{ \begin{array}{ll} 500<h &{} \text {Approach} \\ 85<h\le 500 &{} \text {Approach and Alignment} \\ h \le 85 &{} \text {Grasp} \end{array}\right. } \end{array}\right. } \end{aligned}$$

(32)

where $j$ is the number of vision measurements. Once the approach and alignment ($85<h\le 500$) of the end-effector is initiated, the weight for $M_2$ is increased to ensure a collision with the target does not occur.

6 Experimental validation

6.1 Experimental set-up

The experimental validation was carried out with the use of a custom designed, constructed, and programmed robotic manipulator as shown in Fig. 3. The experimental set up employed a visually low-noise target shown in Fig. 4 with four non-collinear markers in a pattern known and hard-coded into the photogrammetry algorithm. The pattern do not require any specific shape or position, only a high contrast.

The motion of the target is generated through the use of a single motor (referred as target motor in the following) that can run at different velocities. In addition, a pendulum anchor point can be moved, generating a modifiable arc motion for the target. An example of this motion is shown in Fig. 5 along with the robotic manipulator in the foreground. By varying the anchor points and keeping the knowledge of the path separate from the manipulator controller, we can generate several distinct capture scenarios for testing. Two testing scenarios were tested in this work, determined by the target velocity at capture as low and high velocity captures respectively. The upper limit for the low velocity capture was determined by increasing the velocity of the target motor until the point where, without motion prediction, the robotic manipulator was unable to track and capture the target. This velocity was established as $0.27\, cm/s$. The high velocity capture was performed at twice this velocity, namely, $0.54\, cm/s$.

The capture follows several phases that are defined as follows.

1.
Target acquisition
1. (a)
  Markers finding
2. (b)
  Target locking on and tracking
2.
Approach
1. (a)
  High speed approach to the vicinity of target
2. (b)
  Collision avoidance approach before capture
3.
Capture
1. (a)
  Verify misalignments
2. (b)
  Verify weighted summation
3. (c)
  Activate capture

The transitions between the phases is a critical component of the operation and requires an autonomous solution capable of sensing the distance, the position, and the misalignment actively and deciding when the optimal transitions would occur. This will be illustrated in the following section.

6.2 Results and discussion

The experimental results of autonomous captures of a non-cooperative target will be presented in here to illustrate the desired and current joint positions of the torso ($\varTheta _1$), the shoulder ($\varTheta _2$), and the elbow ($\varTheta _3$). The capture operation will be examined in detail where the target is acquired in phase 1 and the predictive algorithm begins to output desired values for the joint angles. Phase 2 denotes the beginning of the tracking and the introduction of the actual joint positions. Phase 2a is the fast approach marked by the rapid changes in joint positions and large oscillations in the motion. Phase 2b is the near stage approach that is characterized by the slow changes in joint position and small oscillations in the motion due to an increased care in collision avoidance. Phase 3 is activated once the metric of misalignments denoted in Sect. 5.2 drops below the threshold value.

6.2.1 Low velocity captures

Low velocity captures refer to the capture operation without predicting target motion by (21) and (22) in the controller. The target was moving at a linear velocity of $0.27\, cm/s$. Two cases were conducted to compare the impact of Kalman filter on the capture performance and the results are shown in Figs. 6 and 7. The differentiating factor between the two operations is that the Kalman filter was employed only in the second operation, in Fig. 7 and resulted in a much faster capture operation with smaller errors and fewer overshoots in joint positions. In terms of phase transitions, both operations perform similarly.

As shown in Figs. 6 and 7, Phase 1 was dominated with the vision system establishing a lock on the target and resolving the inverse kinematics. Obviously, the Kalman filter reduced the time required to perform this phase task significantly. Phase 2a emphasized the approaching speed, which resulted in a rapid motion in the end-effector of robotic manipulator. This can be best seen in the high velocity captures in the following section. Phase 2b started at the close proximity to the target and was executed at a slower end-effector velocity by emphasizing collision avoidance between the end-effector and the target. Of particular importance in this phase is to avoid any perturbations to the target by the end-effector. This is due to the fact that at the close proximity, the markers employed to locate the target in the 3D workspace take up a large percentage of the field of view of the camera and a quick acceleration of the target may result in the loss of lock to the markers and necessitate an abortion from the capture operation. Phase 3 is significant only in the sense that it marks the capture of the target and beginning of a new control regime aimed at decelerating the target, which will be a separate research subject. Table 4 provides a synopsis of the times where these events occur for the low velocity capture operations. Obviously, the Kalman filter enhanced significantly the capture performance in terms of shortening the operating time for a fast capture operation.

Table 4 Low velocity capture synopsis

Full size table

6.2.2 High velocity captures

The high velocity captures were executed when the target moved at a higher velocity of $0.54\, cm/s$. The target motion prediction in (21) and (22) must be considered in the controller in this case. In addition, the higher velocity leads to new challenges including a reduced time window to effect a capture operation due to the limited time the target spends within the operational field of robotic manipulator. Furthermore, high velocities incur higher errors in pose estimation and higher chances for collisions due to the system delay. Again, the comparisons between the cases with and without Kalman filter were performed.

Figure 8 shows the capture operation without using the Kalman filter and exhibits fairly similar trends to the low velocity captures from the previous section. The velocity slopes are steeper due to the faster joint speeds as would be expected.

To closely examine the behavior of the autonomous controller, we are going to take a look at a separate high velocity capture with the Kalman filter shown in Fig. 9 and its companion graph displaying the errors shown in Fig. 10. The errors of joint angles shown in Fig. 10 make it easy to distinguish the thresholds that trigger the different phases of the capture operation. The Phase 1 shows the vision system identifies and locks on the target only. No joint actuation is activated. The Phase 2a is characterized by the sharp reduction in joint angle errors, which corresponds to the fast approach of the end-effector to the target. Once in the close proximity to the target, the joint angle errors are reduced significantly and the Phase 2b starts until a successful capture is performed.

The lines in Fig. 10 represent the joint errors. In particular, the $\varTheta _1$ behaviour perfectly illustrates the separate behaviour profile between phase 2a and phase 2b. In phase 2a, the motion is jagged and indicates rapid changes in joint positions that correlate to a rapid motion of the end-effector of robotic manipulator. Once the error value crosses into the near range operation mode, phase 2b, a much smoother curve emerges which signifies close proximity operations. This type of smooth phase transition provides an intuitive and bio-mimetic style capture that increases capture reliability while reducing collisions and jarring events. Table 5 provides a synopsis of the transitional times for the high velocity captures.

Table 5 High vlocity cpture snopsis

Full size table

7 Conclusion

This paper investigated the autonomous capture of a non-cooperative target by a robotic manipulator using visual servoing and Kalman filter enhanced motion predictive control. The Kalman filter is used to enhance the real-time state and pose estimation of the target from the noisy image date and the trajectory planning for the dynamic capture of a non-cooperative target. The control of capture process has been divided into different phases and a transitional decision making process is developed to guide the robotic manipulator between the different phases of the capture operation using a custom metric that translates visual misalignments between the end-effector and the target into a guidance measurement. A custom 6 DOFs Pieper type robotic manipulator with eye-in-hand configuration is constructed to test and validate the performance of the proposed visual servoing, Kalman filter enhanced motion predictive control scheme for the autonomous capturing task and to demonstrate the robustness of the proposed control scheme in the presence of noise and unexpected disturbances in vision system and constraints for the manipulator operation in real environments. Experimental results demonstrated a successful phase transitioning strategy and capture operation. It also demonstrates the feasibility and applicability of the proposed control scheme in this paper.

References

Bradski, G., & Kaehler, A. (2008). Learning OpenCV: Computer Vision With the OpenCV Library. Sebastopol: O’Reilly Media Inc.
Google Scholar
Chung, T., Hollinger, G., & Isler, V. (2011). Search and pursuit-evasion in mobile robotics a survey. Autonomous Robots, 31, 299–316.
Article Google Scholar
Cretual, A., & Chaumetter, F. (2001). Application of motion-based visual servoing to target tracking. The International Journal of Robotics Research, 20, 878–890.
Article Google Scholar
Edan, Y. (1995). Design of an autonomous agricultural robot. Applied Intelligence, 5, 45–50.
Article Google Scholar
Fang, Y., Liu, X., & Zhang, X. (2011). Motion-estimation-based visual servoing of nonholonomic mobile robots. IEEE Transactions on Robotics, 6, 1167–1175.
Google Scholar
Fang, Y., Liu, X., & Zhang, X. (2012). Adaptive active visual servoing of nonholonomic mobile robots. IEEE Transactions on Industrial Electronics, 1, 489–497.
Google Scholar
Fang Y, Behal A, Dixon W, Dawson D (2002) “Adaptive 2.5d visual servoing of kinematically redundant robot manipulators”. In: Proceedings of the 41st IEEE Conference on Decision and Control, vol. 3, (pp. 2860–2865), December 2002.
Fumagall, M., Ivaldi, S., Randazzo, M., et al. (2012). Force feedback exploiting tactile and proximal force/torque sensing. Autonomous Robots, 33, 381–398.
Article Google Scholar
Ghadyok, V., Goldin, J., & Ren, W. (2012). On the design and development of attitude stabilization, vision-based navigation, and aerial gripping for a low-cost quadrotor. Autonomous Robots, 33, 41–68.
Article Google Scholar
Hsiao, K., Kaelbling, L., & Lozano-Perez, T. (2011). Robust grasping under object pose uncertainty. Autonomous Robots, 31, 253–268.
Article Google Scholar
Ignakov, D., Okouneva, G., & Liu, G. (2012). Localization of a door handle of unknown geometry using a single camera for door-opening with a mobile manipulator. Autonomous Robots, 33, 415–426.
Article Google Scholar
Klein, C., & Huang, C. (1983). Review of pseudoinverse control for use with kinematically redundant manipulators. IEEE Transactions on Systems, 10, 245–250.
Google Scholar
Kurniawati, H., Bandyopadhyay, T., & Patrikalakis, N. (2012). Global motion planning under uncertain motion, sensing, and environment map. Autonomous Robots, 33, 255–272.
Article Google Scholar
Liang, J., & Ma, O. (2011). Angular velocity tracking for satellite rendezvous and docking. Acta Astronautica, 69, 1019–1028.
Article Google Scholar
Lippiello, V., Siciliano, B., & Villani, L. (2008). Interaction control of robot manipulators using force and vision. International Journal of Optomechatronics, 2, 257–274.
Article Google Scholar
Y.-C. Liu and N. Chopra, Semi-autonomous teleoperation in task space with redundant slave robot under communication delays. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (pp. 679–684), September 2011.
McKerrow, P. J. (1991). Introduction to Robotics. Sydney: Addison-Wesley.
Google Scholar
A. Mittal and N. Paragios, Motion-based background subtraction using adaptive kernel density estimation. In: Proceedings of the 2004 IEEE Computer on Computer Vision and Pattern Recognition, 2004, vol. 2, (pp. II-302–II-309), June 2004.
A. Ogilvie, J. Allport, M. Hannah, et al., Autonomous robotic operations for on-orbit satellite servicing. In: Proc. SPIE 6958, Sensors and Systems for Space Applications II, no. 695809, April 2008.
Rekleitis, I., Martin, E., Rouleau, G., L’Archeveque, R., Parsa, K., & Dupuis, E. (2007). Autonomous capture of a tumbling satellite. Journal of Field Robotics, 24(4), 275–296.
Article Google Scholar
F. Roe, R. Howard, and L. Murphy, Automated rendezvous and capture system development and simulation for nasa. In: P. Motaghedi (Ed.), Modeling, Simulation and Calibration of Space-Based Systems, Proceedings of the SPIE.
De Schutter, J. (1987). A study of active compliant motion control methods for rigid manipulators based on a generic scheme. IEEE International Conference on Robotics and Automation, 4, 1060–1065.
Google Scholar
Slaughter, D., Giles, D., & Downey, D. (2008). Autonomous robotic weed control systems: A review. Computers and Electronics in Agriculture, 61, 63–98.
Article Google Scholar
Wang, H., & Xie, Y. (2009). Adaptive jacobian position/force tracking control of free-flying manipulators. Robotics and Autonomous Systems, 57, 173–181.
Article MathSciNet Google Scholar
Wong, A., Mayorga, R., & Rong, L. (1996). Liang X (1996) A vision based on-line motion planning of robot manipulators. In: Proceedings of the. IEEE/RSJ international conference on intelligent robots and systems - robotic intelligence interacting with dynamic worlds, vol. 1–3, (pp. 940–948).
Zhang, S., Ding, Y., Hao, K., & Zhang, D. (2012). An efficient two-step solution for vision-based pose determination of a parallel manipulator. Robotics and Computer-Integrated Manufacturing, 2, 182–189.
Article Google Scholar

Download references

Acknowledgments

This work is supported in part by the Natural Sciences and Engineering Research Council of Canada (NSERC) and the Ontario Graduate Scholarships in Science and Technology (OGSST).

Author information

Authors and Affiliations

Space Engineering Design Laboratory, Department of Earth and Space Science and Engineering, York University, 4700 Keele Street, Toronto, ON, M3J 1P3, Canada
Benoit P. Larouche & Zheng H. Zhu

Authors

Benoit P. Larouche
View author publications
You can also search for this author in PubMed Google Scholar
Zheng H. Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zheng H. Zhu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Larouche, B.P., Zhu, Z.H. Autonomous robotic capture of non-cooperative target using visual servoing and motion predictive control. Auton Robot 37, 157–167 (2014). https://doi.org/10.1007/s10514-014-9383-2

Download citation

Received: 15 June 2013
Accepted: 03 January 2014
Published: 12 January 2014
Issue Date: August 2014
DOI: https://doi.org/10.1007/s10514-014-9383-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Autonomous robotic capture of non-cooperative target using visual servoing and motion predictive control

Abstract

Similar content being viewed by others

4 × 2D Visual Servoing Approach for Advanced Robot Applications

Self-learning Visual Servoing for Robot Manipulation in Unstructured Environments

Visual servoing using an optimized trajectory planning technique for a 4 DOFs robotic manipulator

1 Introduction

2 Kinematics of robotic manipulator

3 Pose estimation

4 Kalman filter enhanced target tracking