Comparison of different pseudo-linear estimators for vision-based target motion estimation

Ning, Zian; Zhang, Yin; Zhao, Shiyu

doi:10.1007/s11768-023-00161-y

Comparison of different pseudo-linear estimators for vision-based target motion estimation

Research Article
Published: 04 September 2023

Volume 21, pages 448–457, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Control Theory and Technology Aims and scope Submit manuscript

Comparison of different pseudo-linear estimators for vision-based target motion estimation

Download PDF

Zian Ning^1,2,
Yin Zhang² &
Shiyu Zhao²

388 Accesses
1 Citation
Explore all metrics

Abstract

Vision-based target motion estimation based Kalman filtering or least-squares estimators is an important problem in many tasks such as vision-based swarming or vision-based target pursuit. In this paper, we focus on a problem that is very specific yet we believe important. That is, from the vision measurements, we can formulate various measurements. Which and how the measurements should be used? These problems are very fundamental, but we notice that practitioners usually do not pay special attention to them and often make mistakes. Motivated by this, we formulate three pseudo-linear measurements based on the bearing and angle measurements, which are standard vision measurements that can be obtained. Different estimators based on Kalman filtering and least-squares estimation are established and compared based on numerical experiments. It is revealed that correctly analyzing the covariance noises is critical for the Kalman filtering-based estimators. When the variance of the original measurement noise is unknown, the pseudo-linear least-squares estimator that has the smallest magnitude of the transformed noise can be a good choice.

Performance analysis of α- β- γtracking filters using position and velocity measurements

Article Open access 09 April 2015

Three-Dimensional Bearings-Only Target Tracking: Comparison of Few Sigma Point Kalman Filters

Extended least square unbiased FIR filter for target tracking using the constant velocity motion model

Article 19 January 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The work in this paper is motivated by the aerial target pursuit task, where one micro aerial vehicle (MAV) uses its onboard camera to detect, localize, and then pursue another flying MAV [1, 2]. The study of this task can be potentially applied to the defence of malicious MAVs. In particular, while MAVs have been applied in many domains nowadays such as video shooting, surveillance, and inspection [3], misused ones have caused serious security problems. Aerial target pursuit may provide a promising technical solution that is complementary to the existing countermeasures [1, 4].

One key problem in the aerial target pursuit task is to use vision measurements to estimate the motion of the target. This problem is the prerequisite for subsequent tasks such as pursuit control. The existing approaches for vision-based motion estimation can be classified into two categories.

In the first category, the vision measurement is modelled as a bearing-only vector that points from the camera to the target. Using bearing-only vectors to estimate the target’s motion is usually called bearing-only target motion estimation [2, 5, 6]. One fundamental problem about bearing-only estimation is observability. In particular, since the depth/distance of the target cannot be measured directly, it is necessary to ensure proper relative motion between the camera and the target to recover the target’s motion. In fact, the camera’s motion must have high-order components than the target’s motion [5]. Otherwise, the target’s motion is unobservable.

In the second category, in addition to the bearing vector, other vision measurements are also assumed to be available [1, 7]. For example, if the exact geometric information of the target is known, its relative position and attitude can be recovered by using a pose estimation algorithm [1]. If the target can be approximated as a sphere and the diameter of the sphere is known, then the distance of the target can be recovered based on the pin-hole camera model [8].

The advantage of the bearing-only approach is that it does not require knowing the size of the target. As long as the target can be recognized by a vision algorithm, its bearing can be readily obtained from the pixel coordinate of the target in the image. However, the disadvantage is that it requires that the camera’s motion must be of higher order than the target’s motion. This requirement may be difficult to achieve in practice. In this case, it is favourable to directly estimate the target’s distance from the size information of the target. The size information may be available in practice. For example, a vision algorithm may not only be able to detect an MAV but also can classify its model. When we know it is a specific type of commercial MAV, then the size information can be known. In addition, the size information is available in cooperative tasks such as vision-based formation control since all the robots are known in advance [9, 10].

Once we have obtained vision measurements, we can readily establish an estimator such as a Kalman filter to estimate the target’s motion. It is notable that vision measurements such as bearing vectors are highly nonlinear functions of the target’s motion. The conventional extended Kalman filter shows unstable performances when handling bearing measurements [11]. Therefore, pseudo-linear measurements and pseudo-linear Kalman filters have been widely used in bearing-related estimation problems [2, 12, 13].

This paper focuses on a very specific problem. That is, once we have obtained the vision measurements, there are different ways to convert the nonlinear vision measurements to pseudo-linear. Which pseudo-linear measurement should be used and how to set the covariance matrix for the pseudo-linear noise? These problems are important for achieving high-performance motion estimation. However, we notice that practitioners usually do not pay special attention to them and often make mistakes. Motivated by this, we compare different types of pseudo-linear measurements and provide advice for practitioners on how to properly utilize vision measurements.

First of all, three different pseudo-linear measurements are established based on the bearing and angle measurements. Then, three pseudo-linear Kalman filter (PL-KF) estimators and three pseudo-linear recursive least square (PL-RLS) estimators are built up based on proposed pseudo-linear measurement equations, respectively. To reveal the roles of different measurements, we compared the performances of these estimators based on numerical experiments for three different scenarios.

In the first set of numerical experiments, it is revealed that all the estimators show similar performances even though they are built based on different measurement equations. The experimental results are supported by the theoretical analysis that the three pseudo-linear measurement equations have the same Fisher information about the states. Although the estimators show similar performances, some differences are still observed. In particular, the three PL-RLS estimators have different performances. That is because the magnitudes of transformed noises in the three different measurement equations are different. However, the PL-RLS estimators do not well incorporate this property. By contrast, the three PL-KF estimators show almost the same performance when the covariances of the measurement noises can be correctly setup.

Furthermore, the covariance setups in the PL-KF estimators are studied in the second set of numerical experiments. Because of the pseudo-linear transformation, the covariance of the pseudo-linear measurement noises are different in different measurement equations. It is revealed that incorrect values of the covariance parameters in PL-KF estimators would downgrade the estimation performance.

In summary, the contribution of this paper is to clarify some mistakes that practitioners may make when conducting vision-based motion estimation in practice. The following advice is given to practitioners. First, if the variances of the original measurement noises are known in advance, any PL-KF estimator is the best choice as long as the calculation of the covariance of the noises is correct. Second, if the variance of the original measurement noise is unknown, the PL-RLS estimator that has the smallest magnitude of the transformed noise can be a good choice.

2 Measurements obtained from visual sensing

This section presents the measurements obtained from vision and the corresponding nonlinear measurement equations.

2.1 Vision measurements

Consider a target MAV flying in 3-D space. Its position and velocity are denoted as $p_T,v_T\in {\mathbb {R}}^3$ respectively. Let $\ell \in {\mathbb {R}}$ be the physical size of the wheelbase of the target multirotor MAV. Suppose that we know $\ell $. There is an observer carrying a monocular camera to observe the target. The position of the observer camera is denoted as $p_o\in {\mathbb {R}}^3$.

A bounding box surrounding the target in the image can be obtained if the target MAV can be detected by a vision algorithm. There are two measurements that can be extracted from the bounding box.

First, the position of the bounding box in the image $q_\text {pix}=[x_\text {pix}, y_\text {pix}, 1]^\textrm{T}\in {\mathbb {R}}^3$ can be used to calculate the bearing vector that points from the observer to the target (see Fig. 1). Denote $g\in {\mathbb {R}}^3$ as the unit bearing vector, and ${\hat{g}}\in {\mathbb {R}}^3$ as the noise corrupted bearing measurement. According to the pin-hole camera model [8], it can be calculated as

$$\begin{aligned} {\hat{g}}=\dfrac{R_\text {c}^\text {w}P_\text {cam}^{-1}q_\text {pix}}{\Vert R_\text {c}^\text {w}P_\text {cam}^{-1}q_\text {pix}\Vert }, \end{aligned}$$

where $P_\text {cam}\in {\mathbb {R}}^{3\times 3}$ is the intrinsic parameter matrix of the camera [8], $R_\text {c}^\text {w}\in {\mathbb {R}}^{3\times 3}$ is the rotation matrix from the camera frame to the world frame.

Second, the size of the bounding box can be used to calculate the angle subtended by the target object in the camera’s field of view. Here, the size can be either the width or the height of the bounding box. In this paper, we consider the width which corresponds to the wheelbase of the multi-copter (Fig. 1). Denote $\theta \in (0, \pi /2)$ as the subtended angle, and ${\hat{\theta }}$ as the noisy angle measurement. According to the pin-hole camera model [8], the angle can be calculated as

$$\begin{aligned} {\hat{\theta }} = \arccos \Big (\dfrac{l_\text {left}^2+l_\text {right}^2 -s_\text {pix}^2}{2l_\text {left}l_\text {right}}\Big ), \end{aligned}$$

where $l_\text {left}=\sqrt{(f/\alpha )^2+(\delta x_\text {pix} - s_\text {pix}/2)^2 + \delta y_\text {pix}^2}$ and $l_\text {right}=\sqrt{(f/\alpha )^2+(\delta x_\text {pix} + s_\text {pix}/2)^2 + \delta y_\text {pix}^2}$ are distances in pixels from the center of the camera to the middle points of the left and right sides of the bounding box respectively (Fig. 1). Here, f and $\alpha $ are the camera’s focal length and single-pixel size, respectively. Here, $\delta x_\text {pix} = \Vert x_\text {pix}-i_\text {width}/2\Vert \in {\mathbb {R}}$ and $\delta y_\text {pix} = \Vert y_\text {pix}-i_\text {height}/2\Vert \in {\mathbb {R}}$ are the distances between the center of the bounding box and the center of the image, respectively.

The goal of the vision-based target motion analysis is to estimate the target’s position and velocity, $p_T$ and $v_T$, based on the noisy bearing ${\hat{g}}$ and angle ${\hat{\theta }}$ measurements, with known target’s physical size $\ell $ and observer’s position $p_o$.

2.2 Nonlinear measurement equations

We next establish the relationship between the vision measurements and the state of the target.

Denote

$$\begin{aligned} x=\begin{bmatrix}p_T \\ v_T\end{bmatrix}\in {\mathbb {R}}^6 \end{aligned}$$

as the state vector that we aim to estimate. Then, the bearing ${\hat{g}}$ and angle ${\hat{\theta }}$ measurements are non-linear functions of the state vector.

The work in [14] proved that the measurement noises from the camera are Gaussian distributed. In the vision-based target motion analysis problem, many works treat the vision-based measurement noises as Gaussian distribution and perform well [1, 2, 7]. In particular, the noise-corrupted non-linear measurement equations are

$$\begin{aligned} {\hat{g}}&=\dfrac{p_T-p_o}{r}+\mu , \end{aligned}$$

(1)

$$\begin{aligned} {\hat{\theta }}&\approx \dfrac{\ell }{r} +w, \end{aligned}$$

(2)

where

$$\begin{aligned} r = \Vert p_T-p_o\Vert \end{aligned}$$

is the distance between the target and the observer. Here, $\mu \sim {\mathcal {N}}(0, \sigma _\mu ^2 I_{3\times 3})$ and $w\sim {\mathcal {N}}(0, \sigma _w^2)$ are the measurement noises. The approximation in (2) is accurate. Specifically, when $r>3\ell $, the approximation error is less than $0.08\%$.

It should be noted that the original noise of bearing measurements is productive ${\hat{g}}=R(\eta , \epsilon )g$, where $R(\eta , \epsilon )\in {\mathbb {R}}^{3\times 3}$ is a rotation matrix that perturbs g. This rotation matrix rotates the vector g by an angle $\epsilon $ around the axis $\eta $. This productive noise can be transformed into an additive one in (1). This kind of operation has been widely used and shown stable performance in bearing-only localization [2, 15, 16].

3 Pseudo-linear measurement equations

It is noted that the measurement equations in (1) and (2) are nonlinear. When the conventional extended Kalman filter is directly applied, the estimation is usually unstable [11]. Therefore, we convert them to pseudo-linear in this section. We will see that there are three different ways to obtain pseudo-linear measurements and equations.They will be compared later in this paper.

Three pseudo-linear measurements are proposed first. Then, three pseudo-linear measurement equations are proposed based on transformed pseudo-linear measurements.

3.1 Three pseudo-linear measurements

In the following, we convert the non-linear measurement Eqs. of (1) and (2) into pseudo-linear.

First, we convert the bearing measurement in (1) to be pseudo-linear. Multiplying $rP_{{\hat{g}}}$ on both side of (1) yields

$$\begin{aligned} P_{{\hat{g}}} p_o=P_{{\hat{g}}} p_T + rP_{{\hat{g}}}\mu , \end{aligned}$$

(3)

where $P_{{\hat{g}}}=I-{\hat{g}}{\hat{g}}^\textrm{T}\in {\mathbb {R}}^{3\times 3}$ is an orthogonal projection matrix, which plays an important role in the bearing-related estimation and control problems [17]. The matrix $P_{{\hat{g}}}$ has an important property: $P_{{\hat{g}}}{\hat{g}}=0_{3\times 1}$. Here, $P_{{\hat{g}}} p_o$ is the transformed pseudo-linear measurements, and $rP_{{\hat{g}}}\mu $ is the transformed noise.

Second, we convert the angle measurement in (2) to be pseudo-linear. There are two types of pseudo-linear measurements.

a)
Multiplying $r{\hat{g}}$ on both side of Eq. (2) yields
$$\begin{aligned} r{\hat{\theta }}{\hat{g}} = \ell {\hat{g}}+rw{\hat{g}}. \end{aligned}$$
(4)
Equation (1) implies $r{\hat{g}}=p_T-p_o+r\mu $. Substituting it into the left-hand of (4) gives
$$\begin{aligned} {\hat{\theta }}p_o+\ell {\hat{g}}={\hat{\theta }}p_T+r({\hat{\theta }}\mu -w{\hat{g}}). \end{aligned}$$
(5)
Here, ${\hat{\theta }}p_o+\ell {\hat{g}}$ is the transformed pseudo-linear measurements, and $r({\hat{\theta }}\mu -w{\hat{g}})$ is the corresponding transformed noise.
b)
Bearing and angle measurements can be used to calculate the target’s position directly. Then, we have another pseudo-linear transformation of the angle measurement. Dividing both sides of (5) by ${\hat{\theta }}$ yields
$$\begin{aligned} p_o+\dfrac{\ell }{{\hat{\theta }}}{\hat{g}}=p_T+r\Big (\mu -\dfrac{w}{{\hat{\theta }}}{\hat{g}}\Big ). \end{aligned}$$
(6)
Here, $p_o+\dfrac{\ell }{{\hat{\theta }}}{\hat{g}}$ is the transformed pseudo-linear measurements, and $r\Big (\mu -\dfrac{w}{{\hat{\theta }}}{\hat{g}}\Big )$ is the corresponding transformed noise.

To sum up, $P_{{\hat{g}}} p_o$, ${\hat{\theta }}p_o+\ell {\hat{g}}$, and $p_o+\dfrac{\ell }{{\hat{\theta }}}{\hat{g}}$ in the left-hand of Eqs. (3), (5), and (6) are three transformed pseud-linear measurements. It should be noted that the corresponding transformed noises are different.

3.2 Three pseudo-linear measurement equations

Choosing one or combining two of the proposed pseudo-linear measurements properly yields the pseudo-linear measurement equations that can be used in Kalman filtering or least square algorithms. To fully compare the difference of transformed noises under different transformations, we propose three pseudo-linear measurement equations that correspond to three PLKF or PLLS estimators.

First, the pseudo-linear measurements in (6) directly give the target’s position. As a result, (6) itself can be seen as an independent measurement equation. Rewriting equation (6) in terms of the state vector x yields the first pseudo-linear measurement equation:

$$\begin{aligned} z_1 = H_1x+\nu _1, \end{aligned}$$

(7)

where

$$\begin{aligned} z_1&= p_o+\dfrac{\ell }{{\hat{\theta }}}{\hat{g}}\in {\mathbb {R}}^3,\\ H_1&= \begin{bmatrix}I_{3\times 3}&0_{3\times 3} \end{bmatrix}\in {\mathbb {R}}^{3\times 6},\\ \nu _1&= r\left( \mu -\dfrac{w}{{\hat{\theta }}}{\hat{g}}\right) \in {\mathbb {R}}^3, \end{aligned}$$

are the corresponding pseudo-linear measurement, pseudo-linear measurement matrix, and transformed noises, respectively. Although the expression of $\nu _1$ is complex, it can be approximately treated as a zero mean Gaussian noise $\nu _1\sim {\mathcal {N}}(0, \varSigma _{\nu _1})$. This kind of approximation has been widely used in many works [2, 12, 13, 18]. Here $\varSigma _{\nu _1}$ can be calculated by

$$\begin{aligned} \varSigma _{\nu _1}=r^2\Big (\sigma _\mu ^2I_{3\times 3} +\dfrac{\sigma _w^2}{{\hat{\theta }}^2}{\hat{g}}{\hat{g}}^\textrm{T}\Big )\in {\mathbb {R}}^{3\times 3}. \end{aligned}$$

(8)

Here, the distance r is unknown. We use ${\hat{r}}=\Vert \hat{p}_{T}-p_o\Vert $ to replace it [2, 19]. Here $\hat{p}_{T}$ is the estimated target’s position.

Second, the pseudo-linear measurements in (5) have a pseudo-linear relationship with the target’s position. Rewriting (5) in terms of the state vector x yields the second pseudo-linear measurement equation:

$$\begin{aligned} z_2 = H_2x+\nu _2, \end{aligned}$$

(9)

where

$$\begin{aligned} z_2&= {\hat{\theta }}p_o+\ell {\hat{g}} \in {\mathbb {R}}^3,\\ H_2&= \begin{bmatrix}{\hat{\theta }} I_{3\times 3}&0_{3\times 3} \end{bmatrix}\in {\mathbb {R}}^{3\times 6},\\ \nu _2&= r({\hat{\theta }}\mu -w{\hat{g}}) \in {\mathbb {R}}^3, \end{aligned}$$

are the corresponding pseudo-linear measurement, pseudo-linear measurement matrix, and transformed noise, respectively.

Similar to $\nu _1$, we approximate $\nu _2$ as a Gaussian noise $\nu _2\sim {\mathcal {N}}(0,\varSigma _{\nu _2})$, where the covariance matrix $\varSigma _{\nu _2}$ can be calculated as:

$$\begin{aligned} \varSigma _{\nu _2}={\hat{r}}^2({\hat{\theta }}^2\sigma _\mu ^2I_{3\times 3}+\sigma _w^2{\hat{g}}{\hat{g}}^\textrm{T})\in {\mathbb {R}}^{3\times 3}. \end{aligned}$$

(10)

Third, the pseudo-linear measurements in (3) alone can not recover the target’s position, because $P_{{\hat{g}}}$ does not have full column rank. However, combining pseudo-linear measurements in (3) and (5) together can. Combining (3) and (5) together and rewriting it in terms of the state vector x yields the third pseudo-linear measurement equation:

$$\begin{aligned} z_3 = H_3x+\nu _3, \end{aligned}$$

(11)

where

$$\begin{aligned} z_3&= \begin{bmatrix} P_{{\hat{g}}} p_o \\ {\hat{\theta }}p_o+\ell {\hat{g}} \end{bmatrix}\in {\mathbb {R}}^6,\\ H_3&= \begin{bmatrix} P_{{\hat{g}}} &{} 0_{3\times 3}\\ {\hat{\theta }}I_{3\times 3} &{} 0_{3\times 3} \end{bmatrix}\in {\mathbb {R}}^{6\times 6},\\ \nu _3&= r\begin{bmatrix} P_{{\hat{g}}}\mu \\ {\hat{\theta }}\mu -w{\hat{g}} \end{bmatrix}\in {\mathbb {R}}^6, \end{aligned}$$

are the corresponding pseudo-linear measurement, pseudo-linear measurement matrix, and transformed noise, respectively.

Similarly, the transformed noise $\nu _3$ is treated as a Gaussian noise $\nu _3\sim {\mathcal {N}}(0, \varSigma _{\nu _3})$, where the covariance matrix $\varSigma _{\nu _3}$ can be calculated by

$$\begin{aligned} \varSigma _{\nu _3}={\hat{r}}^2\begin{bmatrix}\sigma _\mu ^2P_{{\hat{g}}} &{} {\hat{\theta }}\sigma _\mu ^2P_{{\hat{g}}}\\ {\hat{\theta }}\sigma _\mu ^2P_{{\hat{g}}} &{} {\hat{\theta }}^2\sigma _\mu ^2I_{3\times 3}+\sigma _w^2{\hat{g}}{\hat{g}}^\textrm{T} \end{bmatrix}\in {\mathbb {R}}^{6\times 6}. \end{aligned}$$

(12)

To sum up, the key specifications of three pseudo-linear measurement equations are listed in Table 1.

Table 1 Comparison of key specifications of three pseudo-linear measurement equations

Full size table

4 Pseudo-linear estimators

This section proposes three PL-KF estimators and three PL-RLS estimators based on the proposed three pseudo-linear measurement equations.

4.1 PL-KF estimators

To estimate the motion of the target, it is common to model its motion as a noise-driven double integrator:

$$\begin{aligned} x(t_{k+1})=Fx(t_k)+q(t_k), \end{aligned}$$

(13)

where

$$\begin{aligned} F=\begin{bmatrix} I_{3\times 3} &{} \delta t I_{3\times 3} \\ 0_{3\times 3} &{} I_{3\times 3} \end{bmatrix}\in {\mathbb {R}}^{6\times 6}, \end{aligned}$$

where $\delta t$ is the sampling time, and I and 0 are identity and zero matrices, respectively. Here, $q\sim {\mathcal {N}}(0,\varSigma _q)$ is the process noise, whose covariance is

$$\begin{aligned} \varSigma _q=\text {diag}(0,0,0,\sigma _v^2,\sigma _v^2,\sigma _v^2) \in {\mathbb {R}}^{6\times 6}, \end{aligned}$$

where $\sigma _v$ is the standard deviation of the target’s velocity.

With the state transition Eq. (13) and three pseudo-linear measurement equations, three different PL-KF estimators can be realized. They all share the same filtering procedure as shown below. The differences between these three PL-KF estimators are reflected in $z_i$, $H_i$, and $\varSigma _{\nu _i}$. The prediction steps are

$$\begin{aligned} {\hat{x}}^-(t_k)&=F{\hat{x}}(t_{k-1}),\\ P_\text {kf}^-(t_k)&=FP_\text {kf}(t_{k-1})F^T+\varSigma _q, \end{aligned}$$

where ${\hat{x}}^-(t_k)\in {\mathbb {R}}^6$ and $P^-(t_k)\in {\mathbb {R}}^{6\times 6}$ are the prior estimated states and covariance matrix, respectively. The correction and update steps are

$$\begin{aligned}&K_\text {kf}(t_k)=P_\text {kf}^-(t_k)H_i^\textrm{T}(t_k)[H_i(t_k) P_\text {kf}^-(t_k)H_i^\textrm{T}(t_k)+\varSigma _{\nu _i}]^{-1},\\&{\hat{x}}(t_k)={\hat{x}}^-(t_k)+K_\text {kf}(t_k)[z_i(t_k)-H_i(t_k){\hat{x}}^-(t_k)],\\&P_\text {kf}(t_k)=[I-K_\text {kf}(t_k)H_i(t_k)]P_\text {kf}^-(t_k), \end{aligned}$$

where $K(t_k)$ is the gain matrix of Kalman filter, ${\hat{x}}(t_k)$ and $P(t_k)$ are posterior estimated states and corresponding covariance matrix.

4.2 PL-RLS estimators

Similarly, three PL-RLS estimators can be proposed based on the proposed three pseudo-linear measurement equations.

The form of the least square algorithm is as follows:

$$\begin{aligned} x(t_k)=(A_i^\textrm{T}A_i)^\dagger A_i^\textrm{T}b_i, \end{aligned}$$

where

$$\begin{aligned} A_i=\begin{bmatrix} H_i(t_1)F^{-(k-1)} \\ H_i(t_2)F^{-(k-2)} \\ \vdots \\ H_i(t_{k-1})F^{-1} \\ H_i(t_k) \end{bmatrix} \end{aligned}$$

and

$$\begin{aligned} b_i = \begin{bmatrix} z_i(t_1) \\ z_i(t_2) \\ \vdots \\ z_i(t_{k-1}) \\ z_i(t_k) \end{bmatrix}. \end{aligned}$$

Here, $t_1$ is the initial time, and $t_k$ is the current time.

It should be noted that the original form of the least square algorithm uses all times of measurements, which shows unstable performance if the target has time-varying velocity. We use the recursive least square (RLS) algorithm together with a time-decay factor, which can solve this problem [20].

The process of the recursive least square algorithm is as follows:

$$\begin{aligned}&{\hat{x}}^-(t_k)= F{\hat{x}}(t_{k-1}),\\&P_\text {ls}^-(t_k) =FP_\text {ls}(t_{k-1})F^\textrm{T},\\&K_\text {ls}(t_k)=P^-_\text {ls}(t_k)H_i^\textrm{T}(t_k) [H_i(t_k)P^-_\text {ls}(t_k)H_i^\textrm{T}(t_k)+\lambda I]^{-1}, \\&{\hat{x}}(t_k)={\hat{x}}^-(t_k)+K_\text {ls}(t_k)[z_i(t_k)-H_i(t_k){\hat{x}}^-(t_k)],\\&P_\text {ls}(t_k)=\lambda ^{-1}[I-K_\text {ls}(t_k)H_i(t_k)]P_\text {ls}^-(t_k), \end{aligned}$$

where $\lambda \in {\mathbb {R}}$ is the decay factor.

As can be seen, the differences between the three PL-RLS estimators are reflected in $z_i$ and $H_i$.

5 Numerical results

This section presents the simulation results to verify the performances of three PL-KF estimators and three PL-RLS estimators. Three different scenarios are considered in the simulation: 1) MAV tracking scenario; 2) MAV guidance scenario; 3) ground camera monitoring scenario. 1000 independent experiments are done for each scenario. Deep analyses of the results are also given. We analyze the performances of six estimators from the perspective of transformed noises, the Fisher information, and the architecture of algorithms.

5.1 Simulation setup

First, we give the simulation setup. All experiments use the same setup and parameters.

The variances of measurements’ noises are set to $\sigma _\mu = 0.01$, and $\sigma _w = 0.01$. The variance of the process noise is set to $\sigma _v = 0.1$. The decay factor in the PL-RLS estimators is $\lambda =0.8$. The initial estimated states are set to ${\hat{p}}_T=p_o+{\hat{r}}{\hat{g}}$ and ${\hat{v}}_T=[0,0,0]^\textrm{T}$. The initial variance of estimated states is set to ${\hat{P}}=I$. The simulation time interval is set to $\textrm{dt} = 0.1$ s.

1) MAV tracking scenario. The initial target’s position is at $p_T=[10, 0, 5]^\textrm{T}$. The target’s motion is a maneuver with two $90^\circ $’s turns, whose radius is 10 m. The magnitude of the target’s velocity is 2 m/s. Then, the magnitude of the target’s lateral acceleration can be calculated as around $0.4~\mathrm{m/s^2}$. The initial observer’s position is set to $p_o=[0, 0, 10]^\textrm{T}$. The observer’s velocity is controlled by a tracking algorithm

$$\begin{aligned} v_o={\hat{v}}_T + k\dfrac{r^2-d_{\textrm{track}}^2}{d_{\textrm{track}}^2}{\hat{g}}, \end{aligned}$$

where $k=1$ is the control parameter, and $d_\text {track}=5$ is the desired tracking distance. As a result, the initial horizontal distance between two MAVs is 10 m. With the observer tracking the target, their horizontal distance gradually narrowed to 5 m. The trajectory of two MAVs is shown on the left-hand of Fig. 2.

2) MAV guidance scenario. The target moves along a straight line from the initial position $p_T=[10, 0, 5]^\textrm{T}$ with the speed of 2 m/s. The initial observer’s position is set to $p_o=[0, 0, 0]^\textrm{T}$. The observer’s velocity is controlled by a PNG guidance law to pursue the target with a speed of 2.4 m/s. As a result, their distance is getting smaller and smaller. The trajectory of two MAVs is shown on the left-hand of Fig. 3.

3) Ground camera monitoring scenario. The setup of the target MAV is the same as the MAV tracking scenario. While, the observer remains stationary at the position of $p_o=[20, 10, 0]^\textrm{T}$. This scenario simulates the ground monitoring camera. As a result, the target MAV flies overhead the observer, and their distance first decreases and then increases. The trajectory of two MAVs is shown in the left-hand of Fig. 4.

5.2 Simulation 1: Comparison between different estimators

The estimation performances of three PL-KF estimators and three PL-RLS estimators for three scenarios are shown in Figs. 2, 3, and 4 respectively.

We use the norm error of estimated states $\Vert {\hat{x}}-x\Vert $ as metrics to evaluate six estimators. The error of estimated states is shown in the middle-upper of Figs. 2, 3, and 4. In the meantime, the average error of estimated states for 1000 times of independent experiments is shown in the middle-down of Figs. 2, 3, and 4. Both results show that all six estimators have similar performance.

In detail, three PL-KF estimators have the same performance, while the performances of three PL-RLS estimators are different. The PL-RLS estimator 3 has the highest performance, PL-RLS estimator 2 is the second, and PL-RLS estimator 1 is the worst. The performance differences of three PL-RLS estimators are affected by the distance between the target and the observer. Bigger distance results in greater differences.

We next analyze the simulation results from three perspectives: the transformed noises, the Fisher information, and the architecture of the algorithms.

First, the results of similar performance between PL-KF and PL-RLS estimators are analyzed from the perspective of Fisher information, which indicates how much information the measurements contain about the states. The Fisher information matrix is calculated by

$$\begin{aligned} FIM_i = \textstyle \sum \limits _{t=1}^{t_k}\left[ H_i^\textrm{T}(t) \varSigma _{\nu _i}^{-1}(t)H_i(t)\right] . \end{aligned}$$

Here, we only consider the target’s position as states. As a result, only the first three columns of $H_i$ are extracted.

The average determinant of the Fisher information matrix for 1000 times experiments are shown in the right-upper of Figs. 2, 3, and 4. As can be seen, three different pseudo-linear measurement equations have the same Fisher information, and the determinant of the Fisher information matrix increases with time. It makes sense because the pseudo-linear transformation does not bring new information about the states. The exactly same Fisher information means that six estimators use the same amount of information to estimate states. That is why different estimators have similar performance.

Second, the result of different performances between the three PL-RLS estimators is analyzed from the perspective of transformed noises. We plot the average norm of pseudo-linear measurements’ noises for 1000 times of experiments in (3), (5), and (12) in the right-down of Figs. 2, 3, and 4. For a quick review, the average norms of pseudo-linear measurements’ noises are calculated by $\Vert rP_{{\hat{g}}}\mu \Vert /3$, $\Vert r({\hat{\theta }}\mu -w{\hat{g}})\Vert /3$, and $\Vert r(\mu -(w/{\hat{\theta }}){\hat{g}})\Vert /6$.

As can be seen, the noises are influenced by the distance, but in most time, the noise of pseudo-linear measurement 3 is the biggest, while the noises of pseudo-linear measurements 1 and 2 are smaller. Compared to the performances of PL-RLS estimators, we can conclude that smaller transformed noises have better performance. From another perspective, the transformed noises are actually the residual error of the least square algorithm.

Third, the result that three PL-KF estimators have exactly the same performance is analyzed from the perspective of algorithm architecture. Comparing the estimation process of PL-KF estimators and PL-RLS estimators, the calculation of the gain matrix in PL-KF estimators considers the covariance of measurements’ noises, while PL-RLS estimators do not. Together with the same Fisher information, the performance of the PL-KF estimator is not affected by transformed noises.

5.3 Simulation 2: incorrect parameters in PL-KF estimators

Sometimes, the variances of the original measurement noises are difficult to estimate. Some users may overlook or make a mistake in the calculation of the covariance of the transformed noises when applying PL-KF estimators. This section investigates the influence of inappropriate setting of the parameter (covariance of the pseudo-linear measurement noise) in the PL-KF estimator.

Many new users directly use I as the covariance matrix of the measurement noises. The second simulation experiment setups $\varSigma _{\nu _i}=I$, and all other parameters and measurement data are the same as in simulation 1.

The simulation results are shown in Fig. 5. As can be seen, inappropriate measurement covariance can bring uncertain effects on the estimation performance in all three scenarios. As a result, our advice is to use the correct calculation of the covariance matrix of the pseudo-linear measurement noises when applying PL-KF estimators.

5.4 Simulation 3: wrong variance of original noises

Sometimes, it is difficult to estimate the variance of the original measurement noises. However, wrong value may bring bad effects. The third simulation experiment setups $\sigma _\mu =0.2$ and $\sigma _w=0.2$ when calculating the covariance matrix of $\varSigma _{\nu _i}$ in the PL-KF estimators, while the real variances of measurement noises remain $\sigma _\mu =0.01$ and $\sigma _w=0.01$. Other parameters and measurement data are the same as in simulation 1.

The simulation results are shown in Fig. 6. As can be seen, the performances become worse but remain the same in all three scenarios. As a result, our advice is to use PL-RLS estimators if the variances of the original measurement noises are difficult to estimate.

6 Conclusion

This paper investigates the effectiveness of using different pseudo-linear measurements on the estimation performance. We choose the vision-based target motion estimation problem as an example to introduce. The vision measurements are modelled as bearing and angle, and are then transformed into three different pseudo-linear measurements. Three different pseudo-linear measurement equations with corresponding transformed noises are then proposed. Based on the proposed pseudo-linear measurement equations, three PL-KF and three PL-RLS estimators are proposed.

Simulation results and corresponding analysis show that a) three PL-KF and three PL-RLS estimators have similar performance; b) the smaller transformed noises, the better performance of the PL-RLS estimator; c) three PL-KF estimators have the same performance.

For the inappropriate parameters in PL-KF estimators, the simulation results show that a) incorrect covariance brings unknown influence; b) wrong original variance of the measurement noises makes the performances worse but the same.

The final advice is to use the PL-KF estimator with correct parameters if the variance of the original measurement noise is known in advance. However, if the variance of the original measurement noise is unknown, the PL-RLS estimator with the smallest transformed noises is the best choice.

Data Availability

The data used in this study are available from the corresponding author upon request.

References

Vrba, M., & Saska, M. (2020). Marker-less micro aerial vehicle detection and localization using convolutional neural networks. IEEE Robotics and Automation Letters, 5(2), 2459–2466. https://doi.org/10.1109/lra.2020.2972819
Article Google Scholar
Li, J., Ning, Z., He, S., Lee, C.-H., & Zhao, S. (2022). Three-dimensional bearing-only target following via observability-enhanced helical guidance. IEEE Transactions on Robotics, 39(2), 1509–1526. https://doi.org/10.1109/tro.2022.3218268
Kang, H., Joung, J., Kim, J., Kang, J., & Cho, Y. S. (2020). Protect your sky: A survey of counter unmanned aerial vehicle systems. IEEE Access, 8, 168671–168710. https://doi.org/10.1109/access.2020.3023473
Article Google Scholar
Dressel, L., & Kochenderfer, M.J.: (2019). Hunting drones with other drones: Tracking a moving radio target. In: 2019 International Conference on Robotics and Automation (ICRA). Montreal, QC, Canada. https://doi.org/10.1109/icra.2019.8794243
Fogel, E., & Gavish, M. (1988). Nth-order dynamics target observability from angle measurements. IEEE Transactions on Aerospace and Electronic Systems, 24(3), 305–308. https://doi.org/10.1109/7.192098
Article MathSciNet Google Scholar
He, S., Shin, H.-S., & Tsourdos, A. (2019). Trajectory optimization for target localization with bearing-only measurement. IEEE Transactions on Robotics, 35(3), 653–668. https://doi.org/10.1109/tro.2019.2896436
Article Google Scholar
Griffin, B.A., & Corso, J.J. (2021). Depth from camera motion and object detection. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Nashville, TN, USA. https://doi.org/10.1109/cvpr46437.2021.00145
Ma, Y., Soatto, S., & Kosecká, J., & Sastry, S.S. (2012). An Invitation to 3-D Vision, Interdisciplinary Applied Mathematics (1st ed., Vol. 26). New York: Springer. https://doi.org/10.1007/978-0-387-21779-6
Walter, V., Staub, N., Franchi, A., & Saska, M. (2019). UVDAR system for visual relative localization with application to leader/follower formations of multirotor UAVs. IEEE Robotics and Automation Letters, 4(3), 2637–2644. https://doi.org/10.1109/lra.2019.2901683
Article Google Scholar
Schilling, F., Schiano, F., & Floreano, D. (2021). Vision-based drone flocking in outdoor environments. IEEE Robotics and Automation Letters, 6(2), 2954–2961. https://doi.org/10.1109/lra.2021.3062298
Article Google Scholar
Lin, X., Kirubarajan, T., Bar-Shalom, Y., Maskell, S. (2002). Comparison of EKF, pseudomeasurement, and particle filters for a bearing-only target tracking problem. In: O.E. Drummond (Ed.), SPIE Proceedings. Orlando, FL, USA. https://doi.org/10.1117/12.478508
Lingren, A., & Gong, K. (1978). Position and velocity estimation via bearing observations. IEEE Transactions on Aerospace and Electronic Systems, AES–14(4), 564–577. https://doi.org/10.1109/taes.1978.308681
Article Google Scholar
Aidala, V., & Nardone, S. (1982). Biased estimation properties of the pseudolinear tracking filter. IEEE Transactions on Aerospace and Electronic Systems, AES–18(4), 432–441. https://doi.org/10.1109/taes.1982.309250
Article Google Scholar
Boie, R. A., & Cox, I. J. (1992). An analysis of camera noise. IEEE Transactions on Pattern Analysis & Machine Intelligence, 14(6), 671–674. https://doi.org/10.1109/34.141557
Zhao, S., Chen, B. M., & Lee, T. H. (2013). Optimal sensor placement for target localisation and tracking in 2D and 3D. International Journal of Control, 86(10), 1687–1704. https://doi.org/10.1080/00207179.2013.792606
Article MathSciNet MATH Google Scholar
Bishop, A. N., Fidan, B., Anderson, B. D. O., Doğançay, K., & Pathirana, P. N. (2010). Optimality analysis of sensor-target localization geometries. Automatica, 46(3), 479–492. https://doi.org/10.1016/j.automatica.2009.12.003
Article MathSciNet MATH Google Scholar
Zhao, S., & Zelazo, D. (2019). Bearing rigidity theory and its applications for control and estimation of network systems: Life beyond distance rigidity. IEEE Control Systems Magazine, 39(2), 66–83. https://doi.org/10.1109/mcs.2018.2888681
Article MathSciNet MATH Google Scholar
Dogancay, K. (2015). 3D pseudolinear target motion analysis from angle measurements. IEEE Transactions on Signal Processing, 63(6), 1570–1580. https://doi.org/10.1109/tsp.2015.2399869
Article MathSciNet MATH Google Scholar
He, S., Wang, J., & Lin, D. (2018). Three-dimensional bias-compensation pseudomeasurement kalman filter for bearing-only measurement. Journal of Guidance, Control, and Dynamics, 41(12), 2678–2686. https://doi.org/10.2514/1.g003785
Article Google Scholar
Islam, S. A. U., & Bernstein, D. S. (2019). Recursive least squares for real-time implementation [lecture notes]. IEEE Control Systems, 39(3), 82–85. https://doi.org/10.1109/mcs.2019.2900788
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Technology, Zhejiang University, Hangzhou, 310058, Zhejiang, China
Zian Ning
School of Engineering, Westlake University, Hangzhou, 310024, Zhejiang, China
Zian Ning, Yin Zhang & Shiyu Zhao

Authors

Zian Ning
View author publications
You can also search for this author in PubMed Google Scholar
Yin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Shiyu Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shiyu Zhao.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Ning, Z., Zhang, Y. & Zhao, S. Comparison of different pseudo-linear estimators for vision-based target motion estimation. Control Theory Technol. 21, 448–457 (2023). https://doi.org/10.1007/s11768-023-00161-y

Download citation

Received: 27 May 2023
Revised: 02 July 2023
Accepted: 08 July 2023
Published: 04 September 2023
Issue Date: August 2023
DOI: https://doi.org/10.1007/s11768-023-00161-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Comparison of different pseudo-linear estimators for vision-based target motion estimation

Abstract

Similar content being viewed by others

Performance analysis of α- β- γtracking filters using position and velocity measurements

Three-Dimensional Bearings-Only Target Tracking: Comparison of Few Sigma Point Kalman Filters

Extended least square unbiased FIR filter for target tracking using the constant velocity motion model

1 Introduction