Keywords

1 Introduction

UAVs have been widely used in the field of military reconnaissance, and have shown a development trend of miniaturization, clustering and intelligence. The UAV clusters can not only make up for the shortcomings of a single UAV, but also broaden the mission fields through the information fusion and resource complementation between multiple drones.

In the field of single-target passive localization, the commonly used method is pseudo-linear estimation (PLE) [1], which lumps the nonlinearity into the noise term, and then obtains the target position estimate by least squares. After considering the statistical characteristics of the noise term in the PLE equation, the weighted pseudo-linear estimation (WPLE) [2, 3] is proposed. But neither PLE nor WPLE can reduce the estimation bias very well. A commonly used method to reduce bias is maximum likelihood estimation (MLE) [4].

However, networked radar has been widely used due to its wider spatial coverage, higher target detection probability and stronger anti-jamming ability. Scenarios with multiple targets like networked radar that needed to be located are more common. Therefore, single-target passive localization technology is difficult to meet actual needs, and it is particularly important to research the problem of multi-target passive localization.

Under the above background, this paper proposes an algorithm for passive localization of multiple targets by UAV clusters. Firstly, the PLE algorithm is used to obtain the coarse localization information of the targets, and then the UAV resources are allocated to multiple targets based on greedy algorithm. Finally, the improved WPLE method or MLE method is employed in the acquisition of accurate target information.

The main symbols used in the paper are listed as follows (Table 1):

Table 1. The definition of symbols

2 The Passive Localization Mechanism of a Single Target by a UAV Cluster

2.1 Scene Model

The single target passive localization model [9] based on UAV clusters is shown in Fig. 1. K UAVs and one target are distributed in a two-dimensional plane space. The position of the kth (k = 1,2…K) UAV \({{\varvec{s}}}_{{\varvec{k}}}={[{\mathrm{x}}_{\mathrm{k}},{\mathrm{y}}_{\mathrm{k}}]}^{\mathrm{T}}\) is known. \({\uptheta }_{\mathrm{k}}\in [\mathrm{0,2\Pi })\) is the real azimuth angle from the target to the kth UAV. The goal of passive localization of a single target is to obtain the unknown real position of the target \({\varvec{p}}={[\mathrm{x},\mathrm{y}]}^{\mathrm{T}}\). The real azimuth angle, the position of the UAVS, and the position of the target have the following relationships:

$$\begin{array}{c}sin \, {\uptheta }_{\mathrm{k}}=\frac{\mathrm{y}-{\mathrm{y}}_{\mathrm{k}}}{{\mathrm{r}}_{\mathrm{k}}} , cos \, {\uptheta }_{\mathrm{k}}=\frac{\mathrm{x}-{\mathrm{x}}_{\mathrm{k}}}{{\mathrm{r}}_{\mathrm{k}}}\end{array}$$
(1)

Among them, \({\mathrm{r}}_{\mathrm{k}}=|\left|{{\varvec{s}}}_{{\varvec{k}}}-{\varvec{p}}\right||\) is the real distance between the kth UAV and the target. The model assumes that each UAV is affected by independent zero-mean Gaussian additive noise while observing the target. Then the bearing measured by the kth UAV \(\tilde{\uptheta }_{\mathrm{k}}\) is composed of the true bearing and the bearing noise:

$${\tilde{\uptheta }_{\mathrm{k}}}={\uptheta }_{\mathrm{k}}+{\mathrm{n}}_{\mathrm{k}}, {\mathrm{n}}_{\mathrm{k}}\sim N(0, {{\upsigma }_{\mathrm{nk}}}^{2})$$
(2)
Fig. 1.
figure 1

Scene model

2.2 Principle of PLE Method

For the kth UAV, the following observation equation holds [2]:

$$\begin{array}{c}{\varvec{a}}_{\varvec{k}}{\varvec{p}}={\mathrm{b}}_{\mathrm{k}}+{\upeta }_{\mathrm{k}}\end{array}$$
(3)

where \({\varvec{a}}_{\varvec{k}}=\left[{sin}{\tilde{{\uptheta }}_{\mathrm{k}}},-{cos}{\tilde{{\uptheta }}_{\mathrm{k}}}\right]\), \({\mathrm{b}}_{\mathrm{k}}=\left[{sin}{\tilde{{\uptheta }}_{\mathrm{k}}},-{cos}{\tilde{{\uptheta }}_{\mathrm{k}}}\right]{\varvec{s}}_{\varvec{k}}\), \({\upeta }_{\mathrm{k}}= {\mathrm{r}}_{\mathrm{k}}{ sin} \, {\mathrm{n}}_{\mathrm{k}}\) is the error term caused by the bearing noise.

Combine the observation equations of K UAVs together and write them in the form of a matrix:

$$\begin{array}{c}{\varvec{Ap=b+\eta}} \end{array}$$
(4)

where \(\varvec{A}={\left[{{\varvec{a}}_{\varvec{1}}}^{\mathrm{T}} {{\varvec{a}}_{\varvec{2}}}^{\mathrm{T}}\dots \, {{\varvec{a}}_{\varvec{K}}}^{\mathrm{T}}\right]}^{\mathrm{T}}\), \(\varvec{b}={\left[{{\mathrm{b}}_{1}}^{\mathrm{T}} {{\mathrm{b}}_{2}}^{\mathrm{T}}\dots \, {{\mathrm{b}}_{\mathrm{K}}}^{\mathrm{T}}\right]}^{\mathrm{T}}\) and \(\varvec{\upeta} ={\left[{{\upeta }_{1}}^{\mathrm{T}} {{\upeta }_{2}}^{\mathrm{T}}\dots\, {{\upeta }_{\mathrm{K}}}^{\mathrm{T}}\right]}^{\mathrm{T}}\). The PLE of the target position \({\widehat{\varvec{p}}}_{\mathbf{PLE}}\) can be obtained by the least squares method:

$$\begin{array}{c}{\widehat{\varvec{p}}}_{\mathbf{PLE}}={\left({\varvec{A}}^{\mathrm{T}}{\varvec{A}}\right)}^{-1}{\varvec{A}}^{\mathrm{T}}{\varvec{b}}\end{array}$$
(5)

2.3 Principle of WPLE Method

WPLE is based on the PLE method, using the inverse of the covariance matrix C of the noise term η in (4) as the weighting matrix W (that is, trust those observations with low bearing noise). Then weights the bearing information measured by each UAV to obtain the estimate.

The covariance matrix of the noise term is as follows:

$$\begin{array}{c}{\varvec{C}}=E\left[\varvec{\upeta {\upeta }}^{\mathrm{T}}\right]\end{array}$$
(6)

when the bearing noise \({\mathrm{n}}_{\mathrm{k}}\) is small, \({\upeta}_{\mathrm{k}}\) can be replaced by the following formula:

$$\begin{array}{c}{\upeta }_{\mathrm{k}}= {\mathrm{r}}_{\mathrm{k}}{\mathrm{sin}\, n}_{\mathrm{k}}\approx {\mathrm{r}}_{\mathrm{k}}{\mathrm{n}}_{\mathrm{k}}\end{array}$$
(7)

Therefore, the covariance matrix of the noise term η can be calculated:

$$\begin{array}{c}{\varvec{C}}=diag\left({{\mathrm{r}}_{1}}^{2}{{\upsigma }_{\mathrm{n}1}}^{2},{{\mathrm{r}}_{2}}^{2}{{\upsigma }_{\mathrm{n}2}}^{2},\dots {,{\mathrm{r}}_{\mathrm{k}}}^{2}{{\upsigma }_{\mathrm{nk}}}^{2}\right)\end{array}$$
(8)

where diag is a function that converts a row vector into a diagonal matrix. The weighting matrix can be obtained by:

$$\begin{array}{c}{\varvec{W}}={\varvec{C}}^{-1}\end{array}$$
(9)

The WPLE form is:

$$\begin{array}{c}{\widehat{\varvec{p}}}_{\mathbf{WPLE}}={\left({\varvec{A}}^{\mathrm{T}}\varvec{WA}\right)}^{-1}{\varvec{A}}^{\mathrm{T}}{\varvec{Wb}}\end{array}$$
(10)

It is worth noting that the distance between the drone and the target needs to be used in (8). However, the target position is unknown so that the distance is also unknown. The traditional WPLE method uses the localization result obtained by PLE to calculate the target distance. The improvement method of this paper is: first use the covariance matrix Q of the bearing noise as the initial value of the covariance matrix C:

$$\begin{array}{c}{\varvec{C}}_{\varvec{0}}=\varvec{Q}=diag\left({{\upsigma }_{\mathrm{n}1}}^{2},{{\upsigma }_{\mathrm{n}2}}^{2},\dots {{\upsigma }_{\mathrm{nk}}}^{2}\right)\end{array}$$
(11)

Then, the initial value of the target position is calculated by (10), and this initial value is used to calculate the target distance. Then (8) is used to recalculate the covariance matrix of the noise term η, and (10) is used to calculate the iterative value of the target position. By repeating the above iterative process, the estimated value of WPLE can be obtained.

2.4 Principle of MLE Method

The likelihood function of the target bearings (K observations) can be written as:

$$\begin{array}{c}f\left(\tilde{\varvec{\theta }}|\varvec{p}\right)=\frac{1}{{\left(2\Pi \right)}^{\frac{\mathrm{K}}{2}}{\left|\varvec{Q}\right|}^\frac{1}{2}} {{e}}^{-\frac{1}{2}{\left(\tilde{\varvec{\theta }}-{\varvec{\uptheta}} \left(\varvec{p}\right)\right)}^{\varvec{T}}{\varvec{Q}}^{-1} \left(\tilde{\varvec{\theta }}-{\varvec{\uptheta}} \left(\varvec{p}\right)\right)}\end{array}$$
(12)

where \({\varvec{\uptheta }}\left(\varvec{p}\right){=[{\uptheta }_{1}\left(\mathrm{p}\right) {\uptheta }_{2}\left(\mathrm{p}\right)\dots {\uptheta }_{\mathrm{K}}\left(\mathrm{p}\right)]}^{\mathrm{T}}\), \({\tilde{\varvec{\theta }}}={\left[\tilde{{\uptheta }_{1}} \tilde{{\uptheta }_{2}}\dots {\tilde{{\uptheta }}_{\mathrm{K}}}\right]}^{\mathrm{T}}\). The MLE method is to use the position where the likelihood function is maximized as the estimated value of the target position. Also, the ML problem can be written as:

$$\begin{array}{c}\underset{\varvec{p}}{{min}}\, {{f}\left(\varvec{p}\right)=\left(\tilde{\varvec{\theta }}-{\varvec{\uptheta }}\left(\varvec{p}\right)\right)}^{\mathrm{T}}{\varvec{Q}}^{-1} \left(\tilde{\varvec{\theta }}-{\varvec{\uptheta }}\left(\varvec{p}\right)\right)\end{array}$$
(13)

Therefore, solving the problem of maximum likelihood function is reduced to solving the problem of minimum \(\mathrm{f}(\varvec{p})\). We can use this function as a cost function and use the gradient descent method to iterate the estimated value of the target position. Derivation from Eq. (1) can be obtained:

$$\begin{array}{c}\frac{\partial{\varvec{\uptheta }}\left(\varvec{p}\right)}{\partial {x}}={\left[\frac{-\Delta {{y}}_{1}}{{{\mathrm{r}}_{1}}^{2}} \frac{-\Delta {{y}}_{2}}{{{\mathrm{r}}_{2}}^{2}} \dots \frac{-\Delta {{y}}_{{K}}}{{{\mathrm{r}}_{\mathrm{K}}}^{2}}\right]}^{\mathrm{T}} , \frac{\partial{\varvec{\uptheta }}\left(\varvec{p}\right)}{\partial {y}}={\left[\frac{\Delta {{x}}_{1}}{{{\mathrm{r}}_{1}}^{2}} \frac{\Delta {{x}}_{2}}{{{\mathrm{r}}_{2}}^{2}} \dots \frac{\Delta {{x}}_{{K}}}{{{\mathrm{r}}_{\mathrm{K}}}^{2}}\right]}^{\mathrm{T}}\end{array}$$
(14)

where \(\Delta {\mathrm{x}}_{\mathrm{k}}=\mathrm{x}-{\mathrm{x}}_{\mathrm{k}},\Delta {\mathrm{y}}_{\mathrm{k}}=\mathrm{y}-{\mathrm{y}}_{\mathrm{k}}\). Combined with (13), the partial derivative of the cost function with respect to the target position can be obtained by:

$$\begin{array}{c}\frac{\partial {f}\left(\varvec{p}\right)}{\partial {x}}=2{\left({\varvec{\uptheta }}\left(\varvec{p}\right)-\tilde{{\varvec{\uptheta }}}\right)}^{\mathrm{T}}{\varvec{Q}}^{-1}\frac{\partial{\varvec{\uptheta }}\left(\varvec{p}\right)}{\partial {x}} , \frac{\partial {f}\left(\varvec{p}\right)}{\partial \mathrm{y}}=2{\left({\varvec{\uptheta }}\left(\varvec{p}\right)-\tilde{\varvec{\theta }}\right)}^{\mathrm{T}}{\varvec{Q}}^{-1}\frac{\partial{\varvec{\uptheta }}\left(\varvec{p}\right)}{\partial \mathrm{y}}\end{array}$$
(15)

Therefore, the iterative formula is:

$$\begin{array}{c}{\varvec{p}}_{\varvec{new}}={\varvec{p}}_{\varvec{old}}-{\mathrm{r}}*{\left[\frac{\partial {f}\left(\varvec{p}\right)}{\partial {x}} \frac{\partial {f}\left(\varvec{p}\right)}{\partial {y}}\right]}^{\mathrm{T}}\end{array}$$
(16)

where \({\varvec{p}}_{\varvec{new}}\) is the position estimate after one iteration and \(\mathrm{r}\) is the iteration rate. Both the number of iterations and the iteration rate are hyperparameters and should be selected reasonably according to the actual situation.

2.5 Cramer-Rao Lower Bound Calculation

In the unbiased estimation problem, Cramer-Rao lower bound (CRLB) [9] is usually used to measure the validity of an estimate. If the Root Mean Square Error (RMSE) of an estimated value is smaller and closer to the value obtained by CRLB, then the estimate is said to be more effective. In order to calculate CRLB, we introduce the Fisher Matrix (FIM):

$$\begin{array}{c}{\varvec{F}}=\left[\begin{array}{cc}-\mathrm{E}\left[\frac{{\partial }^{2}}{\partial {{x}}^{2}}{f}\left(\tilde{\varvec{\theta }}|\varvec{p}\right)\right]& -\mathrm{E}\left[\frac{{\partial }^{2}}{\partial {x}\partial {y}}{f}\left(\tilde{\varvec{\varvec{\uptheta }}}|{\varvec{p}}\right)\right]\\ -\mathrm{E}\left[\frac{{\partial }^{2}}{\partial {y}\partial {x}}{f}\left(\tilde{\varvec{\theta }}|\varvec{p}\right)\right]& -\mathrm{E}\left[\frac{{\partial }^{2}}{\partial {{y}}^{2}}{f}\left(\tilde{\varvec{\theta }}|\varvec{p}\right)\right]\end{array}\right]\end{array}$$
(17)

According to the likelihood function expression of (12), the above formula can be rewritten as:

$$\begin{array}{c}\varvec{F}={\left[\frac{\partial{\varvec{\uptheta }}\left(\varvec{p}\right)}{\partial {x}} \frac{\partial{\varvec{\uptheta }}\left(\varvec{p}\right)}{\partial {y}}\right]}^{\mathrm{T}} {\varvec{Q}}^{-1}\left[\frac{\partial{\varvec{\uptheta }}\left(\varvec{p}\right)}{\partial {x}} \frac{\partial{\varvec{\uptheta }}\left(\varvec{p}\right)}{\partial {y}}\right]\end{array}$$
(18)

Combining the partial derivative of \({\varvec{\uptheta }}\left(\varvec{p}\right)\) in (14) with respect to the target position, the further expression of Fisher matrix F can be obtained by:

$$\begin{array}{c}\varvec{F}=\sum\limits_{\mathrm{k}=1}^{\mathrm{K}}\frac{1}{{{\upsigma }_{\mathrm{nk}}}^{2}{{\mathrm{r}}_{\mathrm{k}}}^{4}}\left[\begin{array}{cc}{\left(\Delta {{y}}_{{K}}\right)}^{2}& -\Delta {{x}}_{{K}}\Delta {{y}}_{{K}}\\ -\Delta {{x}}_{{K}}\Delta {{y}}_{{K}}& {\left(\Delta {{x}}_{{K}}\right)}^{2}\end{array}\right]\end{array}$$
(19)

The Cramer-Rao lower bound matrix CRLB can be obtained by the Fisher matrix:

$$\begin{array}{c}CRLB={\varvec{F}}^{-1}\end{array}$$
(20)

\(\widehat{\varvec{p}}\) = \({[\widehat{\mathrm{x}},\widehat{\mathrm{y}}]}^{\mathrm{T}}\) is the estimation of target position. The relationship between RMSE and CRLB is:

$$\begin{array}{c}RMSE =\sqrt{\mathrm{E}\left[{\left({x}-\widehat{\mathrm{x}}\right)}^{2}+{\left({y}-\widehat{\mathrm{y}}\right)}^{2}\right]}\ge \sqrt{\varvec{CRLB}\left(\mathrm{1,1}\right)+\varvec{CRLB}\left(\mathrm{2,2}\right)}\end{array}$$
(21)

3 Resource Allocation Method Based on Greedy Algorithm

In this paper, the greedy algorithm is used to realize the resource allocation of the UAV clusters to multiple targets. The principle of the greedy algorithm is to split a complex problem into simple problems, and to obtain an optimal solution for each simple problem.

It can be seen from (3) that when the distance between the UAV and the target is large, the error term \({\upeta }_{\mathrm{k}}\) brought by the bearing noise will be correspondingly larger, which is unfavorable for the estimation. It can be seen from (8) that when the distance between the UAV and the target is far, the corresponding value in the covariance matrix element of the noise term η will also be larger, so the bearing measured by the UAV is not so credible. Therefore, the optimal solution for the resource allocation of a UAV to multiple targets depends on the distance between the drone and the target. Then, the mathematical model for the allocation of K UAVs to N target resources is:

$$\begin{array}{c}{\mathrm{selected}}_{\mathrm{k}}=\underset{{i}}{{min}}\left|\left|{\varvec{s}}_{\varvec{k}}-{\varvec{p}}_{\varvec{i}}\right|\right|\end{array}$$
(22)

where i = 1,2…N, \({\mathrm{selected}}_{\mathrm{k}}\) is the target number assigned to the kth UAV, and \({\mathrm{p}}_{\mathrm{i}}\) is the position coordinate of the i-th target. To prevent drones from being overallocated to the same target, resulting in waste of resources, the constraint below should be satisfied while allocating resources:

$$\begin{array}{c}{\mathrm{N}}_{\mathrm{i}}\le \frac{\mathrm{K}}{\mathrm{N}} \end{array}$$
(23)

where \({\mathrm{N}}_{\mathrm{i}}\) is the number of UAVs assigned to the i-th target.

4 Passive Localization Algorithm Flow of UAV Clusters for Multi-target

In the above, we have discussed the passive localization mechanism of the UAV clusters for a single target. The method of resource allocation is analyzed. Combining the two can realize the passive localization of the UAV clusters for multiple targets.

But pay attention to (22), when using the greedy algorithm in resource allocation, we need to use the target position before passive positioning, which is what needs to be measured and estimated. Therefore, this paper proposes an algorithm and shows it in Table 2. That is, the released UAV cluster first uses the PLE method to coarsely locate each target. The estimated values of the PLE method are used to replace the real targets position. Then the greedy algorithm is adopted for UAVs resource allocation. The improved WPLE or MLE method is used for fine positioning.

Table 2. Multi-target passive localization algorithm

The PLE algorithm has low computational complexity and does not require iteration. But its positioning accuracy is poor. Therefore, other passive positioning methods (such as WPLE and MLE) need to be used to ensure the positioning accuracy. However, these methods generally have high time complexity and space complexity. The reasonable way for multi-target localization is: after resource allocation, the UAVs assigned to the same target use these methods for positioning so that the accurate passive localization of each target can be performed at the same time. The PLE method used in the algorithm for coarse positioning is only to quickly obtain the approximate position coordinates of the targets, and then the greedy algorithm can be used for resource allocation.

5 Simulation Result Analysis

5.1 System Performance Analysis

In the designed simulation experiment, K = 100, N = 4, that is, a cluster of 100 UAVs passively locates 4 targets. The initially released UAVs and targets are uniformly and randomly distributed in a two-dimensional plane (200x200), as shown in Fig. 2. The standard deviation of the bearing noise for each UAV is 8Π/180. According to the algorithm flow, 100 UAVs first adopt PLE method for coarse positioning of 4 targets in turn, and then adopt the greedy algorithm for resource allocation. The result of resource allocation is shown in Fig. 3, and each target is allocated 25 UAVs equally according to the constraint (23). Subsequently, the UAVs assigned to the same target use WPLE or MLE to accurately locate the target.

Fig. 2.
figure 2

Initial release of UAV cluster and targets

Fig. 3.
figure 3

UAV cluster and targets after resource allocation

The results of coarse positioning and fine positioning are shown in Table 3. The data is the deviation between the estimated value and the true position coordinate. In order to avoid the contingency of the experiment, both coarse positioning and fine positioning were performed 1000 times, and the average value of 1000 estimated values is used as the positioning result.

The UAV cluster passively locates each target again after allocating resources. From the data in Table 3, it can be seen that no matter whether the WPLE method or the MLE method is used, although the observations of a single target is reduced (reduced from 100 to 25), its positioning accuracy is greatly improved compared to the initial positioning accuracy of the PLE method for coarse positioning.

Noted from the calculation process, the time complexity of WPLE and MLE is related to the observation times. Due to the resource allocation, the number of observations is reduced and the fine positioning of each target is carried out at the same time. Therefore, the algorithm greatly reduces the time complexity in multi-target localization scenario.

Table 3. Coarse positioning and fine positioning results

5.2 Performance Comparison of Different Passive Localization Methods

In this part of the experiment, to compare the performance of different passive localization methods, 25 UAVs locate a single target using PLE, WPLE and MLE. Among them, it costs 0.000602 s using PLE method for a single run, while the WPLE method takes 0.012833 s, and the MLE method takes 0.017344 s.

Figure 4 shows the RMSE and deviations of the three passive localization methods under different bearing noise standard deviations. After fixing the bearing noise standard deviation, three passive localization methods were used to estimate the position of the target 1000 times to obtain the RMSE and deviation of the estimators. We noticed that as the standard deviation of the bearing noise increases, the estimation performance of the three passive localization algorithms is decreasing (the RMSE and the deviation are increasing). At the same time, it can be seen that the estimation performance of the PLE algorithm is the worst, and both the RMSE and the deviation are the largest; the RMSE of WPLE and MLE can approach CRLB, but the deviation of MLE is smaller than WPLE. It can be concluded that the positioning performance of MLE is better than the other two algorithms under the same scene conditions.

Figure 5 shows the effects of the three passive localization methods under different numbers of UAVs. The standard deviation of the bearing noise is 8Π/180, and the three passive localization methods are all performed 1000 times to obtain the RMSE and deviation of the estimators. We can find that RMSE gradually decreases as the number of UAVs increases, but the deviation is almost unchanged. Therefore, the increase in the number of UAVs can improve the estimation performance of the three positioning methods to a certain extent due to the decrease in RMSE.

Fig. 4.
figure 4

RMSE and deviation under different bearing noise standard deviations

Fig. 5.
figure 5

RMSE and deviation under different numbers of UAVs

6 Conclusion

The main contribution of this paper is to propose a multi-target passive localization algorithm based on UAV clusters and do theoretical and experimental analysis of the algorithm. The proposed algorithm uses PLE for coarse positioning, then uses greedy algorithm for resource allocation, and finally uses improved WPLE or improved MLE for fine positioning. Experiment results shows that the proposed algorithm not only achieves rapid multi-target localization, but also improves the positioning accuracy compared to the traditional PLE method. Therefore, the algorithm is feasible and effective. At the same time, among the three passive positioning methods used in the proposed algorithm, MLE has the excellent estimation performance that both the RMSE and the deviation are the smallest. Otherwise, the estimation performance of the three passive positioning methods improves as the standard deviation of bearing noise decreases and the number of UAVs increases.