Keywords

1 Introduction

Although the fifth generation (5G) mobile communication can meet the demand of Internet of Things (IoT) applications in hot spots, there is still urgent need for more economical and efficient network coverage in remote districts. Meanwhile, to build the space-air-ground integrated network (SAGIN) is proposed in the vision of the sixth generation (6G) mobile communication, which can achieve seamless network coverage and ubiquitous terminal access [1].

The SAGIN usually consists of the space-based segment, the air-based segment and the ground segment [2]. Air-based unmanned aerial vehicle (UAV) network has more advantages, such as lower constructive cost, higher line-of-sight (LoS) capability and flexible deployment characteristics, which can supply access for IoT terminals in remote districts, high altitude platform station (HAPS) ensures much wider coverage [3]. Paper [4] proposed the UAV-assisted SAGIN model, which mainly analyzed the performance of UAV-assisted network, and through network optimization, the throughput and spectral efficiency of the system were improved. It can be seen that heterogeneous AGIN has become an effective scheme to solve the network coverage problem and improve the system performance, which provides an important reference for the study of this paper.

Related studies had been carried out on NOMA technology [5, 6]. Since traditional orthogonal multiple access (OMA)-based technology is no longer adequate, non-OMA (NOMA) that can recover information using successive interference cancellation (SIC) at the receiving end and achieve better spectrum efficiency, has become the focus [7]. Paper [5] proposed a NOMA technology combined with orthogonal frequency division multiplexing (OFDM) technology, which reduced the bit error rate and improved the system performance based on cyclic stationary equalization technology. However, as the increasing of users’ number, the complexity of SIC greatly increases. Therefore, by dividing terminals into multiple clusters, clustered-NOMA (C-NOMA) is an effective solution to both improve system throughput while balancing spectrum efficiency and terminal complexity.

In heterogeneous networks, it is a significant concern to plan UAV trajectory to enhance the system performance considering the constrained on-board energy storage of UAV [8, 9, 10]. Paper [9] proposed an integrated air-ground network system supporting XAPS, which maximized the uplink reachable rate of the system through spectrum, power and position optimization of UAVs based on matching theory. Paper [10] proposed a UAV-assisted space-ground remote IoT network architecture, optimized the deployment of UAVs through successive convex approximation (SCA) method, and solved the energy-saving communication problem between UAVs and ground terminals. The above work mostly only considered communication energy consumption, however less likely to involve flight energy consumption, which have a greater effect on the capabilities of the UAV communication. So considering the flight trajectory and speed control, the propulsion energy consumption model of UAV is introduced to solve the optimization problem of trajectory and velocity of UAV, and achieve high EE and wide coverage of the system.

To cope with the above challenges for future remote IoT applications, we first put forward a C-NOMA AGIN model in this paper, including three segments of UAVs for massive access, HAPS for backhaul, and remote ground terminals. C-NOMA technology has stronger processing power and can achieve a balance between spectrum efficiency and complexity. Then, considering the finite battery of IoT terminals and limited energy storage on UAVs [11-13], we investigate the trajectory plan problem for maximizing system energy efficiency (EE). Solution is provided according to SCA and we obtain the near-optimal UAV trajectory. Simulations demonstrate that our approach is superior to others in terms of EE.

2 System Model

Fig. 1.
figure 1

C-NOMA-enabled AGIN 6G system model.

As shown in Fig. 1, we propose the C-NOMA-enabled AGIN 6G system model, including \(\mathcal{U}=\{{U}_{1}, {U}_{2},\dots ,{U}_{M}\}\) UAVs with flight altitude \({\widehat{h}}_{m}\) for access, which share a channel with bandwidth of \(W\), one HAPS with altitude \({H}_{A}\) for backhaul, and \(\mathcal{D}=\{{D}_{1},{D}_{2},\dots ,{D}_{N}\}\) terminal devices sharing a channel with bandwidth of \(W\). The frequency resource is evenly divided into \(M\) blocks so that each UAV occupies different resource block without interference, and each block is further divided into \(\mathcal{K}=\{{S}_{1}, {S}_{2},\dots ,{S}_{K}\}\) subchannels. UAV serves its associated IoT devices using C-NOMA technology by dividing terminals into clusters. The flight period involves \(\mathcal{T}=\left\{{T}_{1},{T}_{2},\dots ,{T}_{L}\right\}\) time slots, each of which is ∆T, and UAV flies back to the starting point when each cycle ends. Let \({\mathbf{q}}_{m}(t)=({x}_{m}(t),{y}_{m}(t))\), \({\mathbf{v}}_{m}\left(t\right)\triangleq \dot{{\mathbf{q}}_{m}}\left(t\right)\), and \({\mathbf{a}\mathbf{c}}_{m}\left(\mathrm{t}\right)\triangleq \ddot{{\mathbf{q}}_{m}}\left(t\right)\) respectively denote the location, speed and acceleration of the \(m\) th UAV in slot \(t\). The position of the \(n\) th terminal is represented by \({\mathbf{D}}_{n} =({x}_{n},{y}_{n},0)\). Communication between terminal and UAV, UAV and HAPS is dominated by LoS links, and channel fading follows Rice fading [4, 10].

2.1 Communication Model

The distance between UAV \(m\) and user \(n\) in slot \(t\) is.

$${d}_{n,m}\left(t\right)=\sqrt{{{\widehat{h}}_{m}}^{2}+{\Vert {\mathbf{q}}_{m}(t)-{\mathbf{D}}_{n}\Vert }^{2}},\forall n\in \mathcal{D},m\in \mathcal{U},t\in \mathcal{T}.$$
(1)

Then, the channel power gain occupying the \(k\) th subchannel can be expressed as.

$${h}_{n,m,k}(t)=\frac{{G}_{m}{G}_{n}{\beta }_{0}}{{d}_{n,m}^{2}(t)},\forall n\in \mathcal{D},m\in \mathcal{U},k\in \mathcal{S},t\in \mathcal{T},$$
(2)

where \({G}_{m}\) and \({G}_{n}\) are the directional antenna gains of UAV and user, respectively. \({\beta }_{0}\) represents the channel power gain at the reference distance \({d}_{0}=1\mathrm{ m}\). Since HAPS is far from UAVs, the distance can be replaced by \({H}_{A}\). Therefore, the channel gain from UAV \(m\) to HAPS is calculated by.

$${h}_{m}=\frac{{G}_{m}{G}_{2}{\beta }_{0}}{{{H}_{A}}^{2}},\forall m\in \mathcal{U},$$
(3)

where \({G}_{2}\) is the directional antenna gain of HAPS.

Suppose that in the \(t\) th slot, the transmission power from the \(n\) th user to the \(m\) th UAV is \({P}_{n,m,k}(t)\in {\mathbf{P}}^{\mathrm{D}\to \mathrm{U}}\), and the transmission power from the \(m\) th UAV to the HAPS is \({P}_{m}(t){\in \mathbf{P}}^{\mathrm{U}\to \mathrm{H}}\). UAV trajectory, velocity and acceleration are defined as \(\mathbf{Q}=\{{\mathbf{q}}_{m}\left(t\right),\forall m\in \mathcal{U},t\in \mathcal{T}\}\), \(\mathbf{V}=\{{\mathbf{v}}_{m}\left(t\right),\forall m\in \mathcal{U},t\in \mathcal{T}\}\), \(\mathbf{A}\mathbf{C}=\{{\mathbf{a}\mathbf{c}}_{m}\left(\mathrm{t}\right),\forall m\in \mathcal{U},t\in \mathcal{T}\}\), respectively. The channel selection indicator between terminal and UAV is \(\mathbf{A}=\left\{{a}_{n,m,k}\left(t\right),\forall n\in \mathcal{D},m\in \mathcal{U},k\in \mathcal{K},t\in \mathcal{T}\right\}\), i.e., if terminal \(n\) connects with UAV \(m\) through the \(k\) th subchannel in the \(t\) th time slot, \({a}_{n,m,k}\left(t\right)=1\); otherwise \({a}_{n,m,k}\left(t\right)\)=0.

Then, the SINR between UAV \(m\) and terminal \(n\) occupying the \(k\) th subchannel can be calculated by

$${\gamma }_{n,m,k}(t)=\frac{{P}_{n,m,k}(t){h}_{n,m,k}(t)}{{I}_{n,m,k}(t)+{N}_{0}},$$
(4)

where \({N}_{0}\) represents the noise power, \({I}_{n,m,k}\left(t\right)={\sum }_{j=n+1}^{N}{a}_{j,m,k}(t){P}_{j,m,k}(t){h}_{j,m,k}(t)\) represents interference from other users in the same channel. SIC technology can be used to decode signals from terminal devices occupying the same sub-channel.

Likewise, the SINR between UAV \(m\) and HAPS can be calculated by.

$${\gamma }_{m}(t)=\frac{{P}_{m}(t){h}_{m}}{{N}_{0}}.$$
(5)

In the \(t\) th time slot, the uplink rate of user \(n\) to HAPS can be denoted by.

$${R}_{n,m,k}(t)=\frac{W}{\mathit{MK}}{log}_{2}\left(1+\frac{{\gamma }_{n,m,k}(t){\gamma }_{m}(t)}{1+{\gamma }_{n,m,k}(t)+{\gamma }_{m}(t)}\right).$$
(6)

The system capacity can be expressed as.

$$R=\frac{W}{MK}\sum_{t=1}^{L}{\sum }_{k=1}^{K}{\sum }_{m=1}^{M}{\sum }_{\text{n} = 1}^{N}{a}_{n,m,k}(t){\mathit{log}}_{2}\left(1+\frac{{\gamma }_{n,m,k}(t){\gamma }_{m}(t)}{1+{\gamma }_{n,m,k}(t)+{\gamma }_{m}(t)}\right).$$
(7)

2.2 Power Consumption Model

UAV power consumption includes two parts. One is the power associated with communication, including radiation, signal processing, and power generated by other circuits. The other is propulsion power consumption to maintain flight and maneuverability [14]. Fixed-wing UAVs used in this paper normally have no sudden deceleration and do not require engine to generate reverse thrust for the forward movement. Therefore, propulsion power consumption is a function of trajectory \({\mathbf{q}}_{m}(t)\) denoted by

$$P={\sum }_{t=1}^{L}\left({P}_{d}(t)+{P}_{k}(t)\right),$$
(8)

where

\({P}_{d}\left(t\right)={\sum }_{m=1}^{M}\left({c}_{1}{\Vert {\mathbf{v}}_{m}\left(t\right)\Vert }^{3}+\frac{{c}_{2}}{\Vert {\mathbf{v}}_{m}\left(t\right)\Vert }\left(1+\frac{{\Vert {\mathbf{a}\mathbf{c}}_{m}\left(t\right)\Vert }^{2}-\frac{{\left({\mathbf{a}\mathbf{c}}_{m}^{T}\left(t\right){\mathbf{v}}_{m}\left(t\right)\right)}^{2}}{{\Vert {\mathbf{v}}_{m}\left(t\right)\Vert }^{2}}}{ {g}^{2}}\right)\right)\) represents the power required for overcoming drag, \(g\) denotes gravity, \({P}_{k}\left(t\right)=\sum_{m=1}^{M}{\Delta k}_{m}/\Delta T\), and \({\Delta k}_{m}=\frac{1}{2}{{m}_{m}}^{^{\prime}}\left({\Vert {\mathbf{v}}_{m}\left(L\right)\Vert }^{2}-{\Vert {\mathbf{v}}_{m}\left(1\right)\Vert }^{2}\right)\) denotes the kinetic energy of flight in a cycle. \({{m}_{m}}^{^{\prime}}\) is quality of UAV \(m\), and \({c}_{1}\), \({c}_{2}\) are two parameters related to the weight of aircraft, air density, etc.

Then, the system EE is denoted by

$$\lambda =\frac{R}{P}$$
(9)

3 Problem Formulation

The objective is to maximize the uplink system EE for remote IoT terminals in AGIN by designing UAV trajectory plan and speed control, formulated as

$$\mathbf{P}0:\genfrac{}{}{0pt}{}{max}{\mathbf{Q},\mathbf{V}}\lambda $$
$$C1:{\sum }_{m=1}^{M}{\sum }_{k=1}^{K}{a}_{n,m,k}\left(t\right){R}_{n,m,k}\left(t\right)\ge {R}_{min}\left(t\right)$$
(10)
$$C2:{\sum }_{k=1}^{K}{\sum }_{m=1}^{M}{\sum }_{n=1}^{N}{a}_{n,m,k}(t){R}_{n,m,k}(t)\le {R}_{max}^{H}(t)$$
(11)
$$C3:{V}_{min}\le \Vert {\mathbf{v}}_{m}\left(t\right)\Vert \le {V}_{max}$$
(12)
$$C4:\Vert {\mathbf{a}\mathbf{c}}_{m}\left(t\right)\Vert \le {ac}_{max}$$
(13)

Constraint (10) guarantees the system QoS. (11) represents the capacity constraint of the system with \({R}_{max}^{H}\), which guarantees that the total data transmitted cannot exceed the system capacity. (12), (13) denotes the speed and acceleration constraint of UAV, respectively.

4 Algorithm Analysis

The UAV trajectory, velocity and acceleration are all continuous variables, making \(\mathbf{P}0\) difficult to be solved directly. Therefore, we propose the algorithm for solving it based on SCA. The algorithm starts from line 3 with iteration index \(w\). Line 4–5 achieves the best trajectory \(({{\mathbf{q}}_{m}^{w}\left(t\right))}^{\boldsymbol{*}}\), speed \({({\mathbf{v}}_{m}^{w}\left(t\right))}^{\boldsymbol{*}}\), and acceleration \({({\mathbf{a}\mathbf{c}}_{m}^{w}\left(t\right))}^{\boldsymbol{*}}\) for the current loop. It continues until the increase of the objective value is below a threshold \(\varepsilon \). Details are shown below.

The denominator’s upper bound of the objective function \(\lambda \) obtained from (8), the lower bound of system EE is

$$\lambda \ge {\lambda }_{lb}=\frac{R}{\sum_{t=1}^{L}\sum_{m=1}^{M}\left({c}_{1}{\Vert {\mathbf{v}}_{m}\left(t\right)\Vert }^{3}+\frac{{\mathrm{c}}_{2}}{\Vert {\mathbf{v}}_{m}\left(t\right)\Vert }\left(1+\frac{{\Vert {\mathbf{a}\mathbf{c}}_{m}\left(t\right)\Vert }^{2}}{{\mathrm{g}}^{2}}\right)+{\Delta \mathrm{k}}_{m}/\Delta T\right)}.$$
(14)

We can obtain the sub-optimal result based on first-order and second-order Taylor expansion:

$${\mathbf{v}}_{m}\left(t+1\right)={\mathbf{v}}_{m}\left(t\right)+{\mathbf{a}\mathbf{c}}_{m}\left(t\right)\Delta T$$
(15)
$${\mathbf{q}}_{m}\left(t+1\right)={\mathbf{q}}_{m}\left(t\right)+{\mathbf{v}}_{m}\left(t\right)\Delta T+\frac{1}{2}{\mathbf{a}\mathbf{c}}_{m}\left(t\right){\Delta T}^{2}.$$
(16)

Notice that constraints (10)–(12) are non-convex. Both numerator and denominator of \(\mathbf{P}0\) are non-convex making the objective function non-convex. Therefore, we introduce the relaxation variable \(\{{\tau }_{m}\left(t\right)\}\) and restate \(\mathbf{P}0\) as

$$\mathbf{P}1: \begin{array}{c}max\\ \mathbf{Q},\mathbf{V}\end{array}\frac{R}{\sum_{t=1}^{L}\sum_{m=1}^{M}\left({c}_{1}{\Vert {\mathbf{v}}_{m}\left(t\right)\Vert }^{3}+\frac{{\mathrm{c}}_{2}}{{\tau }_{m}(t)}+\frac{{{\mathrm{c}}_{2}\Vert {\mathbf{a}\mathbf{c}}_{m}\left(t\right)\Vert }^{2}}{{\mathrm{g}}^{2}{\tau }_{m}(t)}+{\Delta \mathrm{k}}_{m}/\Delta T\right)}$$
$${\text{s.t.}} C1-C4$$
$${ C5:\tau }_{m}(t)\ge {V}_{min}$$
(17)
$${C6:\Vert {\mathbf{v}}_{m}\left(t\right)\Vert }^{2}\ge {{\tau }_{m}(t)}^{2}.$$
(18)

By this reformulation, the denominator of the objective function is convex for {\({\mathbf{v}}_{m}\left(t\right)\), \({\mathbf{a}\mathbf{c}}_{m}\left(t\right)\), \({\tau }_{m}(t)\)}, but new non-convex constraint (18) is created. To solve this non-convex constraint, we apply a locally convex approximation. With regard to any local point \({\mathbf{v}}_{m}^{r}\left(t\right)\) obtained in the \(r\) th iteration, we have

$${\Vert {\mathbf{v}}_{m}\left(t\right)\Vert }^{2}\ge {\Vert {\mathbf{v}}_{m}^{r}\left(t\right)\Vert }^{2}+2{{(\mathbf{v}}_{m}^{r}\left(t\right))}^{T}({\mathbf{v}}_{m}\left(t\right)-{\mathbf{v}}_{m}^{r}\left(t\right))\triangleq {\psi }_{lb}({\mathbf{v}}_{m}\left(t\right)).$$
(19)

Thus, we define new constraints,

$${\psi }_{lb}\left({\mathbf{v}}_{m}\left(t\right)\right)\ge {{\tau }_{m}}^{2}(t).$$
(20)

Then, for the sake of solving the non-concavity of the numerator, it is transformed into

$$R=\frac{W}{MK}\sum\nolimits_{t=1}^{L}\sum\nolimits_{k=1}^{K}\sum\nolimits_{m=1}^{M}\sum\nolimits_{n=1}^{N}{a}_{n,m,k}(t)\left({\widehat{\gamma }}_{n,m,k}\left(t\right)-{\widehat{\gamma }}_{n,m,k}^{^{\prime}}(t)\right)$$
(21)
$${\widehat{\gamma }}_{n,m,k}\left(t\right)={log}_{2}\left({\Omega }_{n,m,k}(t)+{P}_{n,m,k}{(t)h}_{n,m,k}(t){P}_{m}(t){h}_{m}+{\Theta }_{m}(t)\right)$$
(22)
$${\widehat{\gamma }}_{n,m,k}^{^{\prime}}(t)={log}_{2}\left({\Omega }_{n,m,k}(t)+{\Theta }_{m}(t)\right)$$
(23)

where \({\Theta }_{m}\left(t\right)={N}_{0}^{2}+{N}_{0}{P}_{m}(t){h}_{m}\)

$${\Omega }_{n,m,k}(t)={N}_{0}\sum\nolimits_{j=n+1}^{N}{\rho }_{j,m,k}\left(t\right){h}_{n,m,k}(t)+{N}_{0}{P}_{n,m,k}(t){h}_{n,m,k}(t)\sum\nolimits_{j=n+1}^{N}{\rho }_{j,m,k}\left(t\right){h}_{j,m,k}(t).$$

By using SCA method, the nonconvex problem is transformed into convex optimization problem. Owing to the space limitation, the derivation details are not shown, and similar derivation can be found in [6]. Finally, the numerator is concave, the denominator is convex, and all constraints are convex, making \(\mathbf{P}0\) transformed into a standard convex problem. Thus, using CVX can solve this problem.

figure a

5 Simulation Results

In this section, we make a comparison between the proposed algorithm and random trajectory method (Random) and OMA method using MATLAB. We consider 3 UAVs and 20 terminals randomly distributed in an area of 200 m × 120 m, each NOMA cluster contains two terminals.

Fig. 2.
figure 2

The system EE for different benchmark algorithms.

Figure 2 shows the system EE for different benchmark algorithms. It is shown that as the number of users increases, so does energy efficiency. Moreover, compared with OMA method and random trajectory method, our proposed method is always superior in terms of EE.

Fig. 3.
figure 3

Optimized trajectories for all UAVs.

Figure 3 shows the optimized flight trajectories for three UAVs. It is shown that with the optimized trajectory, the UAV can get close to each terminal device as much as possible, so as to achieve better system EE and provide superior QoS.

6 Conclusion

In this paper, a C-NOMA-based heterogeneous AGIN network is proposed and we formulate the problem of maximizing the system EE by trajectory optimization. We design an algorithm to achieve the maximum system EE. Both theoretical analysis and results show that the proposed method is superior to others in terms of EE.