1 Introduction

The use of wireless broadband services has risen significantly with the deployment of Long Term Evolution (LTE) networks and the generalization of smart phones, tablet computers and other new mobile devices. People make intensive use of these devices when they are on public transport vehicles such as buses, trams, or trains.

According to Ericsson’s mobility report [14], the number of mobile broadband subscriptions grew 10% year-on-year, increasing by 120 million in Q3 2019. The number of smartphone subscriptions is forecast to reach 7.4 billion in 2025, or 83% of all mobile subscriptions.

The quality of service in public transportation remains far from adequate. Vehicles are usually well shielded with coated windows, which leads to a rather high electromagnetic penetration loss between outdoor and in-vehicle. Commonly encountered, the User Equipments (UEs) moving within public transport vehicles are connected to Evolved Node B (eNB) via wireless links, in which the penetration loss critically attenuates the signal quality and downgrades the accomplishable data rate.

Public transportation is vital for the development of major cities, especially in the context of global warming. “Société du Grand Paris” (SGP) will create 200 km of subway lines called “Grand Paris Express”. The metros will connect the main economic and logistical activity zones, including the three Paris airports. As part of the service, SGP would like to give an incomparable customer experience regarding the internet access during their trip to the expected 2 million daily passengers. SGP is thus promoting the emergence of innovations to reach this goal.

Deploying mobile relays with both an outdoor antenna to communicate with the cellular network and an indoor antenna to provide good coverage to passengers is a natural solution to provide a high Quality of Service (QoS). However, the mobile relay is like a remote base station wirelessly connected to the network infrastructure, which should control it. Thus, there are signalling exchanges between the relay and the fixed network. The question is to check that the capacity gain brought by mobile relays is not compromised by additional signalling.

By performing real radio transmissions with a testbed in [22] we proved that a mobile relay architecture can be easily implemented with standard Evolved Packet Core (EPC) and with full compatibility with 3GPP recommendations. We evaluated the performance of this relay architecture by means of simulation in loaded conditions in [21]. In addition to a Voice over Internet Protocol (VoIP) service, we considered a simple client-server protocol. We found that the mobile relay architecture can improve the QoS but we did not simulate real handovers.

Our objective in this paper is to conduct a more in-depth study of mobile relay performance. From a general point of view, the bit rate of mobile networks is limited by the radio interface. With mobile relays, there are two radio interfaces: the access radio interface in the train is the same as for a standard network and can provide high bit rate due to short range transmissions, while the backhaul radio interface is specific to the relay solution. Furthermore, it gathers the links between all terminals and the fixed network infrastructure. The backhaul is thus the bottleneck of the system. Though we consider the whole network, we focus on the backhaul link. In addition to our previous work [21, 22], we study the signaling load on the backhaul, we consider a Web service instead of a simple client-server exchange and we integrate handover.

Our contribution is two-fold:

  • we propose a general analytical model to compute the signaling load generated by terminal disconnection and re-connection due to inactivity, useable both for standard mobile networks and networks with relays,

  • we evaluate the quality improvement provided by mobile relays, with a particular focus on the periods for which this quality is low (i.e. when the metro or the train enters a new cell).

The remainder of this paper is organized as follows: in Sect. 2 we present the state-of-the-art of mobile relay in LTE. In Sect. 3 we depict the architecture and the protocol stack when mobile relays are used. In Sect. 4 we study the impact of signaling in the relay architecture. In Sect. 5 we describe our simulation scenario, including the traffic models, and we present our QoS parameters. In Sect. 6 we present and analyze the simulation results. Section 7 concludes our paper.

2 State of the art on mobile relays

Several studies related to mobile relays for railways have been found in the literature. Since there are many sub fields of research related to mobile relays, we have classified this state-of-the-art into several families based on their common characteristics.

2.1 Mobile relay architectures

The concept of mobile wireless access points was proposed in the early 90’s in the first European projects [2]. However, there was neither real implementation nor deployment for 25 years. There has been a resurgence of interest with the advent of LTE-EPC.

The Third Generation Partnership Project (3GPP) considered several possible architectures for mobile relays in [18]. Some of the suggested solutions compel the modification of several protocols which leads to an eminent standardization effort. Nevertheless, the first alternative called Alt-1 described in [18] is founded on two facts: nodes of the EPC, such as Mobile eNB (eNBm), Mobile Management Entity (MME), just need Internet Protocol (IP) connectivity and, second, IP connectivity can be simply supplied by an LTE/EPC network.

A solution based on mobile relays was studied in [39]. Parameters such as Doppler frequency shift, penetration loss, system capacity planning and mobility management in high-speed railways were analyzed.

In [10] the relay node architectures Alt-1 and Alt-2 are studied. The authors provide a short analysis of Alt-1 and Alt-2 in terms of performance. The authors propose a Proxy Mobile IP (PMIP) mechanism with existing LTE Rel-10 relay architecture, in order to support mobile relay with minimum standardization impacts. In [15] an evaluation of the backhauling relay in LTE is performed. The backhaul is done by an LTE dongle that performs a second GPRS Tunneling Protocol (GTP) encapsulation. The authors claim that the self-backhauling network makes it possible to extend the radio coverage of the conventional LTE network and can improve the throughput of the network.

In [6] the authors study the effect of amplify-and-forward mobile relays on coverage under high mobility scenarios, as well as the loss in throughput for indoor UEs compared to outdoor UEs. They consider two scenarios: with mobile relays and without mobile relays. The performance of mobile relays in terms of coverage is evaluated. They show that the gain of mobile relays attains 17 dB and 12 dB for Reference Signal Received Power (RSRP) and Reference Signal Code Power (RSCP), respectively. On the other hand, throughput exhibits a large gap between outdoor and indoor UEs under the “no mobile relay” scenario.

A carrier aggregation-supported mobile relay for railway LTE-A networks is proposed in [4]. The performance of this scheme is evaluated in terms of average user throughput, packet loss and outage probability. The results presented by the authors show that better performance can be accomplished by applying dual backhaul links with a carrier aggregation scheme.

2.2 QoS and performance studies

The performance of multiuser Multiple-Input–Multiple-Output (MIMO) in base stations and moving relay nodes in LTE is evaluated in [26, 27, 34]. According to the authors, both on-train users and macro cell users get better throughput by deploying MIMO. In [7, 8] a performance study of mobile relays in LTE and LTE-A networks is presented for railway scenarios. The authors investigate the issue of mobile relay capacity improvement for on-board train users and its impact on the overall network performance. The authors show that the addition of mobile relay nodes can improve the overall cell network performance in the railway environment. In [11], stochastic geometry is used to compute the throughput and the energy consumption of a network with mobile relays. However, the analysis is based on snapshots and does not consider the dynamic nature of the traffic.

In [29], the authors present an experimental performance analysis in extreme traffic and density conditions for mobile relays. This article focuses on a load–stress test specifically designed for the two-hop architecture that enables on-board connectivity in high-speed trains. In [23] the authors propose a mechanism to efficiently select the appropriate number of mobile relay nodes and receive antennas in relation to the throughput performance and capital expenses in the train. According to the authors, the use of multiple mobile relay nodes and antennas on the high speed train enables spatial multiplexing gains and reduces the handover failure rate since multiple relay nodes will not perform handover processes at the same time. However, this method seems to be expensive in terms of having multiple relays in trains. There is also no information about the user load in their experiments. In [37] a study focusing on the resource allocation problem for local users (low-mobility users) and the mobile relay in a high-speed train in the same Orthogonal Frequency-Division Multiple Access (OFDMA) system is presented. Under this mobile relay architecture, passengers communicate with base stations via a mobile relay. Then all passengers in the train can be treated as one large user represented by the mobile relay. The authors focus on the problem in two cases where perfect Intercarrier Interference (ICI) cancellation and no ICI cancellation are applied to the mobile relay. A resource allocation for high-speed trains equipped with multiple moving relays in OFDMA systems is studied in [20]. The authors propose a solution to minimize the total transmit power consumption, while serving the passengers by meeting their required throughput. According to the authors, their proposed scheme using moving relays outperforms some previously proposed algorithms, as well as direct transmission. A recent study [38] focuses its research on an asymmetric mobile relay architecture for High-Speed Trains (HST) in 5G networks. In order to avoid high penetration losses of the direct link between the base station and the UEs inside carriages, a mobile relay is deployed at the HST. In the relay architecture the authors propose the use of sub-6 GHz frequency for the BS-relay link and the adoption of a millimeter wave frequency fot the relay-UE link.

2.3 Signaling load with mobile relays

To the best knowledge of the authors, there is no study on the signaling load in wireless networks when a mobile relay is used. More generally, there are only a few publications on signaling load in LTE/EPC. In [31], a mathematical model based on queuing networks is proposed in the case of virtual MME. Though the model could be adapted for standard MME, the reading time of a Web page is assumed to be exponentially distributed. Another close model is defined in [30] to analyze the performance of a dynamic auto-scaling algorithm of control plane resources. The authors assume that a terminal goes back to idle mode at each reading period, which is not true when the reading time is short.

2.4 Handovers in mobile relay nodes

Several studies focus on handover performance on mobile relays. In [28] a study tackles the handover problem within a relay architecture for high-speed rail in LTE-A networks. The authors propose a new handover scheme. Based on the fact that a train travels in fixed trajectories, the proposed measurement procedure can shorten the handover time, according to the authors. In [33] the authors study handover performance based on dual antennas and mobile relay for high speed railways. This work focuses on reducing the handover outage probability.

In [24] another study focuses on minimizing handover failure and link outage probabilities. According to the authors, the proposed mechanism can reduce unnecessary handovers, handover failure, and link outage probability. In [9], a handover authentication mechanism based on trajectory prediction for mobile relays is proposed. Compared with the current 3GPP standards and other related schemes, according to the authors this scheme effectively reduces handover delays and at the same time provides strong security protection.

A handover scheme for a dual-link system architecture is proposed in [36]. This mechanism aims to minimize the communication interruption during handover. The proposed scheme uses two antennas to prevent the communication from being interrupted during handover. The authors analyze their proposed mechanism in terms of handover probability, handover failure probability and communication interruption probability. In [25], the authors seek to find an appropriate power allocation scheme to achieve a trade-off at handover between the transmission rate and the delay. According to the authors, the proposed power allocation scheme can reduce delay at the cost of slightly reduced capacity.

The authors of [40] show that the communication interruption is longer than the time to switch the processes in the physical and the Medium Access Control (MAC) layers due to queuing mechanisms. They propose an improved switch mechanism in order to ensure the continuity of communication.

Although fields such as handover improvement, architecture modifications and allocation algorithms for mobile relays are proposed among the aforementioned studies, none of them has studied in detail the Alt-1 3GPP alternative in terms of signaling and quality of service. To the best of our knowledge, there are no studies related to the effect of this relay architecture on the main services used by passengers, such as Web browsing and VoIP calling.

3 Relay architecture

3.1 Overview of the architecture

Our mobile relay solution is based on 3GPP Alt-1 [17]. We chose this solution because the other ones (namely Alt-2 to Alt-4) need additional software development. Two EPC are used. The first one is called EPCt for track and all nodes are identified with the same suffix (eNBt, MMEt, SGWt, etc.). No user terminal can access this network. The second network is the general network (identified by suffix g, MMEg, SGWg, PGWg), which manages passenger terminals. The relay is made of a UE called Transport Operator terminal Equipment (TOE) that can access the EPCt and an embedded base station called eNBm (suffix m for mobile). It is important to highlight that eNBms have access to the general network through the track network: an eNBm is connected to a TOE and can thereby exchange messages with the Global Mo- bile Management Entity (MMEg) and the Global Serving Gateway (SGWg). In stations, some eNB provide coverage of the platform: they are directly connected to the general network and are thus called eNB connected to the global network (eNBg).

This architecture is \(100\%\) compatible with the 3GPP recommendations and does not require any modification of the protocol stacks or the hardware (Fig. 1). Note that any eNBm have an S1 interface with an SGWg and can have an X2 interface with an eNBg. Messages exchanged on these interfaces are transmitted on the backhaul link (between the TOE and the Track eNB (eNBt).

EPCt is used as a transport network for EPCg. Note that several EPCgs can be connected to the same EPCt and thus use the same shared relay infrastructure. EPCt can be deployed by a specific operator or by a mobile network operator as it is done with Radio Access Network (RAN) sharing.

One important benefit of mobile relay is that mobile management is no longer done on an individual terminal basis, but rather at the relay level. When a terminal is attached to the network through the relay, the terminal is considered as being under the coverage of the mobile eNB and is fixed as long as the user stays in the train. If the train moves to another cell, a single handover command is sent to the TOE but applies for all terminals connected to the mobile eNB.

Fig. 1
figure 1

Mobile relay architecture

3.2 Analysis of the header sizes

As shown in Fig. 2, the counterpart of mobile relays is additional headers. Due to the use of two EPC networks there is an additional level of encapsulation and thus extra-headers (right part of Fig. 2) compared to a standard architecture.

In a standard radio interface, a user IP packet is transported by Packet Data Convergence Protocol (PDCP) and the header size due to PDCP, Radio Link Control (RLC) and MAC is typically 6 bytes. With IPv4, the IP header is 20 bytes. The header size including IP is thus 26 byte. With a relay, the tunnel is set over the radio backhaul. The following headers are thus added to each tunneled packet: 12 bytes for GTP [16] (or 8 bytes if there is no numbering), 8 bytes for User Datagram Protocol (UDP) and 20 bytes for IP. The total header size is thus \(26+40=66\) bytes.

For Radio Resource Control (RRC) signaling messages, the header is also 6 bytes for a standard radio interface. Most signaling messages include a Non-Access Stratum (NAS) part and are thus exchanged between the terminal and the MME. When a mobile relay is used, they are exchanged between the terminal and the MMEg through the backhaul. The additional header is due to the use of IP, Stream Control Transmission Protocol (SCTP) and S1 Application Protocol (S1-AP) protocols. The structure of SCTP messages is complex [35]. We assume that a message includes the standard SCTP header (12 bytes), an acknowledgement chunk (16 bytes) and a data chunk (the header is 16 bytes), which gives a total of 44 bytes. From IP to S1-AP (excluded), the total header size is thus \(6+20+44=70\) bytes.

Fig. 2
figure 2

Protocol stack and different headers on the backhaul

4 Signaling analysis

4.1 Overview of the test-bed

4.1.1 General architecture

The test-bed is built on the Amarisoft solution [5], which is fully compatible with off-the-shelf terminals. Digital signal processing and all the protocol stacks in their entirety are pure software operations running on a standard PC. The high-frequency radio signal is generated and sampled by a radio module. We use several instances of Amarisoft: one on a PC, which acts as the eNBg; a second one with an LTE-dongle and a radio module, which acts as a relay (eNBm and ToE); and a third one, which acts as the eNBt and includes core network functions (MMEt, SGWt, PGWt, MMEg, SGWg and PGWg) and which is connected to the local fixed network and the eNBg (see Fig. 3). An additional computer is used as a controller to have automatic tests.

Fig. 3
figure 3

Implementation of the test-bed

The transmission is made in frequency division duplex mode in the 2.6 GHz band. In order to avoid interference between the access and the backhaul links, two separate carriers are used. The bandwidth for each carrier is 5-MHz to avoid processing overload in the computer.

4.1.2 Tested scenarios

Our focus is on the signaling load. The main part is due to mobility management. With mobile relay, there is only one handover for all terminals when a moving train enters a new cell. The signaling load is thus very low. Furthermore, this type of handover was analyzed in our previous work [22] and we do not consider it in this paper. However, at a station there are multiple mobility scenarios as shown in Fig. 4, which should be studied.

We thus consider the following scenarios:

  • a user with a terminal in idle mode enters a train,

  • a user with a terminal in idle mode leaves the train,

  • a user with an active session (terminal in connected mode) enters a train, known as an on-boarding handover,

  • a user with an active session leaves the train, known as an off-boarding handover,

  • a user can of course stay in the train but this does not in itself generate specific signaling.

The sporadic nature of data flows also generates signaling between an eNB and the core network. We thus consider the following cases:

  • a user does not use his/her terminal for a while and the eNBm triggers an RRC release, putting the terminal back in idle mode,

  • a user uses his/her terminal after a pause and the RRC is resumed, a procedure known as a Service Request.

Fig. 4
figure 4

Studied signaling scenarios

4.2 Size of signaling messages

Table 1 Signaling for on-boarding handover on the S1 (example)

We ran each procedure listed in Sect. 4.1.2 on the testbed. We analyzed the number of messages and the size of each message on the S1-interface between the eNBm and the MMEg or on the X2-interface between the eNBm and the eNBg. Note that we only consider messages transmitted on the backhaul link because it is the interface specific to mobile relaying. Signaling on other interfaces is exactly the same as that in a standard mobile network. An example is shown in Table 1 for on-boarding handover. For each procedure we count the number of S1AP and X2AP messages and the total number of bytes for downlink and uplink. We assume that pure SCTP messages (e.g. SACK) are piggy-backed by application messages. We thus assume that the final SACK shown in Table 1 for a given UE are included in signaling messages concerning other UEs. As explained in Sect.  3.2, the additional header size from the physical layer up to SCTP is 70 bytes. From Table 1, we get a total number of uplink signaling bytes for on-boarding handover of \(334+4\times 70=614\) (and \(1276+4\times 70=1556\) for downlink). The results for all procedures are computed with the same method and are shown in Table 1.

Table 2 Signaling sizes for different scenarios

4.3 Signaling for on-boarding and off-boarding

In this subsection, we evaluate the signaling load when passengers enter or leave a train at a station. As shown in Table 2, there is far less signaling for on and off boarding for a terminal in idle state than in connected state. We thus consider a scenario in which all passengers have a terminal in connected state, which is clearly a worst case scenario. Based on the estimation of SGP, the train stop was assumed to be 45 s. In peak hours, there are up to 1000 passengers in a train. Up to half of the people get off at an important station and the same amount get on. We thus assume 500 off-boarding and 500 on-boarding handovers.

The average signaling bit rate \(S_d\) in kbps on the downlink is thus (from Table 2):

$$\begin{aligned} S_d = 500 \times 8 \times \frac{614+ 1420}{1000 \times 45} = 181 \text { kbps.} \end{aligned}$$
(1)

Similarly, the bit rate \(S_u\) on the uplink is:

$$\begin{aligned} S_u = 500 \times 8 \times \frac{1556 + 534}{1000 \times 45} = 186 \text { kbps.} \end{aligned}$$
(2)

An upper bound of the signaling rate due to on and off boarding is thus 190 kbps both for the uplink and the downlink. This signaling is transmitted when the train is in a station and thus fixed. The link budget can be easily optimized. With a 20-MHz FDD radio interface and a \(2\times 2\) MIMO, the maximum bit rate is 150 Mbps. If we assume that the average capacity is 50% of the maximum bit rate, the amount of signaling is thus \(0.25\%\) of the capacity, which is quite low.

4.4 Activity model of terminals

In this sub-section, we characterize the signaling load due to the sporadic nature of data flows: there are frequent RRC disconnections and reconnections. We first present the analytical model and then use it to compute the signaling load.

4.4.1 Source traffic model

For any positive random variable \(t_{\text {X}}\), let \(f_{\text {X}}(t)\), \(F_{\text {X}}(t)\) and \({\overline{F}}_{\text {X}}(t)\) be its probability density function (PDF), cumulative distribution function (CDF) and complementary CDF (CCDF), respectively. Note that if \(\int _{0}^{\infty } {\overline{F}}_{\text {X}}(t) dt\) exists, then this integral gives the expectation \(E[t_{\text {X}}]\) of \(t_{\text {X}}\).

We consider the model inside a Web browsing session as illustrated in the top part of Fig. 5 (see also Fig. 7). The user downloads Web pages: the time to download is modelled as a random variable \(t_{\text {A}}\) (A stands for activity). The user then reads the page and we similarly define \(t_{\text {R}}\) (R stands for reading). The distributions of \(t_A\) and \(t_R\) can be general. The only assumption we make is that \(t_R\) and \(t_A\) are two independent variables: the value of \(t_A\) has no influence on \(t_R\).

As long as the download is progressing, an RRC connection is maintained. Just after the end of the loading phase, an inactivity timer is launched. Let \(T_{\text {O}}\) be the initial value of the timer. If another Web page is requested before the timer expires, the UE stays in RRC_Connected state. In other cases, a radio resource release procedure is triggered and the terminal switches to state I (for idle). As soon as a new transfer is activated, a Service Request Procedure is executed. An example of this scenario is given in Fig. 5.

A terminal can be in 3 states, namely A, B and I, as shown in Fig. 6. State B corresponds to at most the first \(T_{\text {O}}\) seconds after the end of the page transfer. Note that the terminal is in RRC_connected state for both A and B. We thus define state C as including both states A and B.

Our objective is to compute the probability for a terminal to be in idle state and the average number of service request procedures per time unit. Note that the average number of radio resource release procedures is equal to the average number of service request procedures at the equilibrium.

Fig. 5
figure 5

Illustration of activity and reading times

Fig. 6
figure 6

Activity model with related RRC states

4.4.2 Idle state probability and average service request rate

We first consider a UE that switches to idle mode. When entering state I, \(T_{\text {O}}\) of the reading phase has already been used in state B. Thus, \(t_{\text {I}}=t_{\text {R}}-T_{\text {O}}\). Hence, by using the formula of conditional probabilities

$$\begin{aligned} {\mathbb {P}}(t_{\text {I}}>t) = {\mathbb {P}}[t_{\text {R}}>t+T_{\text {O}}/t_{\text {R}}>T_{\text {O}}] = \frac{{\overline{F}}_{\text {R}}(t+T_{\text {O}})}{{\overline{F}}_{\text {R}}(T_{\text {O}})}. \end{aligned}$$
(3)

For any positive random variable, the mean is the integral of the CCDF if this integral converges. The mean idle time is thus:

$$\begin{aligned} E[t_{\text {I}}] = \int _{0}^{\infty } {\mathbb {P}}(t_{\text {I}}>t) dt = \frac{ \int _{T_{\text {O}}}^{\infty } {\overline{F}}_{\text {R}}(t) dt}{{\overline{F}}_{\text {R}}(T_{\text {O}})}. \end{aligned}$$
(4)

We now want to determine \(E[t_{\text {C}}]\). A UE can stay in state C while switching an unlimited number of times from state A to state B and reciprocally. The finite sate machine that corresponds to states A, B and C and the possible transitions is a memoryless process. We can use renewal theory [12]: let consider a terminal that was in idle state and for which a transmission phase starts. The initial state is A. The expected remaining time is \(E[t_{\text {C}}]\). The UE makes at least one transmission during \(E[t_{\text {A}}]\) and then

  • with probability \({\mathbb {P}}[t_{\text {R}} \ge T_{\text {O}}]={\overline{F}}_{\text {R}}(T_{\text {O}})\), the UE stays in state B during \(T_{\text {O}}\) and then switches to idle state,

  • with probability \({\mathbb {P}}[t_{\text {R}}<T_{\text {O}}]=F_{\text {R}}(T_{\text {O}})\), it stays in addition in state B during \(E[t_{\text {R}}/t_{\text {R}}<T_{\text {O}}]\) and then switches again to state A. Thus, the expected remaining time is \(E[t_{\text {C}}]\).

From the above, we get:

$$\begin{aligned} E[t_{\text {C}}]= & {} E[t_{\text {A}}] + {\overline{F}}_{\text {R}}(T_{\text {O}}) T_{\text {O}} + F_{\text {R}}(T_{\text {O}})\nonumber \\&\times \left( E[t_{\text {R}}/t_{\text {R}}<T_{\text {O}}] + E[t_{\text {C}}]\right) . \end{aligned}$$
(5)

By definition, \(E[t_{\text {R}}/t_{\text {R}}<T_{\text {O}}] = {\int _{0}^{T_{\text {O}}} t f_{\text {R}}(t) dt }/{F_{\text {R}}(T_{\text {O}})}\). By integration by parts, we get:

$$\begin{aligned} E[t_{\text {R}}/t_{\text {R}}<T_{\text {O}}] = \frac{-{\overline{F}}_{\text {R}}(T_{\text {O}})T_{\text {O}} + \int _{0}^{T_{\text {O}}} {\overline{F}}_{\text {R}}(t) dt }{F_{\text {R}}(T_{\text {O}})}. \end{aligned}$$
(6)

Combining (5) and (6) and after some elementary computational steps, we get

$$\begin{aligned} E[t_{\text {C}}] = \frac{E[t_{\text {A}}] + \int _{0}^{T_{\text {O}}}{{\overline{F}}_{\text {R}}(t) dt}}{{\overline{F}}_{\text {R}}(T_{\text {O}}) }. \end{aligned}$$
(7)

When we consider states I and C, we have a simple 2-state process where the terminal is alternately in one of the two states. The average number of transitions \(\lambda _t\) per unit time from RRC_idle state to RRC_connected state is thus \(\lambda _t = {1}/({E[t_{\text {C}}]+E[t_{\text {I}}]})\). From (4) and (7), we deduce

$$\begin{aligned} {\lambda _t = \frac{{\overline{F}}_{\text {R}}(T_{\text {O}})}{E[t_{\text {A}}]+E[t_{\text {R}}]}}. \end{aligned}$$
(8)

The probability \(\Pi _{\text {I}}\) of being in state I is \({E[t_{\text {I}}]}/({E[t_{\text {C}}]+E[t_{\text {I}}]})\). We get

$$\begin{aligned} {\Pi _{\text {I}} = \frac{ \int _{T_{\text {O}}}^{\infty }{{\overline{F}}_{\text {R}}(t) dt}}{E[t_{\text {A}}]+E[t_{\text {R}}]}}. \end{aligned}$$
(9)

The authors of [32] carried out a very large measurement study but on fixed computers. They found that reading time follows a log-normal distribution. The average reading time is 39.7 s and the standard deviation is 324.92 s. This corresponds to \(\mu =1.5717\) and \(\sigma = 2.0541\) and not to \(\mu =-0.495204\) and \(\sigma = 2.7731\) as indicated in [32]. From Sect. 6.1, we get \(E[t_{\text {A}}]=10\) s. This value was obtained for a moderate number of passengers (100). If there are more passengers, \(E[t_{\text {A}}]\) is larger and the number of disconnections/reconnections per unit time is smaller. Hence, regarding the signaling load, we consider the worst case.

We consider a range from 0 to 10 s for radio inactivity timer \(T_{\text {O}}\). Using (8) and (9), we compute the service request rate and the probability for a terminal to be in idle state, respectively, as shown in Table 3.

Table 3 Service request rate and idle-state probability for different timer values when the reading time is a log-normal random variable with mean 39.7 s and \(E[t_{\text {A}}]=10\) s

A commonly agreed value for \(T_{\text {O}}\) is 5 s. In that case, \(\lambda _{\text {C}}= 1/101\) service requests per second for a given terminal. For each connection request, there is a corresponding resource release procedure. For a train with 1000 passengers (all using their smart phones), there are \(2\times 1000/101=19.8\) signaling requests per second. The corresponding signaling bit rate is deduced from Table 2. We get \(1000\times 8\times (184+237)/101=33.4\) kbps on the downlink and \(1000\times 8\times (91+232)/101=25.6\) kbps on the uplink. Considering a a 20-MHz system and a 75-Mbps capacity, this represents less than 0.05% of the bit-rate.

To conclude, if we consider both user mobility (see 4.3) and the sporadic nature of the data flows (see 4.4), the signaling load represents less than 0.3 % of the backhaul capacity. It is thus quite low and does not have a major impact on the performance of the system. We thus neglect it in Sect. 5.

Fig. 7
figure 7

Web service model

Table 4 Parameters of the Web model

5 Performance analysis in loaded conditions

In this section, we analyze and test the achievable benefits brought by the deployment of mobile relays in loaded conditions. Our benchmark is a network without any relay, which we call direct mode. We consider two representative services, namely Web browsing and telephony.

5.1 Transmitted services

5.1.1 Web browsing

We consider a Web browsing service as shown in Fig. 7. The UE sends a request for a Web page to a server. The page generated by the server consists of one main object (which can be seen as files) and a random number of inline objects, which can be texts, photos or videos. Once the reading time is elapsed, a new request is sent to the server by the UE. The application protocol is HyperText Transfer Protocol (HTTP) over Transmission Control Protocol (TCP).

The Web browsing model is the one defined in [32]. The parameters we used for the simulation are given in Table 4 and are similar to the ones in table II of [32]. Note that the authors do not base the fitting between the measurements and the parameters of the r.v. on the mean and the standard deviation. We thus recompute the parameters to get exactly the same means and the standard deviations as in [32]. The average reading time is 39.7 s.

5.1.2 Voice calling service

In 4G networks, voice calling is based on VoIP. End users are generally sensitive to the one-way delay and the packet loss rate. In order to avoid any problems due to possible echoes and to keep an excellent interactivity between the two parties, a maximum of 150 ms one-way latency is recommended by the ITU-T G.114 [3].

Since this includes the entire voice path, part of which may be on the public Internet, the LTE/EPC network should have a transit latency down from 150 ms to guarantee a decent QoS. The GSM Enhanced Full Rate (EFR) is one of the most representative speech codecs [19]. A 244-bit data block is sent every 20 ms, this gives a rate of 12.2 kbps. There is generally a Cyclic Redundancy Check (CRC) that is added.

We simulate the aforementioned EFR codec or equivalently an AMR-WB codec in 12.65-kpbs mode. The source traffic generates a 32-byte payload every 20-ms. Voice is transmitted in Real-time Transport Protocol (RTP) packets. No voice activity detection is simulated. Therefore, the flow is constant bit rate. We consider the same header sizes as in Sect. 3.2: the total header size for PDCP, RLC and MAC is 6 bytes and for GTP, UDP and IP is 40 bytes. The RTP header is 12 bytes. The headers on each interface are shown in Fig. 8. We assume the calls are set up prior to the simulation and did not consider any call arrival and call release. No voice-call signaling is simulated.

Fig. 8
figure 8

VoIP packet headers in relay architecture

5.2 QoS parameters

For Web browsing, the main QoS indicator is the Web page load time but we also consider server response time and the throughput. All indicators are defined below. For VoIP, we consider end-to-end latency, jitter and packet loss ratio (with the standard definitions).

5.2.1 Server response time

This indicator is computed for each request u as

$$\begin{aligned} \eta _u = t_{\text {fp},u} - t_{\text {rq},u} \end{aligned}$$
(10)

where, \(t_{\text {fp},u}\) is the time stamp of the first packet of the main object received by the UE and \(t_{\text {rq},u}\) is the time stamp of the request sent by the UE to the server (see Fig. 7).

5.2.2 Page reception time

This indicator is computed as

$$\begin{aligned} \xi _u = t_{\text {lp},u} - t_{\text {fp},u} \end{aligned}$$
(11)

where \(t_{\text {lp},u}\) is the time stamp of the last packet received by the UE.

5.2.3 Web page load time

The Web page load time is merely the sum of the server response time and the page reception time:

$$\begin{aligned} \tau _u = \eta _u + \xi _u \end{aligned}$$
(12)

5.2.4 Throughput

This parameter measures the throughput per request. Its value is obtained when downloading a Web page. This parameter is computed as:

$$\begin{aligned} \alpha _u = \frac{B_u}{t_{\text {lp},u} - t_{\text {fp},u}} \end{aligned}$$
(13)

where \(B_u\) is the number of bytes received after request u.

5.2.5 Throughput in worst conditions

It is well known that user perception is mainly influenced by the lowest quality they experience. The worst case is represented by the transition from one cell to another because the received signal is weak and also because the short interruption due to handover may induce a reduction of the flow by TCP. We do not measure the handover duration because it is not significant in itself. However, we consider the throughput only for page transmissions that experience a handover. Let \(t_{\text {H0}}\) be the time at which a handover is made. We compute \(\alpha _u\) with (13) only if \(t_{\text {fp},u} \le t_{\text {H0}}\) and \(t_{\text {lp},u} \ge t_{\text {H0}}\).

5.3 Simulation environment

To perform our tests we used the network simulator version 3 (ns-3) [1]. The ns-3 LTE module simulates the LTE model and the EPC model. The LTE model includes the LTE Radio Protocol stack (RRC, PDCP, RLC, MAC, PHY) and the EPC model includes core network interfaces, protocols and entities. These entities and protocols reside within the SGW, PGW and MME nodes, and partially within the eNB nodes. We use the standard ns-3 LTE model with direct transmission between UEs in the train and the network as the reference mode.

We implemented the mobile relay architecture module in ns-3 based on the classic LTE module (see Fig. 1). The encapsulation and decapsulation mechanisms are entirely simulated. We tested the performance of the mobile relay architecture and evaluated it against a direct transmission scenario.

In our scenario a train moves in a straight line at a constant speed of 100 kmph. Base stations (eNBt) are uniformly deployed along the track and have a 600-m radius. The inter-site distance is thus 1200 m. X2-based handovers are activated only for eNbts. Terminals are fixed within the train. We do not simulate any stops at stations. This is necessary to have enough Web service requests in the same conditions and to guarantee the statistical significance of the results.

The eNbt and the eNbm transmit on different frequencies. Therefore, inter-cell interference is avoided. At each simulation the train goes through 20 eNBts.

Three in-train loads are considered: 20, 60 and 100 mobile users. 60% of the passengers browse the Web and the remaining 40% carry out VoIP communications. The simulation parameters are described in detail in Table 5.

Table 5 Simulation parameters

6 Numerical results

In order to analyse the obtained results, several figures represent the statistical distribution of values by using boxplots with the upper quartile, median and lower quartile. The points which fall more than 1.5 times the interquartile range above the third quartile or below the first quartile are known as outliers. The whiskers depict the maximum and minimum values excluding outliers

6.1 Web page load time

Figure 9 depicts the effect of the mobile relay in terms of page load time when the train contains 12, 36 and 60 users Web browsing (within a total of 20, 60 and 100 users). For 12 users the median is less than 1 s for both relay and direct modes. For 36 users the median is reduced from 10 to 5 s and is thus divided by 2. The maximum time is also divided by 2. For 60 users, the mean load time is drastically reduced from 20 to 7 s. The max value is reduced from 248 to 81 s. Users are generally very sensitive to such a reduction of the load time. In all cases, the maximum load time is always smaller in relay mode. An interesting fact to highlight is that the values of 60 users in relay mode are smaller than the direct mode with 36 users.

Fig. 9
figure 9

Boxplot of Web page load time (log scale)

6.2 Throughput

Figure 10a shows the reached throughput for Web browsing. When the train contains 12 users, direct mode shows better performance than relay mode (e.g. 3.1 Mbps vs 2.9 Mbps for median values). Nevertheless, the final user does not experience a QoS degradation since there are enough resources to serve all users. On the other hand, relay mode provides higher minimal throughput than direct mode (e.g. 0.4 Mbps vs 0.2 Mbps). When the train contains 36 users, relay mode shows better performance than direct mode (e.g. 3.1 Mbps vs 6.5 Mbps for median values). The maximum and minimum throughput values are also higher in relay mode.

When the train contains 60 users, the relay mode shows better performance than direct mode (e.g. 4.2 Mbps vs 2.1 Mbps for median values). The maximum and minimum throughput values are also higher in relay mode; however, several outliers show higher throughput in direct mode. Those outliers are due to transmissions performed when the train is very close to the eNBt.

Figure 10b depicts the throughput only for users that started the transmission before the handover and finished the transmission after the handover. Results show that users in relay mode reach a throughput three times as high as those in direct mode, e.g. 0.7 Mbps vs 2.41 Mbps, 0.21 Mbps vs 0.64 Mbps and 0.15 Mbps vs 0.45 Mbps (for median values) for 12, 36 and 60 users, respectively. These results confirm the benefit of deploying a mobile relay on the QoS of sessions that experience a handover (see Sect. 5.2.5).

Fig. 10
figure 10

Boxplots of throughput for Web traffic

6.3 Server response time

In order to have a better understanding of the behaviour of the system, we show the cdf of the server response time in Fig. 11. Due to the additional hop, the minimum response time is larger when a relay is included. The last-decile response time is also larger with a delay at low load as shown in Table 6. However, at high load (60 UEs), the response time is divided by 3. The relay mode is less influenced by the load. In direct mode, the server response time can reach 3 s (last centile) though it is limited to 74 ms in relay mode.

Fig. 11
figure 11

CDF of the http server response time (in log scale)

6.4 Web pages loaded

Figure 12 shows the average number of Web pages loaded in both relay and direct mode. Since the page load time is shorter in relay mode, the cycle period (see Fig. 11) is decreased for the same reading time. Users can make a larger number of requests (1021 vs 700 for 60 users). This reflects the fact that relay mode stimulates the use of mobile services. Our simulation results show an increase of 45%.

6.5 Latency of VoIP flows

Figure 13 shows the distribution of latency. Since packets have more hops in relay architecture, it is normal to have higher latency than in direct mode. However, the latency difference between the relay and the direct mode is reduced as the number of users increases. The difference in median latency is 5 ms for 8 users, 3 ms for 24 users and 1 ms for 40 users.

Since latency is lower down 40 ms, we can assume that a good voice calling interaction is guaranteed in view of the fact that the latency threshold is 150 ms. Furthermore, a simple Proportional Fairness (PF) scheduler is used. PF does not give any priority to VoIP packets. By using a dedicated bearer for VoIP packets on the backhaul, it is possible to grant a higher priority to VoIP packets and to limit the latency rise.

6.6 Jitter of VoIP flows

Figure 14 shows the jitter distribution. Relay mode demonstrates lower jitter. When the train has few users, there is not a big difference in jitter. Nevertheless, for 24 users, jitter is reduced from 16 to 13 ms and for 40 users, jitter is reduced from 27 to 24 ms in relay mode.

Table 6 Last decile and last centile of the http server response time (in ms)
Fig. 12
figure 12

Number of Web pages loaded

6.7 Packet loss ratio of VoIP flows

Figure 15 shows the packet loss ratio. In all cases, relay mode shows no packet loss (RLC mode is AM). On the other hand, direct mode shows a packet loss lower than 1% for 8 and 24 users, and up to 4% when the train is carrying 40 users.

For VoIP flow, both the packet loss and the jitter are reduced because the link budget is much better with a relay. In direct mode, the packet erasure rate for an individual transmission is not negligible. The retransmissions induce variable delays and thus jitter. In some cases, the packet cannot be decoded even after the maximum number of transmissions, which gives packet losses.

Fig. 13
figure 13

Boxplot of latency of VoIP flows

Fig. 14
figure 14

Boxplot of jitter of VoIP flows

Fig. 15
figure 15

Boxplot of packet loss ratio of VoIP flows

6.8 Short analysis of the handovers

Although this work does not focus on handover mechanisms in mobile relays, we noticed an important fact to highlight in addition to the presented results. Since in-train users are moving in the same direction, the direct connection between the eNBt and UEs leads to an increased handover failure rate. A Random Access (RA) procedure is performed at the beginning of the handover execution phase. During the preparation phase, the target cell allocates a unique RA preamble. This uniqueness guarantees a collision-free access. The preamble is released once the RA procedure is finished in the target cell. Each cell has 64 preambles available, but only a limited part is reserved for this contention-less RA. In the case of preamble starvation, handovers are queued.

When the cell is highly loaded (i.e more than 60 users), this causes a connection drop in the worst case (long queuing) and QoS degradation due to a delay increase in the best case. In the direct-mode simulation, we increase the number of preambles for contention-less access to avoid drops and to compare relay mode with the best possible direct mode.

In relay mode since all the users are grouped, the eNBt performs the handover only for the TOE (seen as only one user). This means that there will be only one preamble used per train when performing the handover, no matter the number of UEs in the train.

7 Conclusion

This paper evaluates the performance of a mobile relay architecture for public transportation systems. We base our evaluation on a testbed with real radio transmissions and consider both the signaling issue and the throughput experienced by users. A number of conclusions can be drawn.

The additional overhead due to multiple encapsulations has no significant impact on performance, both for data and for signaling. Signaling uses a negligible part of the bandwidth. As the bit rate is dramatically increased due to a much better link budget, deploying mobile relays is a very efficient way to increase the QoS in public transportation.

The page load time is reduced, especially in high load conditions, compared to a direct transmission. The server response time is also shorter despite the fact that there is an additional hop. We show that these improvements lead to an increase in the number of pages visited and are thus prone to stimulate the usage of mobile services. VoIP also takes advantage of mobile relays, especially regarding packet losses.

The relaying architecture proposed in this article is based on tunneling, which is reused in 5G networks without major modification of GTP in the user plane. Furthermore 4G and 5G technologies can be mixed: 5G base stations can be connected to a 4G core network, and an eNB can be connected to a 5G core network. The next step of our work is to study how the new features of 5G can be used with mobile relays to provide an unprecedented user experience for passengers on public transportation.