1 Introduction

The provisioning of location-based services [1] for users of mobile and mostly wireless devices requires proper knowledge about their current geographical positions. Because most mobile devices, such as notebooks, personal digital assistants, or cellular phones, are unable to determine their positions on their own, e.g., by means of a global positioning system, they rely on the support of the network for this task.

Several proposals have been made in recent years for the determination of device positions. These can be classified in range-based and range-free approaches (see [2]). For the range-based mechanisms, the distance (trilateration) or the angle (triangulation) between a mobile device and at least three of its currently available base stations is required. Based on this information and a precise knowledge of the positions of the stations, the location of the mobile device can be computed. However, determining such distances or angles can be rather difficult, and the inaccuracy might be quite high. Additionally, the realization of range-based mechanisms often requires costly enhancements of the mobile devices or base stations, like special location measurement units [3].

As long as no additional mechanisms are being deployed, the determination of the positions can be achieved by so-called range-free approaches. Statements about a position can be obtained via information about the base stations that can communicate with a mobile device at its current position, without relying on estimates of absolute distances or angles.

Range-free approaches use different algorithms and information about the base stations to give an estimate for the current location of a mobile device. A convenient mechanism uses approximations of base station coverage areas. These areas can be acquired by the mobile devices and provided by respective databases [4]. In other cases, the carriers of a wireless network possess detailed descriptions of the deployed network equipment being used for network planning [5], which can also be used to obtain estimates of the coverage areas. If a mobile device finds itself in the overlapping coverage areas of several base stations at once, the intersection of these areas will be geometrically computed, and the possible positions of the device can be limited to this intersection area.

Computing the intersection of base station coverage areas can be a complex task. Thus, it is desirable to have estimates about the potential refinement of the location information before performing the computation. Because we are interested in the uncertainty of location information, i.e., the entropy of the random variable associated with the exact position of a device, we will use the term information gain for the reduction of this entropy. In the sequel, we focus on analytical investigations of the information gain. Different approaches to the computation of the information gain are investigated, based on stochastic geometry and a Monte-Carlo method.

The rest of the paper is organized as follows: related work is presented in the next section. In Section 3, a model for the considered system is described. Then, in Section 4, the notion of information gain that will be used in the sequel is presented. In Section 5, several approaches for the determination of the information gain are displayed. Some significant numerical results are presented in Section 6, and the last section concludes the paper by giving an outlook into the next steps of our ongoing work.

2 Related work

If the information of a single station is used for range-free localization, the accuracy depends critically on the cell size. For certain technologies with small transmission ranges, like Bluetooth, it might be sufficient. However, other radio cells can cover large parts of a geographical territory, and using a single base station to localize a mobile device might lead to a very coarse position information.

The accuracy can be significantly enhanced by using the information from several stations. In this section, the basics of the existing range-free approaches and the mechanisms for the computation of the device positions will be presented. Some of these have originally been proposed in the context of wireless sensor or ad-hoc networks, but the algorithms are almost independent of the underlying technologies and can also be deployed in other wireless networks.

Centroid computation The authors of [6] propose a mechanism that relies on the deployment of location-aware nodes in a wireless sensor network. If another node wants to determine its whereabout, it will listen to beacon signals of all the location-aware nodes that can be received at its current position. The device is assumed to be located at the centroid of the positions of these nodes. However, computing such centroids yields no information regarding the accuracy of this positioning method. Figure 1 shows the deployment of the mechanism in a wireless network.

Fig. 1
figure 1

Using the centroids of location-aware nodes (e.g., base stations) for localization

Intersection area computation Using the intersection area of different coverage areas has been proposed in [7]. The authors present a distributed algorithm for the localization of nodes in ad hoc networks. Some of the nodes are assumed to be location-aware, other nodes determine their positions by computing the intersection of the coverage areas of the location-aware nodes. For computational simplicity, the algorithm uses a discretization of positions and rectangular coverage areas (see Fig. 2). In [8], an analytical investigation of the intersection protocol is presented, assuming circular coverage areas of the nodes (Fig. 3). Additionally, the authors present and analyze an enhancement of the protocol that achieves the same accuracy but halves the number of required location-aware nodes.

Fig. 2
figure 2

Simplified computation of an intersection area for localization

Fig. 3
figure 3

Intersection area based approach, using circular coverage areas

In [9], a cell-ID-based location-sensing method for wireless networks is proposed, using the intersection area of different base stations for localization. In this paper, the positioning accuracy of the mechanism is computed assuming the base stations are aligned in a hexagonal or mesh structure. Additionally, the optimal accuracy is computed when the transmission power of the stations can be modified for localization purposes. The authors use circular coverage areas of the base stations for their computation. All of the presented approaches for location determination assume circular coverage areas of the nodes or stations being used for the localization, without taking the interference between their radio signals into account.

Approximate point in triangulation test The authors of [2] present a localization algorithm that partitions the geographical territory into triangular regions with location-aware nodes as vertices. These nodes send beacons with their positions to other nodes whose positions should be determined. For localization purposes, a node uses the positions in the beacons that it can currently receive, determines the resulting triangles, and checks whether its current position lies inside the triangles. The intersection region of all triangles returning a “true” result is then computed, and the node is assumed to be located at the center of gravity of this region (see Fig. 4). A perfect test as to whether the current position lies within a given triangle could hardly be realized. Thus, the protocol relies on approximations of the test results.

Fig. 4
figure 4

Using the (approximate) point in triangulation test for localization

From a mathematical standpoint, previous work has been achieved most noticeably in [10]. This paper contains, amongst other results, a first mathematical study of intersections \(\left( C^A\cap C^B\right)\), where C A and C B are the cells covering a fixed reference point in two independent and stationary Poisson–Voronoi tessellations. However, statements regarding the probabilistic distribution of \(\left( C^A\cap C^B\right) \) are essentially limited to a computation of the mean value of its area.

3 System model

We consider a large geographical territory T having an area size |T|. The territory is covered by a wireless network. The network has a set of base stations S, and the members of the set are denoted by \(\mathit{s}\).

Coverage model To estimate the base station coverage areas, we use a simplified model (see [11]), which has been widely deployed for information theoretical analyses, e.g., in [12]. The pathloss, i.e., the weakening of the received signal power P RX between a base station located at position X i and a mobile device at position U during the transmission, is computed via

$$P_{RX}=P_{TX}\cdot\frac{K}{|U-X_i|^\alpha}$$
(1)

P TX denoting the power transmitted by the base station. The constant K and distance power gradient α approximate the attenuation characteristics of the considered environment (see [13]). As long as the strength of the received signal and the ratio between the signal and some received noise (SNR) remain above a certain threshold, a communication with the station is possible and the device is in its coverage area.

Assuming that the signals of the different base stations do not interfere at the receiver, the SNR can be computed via

$$\mathrm{SNR}=\frac{P_{RX}}{P_N}$$
(2)

P N denoting the equivalent power of the noise.

Network model In real scenarios, base stations can sometimes only be installed at a limited number of suitable locations, e.g., on certain rooftops. Such restrictions lead to spatial variability for the locations and density of the base stations.

Following [14], we model the positions of the base stations as realizations of a stationary Poisson point process to take this spatial variability into account. Accordingly, letting λ denote the intensity of the base stations process (mean number of base stations per square kilometer), the probability for the total number of base stations |S| in the territory to be n is given by

$$p(n)=\mathbb{P}[|S|=n]=\frac{e^{-\lambda^{}|{T}|} (\lambda^{}|{T}|)^n}{n!}$$
(3)

and, conditionally to |S| = n, the n positions of the base stations are distributed independently and uniformly over the territory T.

The set of base stations S consists of several subsets of stations S j  ⊂ S. All stations whose radio transmissions heavily interfere with each other belong to the same subset j (e.g., cochannel interference of base stations using the same frequency in IEEE 802.11 WLAN). For stations of different subsets, we assume that the interference between them is negligibly small due to respective characteristics of the filters in the transceivers.

The subsets of the base stations are modeled according to the entire set S. The spatial variability of the base stations of a subset is modeled by a Poisson point process with density λ j , where ∑  j λ j  = λ.

The territory under consideration is populated by a set of mobile devices M. If required, a certain device m ∈ M can determine the ID of the base station in whose coverage area it currently resides. For the interfering base stations of a single subset, we assume that only the station with best SNR can be determined.

If the parameters of all base stations are equal, the last assumption leads to a determination of the station of each subset being closest to the current position of a mobile device. The partitioning of the geographical territory by the coverage areas of the stations leads to a Voronoi tessellation [15] of the territory with the station positions as nuclei (see Fig. 5). Considering the model for the placement of the stations, such partitioning of the territory T is also called a Poisson-Voronoi tessellation.

Fig. 5
figure 5

Coverage areas of two different base station subsets using a Poisson–Voronoi model

4 The notion of information gain

Taking only the information of a first base station s A into account (“The user of device m finds himself in cell C A attached to base station s A ) allows one to consider that the exact position of the user is a random variable U that is uniformly distributed over the cell C A. The entropy of such a random variable is simply given as the logarithm of the total surface |C A| of the corresponding cell (see [16]).

Now, when one is also taking the information of a second base station s B into account (“The user of device m finds himself in cell C B attached to base station s B ), the exact position of the user becomes a random variable that is uniformly distributed over \(\left( C^A\cap C^B\right)\). Here again, the entropy attached to such a uniformly distributed random variable is simply the logarithm of the corresponding total surface, i.e., log(|C A ∩ C B|). This means that the information gain G obtained by taking both informations into account simultaneously (instead of just the first one) may be quantified as

$$G=\log(|C^A|)-\log(|C^A\cap C^B|)$$
(4)

Hence, C A and C B denoting the cells covering a given reference point in each network, one is particularly interested in the distribution of the random variable \(\log\left(\frac{|C^A|}{|C^A\cap C^B|}\right)\).

5 Computing the information gain

In this section, we present several approaches to the computation of the information gain, discussing first the use of random tessellations and then the implementation of Monte-Carlo methods.

In the sequel, we will be assuming that the networks under consideration are spatially stationary and that user localizations are being performed a large number of times (N times), so that if the user of device m k be reported to find himself in cell \(C_{m_k}^A\), as well as in cell \(C_{m_k}^B\), then

$$\frac{1}{N}\sum_{k=1}^N\log\!\left(\!\frac{|C_{m_k}^A|}{|C_{m_k}^A\cap C_{m_k}^B|}\!\right) \!\!\approx\mathbb{E}\left[\!\log\left(\!\frac{|C^A(O)|}{|C^A(O)\cap C^B(O)|}\!\right) \!\right],$$

where C A(O) stands for the cell in the first network that happens to cover an arbitrary fixed point, taken to be the origin O of the plane, and C B(O) stands for the corresponding cell in the second network. The above approximation uses the law of large numbers (the user locations are assumed to be independent and uniformly distributed over the territory) together with Wiener’s ergodic theorem, which holds true for any stationary random tessellation of the plane satisfying a mixing condition (see the Appendix for more precise statements and further references). In any of the cellular network models we shall be considering, be it the model described in Section 3 or a simplified Poisson–Voronoi tessellation model, or even some simpler model of the network at hand, we are thus using the fact that the cellular network may be viewed as a realization of a stationary random tessellation satisfying a mixing property, as well as the fact that the true locations of the successive users are mutually independent and uniformly distributed over the territory.

At any rate, this leaves us with the basic task of computing mean values such as \(\mathbb{E}\left[ \log(|C^A(O)|)\right]\) or \(\mathbb{E}\left[ \log(|C^A(O)\cap C^B(O)|)\right]\). As we shall see, there are several approaches to these computations. The choice of a relevant approach depends in an essential way on the model chosen for the cellular network.

5.1 Using a Poisson–Voronoi tessellation

Assume both cellular networks are being modelled as Poisson–Voronoi tessellations of the plane: the base stations of the first (resp. second) network are thus spread upon the territory according to a Poisson point process \(\left\{ X^A_i\right\}\) of intensity \(\lambda^{A} \big(\text{resp.} \big\{ X^B_j\big\}\text{of intensity}\) \(\lambda^{B}\big)\), and a device m located at the position U belongs to the cell \(C^A_i\) attached to the ith base station in the first network whenever its distance to \(X^A_i\) is smaller than its distance to any other base station in the first network (see Fig. 5). The cell \(C^B_j\) attached to the jth base station in the second network is defined similarly. For a typical realization of the Poisson random clouds \(\left\{ X^A_i\right\}\) and \(\big\{ X^B_j\big\}\), there is precisely one cell C A(O) in the first network covering the origin O in the plane, as well as one cell C B(O) covering the origin in the second network. Having in mind a computation of the stochastic mean values mentioned above, one would like to get a hold of the distribution functions attached to the random variables |C A(O)| and |C A(O) ∩ C B(O)|. Interestingly, some recent works [17] have shown that an explicit computation of the distribution function attached to the area |C A(O)| is possible. To give a brief description of this distribution function, let us present another kind of random cell commonly considered in such a context: the Palm cell \(\mathcal{C}^A\), constructed from a typical realization of the Poisson cloud \(\left\{ X^A_i\right\}\), may be defined to be the set of all points lying closer to O than to any other point in the cloud (so it is simply the new cell obtained by adding the point O to the original cloud). The construction shows that \(\mathcal{C}^A\) is smaller than the cell C A(O) considered precedingly. However, as a consequence of Campbell’s formula (see [18]), each of the main geometric characteristics of the random cells C A(O) and \(\mathcal{C}^A\) (area, perimeter, number of sides) has a distribution that may be seen to be closely related to each other via an “area bias.” For example, the first moment of the area |C A(O)| may be obtained by multiplying the second moment of \(|\mathcal{C}^A|\) through the intensity λ A:

$$\mathbb{E}\left[ |C^A(O)|\right]=\lambda^{A}\mathbb{E}\left[ |\mathcal{C}^A|^2\right] =\frac{\mathbb{E}\left[ |\mathcal{C}^A|^2\right]}{\mathbb{E}\left[ |\mathcal{C}^A|\right]}.$$

More generally, for moments of higher order, one has

$$\mathbb{E}\left[ |C^A(O)|^k\right]=\lambda^{A}\mathbb{E}\left[ |\mathcal{C}^A|^{k+1}\right],$$

and for the logarithms of the areas:

$$\mathbb{E}\left[ \log|C^A(O)|\right]= \lambda^{A}\mathbb{E}\left[ |\mathcal{C}^A|\cdot\log|\mathcal{C}^A|\right].$$
(5)

On the other hand, [17] provides us with an explicit expression for the density function of the random variable \(|\mathcal{C}^A|\). This in turn enables the computation of probabilities such as \(\mathbb{P}\left\lbrace |\mathcal{C}^A|\geq x\right\rbrace\) or \(\mathbb{P}\left\lbrace |\mathcal{C}^A|\leq x\right\rbrace\), and one then has:

$$\begin{aligned}&\mathbb{E}\left[ |\mathcal{C}^A|\cdot\log|\mathcal{C}^{A}|\right] \\ & \quad =\mathbb{E}\left[\int_1^{|\mathcal{C}^{A}|}(1+\log x)dx\right] \\ & \quad =\int_1^{+\infty}(1+\log x)\mathbb{P}\left\lbrace |\mathcal{C}^{A}|\geq x\right\rbrace dx \\ & \qquad -\int_0^{1}(1+\log x)\mathbb{P}\left\lbrace |\mathcal{C}^{A}|\leq x\right\rbrace dx. \end{aligned}$$
(6)

So, in the context of Poisson–Voronoi tessellations, the first half of our task, namely, computing the stochastic mean value of log|C A(O)|, may be achieved analytically. However, it turns out that the explicit expression of the density function attached to \(|\mathcal{C}^A|\) is not particularly easy to handle, its integration necessitating the use of numerical methods (see the Appendix).

More importantly, it seems that an explicit computation of the distribution associated with the random variable |C A(O) ∩ C B(O)| remains completely out of reach (reference [10] only has an explicit expression for the mean value of |C A(O) ∩ C B(O)|). A central difficulty here is that the compound tessellation, obtained by superimposing the cells of the second network to those of the first one, is of a truly new kind: some of the new cells thus obtained contain no base station at all, some other cells have just one base station (of the first or of the second network), some other cells contain two base stations (one of each network), and there is no obvious way in which one could come back to the earlier setting (e.g., by trying to attach one “virtual station” to each of the new cells). Having faced this essential difficulty, one may turn to a further simplification of the cellular network modelling, which allows for an explicit computation of the information gain via scaling arguments, or else opt for a numerical evaluation of the information gain, in which case some other network models may be implemented (see [12]). These two complementary approaches are presented in the following paragraphs.

5.2 Scaling arguments

One way of circumventing the difficulties alluded to above would be to change the notion of cell used in the modelling of the network, e.g. by replacing the Poisson–Voronoi tessellations through Poisson line tessellations: for a given realization \(\left\lbrace Y^A_i\right\rbrace \) of a Poisson random cloud, the corresponding line tessellation is defined by drawing a straight line \(\Delta^A_i\) through each point \(Y^A_i\), in such a way that the segment joining the origin O to \(Y^A_i\) be orthogonal to \(\Delta^A_i\) (see Fig. 6). This procedure yields a (very rough) model for the cells of the first network, and the cells in the second network could be modelled similarly, using a second random cloud \(\big\{ Y^B_j\big\} \) independent of the first one. Note, however, that these two clouds should be given intensity measures proportional to \(d\rho d\theta=\frac{dxdy}{\sqrt{x^2+y^2}}\) to ensure that the resulting tessellations are stationary (see the Appendix for more details). Letting, thus, λ A be the intensity measure of the first cloud and λ B be the intensity measure of the second one, we are then in the advantageous situation where the tessellation obtained through a superposition of the first line tessellation and of the second one is of the same nature: it is precisely the line tessellation associated with the cumulated cloud \(\big(\big\lbrace Y^A_i\big\rbrace\cup\big\{Y^B_j\big\}\big) \). This last cloud is, again, a Poisson random cloud, of intensity measure λ C , where λ C = λ A + λ B. Hence, each of the three Poisson random clouds \(\big\lbrace Y^A_i\big\rbrace, \big\lbrace Y^B_j\big\rbrace, \big(\big\lbrace Y^A_i\big\rbrace\cup\big\lbrace Y^B_j\big\rbrace\big)\) may be obtained as a dilation about the origin of an auxiliary Poisson cloud \(\big\lbrace Z_i\big\rbrace\) having the intensity measure , the dilation factors being \(1/\sqrt{\lambda^{A}},1/\sqrt{\lambda^{B}}\) and \(1/\sqrt{\lambda^{C}}=1/\sqrt{\lambda^{A}+\lambda^{B}}\), respectively. Letting \(C_O(\left\lbrace Z_i\right\rbrace)\) denote the cell covering the origin in the Poisson line tessellation associated with \(\left\lbrace Z_i\right\rbrace\), and using dilations about the origin to represent \(\left\lbrace Y^A_i\right\rbrace\), as well as \(\big( \big\lbrace Y^A_i\big\rbrace\cup\big\lbrace Y^B_j\big\rbrace\big)\), one then finds out that the corresponding information gain may be simply quantified as

$$\log\left(\frac{1}{\lambda^{A}}\right)-\log\left(\frac{1}{\lambda^{A}+\lambda^{B}}\right) = \log\left(1+\frac{\lambda^{B}}{\lambda^{A}}\right)$$
(7)

Note that such concise expression of the information gain could also be obtained in the context of Poisson–Voronoi tessellations, again via scaling arguments: one would simply need to replace the true compound Poisson–Voronoi tessellation \(\mathcal{T}^{Comp}\) by the Poisson–Voronoi tessellation \(\mathcal{T}\big( \big\lbrace X^A_i\big\rbrace\! \cup\! \big\lbrace X^B_j\big\rbrace\big)\) associated with the cumulated cloud \(\big( \big\lbrace X^A_i\big\rbrace \cup \big\lbrace X^B_j\big\rbrace\big)\). Actually, this kind of scaling argument may be significantly improved by taking into account the fact that the cells in the compound tessellation \(\mathcal{T}^{Comp}\) tend to be smaller than those of \(\mathcal{T}\big( \big\lbrace X^A_i\big\rbrace \cup \big\lbrace X^B_j\big\rbrace\big)\). More precisely, by attaching its center of gravity G k to each cell C k in the compound tessellation, one obtains a new point process { G k }, which is again stationary and has the intensity

$$\Lambda^{Comp}(\lambda^A,\lambda^B)=\lambda^A+\lambda^B+\frac{8}{\pi}\sqrt{\lambda^A\lambda^B}$$
(8)

(see [10] for a proof, upon which we expand in the Appendix). This new point process is non-Poissonian; however, the area of the cell about the origin in \(\mathcal{T}^{Comp}\) should be close to the area of the corresponding cell in a Poisson–Voronoi tessellation when the intensity of the underlying Poisson process is given the value ΛComp(λ A,λ B). This approximation and a scaling argument lead to the following expression:

$$\log\left(\Lambda^{Comp}(\lambda^A,\lambda^B)\right) \!-\!\log\lambda^A \!=\! \log\left( 1\!+\!\frac{\lambda^{B}}{\lambda^{A}}\!+\!\frac{8}{\pi}\sqrt{\frac{\lambda^B}{\lambda^A}}\right) \label{eq:ig_log} $$
(9)

for the information gain.

Fig. 6
figure 6

A Poisson line tessellation of the territory

Similarly, considering three independent, homogeneous Poisson clouds of respective intensities λ A, λ B, λ C, and superimposing the cells associated with their Voronoi tessellations, one obtains a new compound tessellation where cells come up with the intensity

$$\Lambda^{Comp}(\lambda^A,\lambda^B,\lambda^C)\!=\!\lambda^A\!+\!\lambda^B\!+\!\lambda^C + \frac{8}{\pi}\left(\sqrt{\lambda^A\lambda^B}\!+\!\sqrt{\lambda^A\lambda^C}\!+\!\sqrt{\lambda^B\lambda^C}\right)$$
(10)

Accordingly, approximating this new compound tessellation through a Poisson–Voronoi tessellation of intensity ΛComp(λ A,λ B,λ C) and using a scaling argument enables one to quantify the corresponding information gain as

$$\begin{aligned}&\log\left( \Lambda^{Comp}(\lambda^A,\lambda^B, \lambda^C)\right) -\log\lambda^A \\ & \quad = \log\left( 1+\frac{\lambda^{B}}{\lambda^{A}}+\frac{\lambda^{C}}{\lambda^{A}}\right.\\ & \qquad\qquad\;\; \left. +\frac{8}{\pi}\left(\sqrt{\frac{\lambda^B}{\lambda^A}}+\sqrt{\frac{\lambda^C}{\lambda^A}} +\frac{\sqrt{\lambda^B\lambda^C}}{\lambda^A}\right)\right)\end{aligned}$$
(11)

However, such expressions for the information gain were obtained by approximating compound tessellations through Poisson–Voronoi tessellations of equal cell intensity, and the corresponding values of the information gain should be carefully compared with those obtained through Monte-Carlo simulations.

5.3 Using a Monte-Carlo method

Using a Monte-Carlo method for the computation of the information gain provides the best accuracy because the model of the network can be directly used for the computation without requiring further simplifications.

According to the system model presented in Section 3, we are assuming that the base stations of each network are spread out on the territory as several Poisson clouds, e.g., \(\big\{X^A_i\big\}\), \(\big\{X^B_j\big\}\) having the intensities λ A, λ B. Each base station \(X^A_i\) in the first network has a random variable \(P^A_{\textrm{TX,}i}\) standing for transmission power, as well as technology and environment specific values K, P N and α.

A given mobile device located at U is said to lie in the cell attached to base station \(X^A_{i_0}\) in the first network whenever the product \(\frac{P^A_{\mathrm{TX,}i_0}}{P_{\mathrm{N}}}\cdot\frac{K}{|U-X^A_{i_0}|^{\alpha}}\) is greater than any of the analogous quantities \(\frac{P^A_{\mathrm{TX,}i}}{P_{\mathrm{N}}}\cdot\frac{K}{|U-X^A_{i}|^{\alpha}}\) for i ≠ i 0. We denote by \(\hat{C}^A_i\) this new notion of a cell attached to the base station located at \(X^A_i\), and by \(\hat{C}^B_j\) the cell attached to \(X^B_j\) in a similar way in the second network, using some further family \(\big\{P^B_{\mathrm{TX,}j} \big\}\) of random variables.

These families of random variables are naturally assumed to be independent of each other. \(\big\{ P^A_i\big\} \big(\text{resp.}\) \(\big.\big\{ P^B_j\big\}\big)\) may be modeled, e.g., as an i.i.d. sequence of uniform or Beta distributed random variables.

Using again the spatial ergodicity underlying the network model at hand, one may define the long-term information gain as the difference of stochastic mean values

$$\mathbb{E}\left[ \log(|\hat{C}^A(O)|)\right]- \mathbb{E}\left[ \log(|\hat{C}^A(O)\cap \hat{C}^B(O)|)\right],$$
(12)

where \(\hat{C}^A(O)\) (resp. \(\hat{C}^B(O)\)) is defined as the unique cell covering the origin O in the first network (resp. the second network). Recalling that the above mean values may be simply expressed as integrals featuring the distribution functions of the relevant areas, so that, e.g.,

$$\begin{array}{rll} \mathbb{E}\left[ \log(|\hat{C}^A(O)|)\right]&=& \int_1^{+\infty}\frac{1}{x}\mathbb{P}\left\lbrace |\hat{C}^A(O)|\geq x\right\rbrace dx \\ &&-\int_0^{1}\frac{1}{x}\mathbb{P}\left\lbrace |\hat{C}^A(O)|\leq x\right\rbrace dx, \end{array}$$
(13)

one is left with the task of finding a satisfactory way of approximating probabilities such as

$$p(x)=\mathbb{P}\left\lbrace |\hat{C}^A(O)|\geq x\right\rbrace.$$
(14)

A Monte-Carlo simulation can be performed along the following steps:

  1. 1.

    Draw a random natural number n according to the Poisson distribution having parameter \(\left( \lambda^{A}\cdot |T|\right) \) and then n i.i.d. points X 1,X 2,..., X n distributed uniformly in the territory T (simulation of the base station locations in the first network).

  2. 2.

    For i = 1,...,n, generate the i.i.d. r.v. P TX,i . Choose P N and K according to the investigated technology.

  3. 3.

    For i = 1,...,n, evaluate the quantity \(\frac{P_{\mathrm{TX,}i}}{P_{\mathrm{N}}}\cdot\frac{K}{|X_i|^{\alpha}}\), and denote by i 0 the index for which this quantity is maximal (so that the origin lies in the cell attached to the tower located at \(X_{i_0}\)).

  4. 4.

    Simulate a (large) number U 1,U 2,...,U N of i.i.d. points (mobile device positions) distributed uniformly in the territory T.

  5. 5.

    For i = 1,...,n and j = 1,...,N, evaluate the quantities \(\frac{P_{\mathrm{TX,}i}}{P_{\mathrm{N}}}\cdot\frac{K}{|U_{\!\!j}-X_i|^{\alpha}}\) and attach a variable ξ j to the point U j in such a way that

    $$\xi_j= \begin{cases} 1&\text{ if $\frac{P_{\mathrm{TX,}i}}{P_{\mathrm{N}}}\cdot \frac{K}{|U_{\!\!j}-X_i|^{\alpha}}$ is maximal for $i=i_0$,}\\ 0&\text{ otherwise.} \end{cases}$$
  6. 6.

    Evaluate the proportion \(\kappa=\frac{1}{N}\sum_{j=1}^N\xi_j\), and define Υ ∈ ℝ by

    $$ \Upsilon=\log(\kappa\cdot|T|)=\log(|T|)-\log\left(\frac{N}{\sum_{j=1}^N\xi_j}\right) $$

    Store Υ and go back to step 1, unless this step has been repeated sufficiently many times (M times).

  7. 7.

    Once steps 1–6 have been repeated sufficiently many times, yielding the successive outputs Υ12,...,Υ M , take the approximation

    $$\mathbb{E}\left[ \log(|\hat{C}_O(\Phi)|)\right]\approx\frac{1}{M}\sum_{k=1}^M\Upsilon_k$$

The stochastic mean value of \(\log(|\hat{C}^A(O)\cap \hat{C}^B(O)|)\) may be computed similarly. In parallel to the first base station set, steps 1 to 3 are performed for the second set using the corresponding parameters, i.e., we draw n B using λ B, select the transmission powers, and finally determine the index \(i_{0}^{B}\) of the base station from the second set which covers the origin. Step 4 remains unchanged. In step 5, ξ j takes up the value “1” only if the quantities for both sets are maximal; otherwise, value “0” is taken. Step 6 is performed based on this ξ j . In step 7, the desired stochastic mean value is obtained.

6 Numerical results

In this section, we present some numerical results that have been obtained by the different mechanisms for the computation of the information gain. First, we will investigate the gain that can be achieved by a computation of the intersection areas of the base stations. Secondly, the results for the approximation of the information gain via Voronoi cells and scaling arguments will be presented and the deviations between the different approaches will be shown. Afterwards, the distribution of the different cell sizes and their intersection areas will be presented for some distinct scenarios.

6.1 Average information gain

Its very definition shows that the information gain depends on the number of base station sets being used, as well as on the sizes of the cells in each set.

First, we consider a territory with a size of 1×1 km, covered by two different base station sets. The expected station number of each set varies between 10 and 200, reflecting different scenarios like indoor, urban, and rural. The transmission power of the stations is uniformly distributed between [0.25 P TX, Max;P TX, Max] with P TX, Max = 100mW, i.e., we have chosen values being typical for IEEE 802.11 WLANs.

In Fig. 7, the average information gain is presented for different base station densities λ B when λ A is fixed. The results have been obtained by the Monte-Carlo method. For each point in the plots, the mean value has been computed based on 250 random base station constellations. The intervals being displayed in the graph denote 95% confidence intervals.

Fig. 7
figure 7

Average information gain obtained by the Monte-Carlo method for fixed base station densities λ A

The resulting information gain when both base station densities vary is displayed in Fig. 8. Twenty different base station intensities have been investigated for each set. Again, a mean value has been computed based on 250 random base station constellations for each point on the surface in the figure. As expected, the information gain increases as the number of base stations in the second set increases, and the largest gain is achieved when the number of base stations in the first set is relatively small.

Fig. 8
figure 8

Average information gain if two base station sets with different densities exist

For Fig. 9, we considered the case where location information is obtained via three different base station sets. In addition to the previous scenario, a third set with fixed λ C = 100 is used for the positioning, leading to an overall increase of the information.

Fig. 9
figure 9

Average information gain if a third base station set with λ C = 100 is added

6.2 Approximations and resulting deviations

Let us now consider the average information gain being obtained if the coverage areas of the base stations are approximated by Voronoi cells and a scaling argument is used.

Figure 10 shows the average information gain being computed based on the Voronoi cell assumption. When a large number of base stations is considered, the computed information gain is very close to that of the Monte-Carlo approach. On the other hand, when the first set of base stations is small, this deviation becomes rather large. The relative deviation of the information gain using the Monte-Carlo and Voronoi approaches is presented in Fig. 11. The average deviation results is 10.1% in the considered parameter range.

Fig. 10
figure 10

Average information gain obtained by the approximation via Voronoi cells

Fig. 11
figure 11

Relative deviation between the original information gain and the approximation via Voronoi cells

In Fig. 12, the resulting deviation of the information gain for three base station sets is displayed. Like in Fig. 9, the density λ C = 100 has been chosen for the third set. One can clearly see that the deviation is quite small in comparison to the results for two base station sets.

Fig. 12
figure 12

Relative deviation between the original information gain and the approximation via Voronoi cells for three base station sets

The relative deviation of the Monte-Carlo numerical results respective to the values obtained via Eq. 9 are displayed in Fig. 13. These two sets of results are closer to each other when λ A is kept small, but on the whole, the average deviation is higher (31.2%) compared with the Voronoi-based approximation.

Fig. 13
figure 13

Relative deviation between the original information gain and the one being computed via a scaling argument

6.3 Distributions of the cell sizes

Next, we were interested in the distributions of the cell sizes and the intersection areas. In Fig. 14, the distribution of the original cell sizes and the intersection areas is presented. We started with one base station set with λ A = 30. Afterwards, a second set with λ B = 100 was added. The corresponding histogram for the two sets shows the size of the intersection areas. Roughly 65% of the intersection areas are smaller than 10 000 m2. If an additional third set with λ C = 100 is used for the location determination, the possible whereabouts of a mobile device can be theoretically limited to an area of 5 000 m2 and below in 75% of the cases.

Fig. 14
figure 14

Original cell sizes and intersection areas. The first histogram shows the distribution of the cell sizes from one set (λ A = 30). The second histogram shows the distribution of the intersection areas of this set with a second set (λ B = 100). The last one presents the distribution of the intersection areas of these two sets with a third set (λ C = 100)

In Fig. 15, the distributions of the cell sizes and the intersection areas for the Voronoi-based approximations are shown, using the same intensities as before. All histograms are generated based on 250 samples of the cell sizes and intersection areas.

Fig. 15
figure 15

Voronoi cell sizes and intersection areas. The figure shows the distributions of the cell sizes and intersection areas of the Voronoi cells, using the same parameters as in Fig. 14

7 Conclusion

In this paper, we presented novel approaches to the computation of the information gain that can be achieved by intersection-based range-free localization. We applied methods from Stochastic Geometry to obtain approximations of the information gain that can be achieved for different base station densities. In contrast to previous work, we modeled the interference of different stations because the assumption of noninterfering stations may hardly be valid for the investigated networks. Additionally, we assumed spatial variability of the station placements, which had rarely been investigated up to now.

The displayed numerical results show that the quality of the approximation increases with the complexity of the techniques involved. Scaling arguments may be used to estimate the order of magnitude of the gain, while approximations based on Voronoi cells lead to more precise results.

Generally speaking, the possible whereabouts of a mobile device can be significantly reduced by using the information from different base stations. For instance, in an indoor scenario with a person in distress, the number of rooms that need to be scanned by a rescue team can be extremely reduced.

An application area of the presented approach has been presented in [19]. Location information of mobile devices is normally provided by so-called location servers in a carrier access network. If the position of a device is to be provided, the servers trigger the respective location determination processes. However, the location determination in a network can be quite costly and the outcome might not meet the expectations of the requester, especially if a request should be used to refine existing location information. To avoid unnecessary workload, the present approach offers requesters the possibility to add the desired accuracy or information gain before they send their queries to the location servers. By using the approaches that have been presented in this paper and information about the deployed base stations, the servers can estimate the accuracy or the gain that can be achieved. Based on these estimates, a server can determine in advance whether the procedure should be started. Regarding the next steps of our ongoing work, we will investigate the effect of different coverage models, suited mathematical approximations, and the correctness of the resulting information gain.