Keywords

1 Introduction

A large collection of densely deployed, spatially distributed, and autonomous devices (or nodes) that communicate via wireless signals and cooperatively monitor physical or environmental conditions is called wireless sensor network (WSN) [1]. In surveillance applications, sensors are deployed in a certain field to detect and report events like presence, movement, or intrusion in the monitored area [2]. Generally speaking, the battery of sensors is unchangeable or the uncertainty of the deployed area makes recharging impossible. In consequence, maximizing the lifetime of WSNs is always a vital issue of this topic. In this paper, we mainly conserve the network energy from the following two steps [2, 3]:

  1. 1.

    Energy-efficient scheduling of sensor states (active or sleep);

  2. 2.

    Energy-efficient clustering and routing among sensors;

We assume that the sensor deployment is dense enough that in any certain area there is more than one sensor. That allows sensors take turns to monitor the overlapped area. By scheduling some redundant nodes into sleep states, the network lifetime is extended while the coverage and connectivity is preserved.

Clustering has been admitted as one of the energy-efficient way for WSNs [4, 1417]. In a cluster-based WSN, some sensors are elected as CHs (cluster head). The duty of CHs is to receive the information collected from non-CHs and send to the sink node (a special node which has infinite energy and all the information feedbacks to this node). One main advantage of clustering is that it could solve some potential problems like bandwidth limits thus increasing the capacity of the system [5]. Furthermore, CHs could function as routers. Since a long-distance transmitting costs much more energy than a short one [7], the nodes from the relatively far location could send the information through a node chain.

Our target is to seek an optimal planning to extend the network lifetime. While maintaining the coverage and connectivity of the whole network, we try to find a synthesize scheme including sensor state scheduling, clustering, and routing to maximize the lifetime of the network. Compared with Chamam’s work in [6], a similar design to schedule sensors acting for different roles, we further consider all possible energy consumption in a WSN, and further introduce a reduction method to reduce the problem scale. Thus our designs are more accurate and time-efficient. We also provide various numerical experiments and comparisons to validate the efficiency of our design.

The rest of this paper is organized as follows: In Sect. 2, we introduce some related work about energy-efficient planning of sensor network. In Sect. 3, we give necessary definition, address energy consumption model for single node, put forward the optimal planning problem, and state all assumptions. In Sect. 4, we show how to formulate this model into an integer linear programming problem. In Sect. 5, we describe how to reduce the problem scale to improve the computing efficiency. In Sect. 6, we display the simulation results and comparisons with previous literature. In Sect. 7, we make the conclusion.

Fig. 1.
figure 1

Multi-hop clustering sensor network

2 Related Work

Much progress has been made on the study of sensor state scheduling, clustering, and routing. For sensor state scheduling, Xing et al. [8] proposed Coverage Configuration Protocol (CCP), which can provide different degrees of coverage requested by applications and nodes’ states can be alternated according to the coverage degree. Heinzelman et al. [7] proposed LEACH (Low-Energy Adaptive Clustering Hierarchy), a clustering-based protocol that put forward the concept of cluster head and utilizes randomized rotation of CHs to balance the energy load among the sensors in WSN. The protocol proved to be energy-efficient and convenient to adapt to practical situations. Lindsey et al. [9] proposed PEGASIS (power-efficient gathering in sensor information systems), an improved, near optimal chain-based protocol. In PEGASIS, each node communicates only with a close neighbor and takes turns transmitting to the base station, thus reducing the amount of energy spent per round. However, the transmission distance is limited to only one hop and routing is not considered. Manjeshwar et al. [12, 13] proposed TEEN (Threshold-sensitive Energy Efficient sensor Network protocol) and APTEEN (Adaptive Periodic Threshold-sensitive Energy Efficient sensor Network protocol). In TEEN, the data transmission is not transmitted as frequently as sensing activities. A cluster head sensor sends its members the threshold value of the sensed attribute and a small change in the value of the sensed attribute that triggers the node to switch on its transmitter and transmit. Rodoplu et al. [10] proposed Small Minimum Energy Communication Network (MECN). The protocol identifies a relay region for every node, which consisting of nodes in a surrounding area where it is more energy efficient to transmit through those nodes than direct transmission. But still it does not prove its routing strategy is the optimal strategy (Fig. 1).

Most works concentrate on raising new mechanisms or protocols to conserve network’s energy, while only a few considered trying to address a linear programming model to formulate the problem and to find the optimal solution. Chamam et al. [6] put forward an optimal configuration problem. The authors classified nodes into three levels (cluster, active, sleep), and set a standard energy constant for each state. The target is to determine the state of each sensor in the next round. Nevertheless, the energy consumption rate in the real situation varies significantly among CHs and active nodes. The number of connected nodes and link ways could largely affect the energy consumption rate, and the communication cost is overly underestimated. The authors suggest that the balance of retained energy among nodes are critical and use balance as the target function of ILP, without giving any proof or reference. Also, the model ensures the existence of the spanning tree but the specific optimal routing scheme is not considered.

3 Problem Statement and Assumptions

3.1 Network Configuration

Consider a cluster-based WSN deployed in a certain flat area, nodes can collect information from the environment, transmit data to others, receive data from others, or sleep. It is important to alternate sensors that are redundant to sleep. According to [3], the power consumption in sleeping mode is generally about 5 % \(\sim \)10 % of active mode.

Besides, sensors in our network are divided into clusters. Sensors connect to cluster heads (CHs) to send message to sink. The election of CHs is critical since CHs form the backbone of the network. CHs usually dissipate energy much faster than non-CHs, for its frequent data exchange with others. Thus usually we choose those who have more energy left to become CHs. On the other hand sensors should also choose wisely about which cluster to join. A cluster of more nodes can have more potential CHs but after deaths of some nodes on the border, the whole area could be cut off from the network leaving a coverage hole.

Directly sending data from sensors to sink has been proved inefficient. Instead of simple sink-to-CH communication, we seek some intermediate CHs to act as routers. Message will be transferred from CH to CH and the delivering path follows some rules. Each time there is only a very short distance to pass data so the energy consumption could be lowered. Still there is a tradeoff between transferring times and transferring distance because compared to sending data directly, intermediate CHs need to process this package and spend extra energy. Thus it is unwise to use routers as much as it can. Also, we need to ensure that for each CH there will be a path for it to communicate with sink. To achieve this, there should be a spanning tree in the topology.

Now we give the definition of coverage. We use the basic Disk model to judge if an area is covered by a set of sensors, which means a point can only be covered or not. The covered area is a disk with the sensor as its center. We assume all sensors having the same sensing range. In this article we require the monitored area fully covered.

Definition 1

Sensor set S covers the area A if and only if \(\forall \) point P \(\in \) A,\( \exists \) sensor i \(\in \) S, the distance between P and i is less than the sensing range of sensor i.

Then we define the connection between sensors. To make sure communication between sensors is stable and reliable, we assume only sensors within safe distance can establish connection.

Definition 2

Sensor i can reach to sensor j in one hop if and only if the distance between i and j is less than the communication range of sensor i.

Our optimal solution consisting three layers (state scheduling, clustering, routing) above, and it has to meet the following demands:

  1. 1.

    The energy consumption is minimized.

  2. 2.

    The whole area should be fully covered.

  3. 3.

    Every active sensor is connected to one and only one CH.

  4. 4.

    There must be a spanning tree including all CHs.

We made the following assumptions:

  1. 1.

    All sensors are homogenous and fixed. Sensors can collect data from all directions.

  2. 2.

    The location and energy information of all sensors are known to sink. Each sensor is unique and can be recognized by sink. We do not consider the initial step of the network construction and neither its energy consumption.

  3. 3.

    Sensors are randomly deployed and the network is dense enough.

  4. 4.

    Sensors can function well as long as retained energy is not used up. The lifetime of their components is not considered.

  5. 5.

    Only CHs can act as routers. CHs receive and transmit data to other CHs or sink.

  6. 6.

    The amount of data a sensor collects from the environment in a period is steady. Here we treat the size of message package every sensor needs to pass to sink as a constant.

3.2 Energy Consumption Estimation

We address the SC model to estimate the energy consumption.

In the SC model, we only consider two major energy dissipations: Standalone cost and Communication cost. The standalone cost includes the energy a sensor uses for gathering data from environment and data processing. It is relatively stable and in the most common situation it will not change. While the communication cost relate to the amount of transmitted data and transmitting range. For example, to transmit a \(k\text {-}bit\) message a distance d needs:

$$E_{Tx}(k,d)=E_{elec}\cdot k+\epsilon _{amp}\cdot k \cdot d^2.$$

\(E_{elec}\) and \(\epsilon _{amp}\) are constants. We assume \(E_{elec}= 50nJ/bit\), \(\epsilon _{amp}=100pJ/bit \cdot m^2\) [5]. d means the distance between communicating nodes.

To receive this message, the radio expends:

$$E_{Rx}(k)=E_{elec}\cdot k.$$

During the period \(\delta t\), the energy a sensor consumes:

$$ \delta E = (E_S+\sum {kt_j \cdot (E_{elec} +\epsilon _{elec} \cdot d_j^2)}+\sum kr_j \cdot E_{elec} )\cdot \delta t. $$

\(E_S\) is a constant, which means standalone cost rate. kt and kr mean bits transmitted and received in a time unit.

Now we can calculate the energy consumption of any node in the next period if we already have the sensor scheduling, in other words, clustering and routing scheme for the next round. If a node is active but not a CH in the next period, the energy consumption includes standalone cost and transmitting cost for sending collected data to CH. If a node is a CH, the energy consumption should not only include the above two costs, but also the receiving cost for gathering data from the cluster, the receiving and transmitting cost for being a routing node.

Undoubtedly we want the total sum of \(\delta E\) as small as possible, which means the whole energy consumption is minimized. But this does not necessarily lead to the network lifetime maximization. For example, some sensors die out quickly while others are nearly as healthy as newborns. In this situation the network is becoming equivalently sparse, which leads to the coverage holes and disagrees with our assumption. Keeping the balance of retained energy among the sensors is of vital importance. Thus, instead to minimize total sum of \(\delta E\), we choose to minimize the sum of ratio between predicted consumed energy and retained energy of sensors \(\delta E/E_i\).

4 Modeling and Definitions

Our work is on the basis of research of Chamam et al. [6]. After taking communication costs into consideration, we formulate the optimal planning problem into an integer linear programming problem.

First we define all the sets, constants, variables we will use later in Table 1. Here S, C, \(d_{ij}\), \(\rho _{ic}\) are constants, while \(X_{i}\), \(Y_{i}\), \(Z_{ij}\), \(V_{ij}^{kl}\) are variables we want to compute.

Table 1. Set table

The domain of variables ijklc is \( i,j,k,l \in \{1..N\}, c \in \{1..M\} \).

The problem is modeled below. The meaning of the target function has been described in the last section, namely the sum of the predicted consumption rate of all sensors, including standalone cost and communication cost. Constraint (1) is coverage requirement that the monitored area should be fully covered. Constraint (2) means a sensor has to be active if it is a CH. Constraint (3) guarantees that there must be one sensor that can reach the sink (though we did not include sink in the sensor table, we can regard sink as a special node). Constraint (4) guarantees that a sensor has to be active if it is connected to CHs, a sensor has to be a CH if there are other nodes connecting to it and the distance requirement for the connection between CHs, which we talked in the definition of connection. Constraint (5) guarantees that one sensor can only join one cluster to avoid redundancy and disorder, and the CHs have the maximum connection limits taking bandwidth of sensors into consideration. Constraint (6) means this is a 0–1 programming problem. Constraints (7) (8) guarantee that the network contains a spanning tree, which ensures the connectivity of network. However, Constraint (7) is not linearized. Thus we use Constraints (9) to help to linearize it. In the end, we get the linear programming model.

$$\begin{aligned} min&\sum _{i=1}^{|S|} X_i \cdot \frac{E_S}{E_i} + \sum _{i=1}^{|S|}\sum _{j=1}^{|S|} Z_{ij}\cdot \frac{E_{elec} + \epsilon _{elec} \cdot dist_{ij}^2}{E_i} \nonumber \\&+ \sum _{i=1}^{|S|}\sum _{j=1}^{|S|}\left( Z_{ji} \cdot \frac{E_{elec}}{E_i} + \sum _{k=1}^{|S|}\sum _{l=1}^{|S|} (V_{kl}^{ij}\cdot \frac{E_{elec} +\epsilon _{elec} \cdot dist_{ij}^2}{E_i}+V_{kl}^{ji}\cdot \frac{E_{elec}}{E_i})\right) \nonumber \\ s.t.&\forall ~c = 1..|C|,~\sum _{i=1}^{|S|}X_i\cdot \rho _{ic} \ge 1 \end{aligned}$$
(1)
$$\begin{aligned}&\forall ~i = 1..|S|, ~Y_i \le X_i \end{aligned}$$
(2)
$$\begin{aligned}&\sum _{i=1}^{|S|}Y_i\cdot d_{i0} \ge 1\end{aligned}$$
(3)
$$\begin{aligned}&\forall ~i,j = 1..|S|, ~j \ne i ,~Z_{ij} \le X_i-Y_i ,~Z_{ij} \le Y_j ,~Z_{ij} \le d_{ij}\end{aligned}$$
(4)
$$\begin{aligned}&\forall ~i = 1..|S|, \sum _{j=1,j \ne i}^{|S|} Z_{ij}+Y_i = X_i, ~Z_{ij} \le N_{max}\end{aligned}$$
(5)
$$\begin{aligned}&X,Y \in \{0,1\}^{|S|}, ~Z \in \{0,1\}^{|S|^2},~V \in \{0,1\}^{|S|^4}\end{aligned}$$
(6)
$$\begin{aligned}&\forall ~i,j,k = 1..|S|,~j \ne i, \sum _{l \in |S|,l\ne k} V_{ij}^{kl}-\sum _{l \in |S|,l\ne k} V_{ij}^{lk}=\left\{ \begin{array}{lll} 0~~~~~~&{}\text {if }k\ne i \text { and } k \ne j \\ Y_i\cdot Y_j~~~~~~&{}\text {if }k = i \\ -Y_i\cdot Y_j~~~~~~&{}\text {if }k = j \\ \end{array} \right. \end{aligned}$$
(7)
$$\begin{aligned}&\forall ~i,j,k,l = 1..|S|,~j \ne i, ~k \ne l, ~V_{ij}^{kl} \le Y_i,~V_{ij}^{kl} \le Y_j, ~V_{ij}^{kl} \le Y_k, ~V_{ij}^{kl} \le Y_l, ~V_{ij}^{kl} \le d_{kl}\end{aligned}$$
(8)
$$\begin{aligned}&\forall ~i,j = 1..|S| ,~ Y_i \cdot Y_j \le Y_i,~ Y_i \cdot Y_j \le Y_j,~ Y_i \cdot Y_j \ge Y_i + Y_j-1 \end{aligned}$$
(9)

5 Problem Scale Reduction

Though the size of the problem grows with the number of sensors in a polynomial speed, for \(10^2\) sensors there should at least be \(10^8\) variables to optimize since \(V \in \{0,1\}^{|S|^4}\). We can eliminate some computing redundancy to make the problem smaller. For \(V_{ij}^{kl}\), it makes sense only if sensor k and l can reach each other, which is written as : \(d_{ij}=1\).

Also, we can strictly control the use of routing, for example, sensor i,k,l,j should gradually approach the sink in sequence. The constraint is written as:

$$d_{i0}\ge d_{k0} > d_{l0}\ge d_{j0}.$$

6 Simulation Results

We perform our simulation in Gurobi [11], a commercial optimization solver, which has a superior performance dealing with large-scale constraints and variables. For convenience we use “EAS" to refer to our model.

6.1 Performance Evaluation: Impact of Problem Scale Reduction

Variation of the Number of Variables and Constraints with the Number of Sensors: In a settled map, the larger the number of sensors is, the denser the network becomes and the diversity of network structure has more potential. We fix the map size as \(100 \cdot 100\), the sensing and communication ranges as 30, and change the number of sensors to see the impact. The result is shown in Table 2. The number of variables and constraints is increasing w.r.t. sensor number, but the reduced model has gained a small-scale advantage and a slower growing speed.

Table 2. Number of variables and constraints vs number of sensors
Table 3. Number of variables and constraints vs communicating ranges

Variation of the Number of Variables and Constraints with the Communicating Ranges: The communicating ranges has a great impression on network planning. A large communicating range means a cluster could potentially contain more nodes, while a small communicating range needs routing technique more. We fix the size of the map as \(100 \cdot 100\), the number of sensors as 30. We change the communicating ranges to see the impact. The result is shown in Table 3. Since our reduction technique takes the connecting possibility between nodes into consideration, the communicating ranges affect the degree of reduction and the smaller the communicating ranges become, the better the reduction behaves.

Variation of the computation time with the problem scale: Let us take a closer look at the impact of the problem scale on the computation time. We fix the size of the map as \(100 \cdot 100\), the sensing range and communication ranges 50, and change the number of sensors to see the impact. The result is shown in Table 4. We can see that there is a positive correlation between the computation time and the problem scale. Also, when we tried to solve the unreduced model, we run out of memory resources on our machine, which reveals the necessity of reduction.

Variation of Objective Function Values Obtained by Two Models: Objective function values are positively correlate to energy consumption in the whole network. We fix the size of the map as \(100 \cdot 100\), the sensing range and communication ranges as 30, and change the number of sensors to see the impact. The result is shown in Table 5. We can see that after reduction, the obtained objective function value will not inevitably cause the decrease of solution quality. On the contrary, it helped to find a near-optimal solution.

Table 4. Variation of the computation time with the problem scale
Table 5. Variation of the objective function with the problem scale

6.2 Comparison Between Reduced OPT-ALL-RCC and EAS

Here we choose to compare our model to the reduced OPT-ALL-RCC( generated by doing the same reduction to OPT-ALL-RCC as to our EAS model) instead of OPT-ALL-RCC for sake of time efficiency. We fix the size of the map as \(100 \cdot 100\), the sensing range and communication ranges as 50, and change the number of sensors to see the impact.

Fig. 2.
figure 2

Computation time using reduced OPT-ALL-RCC and EAS

Fig. 3.
figure 3

Network lifetime using reduced OPT-ALL-RCC and EAS

Computation Time: The result is shown in Fig. 2. For the same problem set, the reduced OPT-ALL-RCC and EAS spend similar time to find a near-optimal solution because after reduction these two models have the exact same problem scale. In other words EAS needs to spend less time than the original versional of OPT-ALL-RCC.

Network Lifetime: The result is shown in Fig. 3. There is an obvious network lifetime lift from OPT-ALL-RCC to EAS. Especially when the scale of network gets large, our model could effectively lengthen the network lifetime.

7 Conclusion

In this article we propose a model that based on communication cost estimation to calculate the energy consumption of sensor network. On top of this we put forward a mechanism to find an optimal or near-optimal configuration for sensor network to maintain coverage and connectivity and extend lifetime of sensor network. The simulation shows that our mechanism could prolong the lifetime of network with the computation time remains nearly constant. As future research directions, we plan to further decrease the problem scale and improve the energy consumption model.