1 Introduction

A WSN is a self-organized network which contains a large number of tiny sensor nodes deployed in the sensing area [1]. Sensor nodes sense the environment, process the data and send it to the BS either directly or through intermediate nodes. All the nodes are capable of acting as sensing node as well as relay node. Environment monitoring, healthcare, agriculture, military and smart home are few important applications of WSN. The battery power of the sensor node is a major constraint in these networks. The proper utilization of this power is necessary to prolong network lifetime [20]. Searching of various paths from sensing node to the BS and selection of a path among them for transmitting the sensed data in an efficient manner is collectively known as routing [9]. Various protocols have been designed for this purpose depending on the objectives of the WSN. In [31], these protocols are categorized into four groups, namely (1) communication model, (2) network structure, (3) network topology and (4) reliable routing. Network structure based routing protocols are further sub-categorized into two classes, namely flat protocols and hierarchical protocols.

Hierarchical routing protocols are designed to build network energy efficient that can improve the overall network lifetime. In these protocols, deployed nodes are split into groups termed as clusters. A node within the cluster works as the CH and the remaining as cluster members (CMs). CMs communicate with the CH and CH further interacts with the BS using single-hop or multihop transmission. The energy depletion rate of the CH is always high with respect to CMs due to the extra load which result is unequal energy consumption in the network. The CH rotation is a common method deployed to balance this uneven energy depletion. To optimize the load balance in a WSN, various optimization techniques have been used to find the optimal number of clusters and CHs [13, 27, 30]. Earlier, most of the clustering protocols have used single hop communication from CH to the BS, which consumes a lot of energy because the required energy for signal propagation increases proportionally to the square of the propagation distance upto a certain distance after that it will be power of four [7, 33]. Recently, multihop communication gets more attention for CH to the BS communication in WSN due to energy efficiency [16]. In spite of various advantages, the major drawback of a multihop routing protocol is the hot spot problem. In multihop mechanism, the nodes closer to the BS are prone to dissipate their energy rapidly as compared to the other nodes due to forwarding of more packets. Hence, these nodes will generally die early and network may get isolated. This is known as hot spot problem [5]. The approaches like unequal clustering techniques [12] and mobile data mule or mobile sinks [26] are mainly used to resolve this issue. However, most of the unequal clustering schemes proposed for the solution of the hot spot uses smaller cluster size near the BS and the size of the cluster increases as we go far from the BS. The size of a cluster is inversely proportional to its distance from the BS. The clusters near the BS accommodate a higher number of nodes which help in effective load sharing. The proposed scheme for size allocation and size variation of clusters results in reduction of the frequency at which a particular node becomes CH. This helps in maintaining the overall connectivity and prevents network isolation. In this manner, the hot spot problem is minimized. One major issue regarding unequal clustering is cluster size increasing or decreasing ratio, which is clearly not discussed in existing schemes. We have resolved this issue by finding the relationship between various sizes of the clusters, discussed in Sect. 5.2.

This research also endeavours at determining a near-optimal time to change the CH as discussed in Sect. 5.1. Hence, in this paper we have taken a CH change factor f and fixed it after 7th round as discussed in Sect. 5.1. The path traversed by data mules have been proposed in several research works [2, 4, 8]. However, most of them make it necessary for the mule to visit every node resulting in unwanted delay. Successive research works introduced the concept of using multiple mules for data collection to decrease this delay. However, this increases the overheads. Our proposed trajectory for the mule attempts at providing a balance in this trade-off between delay and high overheads.

Data mules [34] are dedicated mobile nodes deployed in the network to support the BS in data collection. They primarily operate as a carrier in between sensor nodes and the static BS and they have no resource issues. Data mule collects data from individual sensor node or CHs or relay nodes, when it comes within their transmission range. It collects data in two different contact schemes: mule motion contact and sojourn point (SP) contact. In mule motion contact, a mule is always moving and periodically transmits beacon messages when it detects any node, contact has been made and it accumulates data from the node. In SP contact, mule stays at a predetermined location for a fixed amount of time called sojourn time and in that duration mule collects data [17]. When it reaches near the BS it transfers all its gathered data to the BS. We have considered SP of contact in this work for collecting data from CH.

The early death of sensor nodes near the BS owing to the hot spot and unfair load distribution is a serious issue in static WSN. And then the hot spot problem during the multihop routing needs to be discussed. An energy efficient multihop routing for uniform energy sharing is needed to increase the lifetime of WSN. We are concerned with resolving the hot spot problem in WSN. This can be done by minimizing the communication overheads of the BS’s nearby nodes. In this scheme, we propose an energy efficient protocol, which provides the solution for hot spot problem and maximizes the lifetime of the network. The scheme comprises of unequal rectangular clusters with a data mule, which moves with a constant speed for collecting data from CHs. The operating cost of cluster formation, CH selection and its rotation is minimized as our protocol is centralized in nature and these roles are performed by the BS. The BS uses fixed clustering approach and partition the network into rectangular grids of varying size called clusters. The size of the cluster increases while moving towards the SP. The BS handles the movement path and time schedule of the mobile data mule. The data mule moves in a pre-defined path and halts for a predefined time at SP to collect data. It transfers the collected data to the BS when reaches in the BS range. The unequal clustering and use of data mule in our protocol makes it more energy effective and results in the mitigation of the hot spot problem.

The remaining part of the paper is organized in the following manner: Sect. 2 summarizes the current related literatures. The proposed scheme is explained in detail with system model in Sect. 3. Section 4 gives an overview of the simulation environment and Sect. 5 discussed the evaluation of the proposed work with the help of simulation results. The overall discussion about the contribution and findings is presented in Sect. 6 Finally, Sect. 7 conclude the paper with some future works.

2 Related Works

The paper considers unequal grid based clustering with a mobile data mule. Hence in the related works first we will discuss few basic clustering techniques and after that unequal clustering will be talked about. Further, we will explore research works related to grid-based clustering as well as data mule based protocols.

Hienzelman et al. [15] introduced first cluster based routing protocol LEACH which is distributed in nature. Each node generates a number between 0 and 1 on a random basis at the beginning of each round. The nodes, whose generated number is less than a threshold value, declare themselves as CH for the round. Nodes send join request to their nearest CH based on the received signal strength (RSS). CH sends an acknowledgment message and TDMA time slot for data transmission in reply. The major drawbacks of the protocol are unequal size of clusters in different rounds, no consideration of energy level of nodes in the CH selection process and single-hop transmission between the CH and the BS. They extended their previous works and proposed LEACH-Centralized (LEACH-C) [14] protocol to overcome the limitations of LEACH. The number of CHs for each round is fixed in LEACH-C. The protocol reduces the overhead of CH selection from the nodes as it is centralized in nature. It offers better performance than LEACH but also suffers from problems such as single-hop transmission and centralized CH selection, which is not good for the large area network. A lot of successors of LEACH have been proposed for performance improvement till date [36]. Bandyopadhyay and Coyle [6] proposed Energy-Efficient Hierarchical Clustering (EEHC) algorithm. It is a layer based clustering protocol in which network is segmented into a hierarchy of layers. The data transmission goes from lower layer towards the BS. The CHs at the lowest layer transmit data to the apparent upper layer CHs. This process persists till data is received by the BS. Hybrid Energy-Efficient Distributed Clustering (HEED) [39] is a distributed protocol which uses intra-cluster communication cost and residual energy of the node as parameter for CH selection. In this protocol, CH selection takes place after a defined number of rounds instead of each round. Jin et al. [19] enhanced the EEHC protocol and introduced a new protocol Energy-Efficient Multilevel Clustering (EEMC). This protocol reduces energy consumption and latency of the network by selecting CH based on the residual energy of the node and the distance from the BS. The main demerit of this protocol is that the BS requires information of all the nodes for the selection of CH in each round. All the nodes send information to the BS and receive the reply message. This increases the message overhead and energy depletion of the network. MR-LEACH [10] is circular layer based protocol in which the BS is positioned at the center. Each lower layer CH sends data to its apparent upper layer nearest CH for communicating to the BS. Most of the multihop routing protocols discussed above suffer from the hot spot problem. Goa et al. [11] have proposed a fuzzy multiple criteria decision making (MCDM) CH selection scheme to increase the lifetime of the network. In this scheme, CH selection is optimized by the use of hierarchical fuzzy integral and trapezoidal fuzzy AHP with the help of energy status, QoS impact and location as the main factors. This protocol improves the lifetime of the network but its perform poor for large area network due to single hop communication. Mantri et al. [28] proposed a Two Tier Cluster based Data Aggregation Algorithm (TTCDA) to reduce the overall communication cost by minimizing the number of transmission from sensor nodes to the BS. They have used additive and divisible data aggregation functions at the CH and number of packets generated by a node with variable rate, are maintained by spatial and temporal correlation. This protocol performs better in terms of bandwidth utilization and throughput but it suffers from hot spot problem due to two tier cluster communication. This work has been extended by using a mobile BS and heterogeneous nodes and they named it mobile sink and heterogeneous nodes aware cluster-based data aggregation algorithm (MHCDA) [29]. It minimizes the communication and computation cost by aggregating the packets at a predefined region. The main drawback of this protocol is hot spot problem which affects the network lifetime.

PRODUCE [21] is an unequal clustering based protocol which is semi-centralized in nature. It provides better performance in small networks but it suffers from scalability problem. It balances the intra-cluster and inter-cluster data load by varying the size of the cluster. The transmission distance restricted up to the threshold transmission distance of the radio energy model. A location-based unequal clustering algorithm (LUCA) [23] proposed by Lee et al. reduces the overall energy consumption in the network. The main idea in this scheme is to keep the larger cluster size farther from the BS and small size clusters nearer to the BS. This location based cluster formation minimizes the intra and inter cluster communication but due to the unequal cluster its size is not optimal which results in a high energy consumption. Low Power Grid based Cluster Routing Algorithm (LPGCRA) [24] is a grid based routing protocol which addresses hot spot problem. It selects the maximum residual energy node within the cluster as the CH without considering its distance from the remaining nodes of the cluster. This outcomes in more energy consumption in intra-cluster communication as the overall transmission distance of the cluster is not downplayed. It also uses single-hop transmission for communicating with the BS. Grid Based Routing (GBR) [3] uses data delivery time as a parameter for the CH selection. The nodes nearest to the BS in each cluster are selected as CH. This increases the overall intra-cluster transmission distance which results in shorter network lifetime. Another drawback of this protocol is that it segments the network into fixed number of clusters without considering the size of the network. This leads to the higher rate of energy depletion in larger network as the transmission distance exceeds from a certain limit. In [18], Authors have proposed a grid based protocol GFTCRA to reduce hot spot problem. It uses overall transmission distance within the cluster as a parameter for CH selected. It also limits the number of CHs for which a CH acts as a relay node in order to balance uneven load of the clusters nearer to the BS.

Shah et al. [22] have proposed three tier architecture for sparse sensor network which is based on functionality of data mule. Data mules are mobile devices which go in the network for the accumulation of data from the sensor nodes. Authors in [41] have presented a message ferrying scheme in order to increase the network lifetime and reduce hot spot problem. It specifies a limited route of the data mule for collecting information from the static nodes. In [37], Authors have discussed the vital topics of data mule such as speed control at the time of communication with the sensor nodes, the velocity with which it prompts in the network. In order to minimize data latency time, Luo et al. [25] have introduced an algorithm which utilizes multiple data mules for data collection that reduces the total tour time of each mule. Prince et al. [32] have proposed an unequal clustering based routing protocol using data mule in order to reduce hot spot problem. The protocol specifies a true way for data mule traversal which reduces the complexity of incorporation of the data mule in the routing protocols. The protocol balances the intra-cluster data and inter-cluster data by varying the cluster size. Singh et al. [35] have presented an odd-even level routing in a square grid based clustering with two mobile mules. Authors have divided the whole sensing area into levels and further levels are divided into square grids/clusters. Established along the round number, the mule visits all the levels vertically and collects data from the borderline CHs by stopping at the anchor points and eventually transfer the collected data to the BS. If the round number is even, CHs transmit their data from left to right direction to the CHs in their level and vice versa in case of odd round number. The odd-even level routing reduces the hot spot problem by not forwarding data vertically towards the BS which enhanced the lifetime of the nodes nearer to the BS. The main drawback of this approach is scalability due to the centralized nature.

Nevertheless, most of the above protocols have not discussed about the energy efficiency, hot spot problem and load balancing issues together. In our strategy, we have looked at all these matters and likewise attempted to settle them. A comparative analysis of discussed protocols in this section has been presented in Table 1.

Table 1 Comparative analysis of related work protocols

The proposed scheme has the following advantages over the existing similar type protocols.

  1. 1.

    It considers an unequal clustering where the size of the cluster linearly increases as it goes towards the SP.

  2. 2.

    It has ability to handle the hot spot problem.

  3. 3.

    The distribution of the CH is better.

  4. 4.

    The CH does not rotate in every round. Hence, a significant amount of energy saves due to minimum overhead.

  5. 5.

    It proposed a relationship between different cluster sizes with the help of a factor (r) and also finds the round number after which CH will change (f).

  6. 6.

    The mobile data mule collects data from CHs and delivers it to the BS.

3 Proposed Work

This section, describes our proposed scheme which ensures the equal energy distribution among sensor nodes in the network and extends the network lifetime. The proposed work is divided into four phases- (1) Cluster Formation, (2) Cluster head selection, (3) Route Discovery and (4) Data Forwarding. All four levels are centrally managed by the BS. Each sensor node has a synchronized clock to know the start time of each round. In the cluster formation phase, the sensing area is divided into unequal rectangular grids termed as cluster in this article. The clusterhead selection phase is primarily responsible for selection of CH. The route discovery phase constructs the multihop routing path from CH to the SP. In data forwarding phase, the TDMA slots are allocated to each node for communication.

3.1 Network Model

The network contains a BS, a data mule and n number of sensor nodes, which are scattered in the environment in a random manner. Each node has a unique id denoted by \(\hbox {S}_i\). We have made some assumptions regarding the network model which are presented in this section. The assumptions are quite realistic and can be easily achieved in a deployment.

  • The sensor nodes are homogenous and static.

  • The BS is fixed and positioned outside the network.

  • The BS has information about all sensor nodes regarding their node ids and locations.

  • The data mule moves in a predefined path with constant velocity and stop at SPs.

The different notations used in this article are tabulated in Table 2.

Table 2 Notations used in the protocol

3.2 Cluster Formation Phase

The main objective of this phase is to partition the target area into rectangular grids or clusters. The BS performs the grid (cluster) formation only once after deployment. The grid is fixed for the lifespan of the network. Each grid holds a unique id with fixed number of nodes. Algorithm 1 outlines the working of this phase.

The BS divides the deployment area into m number of horizontal lanes of equal width (w) called levels. Each level is then subdivided into blocks giving the whole network, a grid like structure. The largest cluster which is nearer to the SP is \(w \times w\) meters. To balance the load of CH, each node is selected as CH atleast once during its life cycle. The CHs are selected from each cluster. It may happen that the CHs of two clusters are lying at diagonally opposite ends in the worst case. For this type of situation, we can fix the maximum length and width of a cluster by formulating Eqs. 12 and 3 as shown in Fig. 1.

$$\begin{aligned} \sqrt{w^2 + (2w)^2}=Z \end{aligned}$$
(1)

where \(2w \le w_{next} + w\), therefor

$$\begin{aligned}&\sqrt{w^2+(2w)^2} \le Z \end{aligned}$$
(2)
$$\begin{aligned}&w=\frac{Z}{\sqrt{5}} \end{aligned}$$
(3)

The largest cluster is constructed near to the SP. The width of the cluster decreases with a square exponent formula build with a constant factor r as we move away from the SPs. The parameter \(r=0.5\) is evaluated and found 0.5 to be near-optimal for our simulation as discussed in Sect. 5. The bigger size of the cluster near to the SP can accommodate more number of sensor nodes, which decreases the frequency of a particular node to be selected as CH in multiple rounds. This distributes the traffic load and prolongs the lifetime of the nearer nodes and hence hot spot problem does not arise in the network. The difference in widths of any two clusters belonging to consecutive cluster levels has been expressed as shown in Eq. 4. Based on this relation, the dimension of the sensing area is fixed where, the length is twice of the width. For example: suppose the length and width of the network is 500 and 250 m respectively. Therefore, based on the Eq. CSize, the total number of levels in y axis is 8 and the cluster width decreasing sequence is 40, 39.5, 38, 35.5, 32, 27.5, 22 and 15.5 where, 40 and 15.5 are the maximum and minimum width of a cluster respectively. In x axis approximately 12 levels will formed, since cluster length is uniform and size of the cluster length is \(\frac{Z}{\sqrt{5}}\) which is about 40. Hence, the total number of clusters in this network will be \(12\times 8 = 96\). In this relationship, we have used square power function which falls the value of width by 1 linearly except the first level clusters. We have also looked into this relationship with without power and power three by mathematical computation as well as simulation. In without power case, the difference between two clusters or levels is only 0.5 and with this difference width size is linearly decreasing. Simulation results are also not that much effective, due to this we have not used without power relationship. In font of power three, the deviation between two clusters is very high which is not suited for our system.

$$\begin{aligned} w_{next} = w - ((level_{id})^2)*r \end{aligned}$$
(4)

The BS maintains a record of each cluster after complete execution of aforementioned phase. This record contains cluster id, number of sensor nodes in the cluster and its constituent sensor nodes id, location and energy level.

figure d
Fig. 1
figure 1

Grid formation with maximum w width to restrict the communication in free space radio model

3.3 Cluster head (CH) Selection

Selection of the most suitable node as a CH in each cluster and finding the round in which CH will change are the key functionalities of this phase. These algorithms are executed by BS at the start of each round of operation. The working of this phase is split into two sections: CH change and CH selection. The CH change algorithm determines whether the CH of a cluster will change or not? If yes, the CH selection algorithm is triggered to select new CH for that cluster. Initially, all nodes are having equal energy, hence the BS selects the node as a CH which is nearest to the centroid of the deployed sensor nodes. Thereafter the CH change and CH selections are accomplished by Algorithms 2 and 3 respectively. The BS checks the status of each node by comparing its energy level to \(E_{min}\) given in Eq. 5. If the energy level of a node is less than \(E_{min}\), it is considered as dead or inactive node. The threshold energy \(E_{th}\) for each cluster is calculated in each round. The \(E_{th}\) energy is the minimum required energy for a node of a cluster, which is going to participate in the CH selection process. Different clusters have different \(E_{th}\) due to the variation of nodes and data packets. The \(E_{th}\) of a cluster for a particular round depends on the size of its own cluster data as well as data forwarded to it by the neighboring clusters. The value of \(E_{th}\) depends on the minimum energy required to receive and transmit data as presented in Eq. 8. The minimal energy needed for receiving data \(E_ {recv}\) of a cluster can be derived by estimating the minimal energy required to receive information from its own cluster as well as neighboring cluster as mentioned in Eq. 6. The minimum energy required to transmit in a cluster is based on the distance between the communicating nodes and it can be given by Eq. 7.

figure e
$$\begin{aligned} E_{min}& = {} msg_{length}*E_{elect}+msg_{length}*e_{freespc}*Z^2 \end{aligned}$$
(5)
$$\begin{aligned} E_{recv}& = {} (data_{ownClst}+data_{nbrClst})*E_{elect} \end{aligned}$$
(6)
$$\begin{aligned} E_{trans}& = {} (data_{ownClst} + data_{nbrClst})*E_{elect} + (data_{ownClst} + data_{nbrClst} )*e_{freespc} *Z^2 \end{aligned}$$
(7)
$$\begin{aligned} E_{th}& = {} E_{recv}+ E_{trans} \end{aligned}$$
(8)

The current CH status depends on the factor f and \(E_{th}\). Where, f is the optimal round number after which CH will change and goes for new CH selection. If either \((E(CH_{(level_{id},i)})< E_{th} (level_{id},i))\) or (Round number \(\le f\)) as mentioned in line number 19 in the Algorithm 2 and explains in details in Sect. 5, then CH will change else the current CH remains unchanged. If current CH status has changed then the new CH selection process starts by executing Algorithm 3 for that cluster. The BS updates the status of newly selected CH and previous CH. Algorithm 3 outlines the working of the CH selection process. In this algorithm, the BS first calculates the centroid of the deployed nodes in each cluster by averaging x co-ordinates and y co-ordinates of the participating members of the cluster as presented in line number 8 in Algorithm 2. It also calculates the average energy of the active member nodes within the cluster. Line numbers 11–21 of Algorithm 3 describe the complete process for a node to become CH. The set of nodes having higher energy than the average energy as well as \(E_{th}\) of the cluster are selected as initial CH. The node nearest to the centroid is selected as final CH of that cluster. This minimizes the intra-cluster communication cost due to the minimum transmission distance within the cluster which minimizes the overall energy consumption.

figure f

3.4 Route Discovery Phase

The main objective of this phase is to discover the data forwarding route of the network. Algorithm 4 explains the complete working procedure of this phase.

figure g

The BS performs this process in each round after setup phase. It selects \(CH_{(level_{id},i)}\) of the cluster \(\hbox {C}_{(level_{id},i)}\). It searches the \(\hbox {CH}_{(level_{id},i+1)}\) of the apparent cluster \(\hbox {C}_{(level_{id},i+1)}\) towards the \(\hbox {SP}_{(level_{id})}\). If \(\hbox {CH}_{(level_{id},i+1)}\) is present, it selects this CH to forward the data of \(\hbox {CH}_{(level_{id},i)}\). Otherwise, it first searches the apparent upper level \(\hbox {CH}_{(level_{id+1},i+1)}\) and then the apparent lower level \(\hbox {CH}_{(level_{id-1},i+1)}\). This process is repeated till all the CHs have multiple data forwarding path. The BS maintains a routing path table of the current round after the execution of this phase.

3.5 Data Forwarding Phase

The steady phase allocates the TDMA slots to CMs, CHs and data mule for collision free communication during transmission and reception. The BS finds the cluster, which has maximum number of nodes and determines the corresponding number of nodes (\(\hbox {max}\_\hbox {numSC}_{{level}_{id},i}\)). \(\hbox {t}_{CM}\) is the maximum time required for communication for transmission and reception between the CM and its CH. Hence, the time assigned to each cluster for intra-cluster communication is equal to \(\hbox {t}_{CM} * (\hbox {max}\_\hbox {numSC}_{{level}_{id},i} -1)\). For instance, if the number of CMs is 10 then 9 \(\hbox {t}_{CM}\) time slots are assigned to each cluster. A cluster having 6 CMs utilize 5 time slots and remaining slots are idle for that cluster. After the intra-cluster time allocation, inter-cluster time allocation takes place. If the maximum time taken in transmission and reception of data, including HELLO message exchange between two CHs is \(\hbox {t}_{CH}\), time assigned to each CH for transmitting the received data is equal to \(\hbox {t}_{CH}\). The data mule traverses with a velocity V m/s and waits for time \(\hbox {t}_{wait}\) Sec at each SP to collect the data. The time period of a complete round \(t_{round}\) is calculated based on the above mentioned time units

$$\begin{aligned} t_{round}& = {} t_{CPkt}+t_{CM}*(max\_numSC_{{level}_{id},i}-1)\nonumber \\&\quad +(t_{CH}*Var_{CL})+\frac{(Y-m)}{V}+t_{wait}*m \end{aligned}$$
(9)

Time \(t_{CPkt}\) is allotted to each round as an overhead for the exchange of control packets before the actual data transmission occurs. The time slots \(t_{round}\), \(t_{CPkt}\), \(t_{CM}\), \(t_{CH}\) and \(t_{wait}\) are constant values for the complete network lifetime.

The BS determines the time period of the round by using Eq. 9. It creates a table for each cluster containing the information such as cluster id, \(t_{round}\), number of nodes within the cluster, node id belonging to the cluster, their corresponding location, their TDMA slots, status (active or inactive), role (CH or CM), relay CH node ids and its location. In the first round BS sends a copy of these tables to the corresponding CHs. The CH stores it and forwards it to each CM. Later on, when \(\hbox {t}_{CPkt}\) time lapses, transmission of sensed data takes place according to the allotted time slots. In the subsequent rounds, nodes wait for a \(\hbox {t}_{CPkt}\) time period for the control packet. If it receives a control packet, it updates its cluster table and data transmission startes based on the updated table. The BS sends control packet to a cluster if its CH changes or any node dies. In case the CH changes, BS sends control packet to newly selected CH which transmits this packet to its CMs. Nodes update the role and time slot of the previous CH and the new CH in the table after receiving the packet. The time slots allotted to the new CH and the previous CH are interchanged in the updated table. The BS also sends the control packet to the CHs for which the previous CH is acting as a relay node. These CHs update relay node id and location in their tables.

After the elapse of \(\hbox {t}_{CPkt}\) time period, CMs transmit data to the CH in the allotted time-slot concurrently in each cluster. During the time period \(\hbox {t}_{CM} * (\hbox {max}\_\hbox {numSC}_{{level}_{id},i} -1)\), CH collects the data from all the CMs. All the CHs of the vertical lane, which are farthest from the BS, transmit HELLO message to their relay CH node. After getting the acknowledgment message, they start sending data concurrently to their relay nodes in the allotted time interval \(\hbox {t}_{CH}\). During the time period \(\hbox {t}_{CH} * \hbox {Var}_{CL}\), data are collected by the CHs of vertical lane nearest to the BS. These CHs first establish the connection with the data mule by exchanging HELLO message as it reaches to the SP of the corresponding level and then transmit data to it. Data mule collects the data at all SPs in sequence and delivers it to the BS.

4 Simulation Environment

In order to evaluate the performance of our scheme, we have simulated it in OMNeT++ [38] on Intel Core i7 3.6 GHz CPU and 8GB RAM running on Microsoft Windows 8.1 professional platform. A deployment area of \(500\times 250 \, \hbox{m}^2\) is chosen during simulation where 500 and 750 normal nodes are randomly deployed. The mule moves in the simulation area from one end of the deployment area to the BS. We have simulated 10 different mule movement paths and selected two best paths. The results of mule moving across the boundary and moving in the centre of simulation area are found better than other mule movements. So, the results of mule movements of the above two cases are reported in the article. For each scenario the experiment was performed 40 times and the average result is reported here. The values of the parameters used during simulation are mentioned in Table 3.

Table 3 Simulation parameters

5 Simulation Results and Analysis

The results of the proposed algorithms are compared against well known existing works LPGCRA [24], GBR [3] and GFTCRA [18] protocols. Comparisons and performances of the protocol in both the scenarios are discussed in the subsequent sections based on the simulation results.

First of all, we evaluated the near-optimal frequency of the CH change for both the scenarios. We have set the values of round number after which CH will change and the width decreasing factor of grids as \(f=7\) and \(r=0.5\) respectively through out the simulations. The justification for taking these values and their testified graphs with respect to first node dies (FND) is discussed below.

5.1 Selection of Near-Optimal CH Change Factor (f)

The f represents the round number after which the current CH should leave its role and a new CH should take over. We repeated the experiment with f values ranging from 1 to 10 and noted the round number when we first observed the dead node. The round after which first node died is highest at f value 7 for both scenarios as shown in Fig. 2. The nodes are dying at a very slow rate and uniformly for f value 7 as can be seen from Tables 4 and 5. Upto round number 1750, only 173 nodes died for f value 7 whereas for all other case it is higher than that. This is because the CH change at this rate facilitates the nodes to drain their energy at almost equal rate. However after frequency value of 7, the number of dead nodes in early round starts increasing as shown in Tables 4 and 5. This indicates that for our current setting a frequency value of 7 for CH change is better. Although, after 2000 rounds of operations for frequency of 7, the dead nodes increases dramatically compared to other frequency value. This indicates that the energy of nodes are decreasing uniformly for frequency value of 7 compared to other frequency value. In other cases, some nodes are dying too early due to their depletion of energy indicating that energy drainage is non-uniform.

Fig. 2
figure 2

FND at different f factor with respect to round number

Tables 4 and 5 represents the average number of dead nodes after certain round of operations for the network, where the mule moves in the middle of the network. In a random deployment, the node distribution is the key issue for varying dead nodes in a round. In both the cases, we have considered the average dead nodes of more than 20 different randomly deployed sensor nodes in the target area.

Table 4 Number of Dead Nodes with 500 nodes in the network
Table 5 Number of dead nodes with 750 nodes in network

5.2 Selection of Near-Optimal Width Decreasing Factor (r)

Though our clusters are in unequal size and its width is decreasing in a square exponent as mentioned in Eq. 4, but for this, we should have to take the optimal value of r. In order to efficiently balance the load in clusters and across the network, r should be efficiently set. To analyze the influence of r parameter on network lifetime, we have considered the range of r from 0 to 1 and simulated it on 10 different values between 0 and 1. The simulation is done with respect to round number and range of r to find out the round number when first node has died in the network. As we can seen from the Fig. 3 Range at \(r=0.5\) and 0.6 the FND after 740 rounds but the 0.5 is optimal and best in all cases and for other values like 0.2, 0.4 and 0.8, the FND within 700 rounds. Hence, we have set \(r=0.5\) in our scheme.

Fig. 3
figure 3

The impact of r on FND with respect to round number

5.3 Simulation Results of WSN-Scenario#1

In WSN-Scenario#1, the data mule moves in the middle of the deployment area in a predefined straight line path as shown in the Fig. 4. This straight line path is defined by connecting all the SPs positioned in the middle of each level. The position of the sojourn points \(SP_0\) to \(SP_5\) are (125, 20), (125, 60), (125, 100), (125, 140), (125,180) and (125, 220) respectively.

Fig. 4
figure 4

WSN-scenario#1

5.3.1 Comparison Results in Terms of Dead Nodes

One of the primary motivation in WSN is to prolong the life of operations. Hence, number of dead nodes after a certain round of operations is one of the primary metric evaluated by researchers.

The number of dead nodes versus round number of the proposed work is compared against existing literatures LPGCRA [24], GBR [3] and GFTCRA [18]. The results are plotted in Figs. 5 and 6. As can be seen from the Figs. 5 and 6 that our protocol has the least number of dead nodes throughout the operation of the network compared to other protocols. The first nodes dies (FND) and last node dies (LND) are the key parameters for measuring lifetime of the network. FND may lead to a isolation in the network and LND indicates the total lifetime of the network. It is observed from several simulations that the first node died after the 721 rounds in our protocol, whereas in case of GFTCRA, GBR and LPGCRA, the first node died after round numbers 596, 5 and 7 respectively with 500 nodes as shown in the Table 6. The optimized CH rotation, cluster size selection based on threshold transmission distance and the centroid distance for CH selection are the key factors for the better performance. Uneven clustering is a major lawsuit for the balanced energy depletion of the nodes. This increases the lifetime of the nodes near the BS and minimizes the hot spot problem. It also conserves transmission energy loss of nodes and increases the network lifetime.

Fig. 5
figure 5

Number of dead nodes v/s number of rounds for 500 nodes

Fig. 6
figure 6

Number of dead nodes v/s number of rounds for 750 nodes

Table 6 Round numbers for dead node % in 500 nodes deployment

The same effect is presented in a different representation as shown in Tables 6 and 7 which shows the round number when a fraction of the total nodes are dead. It is seen that our proposed scheme has the maximum rounds of operations before a fraction say 25, 50, and 65% of total nodes are dead as compared to the other existing protocols.

Table 7 Round numbers for dead node % in 750 nodes deployment

5.3.2 Comparison Results in Terms of Residual Energy of the Network

Figures 7 and 8 illustrate the average residual energy (mJ) of the network against the different round numbers for the deployment of 500 and 750 nodes respectively. The comparison results clearly show that the residual energy of the network in our proposed protocol is more than the GFTCRA and even more than the GBR in both deployment cases. This is due to considering the centroid distance and \(E_{th}\) energy for CH selection in our scheme, whereas GFTCRA only considered transmission range and residual energy, and LPGCRA and CBR considered only the residual energy of the sensor nodes. As the number of nodes increases the residual energy decreases as shown in Figs. 7 and 8.

Fig. 7
figure 7

Residual energy (mJ) v/s number of rounds for 500 nodes

Fig. 8
figure 8

Residual energy (mJ) v/s number of rounds for 750 nodes

5.4 Simulation Results of WSN-Scenario#2

In WSN-Scenario#2, the data mule moves along with the vertical boundary of the deployment area towards the BS in a predefined straight line as shown in the Fig. 9. This path is defined by connecting all the SPs located at the border of each level. The position of the sojourn points \(SP_0\) to \(SP_5\) are (250, 20), (250, 60), (250, 100), (250, 140), (250, 180) and (250, 220) respectively. In this scenario we have again simulated with 500 and 750 nodes which are randomly deployed in the area of 500 X 250 meters. The size of the cluster increases from left to right as we move towards SP. The SPs are located at the middle of the right edge of each cluster. The data load of the clusters increases while moving towards the SP but due to the large cluster size and more number of sensor nodes, a particular node will not became CH frequently. From Table 7 we can see that the round numbers against the FND are more in case of 500 nodes as compared to 750 nodes. This is mainly because the network bears higher overheads and has more data to handle in case of higher nodes. However, in this case its performance is far better than other three protocols.

Fig. 9
figure 9

WSN-Scenario#2

5.4.1 Comparison Results in Terms of Dead Nodes

The performance in terms of dead nodes versus round number in both types of node distribution: 500 and 750 slightly changes from the WSN-Scenario#1. Figures 10 and 11 demonstrate the comparison graph of the dead nodes with respect to the number of rounds with other three similar existing schemes. It is seen that the proposed protocol performs better and it has less dead nodes even in the higher round numbers as compared to existing protocols LPGCRA, GBR and GFTCRA. In this scenario, data mule moves at a uniform speed and collects data from the borderlined CH directly nearer to the BS, which cuts down the load of nodes which are nearer to the BS. This increases the overall lifetime of the network and it is evident for the reduction of the hot spot problem in our protocol.

Fig. 10
figure 10

Number of dead nodes (mJ) v/s number of rounds for 500 nodes

Fig. 11
figure 11

Number of dead nodes v/s number of rounds for 750 nodes

5.4.2 Comparison Results in Terms of Residual Energy of the Network

The average residual energy of the entire network is plotted in Figs. 12 and 13 against the number of rounds by simulating the proposed scheme with LPGCRA, GBR and GFTCRA algorithms. The energy consumption in this scenario is higher as compared to the WSN-Scenario#1 mainly because of more overheads and data is transmitted by additional nodes

From Figs. 12 and 13, it is clearly observed that our scheme performs far better than the LPGCRA, GBR and GFTCRA algorithms even beyond 2000 rounds. This means that energy consumption per round of the network is less as compared to other existing protocols. The hot spot problem is also reduced in this scheme due to survival of more number of the nodes that are near the BS.

Fig. 12
figure 12

Residual energy (mJ) v/s number of rounds for 500 nodes

Fig. 13
figure 13

Residual energy (mJ) v/s number of rounds for 750 nodes

6 Discussion

The major contribution of this research is determining the size of unequal clusters in order to divide a large area sensor network. We established the relationships between different sized clusters as presented in Sect. 3.2. In this way, we have formed larger size clusters near the BS which prevented the early death of sensor nodes and minimized the occurrence of hot spot problem. The other major finding of this research is the near-optimal time to change the CH as evaluated in Sect. 5.1. Since CH have not been changed in every round, overheads are reduced and a significant amount of energy has been saved. Several path planning schemes of data mules have been presented in research [2, 4, 8], but most of them tried to make the mule visit each and every node. But this work has used a single mule and at the same time, communication is done only with vertical boundaries CH. This reduces the mule movement complexity and increases the lifetime of sensor nodes. It is evident from the simulations for scenario#1 in the results section that a mule saves more energy if it moves through the middle of the network as compared to borderline movement. Although, we have considered a BS which is located outside the network, our scheme should work efficiently even for scenarios where the BS is placed in the centre of the network. In such cases, the same scheme can be applied after dividing the network into two or four equal parts.

The proposed protocol performed better for WSN-Scenario#1 than WSN-Scenario#2 in terms of dead nodes, average residual energy and network lifetime. In both scenarios, hot spot is also slimmed as our protocol sustains upto more than 3000 rounds, whereas other protocols sustain below 2000 rounds. Also the proposed protocol provides better performance over the existing protocols like LPGCRA, GBR and GFTCRA in both the scenarios.

The worst case time complexity of our routing scheme is O(p) per CH which is similar to the [18], where p is the maximum number of paths of the CH. In route discovery algorithm, p processing time is required for picking all alternative paths. However, we have increased the lifetime of the network as compared to [18] by keeping the routing complexity same. Another complexity that we have calculated is for overheads due to the flow of control messages which is founds to be O(M) as discussed in Lemma 2. The complexity is similar to the [40] which proves the simplicity of our scheme. For a reliable data communication in a multihop routing, the connectivity between CH with other CHs or relay nodes with alternative paths is very essential. The Lemma 1 proved that our proposed multihop routing is reliable and adequate to handle cases where relay nodes or CHs’ fail.

Lemma 1

Each CH ensures that its connectivity with other adjacent CHs in the network is in between \(\ge (1 - \frac{1}{2^3})\) and \((1 - \frac{1}{2^2})\).

Proof

As mentioned earlier, the target area is partitioned into number of unequal size grids. The CHs are appointed among the normal nodes of each grid. The connectivity of a CH with all its adjacent CHs is certain with this type of grid partition. The maximum distance between two consecutive grids is the threshold distance of the free space model. Hence, suppose a \(CH_i\) positioned at any corner of the four neighbouring grids will be free to communicate with one of its neighbor CH, say \(CH_j\), which is also situated at the corner. That means, both CHs are placed at diagonally opposite corner of the radio range Z in the worst case as shown in the Fig. 1. A \(CH_i\) is always within the communication range of maximum 4 CHs and minimum 2 CHs(for corner). However, maximum 3 neighbour CHs can forward their data towards the SP.

A CH is either dead or alive in the network hence, its probability will be \(\frac{1}{2}\). So, the probability of minimum one alive CH for forwarding data towards the BS is \(\ge (1 - \frac{1}{2^3}) \approx 0.87\) which is very high. If the CH is at any corner of the network, then the maximum neighbour in CHs are 2 and the probability of the connectivity is \(>= (1 - \frac{1}{2^2}) = 0.75\) which is also very high.

This proves that our grid partitioning method has better percentage of CHs connectivity.

Lemma 2

The complexity of the overhead due to control messages is O(M) in the network.

Proof

In this scheme, the BS is doing maximum work like network partition, CH selection and route setup. So, the nodes only receive the broadcast packets of BS and send their status (residual energy) information through data packets to the BS in each round. Suppose, M sensor nodes are placed in the network, so the total broadcast packets received by nodes are M. In each round, member nodes broadcast a join message in each cluster and each CH broadcasts a CH message, a schedule message and a route setup message. If in a network N number of CHs are selected, then the total number of join message is \(M-N\) and number of messages between the CHs is N. So, the maximum control messages in the network is: \(M+(M-N)+N+N+N = 2M+2N\). Therefore, The complexity of the overhead due to control messages is O(M) in the network.

7 Conclusion and Future Work

In this paper, we have proposed a centralized unequal clustering based routing protocol for mitigating the hot spot problem of the multihop routing. A data mule, which moves in a predefined path and time interval, is used to gather the data from the network and deliver it to the BS. The simple, straight path of the data mule reduces the complexity of the protocol as compared to other protocols. Due to centralized nature of the protocol, maximum of the tasks are performed by the BS. This ultimately preserves the energy of nodes and prolongs the overall network lifetime. The well organized size allocation and size variation of clusters reduces the frequency at which a particular node becomes CH, which results uniform energy consumption in the whole network. The simulation results show that the proposed protocol is better than the existing protocols LPGCRA, GBR and GFTCRA based on number of dead nodes, average energy depletion and lifetime of the network in both the scenarios. The near-optimal value of r and f, which we have suggested in this paper is also a major cause of the performance enhancement in all aspects. The simulation results clearly indicate the lifespan of the whole network is more than 3000 rounds, which evident that the hot spot problem is reduced significantly in the network. Further, the results shows that the protocol performs better when the data mule moves in the middle section of the network as compared to movement at the outskirts of the network towards the BS.

In future, we will attempt to implement this scheme in distributed nature and analyze the performance at different data mule speeds and different data generation rates of the sensor nodes.