1 Introduction

In recent times, the area of microelectronics has become more advanced and lead the research and development on low-cost wireless sensor nodes, resource constraint devices, and tiny nodes. Wireless sensor network (WSN) plays a vital role in many WSN-assisted IoT applications [1]. WSN-assisted IoT has a broad range of applications that includes a smart parking system, industrial wireless network, healthcare monitoring system, border security monitoring, and animal monitoring system [2, 3]. The sensor nodes, actuation, radio frequency identification (RFID), and many more devices are combined and form a WSN-assisted IoT network to obtain some common objective. The participate nodes in WSN-assisted IoT make the physical object conscious about the various real attributes in the deployed network, such as hearing, monitoring, feeling, and trigger an event with the coordination of other devices [4]. The generated sense data in a network are transmitted to the Base station (BS) by the routing protocol. Most of the participating devices in WSN-assisted IoT networks are run on limited non-rechargeable energy sources such as a battery. Generally, WSN-assisted IoT applications deployed in the rugged and dense environment, and it is complicated to replace or add the energy sources in the sensor. Therefore, efficient energy management is a critical issue in WSN-assisted IoT [5, 6].

The WSN-assisted IoT has brought many applications and makes daily human life more convenient. The deployed network in some of the WSN-assisted IoT applications may remain static or dynamic, depending on the application requirements. For example, border monitoring and environment monitoring supports static network whereas, smart healthcare monitoring system and animal monitoring system support dynamic network. In a dynamic network, nodes move freely from one location to another. The movement of the nodes varies with respect to the time, along with their velocity, pause time, and acceleration. Therefore, in the past various mobility models were proposed to mimic the user mobility patterns, and their results can vary with respect to each other. By review of earlier reported work, we found that Random walk and Random waypoint is the most popular mobility model for simulating the mobile network of WSN-assisted IoT. In the Random walk mobility model, the node selects their destination with random speed and direction with zero pause interval. It is a memoryless model, and information of previous states does not affect the future state. The node in the Random waypoint mobility model is very similar to Random walk with a small variation. The nodes in Random waypoint can choose their destination with a pause interval under some defined range, and future state depends on the previous state [7, 8].

The WSN-assisted IoT networks are deployed at a vast area as compared to WSN; also, numbers of nodes (mostly resource constraint) are large as compared to the WSN network. Hence, traditional WSN methods may not be effective in scalable WSN-assisted IoT network. To overcome the scalability and network lifetime issue, many researchers adopted the cluster-based hierarchical framework [9, 10]. The prime focus in cluster-based routing protocols is on the efficient Cluster Head (CH) selection and organizing of remaining nodes in the clusters. The nodes within the cluster send their data to the CH. The CH has accountability for the data aggregation of local cluster member and conveys the aggregated data to the base station or adjacent CHs toward the Base Station (BS). This results as extra load on CH which leads to more energy depletion as compare to normal nodes [11, 12]. The CH near to BS depletes their energy faster as compare to other nodes because it also acts as a Relay Node (RN) for aggregated data sent by distant CHs. To address all these issues, in this paper, a multi-tier based clustering framework and subdivision technique are proposed for network deployment along with efficient routing protocol that aims to achieve load balancing in the network and transform the long-distance communication into the shorter multi-hop distance communications so that network lifetime can be increased. The major contribution of this paper can be summarized as follow:

  1. (A)

    A multi-tier hierarchical framework based on subdivision technique for node deployment that aims to reduce long distance communication. The proposed multi-tier framework can support load balancing, and scalability.

  2. (B)

    An effective RN and CH selection method for the proposed framework has been introduced. The proposed approach supports load balancing in the network.

  3. (C)

    In this work three node deployment scenarios have been considered to make multi-tier hierarchical framework more realistic as various applications of WSN-assisted IoT. The considered scenarios are: (a) Sensor nodes will be static in the network (b) Senor nodes are allowed to move within their local cluster boundary only (c) Sensor node are able to move within the entire network area.

  4. (D)

    The proposed work in this paper has resulted in improved network lifetime along with lesser energy consumption when compared with LEACH, M-LEACH, EA-CRP, TDEEC, DEEC, SEP, and MIEEPB.

The structure of a paper is as follows: literature review of static and dynamic WSN routing protocols are given in Sect. 2. The system framework, mobility scenarios, subdivision algorithm and energy model are formulated in Sect. 3. The Scalable and Energy-Efficient routing protocol (SEEP) for the proposed framework is presented in Sect. 4. Section 5 will discuss the simulation environment. Section 6 discusses about the simulation results of the proposed framework and their comparative analysis for static and mobile node scenarios under various mobility models. Later, the conclusion has been discussed.

2 Related work

The challenge in WSN-assisted IoT is slightly different from the traditional wireless and wired network due to vast and scalable networks. The classical routing protocol is based on IP based protocol and does not match the requirements of WSN-assisted IoT due to less support of scalability. Hence, many effective routing methods have been introduced to cope with the demands of WSN-assisted IoT. Most of the routing protocols can be classified into three categories: hierarchical or location-based routing and data-centric. In hierarchical routing, nodes are organized into the clusters, and some of the optimal nodes in the cluster are promoted as a Cluster Head (CH). The local member of clusters sends the data to their CH. CH gathers the data into one packet through the data aggregation scheme and relays the data to the base station (BS). The main job of normal nodes is to sense the network area and transmit the sensed data to their CH.

One of the most inspiring and popular hierarchical routing protocols is known as Low Energy Adaptive Cluster Hierarchy (LEACH). LEACH) is a multi-hop routing protocol, where the node in the network is grouped into clusters. The network lifetime of LEACH is divided into rounds, where each round is subsequently divided into the set-up and steady-state phase. In the set-up phase, the network is organized into clusters. Some of the nodes in the cluster are promoted as CH by the probability function (based on random function, which varies between 0 to 1). In a steady-state phase, CH aggregates the sensed data into one packet and consequently conveys to BS directly [13]. LEACH is not suitable for the WSN-assisted IoT network due to some drawbacks. LEACH does not include any relaying method between CH and BS. When the network is deployed in a large area, the distance between CH and BS is increased, and it leads to massive energy depletion. Secondly, the CH selection method in LEACH is randomized and does not assure the best node selection as CH. Mhatre and his colleague come with the variant form of LEACH, which is known as a multi-hop form of LEACH (M-LEACH). In the steady-state phase of M-LEACH, CH sends the data to the BS via other intermediate CHs in a multi-hop way instead of direct transmission as LEACH. M-LEACH gains a better network lifetime and supports uniform energy depletion [14]. Loscri and his colleagues introduced the two-level hierarchy routing protocol (TL-LEACH). TL-LEACH based on the random rotation of primary and secondary cluster heads and made a two-level hierarchy. TL-LEACH distributes the energy load in a better way, specifically when the density of the network is large [15]. To overcome from the LEACH drawbacks, centralized from of LEACH (LEACH-C) is introduced. In LEACH-C, BS has the accountability to discover the optimal node as CHs, along with the best node to handle this selection process. The CH selection method of LEACH-C is slightly better as compared to LEACH random selection method. Before forming the cluster in LEACH-C, BS requests to all nodes in the network to send their location and current remaining energy in the set-up phase of every round. The direct communication between sensor nodes and BS puts a heavy burden on the nodes [16]. The authors in [17] came with a better solution and proposed a Fixed Low Energy Adaptive Clustering Hierarchy Protocol (LEACH-F). In LEACH-F, the set-up overhead phase is eliminated, and BS elects CHs and their local member one time until the network becomes dead. CH role rotates with other cluster members to provide uniform energy consumption, which is resulted in a better network lifetime. EESAA protocol [18] added residual energy as one more parameter for interchange between sleep and active mode to conserve energy. The Hybrid Energy-Efficient Distributed Clustering protocol (HEED) introduced a method for CH selection, which selects according to the hybrid of node residual energy and the node degree [19]. After going through the above-reported protocols, we identified many issues for Cluster Head (CH) selection and their arrangement, which are not adequate as per the basic requirements of the WSN-assisted IoT network. Robustness is an essential feature to maintain network connectivity and data transmission. But due to the early death of nodes near to BS, the protocols reported in this section may not maintain network connectivity. These routing protocols work on the common objective as energy optimization. But they do not account for other WSN-assisted IoT features like scalability, load balancing, etc. Therefore, they are not preferred for scalable and real-time WSN-assisted IoT application.

Huang et al. proposed a novel deployment scheme for green IoT network [20]. The proposed scheme is based on a hierarchical framework, along with communication constraints. The sensor nodes are not allowed to communicate with each other directly, and they can communicate via RN. The sensor sends the sensed data to the RN, and RN transmits the sensor data to the base station. When the network area is vast, direct communication between RN to base station cause the massive energy depletion and decreases the network lifetime.

Rani et al. introduced Minimum Energy Consumption Chain-based Cluster Coordinator Protocol (ME-CBCCP) [21] protocol for WSN-assisted IoT network. The MB-CBCCP consider hierarchical cluster framework for node placement along with some communication constraints to prolong the network life. This technique selects the RNs randomly among the shortest distance nodes and does not consider any distance parameter for the next RN selection. So, some of RNs can be overlapped in each other sensing area. It causes additional RN participation to cover the respective cluster. Hence, this cause additional system cost, and resource utilization is inefficient. The MB-CBCCP pre-decides the possible number of nodes as RNs randomly, which is not preferable. The number of RN selection can vary from cluster to cluster, and it should depend on the node density of the clusters.

The authors in [22] proposed an improved version of LEACH and called Power-Efficient Gathering in Sensor Information Systems (PEGASIS). PEGASIS makes the chain of node and sensor node transmit and receive data from their nearest neighbours. In all chains, one node in the network is selected randomly as a leader node. All nodes in the network communicate to each other in such a way that data reaches the base station by leader node. PEGASIS provides a better lifetime as compared to LEACH due to equal distribution of power consumption, which results in less energy depletion per round in network. Jafri and his colleagues modified the PEGASIS protocol and included the multi-head chain along with the sink mobility concept. The sink is allowed to move in a fixed trajectory from one region to the other. It waits for a sojourn time at the sojourn location (the location where the sink temporarily stays for data collection). Sojourn location is predefined before the network deployment. Sojourn time can be defined as the time interval for which sink stays at a specific decided position and collects data from the chain leaders. This approach increases the network lifetime as compare to PEGASIS [23].

To prolong the network lifetime, many researchers introduced the heterogeneity-aware protocol by using a hierarchical clustering scheme. SEP, DEEC, and TDEEC are one of them. SEP is an extension of LEACH, and it works on two levels of heterogeneity, and the level of heterogeneity cannot be increased by more than two. SEP increases the stability period (time interval before the first node dies) due to more powerful heterogeneous nodes in terms of energy [24]. Distributed Energy-Efficient clustering (DEEC) is an improved protocol over SEP, where it can take two or multi-level heterogeneity. The Cluster Heads (CHs) in DEEC are selected based on dynamic probability. It can be calculated by the ratio of residual energy of every node and the average energy of the entire nodes in the network. Node with the highest energy becomes a CH [25]. Threshold Distributed Energy-Efficient Clustering (TDEEC) is an enhanced form of DEEC. The CH selection in TDEEC is based on the modified threshold value, and it is calculated based on the ratio of node’s residual energy and average energy of network with concern to the optimal number of CH [26].

Darabkh and his colleagues proposed an energy-efficient EA-CRP routing protocol. It is based on the layering and clustering framework. EA-CRP performs well for the scalable network, but when the network size is increased, it forms less number of layers in the network and which is resulted in long-distance communication between the layers. Therefore, it provides less network lifetime for large network area [11]. Priyan et al. proposed a framework for the Internet of Vehicle (IoV) based on the Random waypoint mobility model. This healthcare monitoring system considered the simulation scenario, which includes mobile ambulances, mobile doctors, and patience. The Performance Rank (PR) is used for the selection of best ambulance (mobile node) among all the available ambulances. PR is calculated for every ambulance by these parameters: Euclidean distance from the patient to the mobile ambulance, medical capacity, and the number of patients currently using this ambulance. The ambulance with minimum PR is selected for a needy patient [27]. The prime focus of the above-discussed protocols is to achieve energy-efficient communication to increase the network lifetime for the static network. These routing protocols are not suited for WSN-assisted IoT applications due to complex nature and less support for scalability. In this paper, Scalable and Energy-Efficient routing protocol (SEEP) has proposed along with multi-tier based clustering framework. In this work, a strategy for the zone and cluster formation is introduced, which is varied along with the size of the network area to meet the requirement of an energy-efficient, scalable network. Furthermore, proposed framework considered static and mobile scenarios by applying Random walk and Random waypoint model for node mobility while evaluations to make it more realistic as the various application of WSN-assisted IoT.

3 System model

In this section, an overview of the deployment framework with various scenarios is presented. Subsequently, the sub-area division algorithm will be discussed to organize the network into zones and clusters. Later, various assumptions related to communication constraints and energy model will be covered.

3.1 Proposed framework

Figure 1 represents the multi-tier based clustering framework. The nodes in this framework either can be static or mobile, which depends on the application requirement. The node mobility concept brings by applying Random walk (RW) and Random waypoint (RWP) mobility model. The proposed framework can be used for those WSN-assisted IoT applications, where participating nodes in the network are static and dynamic, for example, Border security force [21], Animal Tracking and Monitoring System [28], Smart healthcare system [27]. In the multi-tier framework, the rectangular shape network area is considered. The top layer is configured as a base station layer, and the network area is divided into various zones. Every zone is divided into subzones, and it is denoted as a cluster. The areas of zones are reduced toward the BS, and the numbers of clusters are increased toward the BS to balance the network load uniformly. The Lower zone (zone 1) has the maximum area with the minimum number of clusters, and the upper zone (zone 3) has a minimum area with the maximum number of clusters. The reason can be listed here that the upper zone not only sends their aggregated data towards the BS, but it also relaying traffic of lower zones. In other words, the intracluster communication distance is reduced as moving towards the BS by increasing the number of clusters with decreasing zone area. The clusters have the same size within the local zone.

Fig. 1
figure 1

A multi-tier based clustering framework

In this work, a smart health care application is considered for the multi-tier based clustering framework (Fig. 1). In a smart healthcare system, persons are equipped with IoT wearable devices to collect their clinical data continuously. The clinical data is transmitted to the base station (BS) via a Relay Node (RN) and Cluster Heads (CHs). The clinical data is sent to a local cluster RN, where the RN forwards this data to their CH, and CH sends the data to the upper zone CH. This process is repeated until the data is received by the BS from the topmost layer CH. Inter-Zone communication is based on some constraints, which will be discussed in the upcoming section. When the patient’s health is critical, the alert message will be forwarded to the Base Station (BS), and BS broadcast this message with all mobile ambulance along with patient id, zone id, and their current cluster id so that the ambulance can reach within the minimum time. The wearable or implanted IoT devices in the patient body cannot be recharged or changed on a daily basis, so there is a need of efficient communication method to avoid inconvenience.

3.2 Different scenario of deployed network

The proposed a framework covers various WSN-assisted IoT applications. It is extension of work carried out by Darabkh et al. [11] for static IoT network. We modified and enhanced their work for mobile WSN-assisted IoT application also. Three different practical network deployment scenarios are considered, and they are discussed below.

  • Scenario 1 In this deployment, nodes remain static after network deployment and their pseudo code is given in Algorithm 2. This type of topology used in many WSN-assisted IoT applications such as environment monitoring [21].

  • Scenario 2 In this scenario we considered the case where nodes are free to move after network deployments, but node movements are constrained within their respective zone boundary. The pseudo code for this scenario and their graphical representation are given in Algorithm 3 and Fig. 2 respectively. This type of network topology is used in animal monitoring application of WSN-assisted IoT, where the animal can roam in their respective terrain [29]. We assumed that animals are attached with some wearable IoT device for monitoring purpose and it is represented in Fig. 2.

    Fig. 2
    figure 2

    Node movements within their respective zone (Scenario 2)

  • Scenario 3 In this deployment, nodes are free to move from one zone to another zone across entire network area and their pseudo code to deploy this network is given in Algorithm 4. This type of topology used in Smart healthcare monitoring system based on WSN-assisted IoT, where the users can move in entire city [30]. We have taken an assumption that humans are attached with medical sensors and represented in Fig. 3.

    Fig. 3
    figure 3

    Nodes movement in the entire network (Scenario 3)

3.3 Network area subdivision algorithm

After many numbers of simulations, the proposed framework division techniques have been derived to increase the network lifetime. Moreover, the proposed framework is generic and simple so that it can be applicable for many WSN-assisted IoT applications.

Let R is the sensor radius, DA is the diameter of RN sensing area, DG, and S is the diagonal and side length of the maximum size rectangle within the RN sensing area. Here, width represents the network area width.

The number of zones will be formed in the network as follows:

$$N_{Z} = \frac{Width }{R} + 1$$
(1)

We know that

$$\begin{aligned} & {\text{D}}_{\text{A}} = 2{\text{R}} \\ & {\text{D}}_{\text{G}} = {\text{S}} \times \sqrt 2 \\ \end{aligned}$$

We can see form the Fig. 4 that diagonal and diameter have same length. Hence,

Fig. 4
figure 4

Maximum size rectangle in RN sensing area

$$\begin{aligned} & {\text{D}}_{\text{A}} = {\text{D}}_{\text{G}} \\ & 2{\text{R}} = {\text{S}} \times \sqrt 2 \\ & {\text{S}} = 2{\text{R}}/\sqrt 2 \\ \end{aligned}$$

The area of rectangle is:

$$\begin{aligned} & {\text{S}} \times {\text{S}} \\ & (2{\text{R/}}\sqrt 2 ) \times (2{\text{R/}}\sqrt 2 ) \\ & 2{\text{R}}^{2} \\ \end{aligned}$$

Let assume X and Y are the length and width of the last zone respectively.

The area of zone is X × Y.

The maximum numbers of clusters (\(N_{{C_{1} }}\)) will be formed to cover the bottom most zone as follows:

$$N_{{C_{1} }} = \left\lceil {\frac{{X \times {\text{Y}} }}{{2R^{2} }}} \right\rceil$$
(2)

Bearing in the mind that the numbering of zones starting from 1 at the bottom most layer. Number of clusters created in the next following layers is defined as follows:

$$N_{{C_{i} }} = N_{{C_{i - 1} }} + 1,\quad 1 < i \le N_{Z}$$
(3)

The width of zones is formulated as following:

$$Width_{{Z_{L} }} = \left\lfloor {\frac{R}{\sqrt 3 }} \right\rfloor$$
(4)
$$Width_{{Z_{M} }} = \left\lceil {\frac{{Total width - Width_{{Z_{L} }} }}{{N_{Z} - 1}}} \right\rceil$$
(5)
$$Width_{{Z_{K} }} = Width_{{Z_{K} - 1}} + \left\lfloor {\frac{{Width_{{Z_{m} }} - Width_{{Z_{L} }} }}{M - 1}} \right\rfloor$$
(6)
$$Width_{{Z_{1} }} = Total width - \mathop \sum \limits_{i = 2}^{{N_{Z} }} Width_{{Z_{i} }}$$
(7)

here ZL represents the upper zone (nearest to BS), Z1 represents the lower zone, ZM represents the next zone to the middle one (M = Round (NZ/2) + 1), ZK represents the zone apart from define above (1 < K < M or M < K < ZL).

It can be seen from Eq. 4, width of ZL zone is proportional to \(\frac{R}{\sqrt 3 }\). After many numbers of experiments for different network area and different sensing radius, we got the better network lifetime as compare to other value. Therefore, in this work, we considered the \(\sqrt 3\) in denominator instead of other value.

3.4 System constraints

In this framework, all nodes can perform the data reception, data aggregation, and transmission. But the nodes, which are Normal Nodes (NNs) can transmit the data only; whereas other types serve nodes (RNs and CHs) can perform the data reception, aggregation, and transmission to reduce the energy consumption. Communication between NNs can take place through local cluster RNs only. RN forwards the aggregated data to the local CH. However, cluster heads of local zone (CHLZ) can communicate only two nearest neighboring cluster head of upper zone (CHUZ), where zone id of CHUZ is exactly one greater than zone id of CHLZ. CH transmits the aggregated data to the BS via intermediate CHUZ. Communication constraints of SEEP is summarized in the following points:

  1. (a)

    Normal Node (NN) to Normal Node (NN)

    NNs are not allowed to communicate directly in SEEP framework, even if NNs belong to same cluster or zone.

  2. (b)

    Normal Node (NN) to Relay Node (RN)

    NNs can send the data to their local cluster RN. But, NN are not allowed to communicate to other clusters RN.

  3. (c)

    Relay Node (RN) to Cluster Head (CH)

    RN can send their aggregated data to the local cluster CH and RNs are not allowed to communicate to other cluster CH or upper zones CH.

  4. (d)

    Cluster head of upper zone (CHUZ) to Cluster head to local zone (CHLZ)

    CH of lower zone can send their data to upper zone neighbor CH, which is discussed in detail in Sect. 4.3. If the upper zone CH become dead due to low energy, then lower zone CH can send the packet directly to BS.

Let p and q two nodes; they will be called neighbor nodes if they can communicate with each other via above discussed communication constraints. Let N(p) represents the set of p neighbors node, and A represents the adjacency matrix. The adjacency matrix is used in energy depletion equation.

$$A = \left[ {\begin{array}{*{20}c} {A_{11} } & {A_{12} } & \ldots & {A_{1\left| N \right|} } \\ {A_{21} } & {A_{22} } & \ldots & {A_{2\left| N \right|} } \\ \vdots & \vdots & \ddots & \vdots \\ {A_{\left| N \right|1} } & {A_{\left| N \right|2} } & \ldots & {A_{\left| N \right|\left| N \right|} } \\ \end{array} } \right]$$
(8)

where Apq= 1 if \(q\epsilon N(p)\), otherwise Apq = 0.

3.5 Energy model

The main source of energy depletion in IoT network is data transmission and reception. Sensor nodes deplete the energy in sensing and processing is very less. In this work, the first order radio model is followed [31] for energy depletion in data communication, which is given below:

$$E_{s } = L \times \left( {E_{elec} + \in_{X} \times d^{2} } \right)$$
(9)
$$E_{r} = L \times E_{elec}$$
(10)
$$E_{l} = L \times \left( {E_{elec} + \in_{X} \times d^{4} } \right)$$
(11)

The energy depletion in short distance and long-distance communication is identified by Eqs. (9) and (11) respectively. Data reception at RN and CH nodes and their utilized energy in this task is computed by Eq. (10). Energy depletion per unit time for each node is calculated by:

$$E_{a} = \mathop \sum \limits_{b \in RN} A_{ab} .F_{ab} .\left( {E_{elec}^{NN} + \in_{1} . d_{ab}^{2} } \right) \quad \forall a \in NN$$
(12)
$$E_{b} = \mathop \sum \limits_{a \in NN} A_{ab} .F_{ab} .E_{elec}^{RN} + \mathop \sum \limits_{{c \in CH_{LZ} }} A_{bc} . F_{bc} .\left( {E_{elec}^{RN} + \in_{2} . d_{bc}^{2} } \right)\quad \forall b \in RN$$
(13)
$$E_{c} = \mathop \sum \limits_{b \in RN} A_{bc} .F_{bc} .E_{elec}^{{CH_{LZ} }} + \mathop \sum \limits_{{d \in CH_{UZ} }} A_{cd} . F_{cd} .\left( {E_{elec}^{{CH_{LZ} }} + \in_{3} . d_{cd}^{2} } \right)\quad \forall c \in CH_{LZ}$$
(14)
$$E_{d} = \mathop \sum \limits_{{c \in CH_{LZ} }} A_{cd} .F_{cd} .E_{elec}^{{CH_{UZ} }} + \mathop \sum \limits_{{e \in CH_{UZ} U BS}} A_{de} . F_{de} .\left( {E_{elec}^{{CH_{UZ} }} + \in_{4} . d_{de}^{2} } \right)\quad \forall d \in CH_{UZ}$$
(15)
$$E_{e} = \mathop \sum \limits_{{d \in CH_{LZ} U CH_{UZ} }} A_{de} .F_{de} .E_{elec}^{BS} \quad \forall e \in BS$$
(16)

Ea, Eb, Ec, Ed, Ee denote the energy depleted by NN, RN, CHLZ, CHUZ, and BS respectively in data transmission and reception. The symbols \({\text{E}}_{\text{elec}}^{\text{NN}}\),\({\text{E}}_{\text{elec}}^{\text{RN}}\), \({\text{E}}_{\text{elec}}^{{CH_{LZ} }} ,\)\({\text{E}}_{\text{elec}}^{{CH_{UZ} }}\), \({\text{E}}_{\text{elec}}^{\text{BS}}\) indicate the energy consumption in the radio electronics of NN, RN, CH, CH and BS. \(\epsilon_{1},\epsilon_{2} ,\epsilon_{3} ,\epsilon_{4}\) are used as node amplifier for NN, RN, CHLZ and, CHUZ respectively. Fab represents the data rate between node a and b. Equation (12) denotes the energy consumed in data transmission processes between NN to RN within the intra cluster. The energy consumed in data receiving processes from NN to RN and sending the data from RN to CHLZ (cluster head of local cluster) by RN is computed by Eq. (13). The energy consumed by CHLZ for the data reception from RN and transmission to the CHUZ (upper zone nearest neighbor’s CH) is represented by Eq. (14). Equation (15) expresses the energy consumed by CHUZ for the data transmission process to upper zone CH or the BS and the data receiving processes by the CHLZ. Equation (16) shows the consume energy at the BS layer. All of the above equations exclude the energy depletion in signaling data by the BS to nodes, because it is negligible as compared to data transmission and receiving [20, 21].

3.6 Aggregation model

In this framework, all nodes (except NN) can perform data aggregation, which significantly reduces the energy depletion of the sensor nodes. Figure 5 represents the simple scenario of data aggregation, where nodes are located in a network. Each Node sends the data packet to the BS. In the first scenario, node start the communication without data aggregation, and total identified number of packets is n*(n + 1)/2. In the second scenario, nodes send the packets with data aggregation and the total observed packet is n. Each node aggregates its data with the preceding node’s received packet. So, the total number of reduced packets would be (n + 1)/2 with data aggregation in the network [31].

figure c
Fig. 5
figure 5

a Data communication without data aggregation and b with data aggregation

4 Proposed scalable and energy-efficient routing protocol (SEEP)

This section introduces the proposed routing protocol. SEEP is a self- sustained protocol with multi-tier based clustering framework that aims to increase the network lifetime by utilizing load balancing and uniform energy depletion among the nodes in the scalable WSN-assisted IoT network. The description of SEEP includes various phases. The first phase is network set up phase followed by network scenario 1, 2 and 3. Further RN selection is explained and finally CH selection is discussed.

figure d
figure e
figure f

4.1 SEEP routing protocol

In SEEP, each node in the network sends their data to the BS in a multi-hop way. In algorithm 1 from line 1-11, BS computes the various divisions related to network. After that SEEP will deploy the network for the selected scenario (Scenario will be selected by the user according to application requirements). Furthermore, CH and RN selection will be done via BS with algorithm 5 and 6 for the selected scenario. After the high responsibility node (RN and CH) selection, sensor nodes transmit their sensed data to the local cluster’s RN. RN aggregate the neighbor data into its own message and a single message forwards to local CH. CH relays the aggregated data to preferable neighbor CH towards the BS. Preferable CH is selected based on the ratio of residual energy and their distance to lower zone CH. CH with more ratio will be selected as a neighbor CH for relaying the data to the upper zone (line 12-32). To make the system effective, high responsibility node’s energy is compared with the energy threshold and if it is found to be less than the new eligible node will be selected as high responsibility nodes (line 33–46). The whole processes will be iterated repeatedly until all nodes in the network become dead. CH and RN selection will be done by base station, whenever nodes have residual energy less than the threshold energy. The control flow of SEEP represents in Fig. 6.

Fig. 6
figure 6

Flow chart depicting the working of SEEP

4.2 Set up phase

In the first phase of SEEP, sensor nodes are deployed in a rectangular shape area. It is good to keep in mind that BS is aware of network attributes such as length and width of a network, and node position (by using global position system) [11, 21]. Furthermore, BS performs the computation at once related to various divisions such as the number of zones, cluster per zone, and width of every zone in the network by Eqs. 1 to 7. The zone and cluster id are assigned to every zone and cluster. Subsequently, BS sends the detailed message to every node in network related to their zone id, cluster id, and neighbors list. Nodes reside in the same cluster are neighbors. Nodes behaviour remains either static or mobile according to the nature of various WSN-assisted IoT applications.

figure g

4.3 CH selection

After the network set up phase, CH selection will be invoked by algorithm 5 for the selected scenario. The CH with more remaining energy can support the uniform energy depletion in the network. On the other end, CH near to BS can decrease the communication distance in between and prolong the network lifetime. In algorithm 5 from line 3 to 8, the Euclidean distance and average distance are calculated between nodes and BS. Subsequently, the value of an energy threshold is calculated to check the eligibility of nodes to become a CH. The energy consumed for sending the data for an average distance will be Threshold, and it can be calculated by Eqs. 9 and 11. In the proposed framework, lower zone cluster’s CH send the data to upper zone nearest neighbor CH. But in the worst case, if the entire intermediate cluster CHs runs out of energy, then lower zone CH can communicate directly to BS. In order to increase the effectiveness of the network, the average distance between node and BS is considered for threshold calculation (Algorithm 5, line 8). After that, the CE (chance of election) is computed for every node in the cluster whose residual energy is higher than the calculated ThresholdCH. In CE, the ratio of nodes residual energy and initial energy along with their distance to BS is considered (Algorithm 5, line 10-16). The node with more residual energy and less distance from BS will have more chance to become CH. In line 17–18, the nodes are sorted in descending order based on their CE factor and node with highest CE promoted as CH.

4.4 RN selection

After the CH selection in every zone, RN selection will take place. It can be noticed from one radio energy model [32]; energy expenditure is proportional to the communication distance. Therefore, in this phase, three parameters are considered for RN selection: (1) node with maximum remaining energy; (2) node with the shortest distance to other neighbor nodes (3) node with the minimum distance to CH. The pseudo code of RN selection is given in algorithm 6. Firstly, in algorithm 6 from line 3 to 9, the average distance between every node in a cluster and their CH are calculated. Subsequently, the energy threshold for RN is calculated. The node with more energy than the threshold will qualify for tentative RN. The ThresholdRN for RN is computed based on RN functionality. The job of an RN is to perform data reception from all their neighboring nodes. After the data reception, RN aggregates the data along with neighbors data into a single fixed message and forwards it to the CH. Therefore, we considered the required energy consumption to accomplish these tasks for threshold evaluation and formula is given from line 10–11. To make RN selection effective, the CE (chance of election) is evaluated for every node. CE is a combination of the node’s residual energy, the average distance to neighbor nodes, and distance to CH. Nodes with maximum CE will be promoted as RN (line 12–25).

figure h

5 Experimental set-up

In this work, we followed the network parameters as given in Table 1. To validate the performance of the proposed protocol in terms of network lifetime, we considered four metrics for network lifetime comparison, which includes First Node Dead Statistics (FND_Statistics), Half Node Dead Statistics (HND_Statistics), Last Node Dead Statistics (LND_Statistics) and the number of dead nodes after each round. The number of packets received by BS through hierarchical topology (NN → RN → CH → BS) is denoted as one round. FND_Statistics, HND_Statistics, and LND_Statistics is the duration between the rounds, when the communication begins and the rounds where the first node, half number of nodes and last node become dead in the network [33, 34].

Table 1 Simulation parameters

The simulation of a multi-tier based clustering framework is represented in Fig. 7 and simulation of the proposed framework along with SEEP has been performed in MATLAB. The network area in figure is divided into 4 zones. It can be noticed from the figure that Zone 1 has one cluster and as we move close to BS, the width of every zone is decreasing, and number of clusters are increasing. The reason is that lower zone CH sends the aggregated data to upper zone neighbor CH only. However, upper zone CH sends the aggregated data of local cluster and also relaying the lower zone data towards the BS. The relaying traffic becomes higher near the BS zones. In this figure, communication between NN to RN represented in red dotted line. RN forwards the aggregated data into one packet to their local zone CH (Blue line). CH relays the data to selected optimal neighbor CH towards the BS (Green Line).

Fig. 7
figure 7

Simulation of multi-tier based clustering framework

6 Result analysis and discussion

In this section, the effectiveness of our proposed approach is analysed via results obtained and compared the performance with existing protocols in literature as SEP [24], DEEC [25], LEACH [13], M-LEACH [14], TDEEC [26], MIEEPB [23], ME-CBCCP [21] and EACRP [11].

6.1 Network lifetime

In this section results are discussed for the various network lifetime metrics (FND, HND, LND and total number of dead nodes after each rounds) to analyse the significance of the proposed approach.

6.1.1 Network lifetime comparison for static techniques

In this section, the performance of proposed work is analysed for static routing protocols under increasing network size have been increased and observed that the proposed approach prolongs the network lifetime and performs well for scalable network. In this work, FND, HND, and LND metrics are considered to examine the performance for scalable network.

It can be noticed from Fig. 8a that static protocol supports the better network lifetime, and the last node died in the network approximately after 1500 rounds for different network dimensions. When network size was increased from 60 × 120 m2 to 100 × 200 m2, the performance of static SEEP almost remains stable due to more number of zones and cluster formation against increased network size. The large communication distance is divided into small multi-hop communication, and additional number of CH distributes the load of the network. The network lifetime of SEEP is slightly decreased when network size reaches 160 × 320 m2, whereas the performance of SEP, M-LEACH, DEEC are decreasing continuously due to large distance communication between nodes for increased network size, which cause higher energy consumptions. LEACH, ME-CBCCP, TDEEC, and EA-CRP remains almost stable for increased network size but supports less network lifetime as compared to SEEP, for LND metric. In the compared protocols, SEP performs well and gained network lifetime up to 1500 rounds for network size of 60 × 120 m2, but as soon as the network area increases, its performance decreases.

Fig. 8
figure 8

Network lifetime Comparison for the static routing protocols under various network topologies: Area = 60 × 120 m2, 80 × 160 m2, 100 × 200 m2, 160 × 320 m2, Nodes = 200

In this work, the EA-CRP framework is modified to meet the requirement of a scalable WSN-assisted IoT network. The number of zone formation should be increased with the increasing network area so that short distance communication, as well as load balancing, can be achieved to enhance network lifetime. When the network area extends from 60 × 120 m2 (7200 m2) to 60 × 120 m2 (51,200 m2), EA-CRP divides the network into 4 to 5 zones, which is very less as compared to the ratio of the extended network area. Therefore, EA-CRP performance is less effective for increased network size. It can be seen from Table 2 that the number of zone is increasing in SEEP with the increasing network size as compared to EA-CRP, which results in a better network lifetime.

Table 2 Zone formation for increasing network area

6.1.2 Network lifetime comparison for dynamic techniques

In this section, the impact of mobility is analysed on the performance of the proposed framework with increased network size. The behaviour of node movement for Random waypoint and Random walk is depicted in Figs. 9 and 10. In figures, the pattern of the mobility using the Random waypoint mobility model as per the parameters given in Table 1.

Fig. 9
figure 9

Mobility model: random waypoint

Fig. 10
figure 10

Mobility model: Random walk

It can be observed from Figs. 9 and 10 that the node starts at position 1 and ends at position 50, followed by multiple steps. The node stays at every step for some amount of time between 4 and 6 s (pause interval). This can be seen from Fig. 9 that some positions are represented by two numbers. The reason is that the node stays at these positions for some amount of time, which depends on the pause interval. Figure 10 shows the traveling pattern of the mobile node using the Random walk mobility model. The distance covered by a mobile node in Fig. 10 is more as compared to Fig. 9. It is due to the pause interval is zero in Random walk, and nodes remain moving until the simulation times expire.

Figure 11 shows the network lifetime comparison for scenario 2, 3, and MIEEPB for four network sizes: 60 × 120 m2, 80 × 160 m2, 100 × 200 m2, and 160 × 320 m2. It can be seen from Fig. 11a, b, d, e that the proposed framework supports better network lifetime with FND and HND metrics as compare to MIEEPB. The nodes in scenario 2 and 3 are mobile in the network and may change their position after every round. Therefore, after each round new CH and RN selection may be done with respect to changed network as per new nodes position, and it leads to uniform energy depletion. One interesting point we observe, if the mobile nodes cover a large distance in the network, then it supports a better network lifetime.

Fig. 11
figure 11

Network lifetime Comparison for the dynamic routing protocols under various network topologies: Area = 60 × 120 m2, 80 × 16 m2, 100 × 200 m2, 160 × 320 m2, Nodes = 100

As we mentioned earlier, while discussing the deployment scenarios, nodes in scenario 2 can move within their zones only, whereas nodes in scenario 3 can move across the network. So, nodes in scenario 3 will travel more distance. Furthermore, as we saw in Figs. 9 and 10 that, nodes with the Random walk model travelled more distance as compared to the Random waypoint mobility model. If the covered distance by the mobile nodes is large, it will change the network as per the new position of mobile nodes. The nodes as high responsibility node (RN or CH) in the previous round (before the mobility) will have less probability to be reselected as high responsibility node in the new round. Hence, in every round new node (most of the time) will be selected as RN and CH by the CE (as discussed in Algorithm 5 and 6). Based on the above discussion, it can conclude that energy depletion in the entire network will be uniformed. Also, it can be noticed from Fig. 11 that scenario 3 performs better as compared to scenario 2. Further, scenarios with Random walk supports better network lifetime as compared to Random waypoint.

6.1.3 Number of alive nodes versus round number

The dead node trend of compared protocols represented in Fig. 12. It can be observed from Fig. 12a that static SEEP (scenario 1) performs better and supports maximum lifetime with respect to other static protocols.

Fig. 12
figure 12

Dead node statistics for network topology: Area = 200 × 200, m2, Nodes = 200

To make it more clear, network lifetime is also compared with FND, HND, and LND statistics in Fig. 13. In Figs. 12b and 13b, a comparison has been made between dynamic protocol MIEEPB, scenario 2, and 3 with Random walk and Random waypoint model. We observed that SEEP with scenario 3 under RWP gains a better network lifetime. SEEP also analysed for all considered scenarios in Figs. 12c and 13c. The plots illustrate that Scenario 2 and 3 perform better as compared to scenario 1 with FND and HND statistics because of the reselection of nodes as CH and RN for new node structure in every round due to mobility. As we discussed earlier, high responsibility nodes (CH and RN) perform data reception, aggregation, and transmission, whereas normal nodes only send the data to RN; therefore, RN and CH deplete more energy. Further, due to mobility, node location changes after every round. The result supports the fact stated earlier in Sect. 6.1.2 that the mobility of nodes results in uniform energy consumptions and leading to better network lifetime. It can be noticed from Figs. 12c and 13c that as the number of the round is increased, scenario 1 performs better in terms of network lifetime for LND statistics.

Fig. 13
figure 13

FND, HND, and LND statistics for network topology: Area = 200 × 200, m2, Nodes = 200

In Tables 3 and 4, we have summarized the network lifetime gain of SEEP compared to static and dynamic protocols under FND, HND, and LND statistics, and it is evaluated by Eq. 17. It can be noticed from Table 3 that SEEP (Scenario 1) performs remarkably well as compared to standard static routing protocols. DEEC performs slightly better with FND statistics due to heterogeneity (nodes in the network is heterogeneous in terms of energy), and the total initial energy of the network is more as compared to SEEP, which will be discussed in the next section. In Table 4, it can be noticed that SEEP (Scenario 3, RWP) performs better as compared to other dynamic protocol with FND, HND, and LND statistics.

Table 3 Performance gain of SEEP (Scenario 1) over static protocols
Table 4 Performance gain of SEEP (Scenario 3, RWP) over dynamic protocols
$${\text{Performance}}\,{\text{gain}} = \frac{SEEP - Other\,Protocol}{Other\,Protocol} \times 100\%$$
(17)

6.2 Energy consumptions

In this section, we analyse the energy consumption of SEEP for static and dynamic technique for both mobility model against standard routing protocol as reported in literature.

6.2.1 Total energy consumptions

It is described as the total energy depletion by NNs, RNs, and CHs in each round. The Total Energy Depletion is defined as:

$$TED = \mathop \sum \limits_{ROUNDS} \left[ {E_{Tx}^{NN} + E_{Tx,Rx}^{RN} + E_{Tx,Rx}^{CH} } \right]$$
(18)

Figure 14a–c represents the total energy depletion of the compared protocols for static, dynamic, and all scenarios of SEEP from the first round to last round. It can be observed from Fig. 14a that DEEC and SEP have more energy in the initial round due to the heterogeneous nature that means initial energy of node are different. The plot illustrates that the proposed protocol has a uniform energy depletion curve as compared to other routing protocols till 1500 rounds. The static SEEP has a better lifetime (1530 rounds) because of the energy consumption per round is minimal as compared to other static and dynamic protocols which have maximum network lifetime (1440-1490 rounds). In dynamic protocols, SEEP (scenario 3) with RWP gained a better network lifetime (1490 rounds). It can be noticed from Fig. 14b that SEEP (scenario 3) with RWP also depletes its energy uniformly (straight curve) till the last round.

Fig. 14
figure 14

Total Energy consumption statistics for network topology: Area = 200 × 200, m2, Nodes = 200

6.2.2 Energy balancing

To analyse energy balancing, 200 nodes were deployed randomly in 200 × 200 m2 network area. The residual energy of sensor nodes after 200 rounds is represented in Fig. 15a–g for static routing protocols. When the nodes are deployed in a large network area, then the communication distance between nodes is increased. In a scalable WSN-assisted IoT network, energy balancing [35] plays an essential role in increasing the network lifetime. It can be observed in Fig. 15a that SEEP (scenario 1) supports energy balancing efficiently. In LEACH, SEP, ME-CBCCP and EA-CRP, some of the nodes have depleted their energy completely, which is depicted in red colour. Among all compared protocol, LEACH supports minimum energy balancing (many of the nodes become dead). It can be noticed in Fig. 15d, g that M-LEACH and DEEC got less energy efficiency as compared to SEEP.

Fig. 15
figure 15

Remaining energy of sensors after 200 rounds for static routing protocols, Network Area = 200 × 200 m2, Nodes = 200

The residual energy of sensor nodes for dynamic routing protocols has been represented in Fig. 16a–e. It can be observed from Fig. 16 that SEEP with scenarios 2 and 3 (with both mobility models) got better energy efficiency as compared to MIEEPB.

Fig. 16
figure 16

Remaining energy of sensor after 200 rounds for dynamic routing protocols, Network Area = 200 × 200 m2, Nodes = 200

Tables 5 and 6 bring the insight view of results and proof that SEEP for scenario 1 and SEEP with RWP for scenario 3 perform better as compared to static and dynamic protocols, respectively. We split the energy into 10 levels with the 0.05 J step. It can be noticed from Table 5 that 99 nodes are recorded after 200 rounds in the second level of energy followed by EA-CRP (98 nodes), ME-CBCCP (68) and DEEC (99 nodes). But in EA-CRP 10 nodes belong to the last level, and DEEC and SEP have more initial energy due to heterogeneity (as discussed in Sect. 6.2.1). Among them, LEACH performance is at least and 190 nodes in the last level.

Table 5 Energy distribution level after 200 rounds for static routing protocols
Table 6 Energy distribution level after 200 rounds for dynamic routing protocols

It can be observed from Table 6 that Scenario 3 performed better with both mobility models as compared to scenarios 1, 2, and MIEEPB. 199 and 193 nodes come in the second level of SEEP (scenario 3) with RW and RWP mobility model. These results assure that SEEP with scenario 1 and SEEP with scenario 3 have better network lifetime as compared to static and dynamic protocols, respectively.

6.2.3 Stability factor

Section 6.2.2 talked about energy balancing specifically up to some particular round (200). To evaluate the variability of energy depletion in every round, we compute the summation of standard deviation for every node up to 600 rounds and term it as stability factor (SF): [35]

$${\text{Stability}}\,{\text{factor}}\,\left( {\text{SF}} \right) = \sqrt {\frac{{\mathop \sum \nolimits_{i = 1}^{N} (E_{avg} - E\left( i \right)_{Current round} )^{2} }}{N}}$$
(19)

Where N is the total number of nodes, Eavg is the average energy depletion of the nodes in the current round, and E(i) is the consumed energy by the ith node in the current round.

We have calculated the SF for static, dynamic, and all scenarios of SEEP and which are represented in Fig. 17a–c. It can be observed from the figure that SEEP (scenario 1) performs better as compared to other static protocols due to more zone and cluster formation for increasing network size. As we discussed in Sect. 3.1, SEEP transforms the long-distance communication into small multi-hop distance communication and support load balancing, which is resulted in less and consistent Stability Factor (SF).

Fig. 17
figure 17

Stability factor for network topology: Area = 200 × 200, m2, Nodes = 200

In LEACH and M-LEACH, CH sends the packet directly to BS. The CHs, which are located at a large distance from the BS, run out of energy in the earlier round due to large distance communication. In ME-CBCCP protocol, the high responsibility nodes (RN, CH and Cluster Coordinator) selection are random, which cause large distance communication distance between nodes, and resulted in less network lifetime. SEP, DEEC, and TDEEC work on the heterogeneity (some nodes have more initial energy) concept in order to increase network lifetime. But in these techniques, CH attempts packet transmission to BS via nearest neighbor, but if they are unable to find a desire neighbor, then the direct communication takes place from CH to BS. Therefore, the discussed protocol suffers from the SF problem. EA-CRP performs better as compared to the discussed protocol (LEACH, M-LEACH, ME-CBCCP, SEP, DEEC, TDEEC, MIEEPB) and use multi-hopping communication between node and BS. But it does not perform well for increased network size.

It can be noticed from Fig. 17b that MIEEPB SF obtained a low curve as compared to mobile scenarios of SEEP. It is due to the base station mobility concept used in MIEEPB for data collection, as we discussed in Sect. 2. In MIEEPB nodes remain static, and BS moves in the entire network, which minimize the distance between sensor nodes and base station. Therefore, MIEEPB obtained less SF. Generally, in IoT applications, this kind of scenario is not considerable, and BS remains stable at some specific area.

7 Conclusion

Sensor nodes are equipped with small batteries and obtain energy from these batteries is very limited. Energy conservation is one of the most critical matters in the development of WSN-assisted IoT network at large scale. In this paper, a novel energy-efficient routing protocol based on the subdivision algorithm along with a multi-tier based clustering framework has proposed. We also introduced a strategy for zone and cluster formation, which is varied along with the size of the network area to meet the requirement of energy-efficient, scalable network. The significant advantage of our proposed protocol is summarized as communication between inter clusters at a small distance in a multi-hopping way for large area network. SEEP has been analysed for different deployment scenarios, mobility models, and performance metrics, which mainly includes network lifetime and energy depletion related statistics. Furthermore, it has been compared with existing standard protocols, and experimental results validate the significance of SEEP in terms of mentioned performance metrics. The performance gain of SEEP over the traditional method summarize below.

  1. 1.

    The proposed SEEP for static network receives network lifetime better by 7.19% than DEEC, 16.43% than EACRP, 133.8% than LEACH, 66.84% than MOD-LEACH, 56.12% than TDEEC, 24.42% than ME-CBCCP and 54.29% than SEP protocols.

  2. 2.

    The proposed SEEP (for scenario 3, RWP) receives network lifetime better by 63.06% than scenario 2 with RW, 37.32% than scenario 2 with RW, 4.32% than scenario 3 with RW, and 44.29% than MIEEPB.

  3. 3.

    We analysed the proposed framework for all considered scenarios; among them, the static approach gains maximum network lifetime.

  4. 4.

    The energy consumption is uniform in nature in our proposed approach that leads to increase network lifetime. Almost 84% of nodes are having residual energy in the range of [0.30–0.45 J] after completing 200 rounds (as given in Table 5).

  5. 5.

    We also analysed both the mobility model for our framework and from simulation results, it can be concluded that Random waypoint performs slightly better as compare to Random walk.