BERA: a biogeography-based energy saving routing architecture for wireless sensor networks

Lalwani, Praveen; Banka, Haider; Kumar, Chiranjeev

doi:10.1007/s00500-016-2429-y

BERA: a biogeography-based energy saving routing architecture for wireless sensor networks

Methodologies and Application
Published: 16 November 2016

Volume 22, pages 1651–1667, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Soft Computing Aims and scope Submit manuscript

BERA: a biogeography-based energy saving routing architecture for wireless sensor networks

Download PDF

Praveen Lalwani¹,
Haider Banka¹ &
Chiranjeev Kumar¹

536 Accesses
39 Citations
Explore all metrics

Abstract

Biogeography-based optimization (BBO) is a relatively new paradigm for optimization which is yet to be explored to solve complex optimization problems to prove its full potential. In wireless sensor networks (WSNs), optimal cluster head selection and routing are two well-known optimization problems. Researchers often use hierarchal cluster-based routing, in which power consumption of cluster heads (CHs) is very high due to its extra functionalities such as receiving and aggregating the data from its member sensor nodes and transmitting the aggregated data to the base station (BS). Therefore, proper care should be taken while selecting the CHs to enhance the life of the network. After formation of the clusters, data to be routed to the BS in inter-cluster fashion for further enhancing the life of WSNs. In this paper, a biogeography-based energy saving routing architecture (BERA) is proposed for CH selection and routing. The biogeography-based CH selection algorithm is proposed with an efficient encoding scheme of a habitat and by formulating a novel fitness function that uses residual energy and distance as its metrics. The BBO-based routing algorithm is also proposed. The efficient encoding scheme of a habitat is developed, and its fitness function considers the node degree in addition to residual energy and distance. To exhibit the performance of BERA, it is extensively tested with some existing routing algorithms such as DHCR, Hybrid routing, EADC and some bio-inspired algorithms, namely GA and PSO. Simulation results confirm the superiority/competitiveness of the proposed algorithm over existing techniques.

Optimizing and Enhancing the Lifetime of a Wireless Sensor Network Using Biogeography Based Optimization

GSA-CHSR: Gravitational Search Algorithm for Cluster Head Selection and Routing in Wireless Sensor Networks

Improved African Buffalo Optimization-Based Energy Efficient Clustering Wireless Sensor Networks using Metaheuristic Routing Technique

Article 02 April 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Wireless sensors are widely used for data gathering applications such as monitoring systems, automations. Each sensor node collects data from its target area and forwards it to the base station (BS). In this process, sensors consume some energy in gathering, processing and transmitting the data to the BS. This entire process of interaction is called as a round. In each round, energy of sensors depletes and after a certain number of rounds, sensors are exhausted due to lack of residual energy (Liu et al. 2010; Taheri et al. 2012). Since, each sensor node is powered by non-rechargeable and non-replaceable batteries, conserving the energy of sensor nodes is the most crucial issue in designing any WSNs. To enhance the performance of the network, various routing protocols have been devised (Heinzelman et al. 2000; Jian et al. 2010; Kundra et al. 2009; Rao and Banka 2016). One of them is a hierarchical cluster-based routing, which divides the network into several subgroups called clusters (Jian et al. 2010; Li et al. 2013; Mao et al. 2013). Each cluster contains a head node known as cluster head (CH). CH collects data from its members (non-CHs) and communicates that to another CH or the BS as demonstrated in Fig. 1.

In the CH selection, if m nodes are selected as CHs out of n nodes, a total of $^nC_m$ possible combinations exists. Hence, the computational complexity grows exponentially with the network size. It becomes an NP-hard problem and it is difficult to solve using heuristic approaches (Agarwal and Procopiuc 2002). Due to limited energy resources of sensor nodes, direct data transmission from CHs to the BS is not a feasible option for large-scale WSNs. Therefore, multi-hop inter-cluster communication is essential to handle this problem. In routing, the computational complexity varies exponentially with the size of the network. For example, if an average k CHs in the communication range of m CHs, then the computational complexity becomes $k^m$. As the size of the network increases, the value of m also increases. Therefore, finding the shortest path for large-scale sensor network becomes an NP-hard problem (Dorigo et al. 2006). We have considered scalable network ranging from small size to large size. It means that the computational complexity for selecting a route grows exponentially with the network size. Therefore, meta-heuristic approaches (Song and Cheng-Lin 2011; Bhari et al. 2009; Yu and Xiaohui 2011) can better approximate the solution to such kind of problems compared to heuristics.

Biogeography-based optimization (BBO) is a new optimization paradigm (Simon 2008) which is proved to be instrumental in solving complex problems in wide variety of domains such as sensor selection (Simon 2008), CT scan image segmentation (Chatterjee et al. 2012), power system optimization (Rarick et al. 2009; Roy et al. 2010; Bhattacharya and Chattopadhyay 2011), parameter estimation (Wang and Xu 2011), satellite image classification (Panchal et al. 2009), optimal meter placement (Jamuna and Swarup 2011), groundwater detection (Kundra et al. 2009). BBO has certain similarities with existing bio-inspired algorithms such as GAs and PSO in the way of sharing information among the solutions. In GA, if the parent is not fittest than its child, it has a low probability of survival for the next generation, while in PSO and BBO, such a solution survives for the next generation. In PSO, solutions are more likely to clump together in similar groups, while in BBO and GA, solutions are not grouped in the cluster. Solution of PSO is updated via velocity, whereas BBO solution is updated directly. Since BBO has provided better performance than GA and PSO in certain cases (Simon 2008), it motivated us to develop BBO-based algorithm for CH selection and routing in WSNs. To the best of our knowledge, it is the first such attempt.

In the present work, two BBO-based optimization algorithms, one for CH selection and another for routing in WSNs, have been proposed. The CH selection algorithm is capable of identifying near-optimal CHs among available sensor nodes. In order to achieve the objective, a novel fitness function is designed with an effective encoding scheme. For the derivation of fitness function, we considered various parameters such as energy, intra-cluster distance and distance from CHs to the BS. Afterward, non-CH nodes are assigned to the CHs using distance. Secondly, the routing algorithm is devised to find the near-optimal path from every CH to the BS. To accomplish this task, a new fitness function and an efficient encoding scheme are presented. The fitness function for the routing algorithm consists of parameters such as energy, distance and node degree. In the extensive simulation, firstly our proposed work (BERA) is compared with some recent existing approaches. Afterward, we have also executed well-established algorithms (GA, PSO) on the above stated problems, and compared with the proposed work in terms performance metrics.

The main contributions of this paper are as follows:

BBO-based CH selection algorithm with a novel fitness function and efficient encoding scheme.
BBO-based multi-hop routing algorithm with a new fitness function and a novel encoding scheme.
In performance analysis, proposed algorithm BERA is compared with some of the existing conventional methods and also tested with well-established bio-inspired techniques (GA, PSO).

The rest of the paper is organized as follows. Next section summarizes the clustering and the routing literature of WSNs. Important preliminaries on network model, energy model and BBO have been discussed in Sect. 3. The proposed cluster head selection and routing algorithms are discussed in detail in Sect. 4. Simulation analysis with some existing conventional and population-based algorithm is shown in Sect. 5. Finally, Sect. 6 concludes the paper.

2 Literature review

In this section, recent advances in clustering and routing algorithms are presented.

2.1 Clustering

A large number of clustering algorithms have been devised for WSNs (Bagci and Yazici 2010; Ran et al. 2010; Singh et al. 2013; Rao and Banka 2015; Rao et al. 2016). LEACH is very popular among them. Its objective is to reduce the energy consumption in WSN (Heinzelman et al. 2000). Each sensor node transmits data to the BS via its respective CH. To balance the energy consumption in the network, CH rotates randomly over time. However, the limitation of LEACH is to choose a sensor node as a CH with low residual energy which hampers the performance of the network. The authors in (Lindsey and Raghavendra 2002) have proposed an improved version of LEACH protocol. In the communication process, node receives data from its neighbor and one node selected from its chain to transmit the data to the BS. However, single leader dissipates energy rapidly as it is involved in regular transmission. Heinzelman et al. (2002) proposed LEACH-centralized (LEACH-C) protocol. In this protocol, the BS collects residual energy and the location information from all the sensors. After that BS forms clusters using a simulated annealing algorithm. But, it ignores the distance between sensors and its respective CH in the cluster formation process which decreases the life of the network. Wang et al. (2012) enhanced the performance of LEACH by considering the residual energy of nodes in the CH selection process. In data transmission phase, CH communicates directly with the BS, it is not feasible for large WSN. The authors in (Chang and Ju 2012) have proposed an energy saving clustering architecture. It enhances the life of the network by uniform cluster formation in which average distance and the center point are taken as input. In the CH selection process, residual energy of nodes within the cluster is taken as an input. However, it does not take care of node degree in the cluster formation process. In Yang and Ju (2014), a communication protocol has been presented. In its cluster formation process, initially BS collects location and the residual energy information of sensor nodes. Afterward, tree structure is formed within the cluster for connecting sensor nodes to its respective CH. In this process, it ignores residual energy while joining sensor nodes to its respective CH. Bagci and Yazici (2013) proposed an unequal clustering algorithm. To achieve the objective, it calculates the competitive radii of each cluster using fuzzy logic which consists of energy and distance as input. It increases the life of WSN. But, it does not consider the average distance between sensors and its respective CH in the cluster formation process. Lee and Cheng (2012) proposed a distributed algorithm for CH selection. It is executed in two phases. In the first phase, cluster formation is done using LEACH algorithm. In the second phase, CH is selected within the cluster using fuzzy logic. It takes residual energy and expected residual energy as input. However, it does not taken consideration of distance to the BS and node centrality for CH selection. Kumar et al. (2011) proposed a fuzzy-based clustering algorithm. In the CH selection process, fuzzy inference system is used which consists of distance, node density and battery level as input. The parameters taken into consideration increases the life of the network. However, it is unable to provide adaptive multi-hop communication.

2.2 Routing

In large WSN, direct communication between CHs and the BS is not feasible due to the higher communication cost. Every node needs to communicate with the BS at the minimum possible cost. Thus, the routing techniques have been devised by researchers (Younis and Fahmy 2004; Senouci et al. 2012; Lai et al. 2012; Abdulla et al. 2012). Among them, one of the popular algorithm is HEED (Younis and Fahmy 2004). Its CH selection process is based on residual energy. When tie occurs for choosing the CHs, then ARMP cost function is used to break it. Each CH transmits data to the BS in a multi-hop fashion. Therefore, it increases the life of the network. Even so, it does not take care of distance in CH selection process. The authors in (Senouci et al. 2012) have proposed an improved version of HEED algorithm. In the cluster formation process, it uses a similar mechanism as HEED. In addition, each cluster is divided into zones, based on distance between non-CH and CH nodes. It reduces the energy consumption of each CH member. In Lai et al. (2012), authors have proposed an unequal cluster formation mechanism. The size of every cluster is computed using load on its respective CH. Therefore, it avoids cluster reconfiguration and increases the life of the network. Abdulla et al. (2012) proposed a hybrid routing algorithm, with an objective to remove the hot-spot problem. In the communication process, it performs flat routing inside the hot-spot zone and hierarchical outwards. However, it does not analyze the effect of the hybrid boundary on the network performance. The authors in (Yu et al. 2012) have proposed a routing algorithm (EADC), which scouts the routing path between every CH and the BS using energy and node degree of CHs. Therefore, it increases the life of the path. Even so, it does not take care of distance between CH and next-hop for finding the routing path. Maryam and Reza (2015) proposed an enhanced version of EADC by adding one more parameter for scouting the communication path, namely the transmission power. Song and Cheng-Lin (2011) proposed a routing algorithm. In the cluster formation process, it finds the competitive radii for each cluster and also estimates the chance of a sensor node to become a CH, in which fuzzy logic is used. It considers density, distance and energy as input parameter. Thereafter, routing path for every CH is computed using ant colony optimization (ACO). Therefore, it enhances the performance of the network. In Bhari et al. (2009), authors have proposed a routing algorithm for large-scale WSN-based genetic algorithm. They derived the fitness function based on network lifetime. However, it does not take care of other essential parameters such as node degree, BS distance. Elhabyan and Yagoub (2015) proposed a particle swarm intelligence-based clustering and routing algorithm. In the clustering process, fitness function consists of energy, cluster quality and network coverage. In routing algorithm, fitness function is derived using energy and link quality. However, it does not take care of power control in the derived fitness in the routing process.

2.3 Advantages of proposed work over existing works

Classical approaches mentioned in related work are not able to tackle CH selection and routing problem for large-scale network (Younis and Fahmy 2004; Senouci et al. 2012; Lai et al. 2012), as both the problems has been proven to be NP-hard in nature (Agarwal and Procopiuc 2002; Dorigo et al. 2006). The stochastic approaches mentioned in the literature are not able to provide better quality of solution due to the lack of consideration of essential parameters in the derivation of the fitness function (Elhabyan and Yagoub 2015). In the proposed work, biogeography-based optimization is used with proper consideration of essential parameters like energy, distance and node degree. So, that better quality of solution was achieved compared to existing works.
Newly devised bio-inspired technique has been adopted and compared with well-established techniques (GA, PSO) with the same fitness function. In contrast, bio-inspired techniques adopted by authors (Song and Cheng-Lin 2011; Elhabyan and Yagoub 2015), but simulated results were not compared with existing well-established bio-inspired techniques (GA, PSO).

3 Preliminaries

In the current section, we have tried to describe the notations, network model, energy model and biogeography-based optimization.

3.1 Notation

Table 1 introduces some of the notations and/or abbreviation used in this study.

Table 1 Description of notations

Full size table

3.2 Network model

In this paper, hierarchal routing architecture is proposed with following properties. In node configuration, all nodes are homogeneous in nature, it means all nodes have equal initial energy, processing and communication capabilities. Distance calculation is based on received signal strength (Xu et al. 2010). Initially, all sensors are deployed randomly in the target area and position of sensors are fixed after deployment. All nodes then transmits its residual energy and location information to the BS. Based on it, the number of CHs (m) are selected out of n nodes by our proposed CH selection algorithm (see Sect. 4.1). Finally, the proposed routing algorithm is executed to establish the path from every CH to the BS (see Sect. 4.2).

3.3 Energy model

The first-order radio model considers for energy computation (Heinzelman et al. 2000). Energy dissipation in transmitting L bits of data at distance $d_{\mathrm{o}}$ is shown in Eq. 3.1, where $E_{\mathrm{ele}}$ is the energy dissipation in transmitter circuitry, and $E_{\mathrm{amp}}$ is the energy dissipation in amplification.

$$\begin{aligned} E_{\mathrm{Tx}} = {\left\{ \begin{array}{ll} E_{\mathrm{ele}} *L + E_{\mathrm{amp}} *L*d_t^2 &{} \quad \text {if} \quad d_t<d_{\mathrm{o}} \\ E_{\mathrm{ele}} *L + E_{\mathrm{amp}} *L*d_t^4 &{} \quad \text {if} \quad d_t \ge d_{\mathrm{o}} \end{array}\right. } \end{aligned}$$

(3.1)

Energy depletion for receiving L bits of data is mentioned in Eq. 3.2

$$\begin{aligned} E_{\mathrm{Rx}}=E_{\mathrm{ele}} \times L \end{aligned}$$

(3.2)

3.4 Biogeography-based optimization

Biogeography-based optimization was devised by Simon (2008). It is a geographical way of assignment of biological species. Each geographical zone is represented by an index known as a habitat suitability index (HSI). Another index is used to represent the area of habitat and livelihood conditions is called as suitability index variable (SIV). The fitness of each habitat is analogous to its HSI value and number of species. To improve the low HSI solution, it accepts features from high HSI solution. This mechanism is known as biogeography-based optimization (BBO).

The model of species abundance in a single habitat is demonstrated in Fig. 2, where immigration rate is $\lambda $ and emigration rate is $\mu $. In the immigration curve, the maximum immigration rate is I when habitat consists of zero species. It is also estimated from this curve, as the number of species increases then $\lambda $ decreases. The maximum number of species in the habitat is $S_{\mathrm{max}}$, at that point immigration rate is zero.

In the emigration curve, the maximum emigration rate is E when habitat consists of maximum number of species ($S_{\mathrm{max}}$). It is estimated from the curve, as the number of species increases in the habitat, then emigration rate also increases. Moreover, emigration rate is zero, when there is no species in the habitat. At the equilibrium point ($S_0$), both immigration and emigration rates are equal.

From the straight line curve as shown in Fig. 2, immigration rate and emigration rate are as follows:

$$\begin{aligned} \mu =\frac{E\times k}{n_{\mathrm{s}}},\quad \lambda =I \left( 1-\frac{k}{n_{\mathrm{s}}}\right) \end{aligned}$$

(3.3)

where k is number of species in the habitat, and $n_{\mathrm{s}}$ is the maximum number of species.

The working principle of BBO is as follows.

3.4.1 Migration

Lets have an optimization problem and a population of candidate solutions, where each solution is represented by a n dimension vector known as a habitat. Each dimension in the habitat is considered to be an SIV. The goodness of a habitat is analogous to HSI value and number of species. To improve the solution, low HSI solution shares information with high HSI solution (similar as GA and PSO), whereas sharing is based on immigration ($\lambda $) and emigration rates ($\mu $). In this process, two habitats are chosen from the population. Firstly, habitat ($H_i$) is selected based on the immigration rate ($\lambda _i$), and other habitat ($H_j$) is selected using emigration rate ($\mu _j$). Afterward, the randomly selected SIVs are migrated from $H_j$ solution and appears in $H_i$.

3.4.2 Mutation

In a geographical region, due to some natural disasters, HSI of a habitat changes suddenly and causes habitat deviation from its equilibrium position. Similar effect demonstrated in BBO using mutation operation. It is performed using the species count of each habitat as shown in Eqs. (3.4, 3.5). A probability is assigned to each habitat for mutation. If it is high, it means that there is a less chance for mutation and a solution is nearer to the optimized solution. If it is low, it means a high chance for mutation and a solution is far away from the optimized solution.

$$\begin{aligned}&P_S^h= {\left\{ \begin{array}{ll} -(\lambda _S + \mu _S)P_S + \mu _{S+1}P_{S+1},&{} \quad S=0\\ -(\lambda _S + \mu _S)P_S + \lambda _{S-1}P_{S-1} &{}\\ \quad + \mu _{S+1}P_{S+1}, &{}\quad 1\le S \le S_{\mathrm{max}}-1\\ -(\lambda _S + \mu _S)P_S + \lambda _{S-1}P_{S-1}, &{} \quad S=S_{\mathrm{max}} \end{array}\right. } \end{aligned}$$

(3.4)

$$\begin{aligned}&m(S)=m_{\mathrm{max}}\left( \frac{1-P_S}{P_{\mathrm{max}}}\right) \end{aligned}$$

(3.5)

where m(s) is mutation rate of S species, $m_{\mathrm{max}}$ is maximum mutation rate and $P_{\mathrm{max}}$ is maximum mutation probability.

Merits of mutation operation describe as follows: (i) increase the variety of population. (ii) Resist high HSI solution to disrupt. (iii) Improves high and low HSI solutions.

The demerit of mutation operation is probability of degrade the solution.

4 BERA: the proposed approach

The proposed approach entails of two phases. In the first phase, BBO-based cluster head selection algorithm is executed to select some nodes as CHs (see Sect. 4.1). In the second phase, the proposed routing algorithm is used to compute the data transmission path from each CH to the BS (see Sect. 4.2).

4.1 BBO-based cluster head selection algorithm

It selects near-optimal nodes as CHs among all sensor nodes using residual energy, intra-cluster distance and distance between CHs and the BS.

4.1.1 Representation of habitat

In BBO, a potential solution is called as habitat. In the CH selection phase, a habitat represents a set of sensor nodes to be selected as CHs. The dimension of each habitat is equal to the number of CHs in the network.

4.1.2 Initialization of habitat

Each habitat position is initialized with a random node_id between 1 and n. Let $H_i = (H_{i,1}(t), H_{i,2}(t),\ldots , H_{i,m}(t))$ be the $i\hbox {th}$ habitat, where each habitat position $H_{i,d}, 1\le d\le m$ represents node_id between 1 to n in the network.

Illustration of Fig. 3: let the number of sensor nodes be 100, number of CHs are 10% of total number of nodes and dimension of each habitat is equal to the number of CHs i.e., 10. Now each habitat position $H_{i,d},1\le d\le 10$ is initialized with random number between 1 and 100, i.e., node_id.

4.1.3 Derivation of fitness function

The fitness function is derived using the following parameters:

(a)
Residual energy of cluster head: in the communication process, energy consumption of CH is high due to its functioning such as receiving data from its respective CH members, performing aggregation and then transmitting the data to a CH or the BS. Therefore, sensor node with higher residual energy is a more preferable choice as a CH. It enhances the life of the network. So, our first objective in terms of residual energy is $f_1$, which can be minimized as follows:

Objective 1

$$\begin{aligned} \hbox {Minimize} \quad f_1=\sum \limits _{i=1}^{m}\frac{1}{E_{\mathrm{CH}_{i}}} \end{aligned}$$

(4.1)

(b)
Intra-cluster distance: it is the average distance between CH and its respective members. Energy dissipation of a sensor node depends on transmission distance that is described in Sect. 3.3. If it is minimum, then energy consumption is also less. So, a second objective in terms of intra-cluster distance is $f_2$, which can be minimized as follows:

Objective 2

$$\begin{aligned} \hbox {Minimize} \quad f_2= \sum \limits _{j=1}^m \left( \sum \limits _{i=1}^{I_j} \hbox {dis}\left( s_i,\hbox {CH}_j \right) {/}I_j\right) \end{aligned}$$

(4.2)

(c)
Distance between CH and the BS: it is the distance between each CH to the BS. Energy consumption of a sensor node depends on transmission distance that is described in Sect. 3.3. In the data transmission process, CHs transmitting data to the BS. Therefore, CHs with minimum distance from the BS is a more preferable choice. So, our third objective is $f_3$, which can be minimized as follows:

Objective 3

$$\begin{aligned} \hbox {Minimize}\quad f_3=\sum \limits _{i=1}^{m} \hbox {dis}\left( \hbox {CH}_i,\hbox {BS}\right) \end{aligned}$$

(4.3)

All of the above stated objectives are not strongly conflicting with each other. Therefore, the weighted sum approach is applied and all objectives converted into a single objective function as shown in Eq. 4.5, where $\alpha _1$, $\alpha _2$ and $\alpha _3$ are the weights assigned to each objective. As we know that all the objectives have different units and values, therefore, min–max normalization function is applied to each objective using Eq. 4.4.

$$\begin{aligned} F(x)=\frac{f_i -f_{\mathrm{min}}}{f_{\mathrm{max}}- f_{\mathrm{min}}} \end{aligned}$$

(4.4)

where $f_i$ is the value of the function, $f_{\mathrm{min}}$ is minimum value, $f_{\mathrm{max}}$ is maximum value and F(x) is the normalized value between 0 and 1.

$$\begin{aligned} \begin{aligned}&\hbox {Minimize fitness}=\alpha _1 \times f_1+\alpha _2 \times f_2+\alpha _3 \times f_3,\\&\hbox {where} \sum \limits _{i=1}^{3} \alpha _i=1;\quad \hbox {and}\quad \alpha _i~ \epsilon ~(0,1) \end{aligned} \end{aligned}$$

(4.5)

4.1.4 Habitat migration

In the migration process, firstly habitat ($H_i$) is selected based on the immigration rate ($\lambda _i$) probabilistically. Thereafter, another habitat ($H_j$) is also selected based on the emigration rate ($\mu _j$) in a probabilistic way. After selection of two habitats, some SIVs from $H_j$ appears in $H_i$, i.e., node_ids of the high HSI solution appears in the low HSI solution. For that, one position is randomly generated between 1 and $m\hbox {th}$ dimension. From generated position to the last position, all node_ids from $H_j$ appears in $H_i$ solution. In this way, all habitats are updated until the best solution is achieved.

Illustration of Fig. 4: it shows all the steps from habitat initialization to the migration process. In Fig. 4a, all the habitats are initialized randomly by generating node_id between 1 and n. Afterward, HSI of each habitat is calculated using Eq. 4.5 as shown in Fig. 4b and species are distributed accordingly as demonstrated in Fig. 4c. The immigration rate ($\lambda _i$) and emigration rate ($\mu _j$) of each habitat are calculated based on the number of species as shown in Fig. 4d. In migration process, firstly, a habitat $H_4$ is selected based on high immigration rate ($\lambda _4=0.99$) and $H_5$ is also selected based on high emigration rate ($\mu _5=0.77$). Afterward, a random position (say $5\hbox {th}$) is chosen among all the positions. Then from $5\hbox {th}$ position onward all the SIVs from $H_5$ appears in $H_4$ habitat, i.e., node_ids as shown in Fig. 4e.

4.1.5 Mutation

For example, we consider 100 sensor nodes in a target area with node_ids ranges from 1 to 100. In mutation, a habitat is selected by considering the mutation probability. Afterward, a random position/SIV is chosen in the habitat and its value is replaced with node_id generated randomly between 1 and 100.

Illustration of Fig. 5: let the emigration rate of $H_1$–$H_5$ habitats as [0.02, 0.13, 0.07, 0.01, 0.77] is shown in Fig. 5a, and its corresponding mutation probability is calculated and shown in Fig. 5b. Suppose a habitat $H_4$ is selected for mutation and the random number (rand) is generated between 0 and 1. If rand is less than the mutation probability ($M_4$), then mutation is performed. For that, a random position is selected (say $6\hbox {th}$) within a habitat and its corresponding position value is replaced with newly generated random nod_id between 1 and 100. In Fig. 5c, $N_3$ is replaced with $N_{91}$.

4.1.6 Pseudo-code of BBO-based cluster head selection algorithm

4.2 BBO-based routing algorithm

In the second phase, the near-optimal route from each CH to the BS is computed based on residual energy, distance and node degree of CH.

4.2.1 Representation of habitat

In routing, each habitat represents the data forwarding path from every CH to the BS. The dimension of each habitat is equal to the total number of CHs in the network.

4.2.2 Initialization of habitat

Here, the dimension of the habitat is equal to the number of CHs in the network. Let $H_{i}= (H_{i,1}(t), H_{i,2}(t),\ldots ,H_{i,m}(t))$ be the $i\hbox {th}$ habitat, where each position $H_{i,d}, 1\le d \le m$ denotes next-hop ($\hbox {CH}_j$) toward the BS as shown in Fig. 7.

Example 1

Let the number of CHs are 10 and BS is denoted by Id 11 as shown in Fig. 6. Therefore, the dimension of a habitat is 10. Now, for every position $H_{i,d}, 1\le d \le 10$ is initialized by randomly generated next-hop within its range as shown in Fig. 7. The routing path from every CH to the BS is mentioned in Table 2.

4.2.3 Derivation of fitness function

The formulation of fitness function is based on following parameters: residual energy, euclidean distance and node degree.

(a)
Residual energy of next-hop node: our first objective is to consider the residual energy of the next-hop (NH) node use to relay the data toward the BS. If the residual energy of a NH node is high, it would be more preferable choice for data to receive, aggregate and transmit to the next CH or the BS. So, our first objective in terms of residual energy is $g_1$, which can be maximized as follows: