1 Introduction

As compared to centralized bulk power generation, transmission and distribution of power at the load centers constitute major challenges for the utilities. Hence, distributed generation (DG) is a viable option where power is generated near the load centers. DG comprises both fossil fuel-based conventional and nonconventional energy sources like solar power, hydro-, biofuel, geothermal, etc., ranging from few kilowatts to about 50 MW (Singh et al. 2009). Competitive markets, environmental issues and reliability of power sources are some of the major criteria for the selection of energy sources. Depletion of fossil fuel sources and continuous improvement in the area of nonconventional ones have been the motivating factors for the utilities to go for DG.

Proper siting and sizing of DGs is crucial for better distribution system performance. Optimal sizing and placement of DGs play a crucial role in the reduction of power losses, line loading and system operating costs along with improvement in system reliability and prevention of voltage collapse. Earlier, for the voltage profile improvement in power systems, capacitor banks and synchronous condensers were predominantly used (Nagrath and Kothari 2007). But nowadays, to improve the performance of power systems, both active and reactive power injections are carried out using DGs. Therefore, finding the optimal location and size of DGs has been a global challenge for both the academia and the industry.

Several comprehensive research works have been reported in the area of optimal allocation of DGs to improve distribution system performance. A noniterative analytical method for single DG placement in both radial and meshed systems has been presented in Wang and Nehrir (2004) to minimize system power losses for both time-variant- and -invariant-type loads in the IEEE 6-bus and IEEE 30-bus distribution systems. A very efficient analytical method employing BIBC approach for siting and sizing of DGs in the IEEE 12-bus, 34-bus and 69-bus system has been proposed in Gözel and Hocaoglu (2009). However, Gözel and Hocaoglu (2009) is limited to placement of a single DG. An analytical method for multiple DG placements based on the active and reactive components of the branch currents has been reported in Gopiya Naik et al. (2015). Hung et al. (2010) have presented an improved analytical (IA) method for single DG placement and has highlighted its superiority over existing methods based on the assumption that the DG power factor is same as the aggregate load power factor. This method reduces the computational time significantly while considering various types of DGs to minimize power losses and improve the voltage profile of the network. Although analytical methods for siting and sizing of DGs have fast convergence, with increase in the types and number of DGs, the computational complexity increases and may lead to nonoptimal solutions.

Advances in soft computing techniques have led to the development of several optimization algorithms for optimal allocation of DGs. In this perspective, Singh et al. (2008), Ameli et al. (2014), Gomez-Gonzalez et al. (2012), Moradi and Abedini (2012), Niknam et al. (2011), Saha and Mukherjee (2016), Abu-Mouti and El-Hawary (2011), Sheng et al. (2016), Mohanty and Tripathy (2016) and Martín García and Gil Mena (2013) have presented some comprehensive research works on the optimal allocation of Type 1 DGs. Singh et al. (2008) have carried out DG placement using genetic algorithm (GA) in the IEEE 16-bus, 37-bus and 75-bus radial distribution networks with different loading conditions. However, GA requires more computational time while suffering from premature convergence. Multi-objective particle swarm optimization (PSO) for DG placement in the IEEE-33-bus radial distribution system (RDS) has been proposed by Ameli et al. (2014) for power loss reduction, improvement of voltage profile and stability. Gomez-Gonzalez et al. (2012) have employed discrete PSO for optimal DG allocation to reduce power losses in the IEEE 30-bus radial distribution network. A hybrid GA-PSO-based algorithm for optimal DG allocation to reduce power losses and improve voltage profile and stability has been presented in Moradi and Abedini (2012). A multi-objective, modified honey bee mating optimization algorithm for siting and sizing of DG has been reported in Niknam et al. (2011). The proposed method avoids strong local convergence as compared to the original honey bee mating optimization algorithm. Several nature-inspired algorithms have also been proposed by researchers for siting and sizing of DGs. Some notable ones include the chaotic symbiotic organism search (CSOS) algorithm (Saha and Mukherjee 2016) and the artificial bee colony (ABC) algorithm (Abu-Mouti and El-Hawary 2011). Sheng et al. (2016) have implemented multi-objective harmony search algorithm (HSA) for optimal DG allocation in the 33-bus RDS. TLBO method for the optimal siting and sizing of DGs has been reported in Mohanty and Tripathy (2016). A multi-objective, modified teaching–learning-based optimization algorithm for the optimal allocation of DGs in the 33-bus, 69-bus and 118-bus RDS has been presented in Martín García and Gil Mena (2013).

1.1 Motivation and aim

Although analytical methods have no convergence issues, with an increase in the number and type of DGs, their complexity increases, with a consequent increase in the computational time. This is particularly true for multi-objective formulations with a large number of equality and inequality constraints. Analytical approaches for optimal DG allocation in distribution systems require more robust algorithms (Arqub and Abo-Hammour 2014; Abu Arqub et al. 2016; Arqub et al. 2017; Abu Arqub 2017) to solve differential and nonlinear equations. In this respect, heuristic methods for siting and sizing of DGs do not involve differential equations. However, the algorithms need to be tuned properly to reach the global solution for DG sizing and placement. Recently, TLBO method (Rao et al. 2011) has been developed which is almost independent of the optimization parameters and possesses excellent convergence characteristics for lesser number of variables. However, it has the tendency to be trapped in strong local minima when the number of variables increases. A modified TLBO algorithm for DG placement has been reported in Martín García and Gil Mena (2013) where additional mutation phase is considered for finding the global solution. Sultana and Roy (2014) have presented a quasi-oppositional TLBO (QOTLBO) for DG placement in RDS. However, these features add complexity to the TLBO and increase the computational time. An improved TLBO has been reported in Kanwar et al. (2015) in which a crossover rate and a crossover parameter have to be specified. However, both of them need to be tuned to achieve a satisfactory convergence. Harmony search (HS) is another swarm intelligence algorithm which shows good exploration capability but poor exploitation capability for global solutions. Several HS variants have been suggested to solve complex optimization problems, such as SGHS (Self-Adaptive Global Harmony Search Algorithm) (Pan et al. 2010), IHS (Mahdavi et al. 2007), ITHS (Intelligent Tuned Harmony Search algorithm) (Yadav et al. 2012), EHS (Enhanced Harmony Search) (Das et al. 2011), NGHS (Nobel Global Harmony Search) (Zou et al. 2010), DIHS (Tuo et al. 2015), NDHS (Chen et al. 2012) and DSHS (Kattan and Abdullah 2013). However, these improved versions of HS are also unable to handle complex optimization problems of high dimensionality and modality. IHS lacks in precise solution. It is observed that NGHS, SGHS and NDHS get easily trapped into strong local minima/maxima. Although solutions provided by EHS and DSHS are satisfactory, they require more convergence time for global solution with high-dimensional problems. It is observed that TLBO exhibits good search space exploitation capability while HS has good exploration capability. So, a proper integration of the merits of TLBO and HS would result in a better optimization technique for high-dimensional and multimodality problems.

To get a good balance of exploration and exploitation of the search space, a new optimization algorithm based on the hybridization of HS and TLBO is presented in this paper. In this algorithm, at first the HS algorithm is utilized to explore the search space with a high probability of finding the global solution. Subsequently, TLBO carries out exploitation of the search space for the global solution. For choice of TLBO or HS, the proposed algorithm uses self-adaptive selection probability.

In this work, at the outset, the performance of the proposed algorithm is tested on standard mathematical benchmark functions. Subsequently, a multi-objective problem of optimal DG allocation in the IEEE 33-bus, 69-bus and 118-bus RDS is solved by the ɛ-constraint method using HTLBO. The proposed HTLBO is compared with existing TLBOs and QOTLBO. The results validate the proposed method.

The contribution of the paper can be summarized as follows:

  • Development of a HTLBO algorithm which is competent in handling continuous as well as discrete variables with constrained and unconstrained optimization problems.

  • HTLBO utilizes the exploration quality of HS and the exploitation quality of TLBO. It has the capability to escape strong local maxima/minima convergence, thus yielding a global solution.

  • A comparative analysis for solving multi-objective problems based on two approaches—the weighted sum and the ɛ-constraint method—is presented. The later is independent of penalty coefficients.

  • HTLBO is validated with standard mathematical benchmark functions (both single- and multi-objective). The results are compared with global best ABC (g-ABC), parallel ABC (P-ABC) and improved TLBO (ITLBO) for single-objective functions. On the other hand, for multi-objective functions, the results are compared with NSGA-II.

  • The impact of parameter variations of the proposed algorithm on the solution is investigated, which shows that proper tuning of algorithm parameters is required for global solution.

  • Optimal siting and sizing of DGs is analyzed in the IEEE 33-bus, 69-bus and 118-bus radial distribution test systems to improve the network active power losses, voltage profile and voltage stability index.

1.2 Paper layout

The paper is organized as follows. Section 2 presents the mathematical problem formulation of the work carried out in this paper. Section 3 illustrates the basics of TLBO and HS algorithms. Section 4 presents several modifications carried out in the TLBO and HS algorithms to improve their performance. Section 5 illustrates the flowchart of the proposed HTLBO. Section 6 demonstrates the effectiveness and validity of the proposed HTLBO on mathematical benchmark function. The simulation results of DG allocation in various distribution networks are also presented in this section. Finally, conclusions are presented in Sect. 7.

2 Problem formulation

The proposed work has considered the following assumptions:

  1. a.

    All RDS are balanced.

  2. b.

    Unity power factor DGs have been employed in RDS.

  3. c.

    Nominal load level and constant power load model is assumed.

  4. d.

    The outputs of all DGs are time invariant.

2.1 Single-objective function (SOF)

The objective of allocation of DGs in RDS is to reduce the network active power losses and enhance the voltage profile and VSI while satisfying all working constraints. Descriptions of these SOF are given below:

2.1.1 Real power loss P loss

The real power loss of the network is the primary concern while allocating DGs in RDS. Various formulations (Das et al. 1994; Teng 2003; Goswami and Basu 1991) exist in the literature for load flow studies of RDS. The \( P_{\text{loss}} \) can be calculated as shown below (Gözel and Hocaoglu 2009):

$$ P_{\text{loss}} = \mathop \sum \limits_{j = 1}^{\text{nb}} I_{j}^{2} R_{j} \quad {\text{where}} \quad I_{j} = {{\mathop \sum \limits_{k = 1}^{m} \left( {\left( {P_{L,k} - P_{DG,k} } \right)^{2} + Q_{L,k}^{2} } \right)}/ {\left| {V_{k}^{2} } \right|}} $$
(1)

So, SOF to minimize real power loss is shown below:

$$ F_{1} = {\text{Minimize}}\;P_{\text{loss}} $$
(2)

In Eq. (1), PL,k and QL,k are the real and reactive load demand at the kth bus, PDG,k is the real power injected by the DG at the kth bus, ‘\( \left| {V_{k} } \right| \)’ is the voltage magnitude at the kth bus, ‘Rj’ is the resistance of the jth line, ‘n’ is the total number of buses in the network, ‘nb’ is the total number of branches in the network and ‘m’ is the number of buses beyond branch ‘j.

2.1.2 Voltage profile

It is desirable to keep the voltage magnitude (Vk) at a bus between 1.05 p.u. (maximum) and 0.95 p.u. (minimum). Real power injections by DGs allocated optimally enhance the voltage profile of the RDS. So, SOF (F2) to minimize voltage deviation index (VDI) is formulated as shown below:

$$ F_{2} = \mathop \sum \limits_{k = 1}^{n} \left( {V_{k} - V_{\text{rated}} } \right)^{2} , $$

where ‘Vk’ and ‘\( V_{\text{rated}} \)’ are expressed in p.u.

2.1.3 Voltage stability index (VSI)

The characterization of voltage profile of RDS is analyzed by its VSI. For stable operation of the network, the VSI must be greater than zero. So, VSI should be maximum to ensure maximum stability of RDS. The VSI formulation of a RDS is represented by Eq. (4) (Chakravorty and Das 2001) and is shown in Fig. 1.

$$ VSI_{k + 1} = \left| {V_{k} } \right|^{4} - 4\left\{ {P_{k + 1} X_{j} - Q_{k + 1} R_{j} } \right\}^{2} - 4\left\{ {P_{k + 1} R_{j} + Q_{k + 1} X_{j} } \right\}\left| {V_{k} } \right|^{2} $$
(4)

The SOF (\( F_{3} \)) to maximize the VSI is shown below:

$$ F_{3} = {\text{maximize}}\,{\text{VSI,}} $$
(5)

where \( {\text{VSI}}_{k} \) and \( {\text{VSI}}_{k + 1} \) are the VSI of the kth and (k + 1)th bus, respectively and ‘Rj’ and ‘Xj’ denote the resistance and reactance, respectively, of the branch ‘j’ incorporated between the kth and the (k + 1)th nodes as shown in Fig. 1a. Similarly, ‘Pk+1’ and ‘Qk+1’ are the real and reactive power demands, respectively, at the (k + 1)th node of the RDS.

Fig. 1
figure 1

Equivalent circuit of the jth branch of the network between buses ‘k’ and ‘(k + 1)’

2.2 Multi-objective function

A multi-objective function (MOF) considers all the objective functions simultaneously for either maximization or minimization, while satisfying the equality and inequality constraints. In this paper, a MOF (Sultana and Roy 2014) simultaneously minimizes power loss (F1) and VDI (F2) and maximizes VSI (F3) by the ɛ-constraints method (Deb 2001), satisfying operational constraints like bus voltage limits, line thermal limits, DGs penetration limits and network power balance as given below:

$$ {\text{MOF}} = {\text{Minimize}}\;f\mu \left( x \right) $$
(6)
$$ {\text{Subject}}\,{\text{to}}\;f_{m} \left( x \right) \le \varepsilon_{m } \quad m = 1,2,3 \ldots . \;{\text{and}}\;m \ne \mu ; $$
$$ g_{j} \left( x \right) \ge 0, \quad j = 1,2,3 \ldots .; $$
$$ h_{l} \left( x \right) = 0,\quad l = 1,2,3 \ldots .; $$
$$ x_{i}^{\left( L \right)} \le x_{i} \le x_{i}^{\left( U \right)} , i = 1,2,3 \ldots .; $$

where one of the objectives of the MOF is maximized/minimized, keeping the remaining objectives as constraints. Here, \( f\mu \left( x \right) \) is the power loss while \( f_{m} \left( x \right) \) comprises both the VDI and the inverse of VSI as constraints. The parameter \( \varepsilon_{m } \) represents the upper bound of the value of \( f_{m} \). \( g_{j} \) comprise the bus voltage limits, line thermal limits, DGs penetration limits while \( h_{k} \) constitutes equality constraints (i.e., power balance), respectively. \( x_{i}^{\left( L \right)} \) and \( x_{i}^{\left( U \right)} \) are the variables’ lower and upper limits, respectively.

2.2.1 Equality constraints

The SOF and the MOF are subject to the following constraints for the optimal allocation of DGs in the RDS as given below.

2.2.2 Active and reactive power balance constraints

$$ P_{\text{sub}} + P_{\text{DG}} = P_{\text{loss}} + P_{\text{D}} \quad {\text{and}}\quad Q_{\text{sub}} + Q_{\text{DG}} = Q_{\text{loss}} + Q_{\text{D}} $$
(7)
$$ Q_{\text{loss}} = \mathop \sum \limits_{j = 1}^{\text{nb}} I_{j}^{2} X_{j} $$
(8)

where \( P_{\text{sub}} /Q_{\text{sub}} \) are the active/reactive powers supplied by the substation, \( P_{\text{D}} /Q_{\text{D}} \) are the total active/reactive power demands of the load and \( Q_{\text{loss}} \) is the reactive power loss in the distribution network.

2.2.3 Voltage limit constraint

The voltage magnitudes (\( V_{k} \)) of all the buses of RDS must be within limits of \( V_{{\max} } \;(1.05 \,{\text{p}} . {\text{u}} .) \)) and \( V_{ {\min} } \;(0.95\,{\text{p}} . {\text{u}} . ) \)

$$ V_{{\min} } \le V_{k} \le V_{{\max} } \quad k = 1, 2, 3, 4 \ldots n $$
(9)

2.2.4 Thermal limit (Kanwar et al. 2017)

$$ I_{j} \le I_{j}^{{\max} } \quad j = 1,2, 3, 4 \ldots {\text{nb}} $$
(10)

where \( I_{j} \,{\text{and}}\,I_{j}^{{\max} } \) are the actual and the permissible loading of the branch ‘j’ of RDS.

2.2.5 Real power limit and reactive power limit (Moravej and Akhlaghi 2013)

$$ P_{DG,k}^{{\min} } \le P_{DG,k} \le P_{DG,k}^{{\max} } $$
(11)
$$ Q_{DG,k}^{{\min} } \le Q_{DG,k} \le Q_{DG,k}^{{\max} } $$
(12)

where \( P_{{{\text{DG}},k}}^{{\min} } \) and \( P_{{{\text{DG}},k}}^{{\max} } \) are the minimum and maximum injected active power limits, respectively, of the kth DG while \( Q_{{{\text{DG}},k}}^{{\min} } \) and \( Q_{{{\text{DG}},k}}^{{\max} } \) are the minimum and maximum injected reactive power limits, respectively, of the kth DG.

2.2.6 Maximum penetration of DG units in the system (Zhang et al. 2015)

$$ \mathop \sum \limits_{k = 1}^{n} P_{{{\text{DG}},k}} = \% J* \mathop \sum \limits_{k = 1}^{\text{nb}} P_{L,k} , $$
(13)

where J is maximum penetration of DGs in the RDS.

2.3 TLBO algorithm

TLBO algorithm was first introduced by Rao et al. (2011). TLBO is a nature-inspired algorithm and is comprised of two stages, i.e., Teaching phase and Learning Phase. In ‘teaching phase’, the best learner, i.e., teacher, transfers his knowledge to the other remaining learners to enhance their knowledge. Subsequently, in the ‘learning phase’, each learner interacts with other fellow learners to further improve his/her own knowledge. These two stages are repeated till the TLBO proceeds toward the global best knowledge (global solution).

2.3.1 Teaching phase

The teacher keeps on trying to transfer his knowledge to the remaining learners, to the best of his capacity. Transfer of the teacher’s knowledge can be utilized to improve the remaining learners/variables. It is mathematically expressed by Eq. (14). This can be illustrated with an example that to fulfill the objective of improving the knowledge of the students of a particular class in a particular subject. The student having the maximum marks is considered as the teacher (\( X_{{{\text{Teacher}},i}} ) \). The teacher’s duty is to improve the old marks (\( X_{{{\text{old}},i}}^{\text{HMS}} ) \) of other students to new marks \( (X_{{{\text{new}},i}}^{\text{HMS}} ) \) toward the mean marks (\( M_{i} \)) of that particular subject/variables utilizing his own marks (\( X_{{{\text{Teacher}},i}} \)). So, a random process takes place to improve the marks of the remaining students of the class. For each individual, new marks are generated by:

$$ X_{{{\text{new}},i}}^{\text{HMS}} = X_{{{\text{old}},i}}^{\text{HMS}} + {\text{rand}}*\left( {X_{{{\text{Teacher}},i}} - T_{\text{F}} M_{i} } \right), $$
(14)

where ‘TF’ is defined as ‘Teaching factor’ which is randomly selected as either 1 or 2 (Rao et al. 2011). The newly generated solution vector (Xnew) is accepted if its fitness (marks) is better than the old solution vector. In Eq. (14), \( X_{{{\text{old}},i}}^{\text{HMS}} \) and \( X_{{{\text{new}},i}}^{\text{HMS}} \) are the old and the new variables, respectively, while ‘rand’ is a randomly generated number between 0 and 1.

2.3.2 Learner phase

Learners’ knowledge can be further improved by one’s own effort. Students of the class coordinate with each other randomly which enhances their perceptive about a particular subject. Mathematically, the learner phase can be explained as per Eqs. (15, 16):

$$ X_{{{\text{new}},i}}^{\text{HMS}} = X_{{{\text{old}},i}}^{\text{HMS}} + {\text{rand}}*\left( {X_{j} - X_{k} } \right)\quad {\text{if}}\;F\left( {X_{j} } \right) < F\left( {X_{k} } \right), $$
(15)
$$ X_{{{\text{new}},i}}^{\text{HMS}} = X_{{{\text{old}},i}}^{\text{HMS}} + {\text{rand}}*\left( {X_{k} - X_{j} } \right)\quad {\text{if}}\;F\left( {X_{j} } \right) > F\left( {X_{k} } \right), $$
(16)

where ‘i’, ‘j’, ‘k’ are different learners in the class. F(X) shows the fitness/knowledge (marks) of a particular learner in a subject. If the fitness corresponding to the set of ‘\( X_{{{\text{new}},i}}^{\text{HMS}} \)’ is better than that corresponding to the set of ‘\( X_{{{\text{old}},i}}^{\text{HMS}} \)’, a set of \( X_{{{\text{new}},i}}^{\text{HMS}} \) is considered; otherwise, it is discarded.

So these two phases, i.e., teaching and learning phases, constitute an iteration. Several iterations of knowledge transferring and sharing ensure the global solution/best knowledge by the TLBO algorithm. Although this algorithm is almost parameter independent compared to other population-based optimization techniques, premature convergence may occur due to strong local maxima/minima trappings for multimodal problems.

2.4 Harmony search algorithm (HSA)

HSA mimics the improvisation of music players. The sounds for better aesthetic estimation can be improved through more and more practice just as the value of an objective function improves iteration by iteration.

The steps of the HSA are as follows:

  • Step 1 Initialize the harmony memory (HM) and algorithm parameters

At the beginning, a matrix of HM is formed, where each row represents a set of decision variables (\( X_{{{\text{new}},i}}^{\text{HMS}} \)) for the objective function F (\( X_{{{\text{new}},i}}^{\text{HMS}} \)) defined for the optimization problem. Each decision variable \( X_{{{\text{new}},i}}^{\text{HMS}} \) is computed using Eq. (17).

$$\begin{aligned} X_{{\text{new}},i}^{\text{HMS}} & = X_{i\,{\min}} +{\text{rand}}* \left({X_{i\,{\max}}} - X_{i\,{\min}}\right)\\ &\quad \forall i, = 1,2,3 \ldots N\;\left({\text{number}}\,{\text{of}}\,{\text{variables}}\right), \end{aligned} $$
(17)

where \( X_{i\,{\min} } \) and \( X_{i\,{\max} } \) are the minimum and the maximum value of marks (variable), respectively.

Harmonic memory size (HMS) represents numbers of the set of decision variables in HM, harmony memory consideration rate (HMCR) represents the probability of the new value of decision variables for HM, pitch adjustment rate (PAR) represents the probability of shifting the decision variables to neighboring values within the possible range and NI represents the stopping criterion.

$$ {\text{HM}} = \left[ {\begin{array}{*{20}l} {X_{1}^{1} } \hfill & {X_{2}^{1} } \hfill & {X_{3}^{1} } \hfill & \cdots \hfill & {X_{{N - 1}}^{1} } \hfill & {X_{N}^{1} } \hfill \\ {X_{1}^{2} } \hfill & {X_{2}^{2} } \hfill & {X_{3}^{2} } \hfill & {} \hfill & {X_{{N - 1}}^{2} } \hfill & {X_{N}^{2} } \hfill \\ \vdots \hfill & \vdots \hfill & \vdots \hfill & \cdots \hfill & \vdots \hfill & \vdots \hfill \\ {X_{1}^{{{\text{HMS}} - 1}} } \hfill & {X_{2}^{{{\text{HMS}} - 1}} } \hfill & {X_{3}^{{{\text{HMS}} - 1}} } \hfill & \cdots \hfill & {X_{{N - 1}}^{{{\text{HMS}} - 1}} } \hfill & {X_{N}^{{{\text{HMS}} - 1}} } \hfill \\ {X_{1}^{{{\text{HMS}}}} } \hfill & {X_{2}^{{{\text{HMS}}}} } \hfill & {X_{3}^{{{\text{HMS}}}} } \hfill & {} \hfill & {X_{{N - 1}}^{{{\text{HMS}}}} } \hfill & {X_{1}^{{{\text{HMS}}}} } \hfill \\ \end{array} } \right] $$
(18)
  • Step 2 Improvise new harmony

A new set of decision variables is generated based on the three parameters: (1) HMCR, (2) PAR and (3) random selection. The generation of the new decision variable of HM is known as improvisation. The probability of HMCR varies between 0 and 1. It is the probability of choosing the variable value from the HM, while (1-HMCR) is the probability of selecting the variable from the possible range of values, as given below:

$$ \begin{aligned} & {\text{if}}\;{\text{rand}} < {\text{HMCR}} \\ & \quad X_{{{\text{new}},i}}^{\text{HMS}} \leftarrow X_{{{\text{old}},i}}^{\text{HMS}} \in \left\{ {\begin{array}{*{20}c} {X_{i}^{1} } & {\begin{array}{*{20}c} {X_{i}^{2} } & \cdots & {\begin{array}{*{20}c} \cdots & \cdots & {X_{i}^{\text{HMS}} } \\ \end{array} } \\ \end{array} } \\ \end{array} } \right\} \\ \end{aligned} $$
(19)
$$ \begin{aligned} & {\text{else}} \\ & \quad X_{{{\text{new}},i}}^{\text{HMS}}= X_{i\,{\min} } + {\text{rand}}*\left( {X_{i\,{\max} } - X_{i\,{\min} } } \right)\quad {\text{where}}\,\\ & i, = 1,2,3 \ldots N \\ & {\text{end}} \\ \end{aligned} $$
(20)

HMCR decides whether to adjust the pitch of each decision variable or not. After considering the decision variable by HMCR, the pitch adjustment is decided by PAR as given below:

$$ \begin{aligned} & {\text{if}}\; {\text{rand}} < {\text{PAR}} \\ & \quad \quad \quad X_{{{\text{new}},i}}^{\text{HMS}} \leftarrow X_{old,i}^{HMS} + rand*BW \\ \end{aligned} $$
(21)
$$ \begin{aligned} & {\text{Else}} \\ & \quad \quad \quad X_{{{\text{new}},i}}^{\text{HMS}} \leftarrow X_{{{\text{old}},i}}^{\text{HMS}} - {\text{rand}}*{\text{BW}} \\ & {\text{end}} \\ \end{aligned} $$
(22)

where BW is known as the bandwidth. The value of BW lies between 0 and 1.

  • Step 3 Update harmony memory

If the new set of decision variables, i.e.,

$$ X_{{{\text{new}}}}^{\text{HMS}} = \left\{{\begin{array}{*{20}c} {X_{1}^{\text{HMS}} } & {\begin{array}{*{20}c} {X_{2}^{\text{HMS}} } & \cdots & {\begin{array}{*{20}c} \cdots & \cdots & {X_{N}^{\text{HMS}} } \\ \end{array} } \\ \end{array}}\\ \end{array}}\right\} $$
(23)

gives better objective function F (\( X_{{{\text{new}}}}^{\text{HMS}} \)) values than the worst set of decision variables in HM, the worst set of decision variables is replaced by the new set of decision variables.

  • Step 4. Termination

The HSA keeps on repeating steps 1, 2 and 3, till maximum number of improvisation (NI) is met, which is set during the parameter initialization process.

There are several modifications proposed in TLBO and HS, so that the individual algorithm has better exploration and exploitation capabilities before hybridization of TLBO and HS is carried out.

3 Alterations in the TLBO and HS algorithms

3.1 Alterations in the TLBO algorithm

The following alterations are adopted to enhance the global convergence of the TLBO and evade strong local minima/maxima trapping.

3.1.1 Alteration in teaching phase

In basic TLBO, the new set of variables is based on the mean marks of the particular subject and the teacher’s knowledge. In the suggested algorithm, the solution vector with the worst fitness is considered instead of the mean (Mi). This ensures knowledge improvement of the weakest student of the class, resulting in better knowledge transfer to every student of the class. Thus, the new generated vector ‘\( X_{{{\text{new}},i}}^{\text{HMS}} \)’ will have a higher probability of reaching the global solution/knowledge.

This modification can be mathematically represented by Eq. (24):

$$ X_{{{\text{new}},i}}^{\text{HMS}} = X_{{{\text{old}},i}}^{\text{HMS}} + {\text{rand}}*\left( {X_{{{\text{Teacher}},i}} - T_{\text{F}} X_{{{\text{worst}},i}} } \right), $$
(24)

where \( X_{{{\text{worst}},i}} \) is the vector having the worst fitness function within the population.

3.1.2 Alteration in T F

\( T_{F} \)’ is randomly considered as 1 or 2; i.e., knowledge transferred from the best learner (teacher) to the remaining learner is either 0% or 100%, respectively. But practically knowledge transfer could be in between 0–100%. Hence, the ‘TF’ is altered between 0 and 1, to account for practical knowledge transfer, as given below:

$$ T_{\text{F}} = \left( {{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 {\text{rand}}}}\right.\kern-0pt} \!\lower0.7ex\hbox{${\text{rand}}$}}} \right)^{a} , $$
(25)

where ‘\( a \)’ is defined as the teaching factor rate. High value of ‘\( a \)’ ensures larger search space which increases the probability of reaching the global solution. It is observed from several case studies (Sect. 6) that the value of ‘\( a \)’ between 0–5 ensures global solution.

3.1.3 Alteration in the HS algorithm

In basic HSA, the parameters HMCR, PAR and BW are constant values. The proposed algorithm adopts dynamic strategies to select the values of HMCR and PAR (Yadav et al. 2012). Dynamic selection of HMCR and PAR provides better balance in exploration and exploitation of search space in the hybrid algorithm:

$$ {\text{HMCR}} = {\text{HMCR}}_{{\max} } - \left( {{\text{HMCR}}_{{\max} } - {\text{HMCR}}_{{\min} } } \right)*\left( {\frac{{{\text{current}}\,{\text{iteration}}}}{{{\max} \,{\text{iteration}}}}} \right), $$
(26)
$$ {\text{PAR}} = {\text{PAR}}_{{\max} } - \left( {{\text{PAR}}_{{\max} } - {\text{PAR}}_{{\min} } } \right)*\left( {\frac{{{\text{current}}\,{\text{iteration}}}}{{{\max} \,{\text{iteration}}}}} \right), $$
(27)
$$ {\text{BW}} = {\text{BW}}_{{\max} } *\exp \left[ {\ln \left( {\frac{{{\text{BW}}_{{\max} } }}{{{\text{BW}}_{{\min} } }}} \right)*\frac{{{\text{current}}\,{\text{iteration}}}}{{{\max} \,{\text{iteration}}}} } \right], $$
(28)

where \( {\text{HMCR}}_{{\min} } \) and \( {\text{HMCR}}_{{\max} } \) are the minimum and the maximum values of HMCR, respectively, \( {\text{PAR}}_{{\min} } \;{\text{and}}\;{\text{PAR}}_{{\max} } \) are the minimum and the maximum values of PAR, respectively, while \( {\text{BW}}_{{\min} } \) and \( {\text{BW}}_{{\max} } \) are the minimum and maximum values of BW, respectively.

4 Proposed HTLBO algorithm

It is observed that the characteristics of TLBO and HS algorithms are complimentary to each other; i.e., HSA has excellent exploratory behavior but slow convergence while TLBO exhibits very good exploitation characteristics and fast convergence. Hence, the proposed method aims to utilize the merits of both algorithms and reach the global solution quickly. So, for the maximum exploration of the search space, initially, HSA is employed and then exploitation of the search space is done with the help of TLBO. In the proposed method, selection of the teaching phase of the TLBO algorithm and local pitch adjustment by HMCR of HSA are based on the ratio autoselection rate (ASR) defined as

$$ {\text{ASR}} = \frac{{{\text{Best}}\,{\text{fitness}}\,{\text{of}}\,{\text{HM}}}}{{{\text{Worst}}\,{\text{fitness}}\,{\text{of}}\,{\text{HM}}}} $$
(29)

As the value of ASR changes in each iteration, ASR is dynamic in nature and is self-adaptive for the selection of either of these two algorithms for the improvisation of the harmony vector. For minimization problems, as the value of best fitness will always be less than the worst fitness value, ASR will be less than 1. For maximization problems, ASR is obtained by minimizing the inverse of the fitness function values, which will also lead to ASR less than 1. Initially, ASR will be low which gives preference to the HS algorithm for better exploration of the search space. As ASR moves toward unity, teaching phase is selected for better tuning of the variables.

To minimize the demerits of HS and TLBO, the HTLBO initially utilizes the exploration characteristics of HS. Subsequently, the TLBO algorithm is implemented for better tuning and fast convergence of the solution vector. The following steps are considered for the HTLBO algorithm.

  1. A.

    Target vector selection: \( X_{\text{new}}^{\text{HMS}} = X_{\text{old}}^{\text{HMS}} \) is the harmony vector.

  2. B.

    Generation of target vector \( X_{{{\text{new}},i}}^{\text{HMS}} \) in each iteration is comprised of four steps.

  • Step 1 Autoselection rate

The selection of teaching phase or HMCR for the generation of target vector \( X_{\text{new}}^{\text{HMS}} \) depends upon the autoselection rate. ASR is found using Eq. (29).

  • Step 2 Teaching phase/harmony memory consideration rate and local pitch adjustment/mutation phase

When ASR is less than ‘r’, (‘r’ is a number between 0 and 1), the teaching phase is selected and Eq. (24) is used to generate \( X_{{{\text{new}},i}}^{\text{HMS}} \). If ASR is more than ‘r,’ HS algorithm is selected for the generation of \( X_{\text{new}}^{\text{HMS}} \). The probability of tuning the target vector is decided by the HMCR. If the random number is less than HMCR, local pitch adjustment of \( X_{{{\text{new}},i}}^{\text{HMS}} \) is carried out by using Eq. (27); otherwise, mutation phase is carried out according to Eq. (20) to generate the new target vector \( X_{\text{new}}^{\text{HMS}} \) in the feasible space. If the fitness of the target vector \( X_{\text{new}}^{\text{HMS}} \) is better than that of the worst fitness vector, then \( X_{{{\text{worst}},i}} \) is replaced by \( X_{\text{new}}^{\text{HMS}} \).

  • Step 3 Learning phase

This phase enhances the knowledge (fitness) of the target vector \( X_{{{\text{new}},i}}^{\text{HMS}} \) by interacting with the randomly selected harmony vector from harmony memory. Improvement of \( X_{\text{new}}^{\text{HMS}} \) is done according to Eqs. (15, 16). If the fitness of the target vector \( X_{\text{new}}^{\text{HMS}} \) is better than the fitness of the worst fitness vector, then \( X_{{{\text{worst}},i}} \) is replaced by \( X_{\text{new}}^{\text{HMS}} \).

  • Step 4 Dynamic adjustment of parameters

For the efficient operation of the HTLBO algorithm, the HMCR, PAR and the BW parameters are dynamically tuned, in each iteration, as per Eqs. (26), (27) and (28).

C. If the target vector generated has better fitness, then the HM is updated; otherwise, the next iteration is started. If the number of iterations satisfies the condition for termination, the algorithm is terminated and the best solution vector of HM is selected as the solution of the objective function.

The HTLBO algorithm parameters are detailed in Table 5.

The above steps of the proposed algorithm can be well understood by the flowchart shown in Fig. 2.

Fig. 2
figure 2

Flowchart for the proposed HTLBO algorithm

5 Case studies and results

The suggested HTLBO algorithm is first validated on standard mathematical benchmark functions. The HTLBO algorithm is tested for 20, 30 and 50 variables, and the results are shown in Table 1. Subsequently, the suggested HTLBO was used for optimal allocation of Type 1 DGs in the IEEE 33-bus, 69-bus and 118-bus RDS using ɛ-constraints method for the MOF. The MOF considers all the three SOFs simultaneously and results in better F1, F2 and F3 than TLBO and QOTLBO, respectively, as shown in subsequent tables. The proposed HTLBO algorithm was implemented in MATLAB R2015a environment on a Intel i5-4570, 3.2 GHz processor, 4 GB RAM, desktop PC.

Table 1 Mathematical benchmark functions (Rao et al. 2011)

5.1 Test case 1: Mathematical benchmark validation

The performance of the proposed HTLBO algorithm is validated on 11 mathematical benchmark test functions (Martín García and Gil Mena 2013) as given in Table 1. A comparison of the proposed HTLBO vis-à-vis other algorithms (Rao et al. 2011) is shown in Table 2. It is observed that the proposed method gives the best results with Penalized, Penalized 2 and Rosenbrock functions, while it yields results identical to ITLBO (improved TLBO) with Sphere, Schwefel 2.22, Schwefel 1.2, Griewank, Ackley and Rastrigin functions. However, it is observed that for the Schwefel 2.21 and Step 2 functions, ITLBO gives the best result. For the purpose of demonstration, Pareto sets are obtained for some multi-objective test functions (unconstrained and constrained) shown in Table 3, when HTLBO is integrated with non-dominating sorting algorithm (Deb 2001). From Fig. 3a, it is observed that for unconstrained problems, HTLBO yields similar results with SCH and KUR, inferior results with ZDT3 and better results with ZDT4, as compared to NSGA-II (Deb 2001). For constrained problems, HTLBO gives similar result with TNK, while better results are obtained with BNH and OSY than NSGA-II, as shown in Fig. 3b. Table 4 shows the impact of variations of HTLBO parameters on the solution of the Sphere function. The parameters considered are the teaching factor rate ‘a’, HMS, maximum iteration (NI) and the number of variables (X). It is observed from Table 4 that for HMS = 10 and maximum iteration = 6000, HTLBO algorithm gives best results for all ‘a.’ Similarly, for HMS = 20 and 30, HTLBO gives best results with ‘a = 0.2.’ For HMS = 40 and 50, HTLBO gives best results with ‘a = 0.1’ while considering number of variables = 20. On the other hand, with number of variables = 30 and 50, ‘a = 0.2’ gives better result.

Table 2 Comparative results of HTLBO algorithm with other algorithms (Rao et al. 2011) over 30 runs
Table 3 Mathematical benchmark functions for multi-objective functions (Deb 2001)
Fig. 3
figure 3

a Comparative results of unconstrained multi-objective functions by NSGA-II and HTLBO for [1] SCH. [2] KUR. [3] ZDT3. [4] ZDT4. b Comparative results of constrained multi-objective functions by NSGA-II and HTLBO for [1] BNH. [2] TNK. [3] OSY

Table 4 Impact of algorithm parameters on mathematical test functions over 30 runs

5.2 Test case 2: Selection of optimal size and location of distributed generation (DG) resources in different RDSs

5.2.1 IEEE 33-bus RDS

The suggested HTLBO is first implemented for optimal allocation of DGs in the IEEE 33-bus RDS using two methods. The first one is the weighted sum method (Sultana and Roy 2014), where the MOF is formulated as a weighted sum of the three SOFs as detailed in ‘Appendix.’ The second approach comprises the modification of the MOF into a SOF using the ɛ-constraints method (Deb 2001). In both the methods, the objective is simultaneous minimization of the network active power losses and voltage deviation together with VSI maximization. The detailed network data are given in Baran and Wu (1989). It has 33 nodes, three laterals, 37 branches with five tie switches normally kept open. The nominal voltage rating is 12.66 kV. The nominal load demand of the RDS is 3.72 MW and 2.3 MVAr, respectively. The base case total active power losses are 210.998 kW while the total reactive power losses are 143 kVAr. The base case VSI of this RDS is 0.667168 (Injeti and Prema Kumar 2013). The network base KVA is 1000 KVA (Baran and Wu 1989). Three DGs (Type 1) are selected for optimal allocation in this RDS. The HTLBO parameters are selected as shown in Table 5.

Table 5 HTLBO algorithm parameters

The weight factors ‘b1’, ‘b2’ and ‘b3’ (detailed in ‘Appendix’) corresponding to the three SOFs are shown in Table 6 (Sultana and Roy 2014). As shown in Table 6, when b1 = 1.0 b2 = 0.6 and b3 = 0.0.35, the suggested HTLBO shows reduction in the active power losses to 82.815 kW (from 210.998 kW in the base case). On the other hand, results of MOF with ɛ-constraints method give simultaneous improvement in all three SOFs, i.e., F1, F2 and F3 (97.5330 kW, 0.0009 p.u. and 0.9653 p.u., respectively), in comparison with both TLBO and QOTLBO, as shown in Table 6. Figure 4a shows the voltage profile enhancement of the IEEE 33-bus RDS with DGs vis-a-vis that without DG. From Fig. 4a, minimum network voltage magnitude of 0.9836 p.u. at node 33 is noticed in the presence of DGs. The network voltage profile shows a remarkable improvement than that without DG (voltage magnitude of 0.9038 p.u. at bus 18).

Table 6 Simulation results for optimal DG allocation in 33 BUS RDS using TLBO, QOTLBO and HTLBO
Fig. 4
figure 4

a Bus voltage profile of the 33-bus radial distribution system without and with DG. b Convergence characteristics of 33-bus RDS for multi-objective function for TLBO, QOTLBO and HTLBO

Figure 3b shows the convergence characteristics of the MOF corresponding to the TLBO, QTLBO and the proposed HTLBO algorithm. It is observed from Fig. 4b that the proposed algorithm (HTLBO) has the lowest fitness value, i.e., 0.4590 p.u., and the fastest convergence rate.

5.2.2 69-bus RDS

The proposed algorithm (HTLBO) is now implemented on the IEEE 69-bus RDS for optimal allocation of DGs. Again, three DGs (Type 1) have been considered. Similar to the case of the IEEE 33-bus RDS above, the MOF (for minimization of the network active power losses and voltage deviation together with maximization of VSI) is evaluated using both the weighted sum method and the ɛ-constraints method. The HTLBO parameters are again selected as shown in Table 5. The detailed network data are given in Das et al. (1994). It has 69 nodes, seven laterals, 73 branches with five tie switches normally kept open. The nominal voltage rating is 12.66 kV. Nominal load demand on the RDS is 3.8 MW and 2.69 MVAr. The base case real and reactive power losses are 224.9 kW and 102.13 kVAr, respectively. The base case VSI of this RDS is 0.6833 (Injeti and Prema Kumar 2013). The network base is 1000 KVA (Baran and Wu 1989).

As shown in Table 7, when b1 = 1.0, b2 = 0.6 and b3 = 0.0.35, HTLBO shows reduction in active power losses to 76.938 kW and improvement in the VDI to 0.0006 p.u. On the other hand, results of MOF with ɛ-constraints method give simultaneous improvement in all the three SOFs (F1, F2 and F3) to 79.431 kW, 0.0003 p.u. and 0.9770 p.u., respectively, in comparison with both TLBO and QOTLBO, as shown in Table 6. Figure 5a shows the voltage profile enhancement of the IEEE 69-bus RDS with DGs over that without DG. From Fig. 5a, it is observed that the minimum network voltage magnitude of 0.9942 p.u. occurs at bus 56 in the presence of DGs. Thus, with DGs, the network voltage profile shows a remarkable improvement than that without DG (0.9092 p.u. at bus 65). Figure 5b shows the convergence characteristics of the MOF corresponding to the TLBO, QTLBO and the proposed HTLBO algorithm. It is observed from Fig. 5b that the proposed algorithm (HTLBO) has the lowest fitness value, i.e., 0.4389 p.u., and the fastest convergence rate.

Table 7 Simulation results for MOF using TLBO, QOTLBO and HTLBO of 69-bus RDS
Fig. 5
figure 5

a Voltage distribution of 69-bus radial distribution system with and without DG. b Convergence characteristics of 69-bus RDS for multi-objective function for TLBO, QOTLBO and HTLBO

5.2.3 118-bus RDS

The proposed algorithm (HTLBO) is now implemented on the IEEE 118-bus RDS for optimal allocation of DGs. Seven DGs (Type 1) have been considered. Similar to the case of the IEEE 33- and 69-bus RDS above, the MOF (for minimization of the network active power losses and voltage deviation together with maximization of VSI) is evaluated using both the weighted sum method and the ɛ-constraints method. The HTLBO parameters are again selected as shown in Table 5. The detailed network data are given in Zhang et al. (2007). It has 119 nodes, 16 laterals, 132 branches with 15 tie switches normally kept open. The nominal voltage rating is 11 kV. Nominal load demand on the RDS is 22.709 MW and 17.041 MVAr. The base case real and reactive power losses are 1298.0916 kW and 978.736 kVAr, respectively. The base case VSI of this RDS is 0.569734 (Injeti and Prema Kumar 2013). The network base is 100 MVA (Zhang et al. 2007).

As shown in Table 8, when b1 = 1.0, b2 = 0.6 and b3 = 0.0.35, HTLBO shows improvement in both VDI and VSI as compared to both TLBO and QOTLBO, although the active power losses are inferior to both TLBO and QOTLBO. On the other hand, results of MOF with ɛ-constraints method give simultaneous improvement in all the three SOFs (F1, F2 and F3) to 658.756 kW, 0.0225 p.u. and 0.8978 p.u., respectively, in comparison with both TLBO and QOTLBO, as shown in Table 8. Figure 4a shows the voltage profile enhancement of the IEEE 118-bus RDS with DGs over that without DG.

Table 8 Simulation results for MOF using TLBO, QOTLBO and HTLBO of 118-bus RDS

From Fig. 6a, it is observed that the minimum network voltage magnitude of 0.97344 p.u. occurs at node 56 in the presence of DGs. The network voltage profile again shows a remarkable improvement over that without any DG (voltage magnitude of 0.86879 p.u. at bus 80). Figure 6b shows the convergence characteristics of the MOF corresponding to the TLBO, QTLBO and the proposed HTLBO algorithm. It is observed from Fig. 6b that the proposed algorithm (HTLBO) has the lowest fitness value of 0.4104 p.u.

Fig. 6
figure 6

a Voltage distribution of 118-bus radial distribution system with and without DG. b Convergence characteristics of 118-bus RDS for multi-objective function for TLBO, QOTLBO and HTLBO

6 Conclusions

A HTLBO technique has been presented in this paper. Its computational capability with continuous variables is first demonstrated on standard mathematical benchmark functions in the form of mean value and standard deviations. Subsequently, the proposed algorithm is implemented for optimal allocation of multiple DGs in the IEEE 33-, 69- and 118-bus RDS to validate their capability with mixed-integer variables (i.e., DGs sizes belong to continuous domain while DG locations are in the discrete domain). The allocation of DGs in RDS is carried out using two methodologies, i.e., weighted sum approach and ɛ-constraints method, to solve the MOF, which includes the minimization of real power losses and voltage deviations along with maximization of VSI. The ɛ-constraints approach yields better results than the weighted sum approach in all the objectives over TLBO and QOTLBO. The results authenticate the global convergence competence of the HTLBO. The impact of the various parameters of the proposed algorithm is also investigated. It is observed that for a certain level of DG penetration, a proper tuning of the algorithm parameters ensures global solution. It is also observed that optimal allocation of DGs results in substantial performance improvement of RDS, along with better network congestion management.