1 Introduction

Tuberculosis (TB) is a chronic bacterial infectious disease caused by Mycobacterium tuberculosis which pose a major health, social and economic burden globally, especially in low and middle income countries [17]. It is the second deadliest disease due to a single infectious agent only after HIV/AIDS [22, 64]. The surge in HIV-TB co-infection and growing emergence of multidrug-resistant TB (MDR-TB) and extensively drug resistant TB (XDR-TB) strains has further fuelled TB epidemic [62, 64]. TB usually affects the lungs (pulmonary TB) but it can also affect other sites as well (extra-pulmonary TB). Tuberculosis is transmitted by tiny airborne droplets which are expelled into the air when a person with active pulmonary TB coughs or talks [26].

Estimate show that TB has infected one-third of the world’s population with the most infections occurring in Africa and Asia [64]. In 2015, there were an estimated 10.4 million new TB cases (incidence) in worldwide, as well as an estimated 1.4 million TB-induced deaths, and an additional 0.4 million deaths resulting from TB disease among persons living with HIV [64]. Furthermore, over 95% of these deaths occurred in low- and middle-income countries where the cost of diagnosis and treatment is high, and not readily accessible [64].

Diagnosis of latent TB infections (LTBI) and prompt treatment of active cases remains an important component of effective TB control [5, 14, 30, 32, 39]. On the other hand, undetected TB infection and delay in the treatment of active TB cases leads to more severe disease conditions in the infected person which could result in wider disease spread in the community [2, 31, 51, 55, 63]. Such delays also contribute to increased infectivity in the community [5], whereby, the infected individuals unknowingly continue to serve as a reservoir for the pathogen (M. tuberculosis). Hence, this could lead to increased risk of disease transmission in the community. In fact, the effects of diagnosis of LTBI and delay in the treatment of active TB cases on the incidence and prevalence of TB were issues considered by some presenters at the 45th World Conference on Lung Health held in 2014 [25].

Health literacy, i.e., knowledge and education related to tuberculosis, as well as socio-cultural factors such as gender roles and status in the family has been identified as having considerable influence on undetected latent TB infection and delay in the treatment of active cases [31, 33, 65]. TB knowledge includes the ability to identify causes and understand the transmission path of the disease, recognize disease symptoms, and be aware of available treatment regimens such as directly observed treatment short course (DOTS); which is the TB treatment strategy recommended by the World Health Organization (WHO) [9, 20, 27, 44]. On the other hand, ignorance on the part of infected individuals about TB usually leads to postponement in seeking appropriate medical attention, and in some cases such persons will rather adopt alternative approaches, such as seeking traditional healers, before consulting DOTS facility, thereby delaying diagnosis and treatment of TB [36, 58].

A recent study in China identified financial difficulty as well as ignoring early TB symptoms or not understanding the meaning of such symptoms on the part of actively-infected persons as being responsible for delay in seeking appropriate medical attention [31]. Besides, some infected individuals are scared of undergoing TB diagnosis because of fear of having to pay for a prolong TB treatment regimen (6–8 months), which ultimately could increase the possibility of developing multidrug-resistant TB strains [10].

In some developing countries, the distance and additional transportation cost to and from a DOTS facility could also discourage actively-infected persons from seeking appropriate medical attention as soon as possible [7]. In the same vain, the life style and habit of some infected individuals have been identified as being also responsible for undetected latent TB infection and delay in the treatment of active TB cases. For example, a person who smokes cigarette may continue to trivialize early TB symptoms such as cough, thereby delaying prompt diagnosis and treatment of the disease [6].

Stigmatization and discrimination towards individuals infected with TB is another factor that could deter persons from seeking prompt TB diagnosis and treatment since it fosters social exclusion and saddens the infected person and members of their family [44, 47]. And because of the social rejection and stigmatization, some patients would rather postpone seeking appropriate medical attention due to the fear of getting to find out about their positive TB status [13].

Health system delay (HSD) in the diagnosis and treatment of LTBI as well as active TB case is usually connected with an infected patient’s visit to an health care center, but without receiving accurate diagnosis. HSD is often caused by unavailability of up-to-date diagnostic laboratory equipments, not adhering to the diagnostic procedures based on DOTS strategy recommended by the WHO [27], and difficulties in identifying TB symptoms in an infected person especially if such symptoms coexist with other chronic lung diseases and/or severe cough [3, 49, 61]. Infected individuals can also experience diagnostic delays as a result of low clinical suspicion of tuberculosis on the part of health-care practitioners. A possible explanation for such wrong or delayed diagnosis on the part of health-care provider is that pulmonary TB can manifest itself with symptoms that are very similar to other diseases such as community-acquired pneumonia (CAP), which is often treated with antibiotics like fluoroquinolones [60]. It has been shown that FQs also have antimicrobial activity against M. tuberculosis [60]. This results in a seeming improvement in the TB symptoms of the infected individual which may ultimately delay the positive diagnosis of the disease [53].

Several mathematical models have been developed and used to study the transmission dynamics of TB in a population [4, 12, 23, 40, 41, 43, 45, 52]. For example, Okuonghae [40] presented a deterministic TB model with genetic heterogeneity in susceptibility and disease progression. Zhang and Feng [67] formulated and globally analyzed a dynamical model to investigate the spread of TB in a community with isolation and incomplete treatment. Trauera et al. [54] presented a deterministic model to describe TB spread in highly endemic regions in Asia-Pacific regions. The model incorporates features such as partial and temporary vaccine efficacy, likelihood of re-infection during the latent state and after treatment, decreasing risk of acute disease after infection. Mishra and Srivastava [37] proposed a mathematical model to study the transmission of TB in human population for both sensitive and drug-resistant subjects.

The purpose of this article is to formulate and rigorously analyze a new model for the transmission dynamics of TB which provides insights into the effects of diagnosis of latent and active TB infection, as well as delay in the treatment of some active cases, on the disease burden in a population. The paper is organized as follows. In Sect. 2, the treatment model is formulated and its basic properties explored. In Sect. 3, the local asymptotic stability of the disease-free equilibrium of the model is examined. The existence of endemic equilibrium point (EEP) for a special case is investigated in Sect. 4. Backward bifurcation analysis is presented in Sect. 5. The existence and global asymptotic stability of EEP for a special case is shown in Sect. 6. Uncertainty and sensitivity analysis and numerical simulation of the model are presented in Sect. 7. Finally, a discussion of the findings from this work is presented in Sect. 8.

2 Model formulation

The total population at time t, denoted by N(t), is subdivided into eight mutually-disjoint compartments of susceptible (S(t)), new latently-infected \((E_1(t))\), diagnosed latently-infected \((E_2(t))\), undiagnosed latently-infected \((E_3(t))\), undiagnosed actively-infected (I(t)), diagnosed actively-infected with prompt treatment \((J_1(t))\), diagnosed actively-infected with delay in treatment \((J_2(t))\), and treated individuals (T(t)). Let \(\Lambda \) be the recruitment rate of individuals into the susceptible class (S). We assume that \(\mu \) is the per capita natural death rate in all epidemiological classes; hence \(1/\mu \) is the average life span of individuals in the population. Let \(\delta _i, i=1,2,3\) be the TB-induced death rates, with \(\delta _1\) being the rate for undiagnosed actively-infected persons (I), \(\delta _2\) being the rate for diagnosed actively-infected persons with prompt treatment \((J_1)\), and \(\delta _3\) being the rate for diagnosed actively-infected persons with delay treatment \((J_2)\); hence, it is reasonable to assume that \(\delta _1 \ge \delta _3 \ge \delta _2\). Susceptible individuals make contact with actively-infected persons and become infected at the rate \(\lambda \) (called the force of infection), with \(\beta \) being the transmission rate, \(\eta _1\) and \(\eta _2\) are modification parameters that accounts for the reduced likelihood of disease transmission by individuals who are diagnosed with active TB and promptly treated (\(J_1\)) and those who delay in receiving treatment (\(J_2\)), respectively, in comparison to actively-infected individuals who were not diagnosed (I); hence we assume that \(0< \eta _1 \le \eta _2 < 1\) [66].

We assume that \(p ~ (0< p < 1)\) is the fraction of individuals with new infections who develop TB fast per unit of time [48]. Hence a fraction, \((1-p)\), of the newly infected individuals (\(\lambda S\)), and previously effectively treated individuals (\(b_3\lambda T\)) who become re-infected are assumed to slowly progress in their disease condition and are moved to the latent class, \(E_1\), with \(b_3 ~ (b_3 > 1)\) being a modification parameter which accounts for the increased likelihood of infection by the previously treated individuals in comparison to wholly-susceptible individuals [59]. The assumption that \(b_3 > 1\) makes biological sense since Verver et al. [59] showed that the rate of tuberculosis reinfection after successful treatment is higher than the rate of a new TB infection.

New latently-infected individuals \((E_1)\) are either diagnosed or remain undetected, at the rate \(\sigma \), in a given unit of time [1, 5]. Hence \({1}/({\sigma + \mu })\) will be the time spent in the “newly” infected state, which is determined by how often screening is done in order to detect new infections. A fraction, \(n ~ (0< n <1)\), of new latently-infected individuals are diagnosed \((E_2)\) and placed on TB treatment at a rate \(r_0\), while the remaining fraction, \(1 - n\), remain undiagnosed \((E_3)\). Undiagnosed latently-infected persons are individually diagnosed at a rate \(\alpha \). Diagnosed latently-infected and undiagnosed latently-infected individuals are both exogenously re-infected at the rates \(b_1~ (0< b_1 <1)\) and \(b_2~ (0< b_2 <1)\), respectively. Individuals with undiagnosed latent TB infections progress to active TB at a rate k. Individuals with undiagnosed active TB cases (I) are detected at a rate \(\kappa \), with a fraction, \(q ~ (0< q < 1)\), of these detected active cases being promptly treated at a rate \(r_1\), while the remaining fraction, \(1 - q\), experiencing some delay before the commencing treatment at a rate \(r_2\); hence it is makes biological sense to assume that \(r_1 \ge r_2\).

Based on the assumptions above, our formulated TB model is given by the following system of nonlinear ordinary differential equations in (2.1), and a flow diagram of the model is given in Fig. 1. The associated variables and parameters used for the formulation are defined in Table 1, while the values and ranges of the parameters used for carrying out numerical simulation of the treatment model (2.1) are listed in Table 2

$$\begin{aligned} \begin{aligned} \frac{dS}{dt}&= \Lambda - \lambda S - \mu S, \\ \frac{dE_1}{dt}&= (1 - p)\lambda (S + b_3T) - (\sigma + \mu )E_1, \\ \frac{dE_2}{dt}&= \sigma n E_1 +\alpha E_3 - b_1 \lambda E_2 - (r_0+\mu )E_2, \\ \frac{dE_3}{dt}&= \sigma (1 - n) E_1-b_2\lambda E_3 - (\alpha + k+\mu )E_3,\\ \frac{dI}{dt}&= p\lambda (S + b_3T)+ b_1\lambda E_2 + b_2 \lambda E_3 + k E_3 - (\kappa +\delta _1+\mu )I, \\ \frac{dJ_1}{dt}&= \kappa q I - (r_1+\delta _2+\mu )J_1,\\ \frac{dJ_2}{dt}&= \kappa (1 - q) I - (r_2+\delta _3+\mu )J_2, \\ \frac{dT}{dt}&= r_0 E_2 +r_1J_1 +r_2J_2 - b_3\lambda T - \mu T, \end{aligned} \end{aligned}$$
(2.1)

where the force of infection \(\lambda \) is given by

$$\begin{aligned} \lambda = \frac{\beta \big (I+\eta _1J_1 + \eta _2J_2 \big )}{N}. \end{aligned}$$
(2.2)
Fig. 1
figure 1

Schematic diagram of the treatment model (2.1), where \(\lambda \) is the force of infection given in (2.2)

Since the model (2.1) monitors human population, it is important that all its state variables and associated parameters are non-negative for all time, t. Hence, the following non-negativity result holds for the state variables in the model (2.1).

Theorem 2.1

Let the initial data for the TB model (2.1) be \(S(0)>0, E_1(0)>0, E_2(0)>0, E_3(0)>0, I(0)>0, J_1(0)>0, J_2(0)>0~\) and \(~T(0)>0~\). Then the solutions \((S(t), E_1(t), E_2(t), E_3(t), I(t), J_1(t), J_2(t), T(t))\) of the model (2.1), with positive initial data, will remain positive for all time \(t>0.\)

Proof

Let \(t_1 = \sup \{t>0 : S(0)>0, E_1(0)>0, E_2(0)>0, E_3(0)>0, I(0)>0, J_1(0)>0, J_2(0)>0, T(0)>0\}.\)

Table 1 Description of variables and parameters of the TB model (2.1)
Table 2 Baseline values and ranges of the parameters of the treatment model (2.1), with the total population (N) of Nigeria as of 1 January 2016 estimated at 184,635,279 [16]

From the first equation of model (2.1), it follows that

$$\begin{aligned} \frac{dS(t)}{dt} = \Lambda - (\lambda + \mu ) S(t) \end{aligned}$$
(2.3)

which can be rewritten as

$$\begin{aligned} \frac{d}{dt} \Big [S(t) \exp \Big \{\mu t + \int _0^t \lambda (\tau ) d\tau \Big \}\Big ] = \Lambda \exp \Big \{\mu t + \int _0^t \lambda (\tau ) d\tau \Big \}. \end{aligned}$$
(2.4)

Thus,

$$\begin{aligned} S(t_1) \exp \Big \{\mu t_1 + \int _0^{t_1} \lambda (\tau ) d\tau \Big \} - S(0) = \int _0^{t_1} \Lambda \Big [ \exp \Big \{\mu y + \int _0^y \lambda (\tau ) d\tau \Big \} \Big ]dy,\quad \end{aligned}$$
(2.5)

So that,

$$\begin{aligned} S(t_1)&= S(0)\exp \Big [-\mu t_1 - \int _0^{t_1} \lambda (\tau ) d\tau \Big ] + \Big [\exp \Big \{-\mu t_1 - \int _0^{t_1} \lambda (\tau ) d\tau \Big \}\Big ]\nonumber \\&\quad \times \int _0^{t_1} \Lambda \Big [ \exp \Big \{\mu y + \int _0^y \lambda (\tau ) d\tau \Big \} \Big ] dy > 0. \end{aligned}$$
(2.6)

Similarly, it can be shown that \(E_1(t)>0, E_2(t)>0, E_3(t)>0, I(t)>0, J_1(t)>0, J_2(t) > 0\) and \(T(t) > 0\) for all time \(t>0.\) Thus, all solutions of the system (2.1) remain positive for all non-negative initial conditions. \(\square \)

Lemma 2.2

The closed set

$$\begin{aligned} \mathcal {D} = \Big \{ (S, E_1, E_2, E_3, I, J_1, J_2, T) \in \mathbb {R}^8_+ : N \le \frac{\Lambda }{\mu } \Big \} \end{aligned}$$
(2.7)

is positively-invariant and attracts all positive solutions of the treatment model (2.1).

Proof

Adding all the equations of model (2.1) gives

$$\begin{aligned} \frac{dN}{dt} = \Lambda - \mu N - (\delta _1 I + \delta _2 J_1 + \delta _3 J_3 ). \end{aligned}$$
(2.8)

Since \(\frac{dN}{dt} \le \lambda - \mu N,\) it follows that \(\frac{dN}{dt} \le 0\) if \(N(t) \ge \frac{\Lambda }{\mu } \). Hence, a standard comparison theorem [28] can be used to show that \(N(t) \le N(0) e^{-\mu t}+\frac{\Lambda }{\mu }(1-e^{-\mu t}).\) In particular, if \(N(0) \le \frac{\Lambda }{\mu }\), then \(N(t) \le \frac{\Lambda }{\mu }\) for all \(t>0.\) Hence, the domain \(\mathcal {D}\) is positively invariant. Furthermore, if \(N(0)>\frac{\Lambda }{\mu },\) then either the solution enters the domain \(\mathcal {D}\) in finite time or N(t) approaches \(\frac{\Lambda }{\mu }\) asymptotically as \(t \rightarrow \infty .\) Hence, the domain \(\mathcal {D}\) attracts all solutions in \(\mathbb {R}^8_+.\)\(\square \)

Since the domain \(\mathcal {D}\) is positively-invariant, it is enough to investigate the dynamics of the flow generated by the model (2.1) in \(\mathcal {D}\). Hence the model (2.1) is both mathematically and epidemiologically well posed.

3 Local asymptotic stability of disease-free equilibrium

The disease-free equilibrium (DFE) of the model (2.1) is given by

$$\begin{aligned} \mathcal {E}_0 = \left( S^0, E_1^0, E_2^0, E_3^0, I^0, J_1^0, J_2^0, T^0\right) = \left( \frac{\Lambda }{\mu },0,0,0,0,0,0,0 \right) . \end{aligned}$$
(3.1)

The local asymptotic stability (LAS) of the DFE is shown using the next generation operator method [18, 57]. Following similar procedure as in [57], the computed effective reproduction number associated with the model (2.1) is given by

$$\begin{aligned} \mathcal {R}_T = \beta \frac{ \big [ a_{11} a_{33} p + k(1-n)(1-p)\sigma \big ] \big [ a_{55}a_{66} + a_{66} \kappa q \eta _1 + a_{55} \kappa (1-q) \eta _2 \big ]}{a_{11} a_{33} a_{44}a_{55} a_{66}}, \end{aligned}$$
(3.2)

where \( a_{11}=(\sigma +\mu )\), \(a_{22}=(r_0+\mu )\), \(a_{33}=(\alpha +k+\mu )\), \(a_{44}=(\kappa +\delta _1+\mu )\), \(a_{55}=(r_1+\delta _2+\mu )\), and \(a_{66}=(r_2+\delta _3+\mu )\).

Using Theorem 2 in [57] we have the following result:

Lemma 3.1

The DFE (\(\mathcal {E}_0\)) of the model (2.1), given by (3.1), is locally asymptomatically stable (LAS) if \(\mathcal {R}_T < 1\) and unstable if \(\mathcal {R}_T > 1.\)

The threshold quantity, \(\mathcal {R}_T,\) measures the average number of new TB infections generated by a single actively-infected individual in a wholly susceptible populations where there exist control measures such as treatment. The epidemiological implication of Lemma 3.1 is that TB can be eliminated from the community when the effective reproduction numnber is less than unity (\(\mathcal {R}_T < 1\)) if the initial sizes of the subpopulations of the model (2.1) are in the basin of attraction of the DFE (\(\mathcal {E}_0\)). That is, a small influx of TB-infected persons into the community will not generate a large TB outbreak in the community, and the disease will eventually die out in time.

3.1 Analysis of reproduction number (\(\mathcal {R}_T\))

Using the threshold quantity, \(\mathcal {R}_T,\) in (3.2), we wish to investigate the effect of diagnosis of latently-infected individuals and delay in the treatment of active TB cases on the dynamics of the disease in a population, and thus glean effective control strategies involving parameters that characterize these effects.

It can be seen from (3.2), that

$$\begin{aligned}&\lim _{\begin{array}{c} n \rightarrow 1 \\ \sigma \rightarrow \infty \end{array}} \mathcal {R}_T \nonumber \\&\quad = \frac{\beta p [(r_1+\delta _2+\mu ) (r_2+\delta _3+\mu ) + \kappa q (r_2+\delta _3+\mu ) \eta _1 + \kappa (1-q) (\delta _2+r_1+\mu ) \eta _2 ]}{(\kappa +\delta _1+\mu ) (r_1+\delta _2+\mu ) (r_2+\delta _3+\mu )}. \end{aligned}$$
(3.3)

The limit in (3.3) implies that a TB control programme that focuses on correct diagnosis of a large fraction of new latent TB infections (\(n \rightarrow 1\)) at a high rate (\(\sigma \rightarrow \infty \)) can lead to effective TB control provided it results in making the right-hand sides of (3.3) less than unity.

From (3.2), it can also be seen that

$$\begin{aligned} \lim _{\begin{array}{c} q \rightarrow 1 \\ \kappa \rightarrow \infty \end{array}} \mathcal {R}_T = \frac{\beta [p (\sigma +\mu ) (\alpha +k+\mu ) + \sigma k (1-n)(1-p) ] \eta _1}{ (\sigma +\mu ) (\alpha +k+\mu ) ( r_1+\delta _2 +\mu ) }. \end{aligned}$$
(3.4)

The limit in (3.4) implies that a TB programme that combines prompt treatment of a large fraction of detected active TB cases (\(q \rightarrow 1\)) at a high rate (\(\kappa \rightarrow \infty \)) can lead to effective TB control provided it results in making the right-hand sides of (3.4) less than unity.

In the same vein, we can see that

$$\begin{aligned} \lim _{\begin{array}{c} n \rightarrow 1 \\ q \rightarrow 1 \end{array}} \mathcal {R}_T = \frac{\beta p [ ( r_1+\delta _2 +\mu ) + \kappa \eta _1]}{(\kappa +\delta _1+\mu ) ( r_1+\delta _2 +\mu ) }. \end{aligned}$$
(3.5)

The limit in (3.5) implies that a TB control programme that focuses on correct diagnosis of a large fraction of new latent TB cases (\(n \rightarrow 1\)) and prompt treatment of a large fraction of detected active TB cases (\(q \rightarrow 1\)) can lead to effective TB control provided it results in making the right-hand sides of (3.5) less than unity.

Furthermore, we see that

$$\begin{aligned}&\lim _{\begin{array}{c} \sigma \rightarrow \infty \\ \kappa \rightarrow \infty \end{array}} \mathcal {R}_T \nonumber \\&\quad = \frac{\beta [p (\alpha +k+\mu ) + k (1-n)(1-p) ] [q (r_2+\delta _3+\mu ) \eta _1 + (1-q)(r_1+\delta _2 +\mu ) \eta _2 ]}{(\alpha +k+\mu ) ( r_1+\delta _2 +\mu ) (r_2+\delta _3+\mu )}. \end{aligned}$$
(3.6)

The result in (3.6) implies that a TB control programme that focuses on high rate of diagnosis of latent TB cases and high rate of detection of active TB cases (\(\kappa \rightarrow \infty \)) can lead to effective TB control provided it results in making the right-hand sides of (3.6) less than unity.

Finally, using (3.2), it can be seen that

$$\begin{aligned} \lim _{\begin{array}{c} n \rightarrow 1 \\ q \rightarrow 1 \\ \sigma \rightarrow \infty \\ \kappa \rightarrow \infty \end{array}} \mathcal {R}_T&= \frac{\beta p \eta _1}{ (r_1+\delta _2 +\mu )} > 0. \end{aligned}$$
(3.7)

The limit in (3.7) implies that a TB control programme that focuses on effective combination of diagnosis of a large fraction of new latent TB infections, detection of a large fraction of active TB cases with prompt treatment, high rate of diagnosis of latent TB infections, and high rate of detection of active TB cases (i.e., \(n \rightarrow 1, q \rightarrow 1, \sigma \rightarrow \infty , \kappa \rightarrow \infty \)) can lead to effective TB control if it results in making the right-hand side of (3.7) less than unity. Hence, control strategies that results in any of the limiting expressions in (3.3) to (3.7) to be less than unity will lead to a reduction in the TB burden in the population.

Further sensitivity analysis on some key parameters associated with diagnosis and treatment of latent TB infections and active TB cases (i.e., \(n, q, \kappa , r_1,\) and \( r_2\)) in the model (2.1) are carried out by computing the partial derivatives of \(\mathcal {R}_T\) with respect to these parameters.

The partial derivative of \(\mathcal {R}_T\) with respect to the fraction of new latent TB infections who got diagnosed (n) yields

$$\begin{aligned}&\frac{\partial \mathcal {R}_T }{\partial n} \nonumber \\&\quad = - \frac{\beta k (1-p) \sigma [(r_1+\delta _2 +\mu ) (r_2+\delta _3+\mu ) + q \kappa (r_2+\delta _3+\mu ) \eta _1 + \kappa (1-q) (r_1+\delta _2 +\mu ) \eta _2 ]}{(\sigma +\mu ) (\alpha +k+\mu ) (\kappa +\delta _1+\mu ) (r_1+\delta _2 +\mu ) (r_2+\delta _3+\mu )}. \end{aligned}$$
(3.8)

Clearly, it follows that \(\frac{\mathcal {R}_T}{\partial n} < 0\) unconditionally. Hence, increasing the fraction of new latent TB who got diagnosed, will have a positive impact in reducing TB burden in a population, regardless of the values of other parameters in the expression for \(\mathcal {R}_T\).

Considering the rate of change of (3.2) with respect the fraction of detected and promptly treated active TB cases (q), we have

$$\begin{aligned}&\frac{\partial \mathcal {R}_T }{\partial q} \nonumber \\&\quad = \frac{\beta [p (\sigma +\mu ) (\alpha +k+\mu ) + k(1-n)(1-p) \sigma ][ \kappa (r_2+\delta _3+\mu ) \eta _1 - \kappa (r_1+\delta _2 +\mu ) \eta _2 ]}{(\sigma +\mu ) (\alpha +k+\mu ) (\kappa +\delta _1+\mu ) (r_1+\delta _2 +\mu ) (r_2+\delta _3+\mu ) }. \end{aligned}$$
(3.9)

It follows from (3.9) that \(\frac{\mathcal {R}_T}{\partial q} < 0\) if

$$\begin{aligned} \eta _1 < \triangle _1 = \frac{(r_1+\delta _2 +\mu ) \eta _2}{(r_2+\delta _3+\mu )} \quad \text {or} \quad \eta _2 > \triangle _2 = \frac{(r_2+\delta _3+\mu ) \eta _1}{(r_1+\delta _2 +\mu )}. \end{aligned}$$
(3.10)

Equation (3.10) implies that the detection (and prompt treatment) of a fraction of active TB cases will have a positive impact in reducing TB burden in the community only if \(\eta _1 < \triangle _1\) (or \(\eta _2 > \triangle _2\)). Such control strategy will fail to reduce TB burden in the community if \(\eta _1 = \triangle _1\) (or \(\eta _2 = \triangle _2\)), and will have a detrimental impact in the community if \(\eta _1 > \triangle _1\) (or \(\eta _2 < \triangle _2\)), since this will lead to an increase in the reproduction number \( \mathcal {R}_T\). This results are summarized below:

Lemma 3.2

The detection (and prompt treatment) of a fraction of actively-infected TB cases will have a positive impact in reducing TB burden in the community only if \(\eta _1 < \triangle _1\) (or \(\eta _2 > \triangle _2\)), no impact if \(\eta _1 = \triangle _1\) (or \(\eta _2 = \triangle _2\)), and a detrimental impact if \(\eta _1 > \triangle _1\) (or \(\eta _2 < \triangle _2\)).

Considering the rate of change of (3.2) with respect to the rate of detection of active TB cases (\(\kappa \)), it follows that

$$\begin{aligned} \frac{\partial \mathcal {R}_T }{\partial \kappa }&= \frac{\beta [p a_{11} a_{33} + k(1-n)(1-p) \sigma ]}{a_{11}a_{33}a_{55}a_{66} (\kappa + \delta _1 + \mu )^2} \nonumber \\&\quad \times \Big [ (\delta _1+\mu ) [ q a_{66} \eta _1 + a_{55} (1-q) \eta _2 ] - a_{55}a_{66} \Big ] , \end{aligned}$$
(3.11)

where \(a_{11}, a_{22}, a_{33}, a_{44}, a_{55}\), and \(a_{66}\) are as defined in (3.2). From (3.11) it can be seen that \(\frac{\mathcal {R}_T}{\partial \kappa } < 0\) if

$$\begin{aligned} \frac{1}{(\delta _1 + \mu )}&> \triangle _3 = \triangle _{3_1} + \triangle _{3_2}, \quad \text {where} \quad \triangle _{3_1} = \frac{q \eta _1 }{ r_1+\delta _2+\mu }, \quad \text {and} \nonumber \\ \triangle _{3_2}&= \frac{ (1-q) \eta _2}{ r_2+\delta _3+\mu }. \end{aligned}$$
(3.12)

The expression in \(\triangle _{3_1}\) is the product of the fraction of detected active TB cases who receive prompt treatment (q) together with the modification parameter \((\eta _1)\) that accounts for the reduced likelihood of disease transmission by individuals who are diagnosed with active TB and promptly treated (\(J_1\)) in comparison to actively-infected individuals who were not detected (I), and the average time spent in the \(J_1\) class. The expression in \(\triangle _{3_2}\) is the product of the fraction of detected active TB cases with delay in commencing treatment \((1-q)\), the modification parameter \((\eta _2)\) that accounts for the reduced likelihood of disease transmission by individuals who are diagnosed with active TB but delay in commencing treatment (\(J_2\)) in comparison to actively-infected individuals who were not detected (I), and the average time spent in the \(J_2\) class. The expression in (3.12) implies that detection of active TB cases will have a positive impact in reducing TB burden if \(\frac{1}{(\delta _1 + \mu )} > \triangle _3\). Such a strategy will have no effect in reducing TB burden if \(\frac{1}{(\delta _1 + \mu )} = \triangle _3\), and will have a detrimental impact in the community if \(\frac{1}{(\delta _1 + \mu )} < \triangle _3\). This result is summed up in the lemma below:

Lemma 3.3

The detection of active TB cases will have a positive impact in reducing TB burden in the community only if \(\frac{1}{(\delta _1 + \mu )} > \triangle _3\), no impact if \(\frac{1}{(\delta _1 + \mu )}= \triangle _3\) and a negative impact if \(\frac{1}{(\delta _1 + \mu )} < \triangle _3\).

Note that \(\frac{1}{\delta _1 + \mu }\) is the average time spent in the class of undetected active TB cases when there is no diagnosis of active TB in the population. If this time spent is greater than the expression in the right hand side of (3.12), then detection (and diagnosis) of active TB cases will have a positive effect on TB population level control.

If we consider the rate of change of \(\mathcal {R}_T\) with respect to the rate at which active TB cases are promptly treated (\(r_1\)), we have

$$\begin{aligned} \frac{\partial \mathcal {R}_T }{\partial r_1 } = -\frac{\beta \Big [p (\sigma + \mu ) (\alpha +k+\mu ) + k(1-n)(1-p) \sigma \Big ] \kappa q \eta _1}{(\sigma + \mu ) (\alpha +k+\mu ) (\kappa +\delta _1+\mu ) (r_1+\delta _2+\mu )^2}. \end{aligned}$$
(3.13)

Obviously, it follows that the partial derivative in (3.13) is unconditionally less than zero, i.e., \(\frac{\mathcal {R}_T}{\partial r_1} < 0\). Hence prompt treatment of active TB cases will go a long way in reducing the TB burden in the population, regardless of the values of other parameters in the expression for \(\mathcal {R}_T\).

Similarly, if we consider the rate of change of \(\mathcal {R}_T\), (3.2) with respect to the rate at which active TB cases with delayed treatment (\(r_2\)), we have

$$\begin{aligned} \frac{\partial \mathcal {R}_T }{\partial r_2 } = -\frac{\beta \Big [p (\sigma + \mu ) (\alpha +k+\mu ) + k(1-n)(1-p) \sigma \Big ] \kappa (1-q) \eta _2}{(\sigma + \mu ) (\alpha +k+\mu ) (\kappa +\delta _1+\mu ) (r_2+\delta _2+\mu )^2}. \end{aligned}$$
(3.14)

Although the partial derivative in (3.14) is also unconditionally less than zero, i.e., \(\frac{\mathcal {R}_T}{\partial r_2} < 0\) with respect to the treatment of active TB cases with delay in treatment, however delay in treatment of active TB cases should be discouraged since this could lead to increase TB incidence in the population [31, 51, 55]. Hence, treatment of active TB (even if there is a delay) will always have a positive population-level impact on TB control, albeit, at a different rate compared to active cases promptly treated.

To further investigate the impact of diagnosis and treatment of both latent and active cases in TB control [7], contour plots of the reproduction number (\(\mathcal {R}_T\)), as a function of the fraction of new latent TB cases that got diagnosed (n) and the fraction of detected active TB cases that received prompt treatment (q), at different treatment levels, are given in Figs. 2 and 3. The parameter values used in producing the contour plots are the baseline values given in Table 2.

Figure 2 shows that at low treatment level (i.e., \(r_0 = 1.5, r_1 = 2.5, r_2 = 1.5\)), it is not possible to eradicate TB from the population since any control strategy based on such low treatment level cannot bring the reproduction number to a value less than unity.

Fig. 2
figure 2

A 2D contour plot of \(\mathcal {R}_T\) as a function of n and q at low treatment level: \(r_0 = 1.5, r_1 = 2.5, r_2 = 1.5\), and \(\beta = 15.557\)

As depicted in Fig. 3, we observe that at treatment levels that are 10 times the low treatment values above (i.e., \(r_0 = 15, r_1 = 25, r_2 = 15\)), eradicating tuberculosis from the population is achievable. For example, if almost \(100\%\) of latently-infected individuals are diagnosed (and treated) along with about \(60\%\) of active TB cases, then it is possible to eliminate TB from the population. However, this strategy will be very difficult to implement in developing countries since a TB control programme that requires diagnosis of a large percentage of latent TB will not be easy to implement in such countries with limited resources.

Fig. 3
figure 3

A 2D contour plot of \(\mathcal {R}_T\) as a function of n and q at moderate treatment level: \(r_0 = 15, r_1 = 25, r_2 = 15\), and \(\beta = 15.557\)

4 Existence of endemic equilibrium point: special case

The existence of endemic equilibria of the treatment model (2.1) is now investigated for a special case when the disease-induced death rates are assumed to be negligible (i.e., \(\delta _1 = \delta _2 = \delta _3 = 0\)). This can fits setting such as most countries in Western Europe, Canada, the United States, Australia and New Zealand where the average number of TB-induced deaths is less than 1 per 100,000 population [64].

Adding up all equations in the model (2.1), and setting the disease-induced death rates to zero yields the following equation for the rate of change of the total population,

$$\begin{aligned} \frac{dN}{dt} = \Lambda - \mu N. \end{aligned}$$
(4.1)

Hence, \(N(t) \rightarrow \frac{\Lambda }{\mu } \) as \(t \rightarrow \infty \). Using the limiting value \(N(t) = \frac{\Lambda }{\mu }\) and substitution \(\delta _1 = \delta _2 = \delta _3 = 0\) in the model (2.1), we have the following system of equations:

$$\begin{aligned} \begin{aligned} \dot{S}&= \Lambda - \tilde{\lambda } S - \mu S, \\ \dot{E_1}&= (1-p) \tilde{\lambda } (S+b_3T) - (\sigma + \mu ) E_1, \\ \dot{E_2}&= \sigma n E_1 + \alpha E_3 - b_1 \tilde{\lambda } E_2 - (r_0 + \mu ) E_2, \\ \dot{E_3}&= \sigma (1-n) E_1 - b_2 \tilde{\lambda } E_3 - (\alpha +k+\mu ) E_3,\\ \dot{I}&= p \tilde{\lambda } (S + b_3T) + b_1 \tilde{\lambda } E_2 + b_2 \tilde{\lambda } E_3 + k E_3 - (\kappa +\mu ) I, \\ \dot{J_1}&= \kappa q I - (r_1+\mu ) J_1,\\ \dot{J_2}&= \kappa (1-q) I - (r_2+\mu ) J_2, \\ \dot{T}&= r_0 E_2 + r_1J_1 + r_2J_2 - b_3 \tilde{\lambda } T - \mu T, \end{aligned} \end{aligned}$$
(4.2)

where \(~\tilde{\lambda } = \frac{\beta \mu }{\Lambda } (I+\eta _1J_1 + \eta _2J_2 \big ) \) is the force of infection.

Let

$$\begin{aligned} \mathcal {E}_2 = \left( S^{**}, E_1^{**}, E_2^{**}, E_3^{**}, I^{**}, J_1^{**}, J_2^{**}, T^{**}\right) \end{aligned}$$

denote an EEP of the model (4.2). The equations in (4.2) are solved in terms of the force of infection (\(\tilde{\lambda }^{**}\)), at steady-state, given by

$$\begin{aligned} \tilde{\lambda }^{**} = \frac{\beta \mu }{\Lambda } \left( I^{**} + \eta _1 J_1^{**} + \eta _2 J_2 ^{**}\right) . \end{aligned}$$
(4.3)

It follows that the associated reproduction number corresponding to the model (4.2), denoted by \(\mathcal {R}_{T1}\), is:

$$\begin{aligned} \mathcal {R}_{T1}= & {} \mathcal {R}_{T} |_{ \left( \delta _1 = \delta _2 = \delta _3 = 0\right) } \nonumber \\= & {} \beta \Big [ \frac{ \big [ a_{11} a_{33} p + k(1-n)(1-p)\sigma \big ] \big [ \tilde{a}_{55} \tilde{a}_{66} + \tilde{a}_{66} \kappa q \eta _1 + \tilde{a}_{55} \kappa (1-q) \eta _2 \big ]}{a_{11} a_{33} \tilde{a}_{44} \tilde{a}_{55} \tilde{a}_{66}} \Big ].\nonumber \\ \end{aligned}$$
(4.4)

where \( a_{11}=(\sigma +\mu )\), \(a_{22}=(r_0+\mu )\), \(a_{33}=(\alpha + k+\mu )\), \(\tilde{a}_{44}=(\kappa +\mu )\), \(\tilde{a}_{55}=(r_1+\mu )\), and \(\tilde{a}_{66}=(r_2+\mu )\).

Setting the right-hand sides of the equations in (4.2) to zero and solving for the state variables, in terms of the force of infection at steady state, yields:

$$\begin{aligned} S^{**}&= \frac{\Lambda }{\tilde{\lambda }^{**}+\mu }, \quad E_1^{**} = \frac{\tilde{\lambda }^{**} \Lambda (1-p)}{a_{11}(\tilde{\lambda }^{**}+\mu )} \Big [ 1 + \frac{\tilde{\lambda }^{**} b_3 \sigma r_0 (1-p)A_{11}}{A_{33}} \Big ]\nonumber \\&\quad + \frac{\tilde{\lambda }^{**} b_3 (1-p) (\tilde{\lambda }^{**}b_1+a_{22}) (\tilde{\lambda }^{**}b_2+a_{33}) }{\tilde{a}_{55} \tilde{a}_{66} A_{33}} I^{**}, \nonumber \\ E_2^{**}&= \frac{\tilde{\lambda }^{**} \Lambda \sigma (1-p) A_{11}}{a_{11}(\tilde{\lambda }^{**}+\mu ) (\tilde{\lambda }^{**}b_1+a_{22}) (\tilde{\lambda }^{**}b_2+a_{33})} \Big [ 1 + \frac{\tilde{\lambda }^{**} b_3 \sigma r_0 (1-p)A_{11}}{A_{33}} \Big ] \nonumber \\&\quad + \frac{\tilde{\lambda }^{**} b_3 \sigma (1-p) A_1 A_2 }{\tilde{a}_{55} \tilde{a}_{66} A_{33}} I^{**}, \nonumber \\ E_3^{**}&= \frac{\tilde{\lambda }^{**} \Lambda \sigma (1-n)(1-p)}{a_{11} (\tilde{\lambda }^{**}+\mu ) (\tilde{\lambda }^{**} b_2+a_{33})} \Big [ 1 + \frac{\tilde{\lambda }^{**} b_3 \sigma r_0 (1-p)A_{11}}{A_{33}} \Big ] \nonumber \\&\quad + \frac{\tilde{\lambda }^{**} b_3 \sigma (1-n) (1-p) (\tilde{\lambda }^{**}b_1+a_{22}) A_{22} }{\tilde{a}_{55} \tilde{a}_{66} A_{33}} I^{**}, \nonumber \\ I^{**}&= \frac{(\tilde{\lambda }^{**})^6 M_6 + (\tilde{\lambda }^{**})^5 M_5 + (\tilde{\lambda }^{**})^4 M_4 + (\tilde{\lambda }^{**})^3 M_3 + (\tilde{\lambda }^{**})^2 M_2 + \tilde{\lambda }^{**} M_1}{(\tilde{\lambda }^{**})^6 N_6 + (\tilde{\lambda }^{**})^5 N_5 + (\tilde{\lambda }^{**})^4 N_4 + (\tilde{\lambda }^{**})^3 N_3 + (\tilde{\lambda }^{**})^2 N_2 + \tilde{\lambda }^{**} N_1 + N_0}, \nonumber \\ J_1^{*}&= \frac{q \kappa }{\tilde{a}_{55}}I^{**}, ~ J_2^{**} = \frac{(1-q) \kappa }{\tilde{a}_{66}}I^{**}, \quad T^{**} = \frac{\tilde{\lambda }^{**} \Lambda \sigma r_0 (1-p) A_{11}}{ (\tilde{\lambda }^{**}+\mu ) A_{33}}\nonumber \\&\quad + \frac{a_{11} (\tilde{\lambda }^{**} b_1+a_{22}) (\tilde{\lambda }^{**}b_2+a_{33}) }{\tilde{a}_{55} \tilde{a}_{66} A_{33}} I^{**}, \end{aligned}$$
(4.5)

where

$$\begin{aligned} \begin{aligned} A_{11}&= n (\tilde{\lambda }^{**} b_2 + a_{33} )+ \alpha (1-n), A_{22} = \tilde{a}_{66}r_1 \kappa q + \tilde{a}_{55} r_2 \kappa (1-q), \\ A_{33}&= a_{11} (\tilde{\lambda }^{**}b_1+a_{22}) (\tilde{\lambda }^{**}b_2+a_{33}) (\tilde{\lambda }^{**} b_3+ \mu ) - \tilde{\lambda }^{**} b_3 \sigma r_0 (1-p) A_1, \\ M_1&= \Lambda \mu a_{11} a^2_{22} a^2_{33} \tilde{a}_{55} \tilde{a}_{66} [p a_{11} a_{33} + k(1-n)(1-p)\sigma ], M_2, \ldots , M_6, \\ N_0&= \mu ^2 a^2_{11} a^2_{22} a^2_{33} \tilde{a}_{44} \tilde{a}_{55} \tilde{a}_{66}, N_1, \ldots , N_5, \\ N_6&= a_{11} b_1^2 b_2^2 b_3 \Big [ a_{66} \Big ( \big (a_{11} a_{44} a_{55}+q \kappa ((1-p) \sigma + p a_{11} \big ) r_1 \Big )\\&\quad + \kappa (1-q) \Big ( (1-p) \sigma + p a_{11} \Big ) a_{55} r_2 \Big ]. \end{aligned} \end{aligned}$$
(4.6)

The remaining coefficients \(M_2, \ldots , M_6\) and \(N_1, \ldots , N_5\) are quite large, and have been omitted for convenience sake.

By substituting the expressions for the EEP in (4.5) [and noting (4.6)] into the force of infection at the steady state given in (4.3), it follows (after several algebraic manipulations) that the endemic equilibria of the model (4.2) satisfies the following polynomial (in terms of \( \tilde{\lambda }^{**} \)),

$$\begin{aligned} \left( \tilde{\lambda }^{**}\right) ^6 F_6 + \left( \tilde{\lambda }^{**}\right) ^5 F_5 + \left( \tilde{\lambda }^{**}\right) ^4 F_4 + \left( \tilde{\lambda }^{**}\right) ^3 F_3 + \left( \tilde{\lambda }^{**}\right) ^2 F_2 + \tilde{\lambda }^{**} F_1 + F_0 = 0. \end{aligned}$$
(4.7)

where

$$\begin{aligned} \begin{aligned} F_0&= \Lambda \mu ^2 a^2_{11} a^2_{22} a^2_{33} \tilde{a}_{44} \tilde{a}^2_{55} \tilde{a}^2_{66} \left( 1 - \mathcal {R}_{T1}\right) , \quad F_1 = N_1 \Lambda \tilde{a}_{55} \tilde{a}_{66} - \beta \mu M_2 A_{23}, \\ F_2&= N_2 \Lambda \tilde{a}_{55} \tilde{a}_{66} - \beta \mu M_3 A_{23}, \\ F_3&= N_3 \Lambda \tilde{a}_{55} \tilde{a}_{66} - \beta \mu M_4 A_{23}, \quad F_4 = N_4 \Lambda \tilde{a}_{55} \tilde{a}_{66} - \beta \mu M_5 A_{23}, \\ F_5&= N_4 \Lambda \tilde{a}_{55} \tilde{a}_{66} - \beta \mu M_6 A_{23}, \quad F_6 = N_6 \Lambda \tilde{a}_{55} \tilde{a}_{66}. \end{aligned} \end{aligned}$$
(4.8)

The components of the EEP are then obtained by solving for \(\tilde{\lambda }^{**}\) from the polynomial (4.7), and substituting the positive values of \(\tilde{\lambda }^{**}\) into the expressions in (4.5) [noting (4.6)]. Furthermore, it follows from (4.8) that the coefficient \(F_6\), is always positive and \(F_0\) is positive (negative) if \(\mathcal {R}_{T1}\) is less (greater) than unity. The following results can be deduced.

Theorem 4.1

The TB model (4.2) with \(\delta _1 = \delta _2 = \delta _3 = 0\) has:

  1. (i)

    six or four endemic equilibria if \(F_5< 0, F_4> 0, F_3< 0, F_2 > 0, F_1 < 0\) and \(\mathcal {R}_{T1} < 1\),

  2. (ii)

    four or two endemic equilibria if \(F_5> 0, F_4< 0, F_3> 0, F_2 < 0, F_1 > 0\) and \(\mathcal {R}_{T1} < 1\),

  3. (iii)

    two endemic equilibria if \(F_5> 0, F_4> 0, F_3< 0, F_2 < 0, F_1 > 0\) and \(\mathcal {R}_{T1} < 1\),

  4. (iv)

    no endemic equilibrium otherwise, when \(\mathcal {R}_{T1} < 1\).

Items (i)–(iii) of Theorem 4.1 suggests the possibility of backward bifurcation in the treatment model (4.2) with negligible TB-induced deaths (i.e., \(\delta _1 =\delta _2 = \delta _3 = 0\)) when \(\mathcal {R}_{T1} < 1\). The phenomenon of backward bifurcation is characterised by the co-existence of a stable DFE and a stable EEP when the associated reproduction number of the model is less than unity. The epidemiological implication of backward bifurcation is that the classical requirement of having the the reproduction number less than unity, while necessary, is no longer sufficient for effective TB control. Such control measures will now be dependent on the initial sizes of the sub-population of the model [24].

5 Bifurcation analysis

In this section, we characterize the type of bifurcation exhibited by the models (4.2). We claim the following result:

Theorem 5.1

The model (4.2) with \(\delta _1 = \delta _2 = \delta _3 = 0~\) exhibit backward bifurcation at \(\mathcal {R}_{T1}=1\) whenever a bifurcation coefficient, denoted by a [given by (5.9)], is positive.

Proof

Let

$$\begin{aligned} \mathcal {E}_a = \left( S^{**}, E_1^{**}, E_2^{**}, E_2^{**}, E_3^{**}, I^{**}, J_1^{**}, J_2^{**}, T^{**} \right) \end{aligned}$$
(5.1)

represent an arbitrary endemic equilibrium point of the complete model system in (2.1). The existence of backward bifurcation is investigated using the Center Manifold Theory [11, 12, 19, 57]. Before applying this theory, it is convenient to carry out the following change of variables.

Let \(S = x_1\), \(E_1 = x_2\), \(E_2 = x_3\), \(E_3 = x_4\), \(I = x_5\), \(J_1 = x_6\), \(J_2 = x_7\), and \(T = x_8\), so that the total population \(N = \sum ^8_{i=1} x_i\).

It follows, that the model (2.1) can be re-written as

$$\begin{aligned} \begin{aligned} \dot{x}_1&\equiv f_1 = \Lambda - \lambda x_1 - \mu x_1, \\ \dot{x}_2&\equiv f_2 = (1 - p) \lambda (x_1 + b_3 x_8) - (\sigma + \mu ) x_2, \\ \dot{x}_3&\equiv f_3 = \sigma n x_2 + \alpha x_4 - b_1 \lambda x_3 - (r_0 + \mu ) x_3, \\ \dot{x}_4&\equiv f_4 = \sigma (1-n) x_2 - b_2 \lambda x_4 - (\alpha + k + \mu ) x_3, \\ \dot{x}_5&\equiv f_5 = p \lambda (x_1+ b_3 x_8) + b_1 \lambda x_3 + b_2 \lambda x_4 + kx_4 - (\kappa +\mu ) x_5, \\ \dot{x}_6&\equiv f_6 = \kappa q x_5 - ( r_1 + \mu ) x_6, \\ \dot{x}_7&\equiv f_7 = \kappa (1-q) x_5 - ( r_2 + \mu ) x_7, \\ \dot{x}_8&\equiv f_8 = r_0x_3 + r_1x_6+ r_2x_7 - b_3 \lambda x_8 - \mu x_8. \end{aligned} \end{aligned}$$
(5.2)

with

$$\begin{aligned} \lambda = \frac{\beta (x_5 + \eta _1 x_6 + \eta _2 x_7)}{\sum ^{8}_{i=1} x_i}, \end{aligned}$$
(5.3)

Consider the case with \(\beta = \beta ^*\), a bifurcation parameter. Solving for \(\beta = \beta ^*\) from \(\mathcal {R}_T = 1\) yields

$$\begin{aligned} \beta = \beta ^* = \frac{a_{11} a_{33} \tilde{a}_{44} \tilde{a}_{55} \tilde{a}_{66}}{ \big [ a_{11} a_{33} p + \sigma (1-n)(1-p) k \big ] \big [ \tilde{a}_{55} \tilde{a}_{66} + \tilde{a}_{66} q \kappa \eta _1 + \tilde{a}_{55} \kappa (1-q)\eta _2 \big ]},\nonumber \\ \end{aligned}$$
(5.4)

where \( a_{11}= (\sigma +\mu ), a_{22}= (r_0+\mu ), a_{33}= ( \alpha + k + \mu ), \tilde{a}_{44} = (\kappa +\mu ), \tilde{a}_{55} = ( r_1 + \mu )\), and \(\tilde{a}_{66} = ( r_2 + \mu )\).

The Jacobian of the transformed system (5.2), evaluated at the DFE (\(\mathcal {E}_0\)) with \(\beta = \beta ^*\), is given by

$$\begin{aligned} J^*= & {} J(\mathcal {E}_0) |_{\beta = \beta ^*} \nonumber \\= & {} \begin{pmatrix} -\mu &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad -\beta ^* &{}\quad -\beta ^* \eta _1 &{}\quad - \beta ^* \eta _2 &{}\quad 0 \\ 0 &{}\quad -a_{11} &{}\quad 0 &{}\quad 0 &{}\quad \beta ^* (1-p) &{}\quad \beta ^* (1 - p) \eta _1 &{}\quad \beta ^* (1 - p)\eta _2 &{}\quad 0 \\ 0 &{}\quad \sigma n &{}\quad -a_{22} &{}\quad \alpha &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 \\ 0 &{}\quad \sigma (1-n) &{}\quad 0 &{}\quad -a_{33} &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad k &{}\quad \beta ^*p - \tilde{a}_{44} &{}\quad \beta ^*p \eta _1 &{}\quad \beta ^* p\eta _2 &{}\quad 0 \\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad \kappa q &{}\quad - \tilde{a}_{55} &{}\quad 0 &{}\quad 0\\ 0 &{}\quad 0 &{}\quad 0 &{}\quad 0 &{}\quad \kappa (1-q) &{}\quad 0 &{}\quad - \tilde{a}_{66} &{}\quad 0\\ 0 &{}\quad 0 &{}\quad r_0 &{}\quad 0 &{}\quad 0 &{}\quad r_1 &{}\quad r_2 &{}\quad -\mu \\ \end{pmatrix}.\nonumber \\ \end{aligned}$$
(5.5)

The matrix \(J^*\) has a simple zero eigenvalue (a center) and all other eigenvalues having negative real part (thus, the Center Manifold Theory can be applied [11, 12, 19, 57]. Moreover, \(J^*\) has a right eigenvector given by \(\mathbf {w} = (w_1, w_2, \ldots , w_8)^T\), where

$$\begin{aligned} \begin{aligned} w_1&= - \beta ^* \Big ( \frac{ \tilde{a}_{55} \tilde{a}_{66} + \tilde{a}_{66} \kappa q \eta _1+ \tilde{a}_{55} \kappa (1-q) \eta _2}{ \tilde{a}_{55} \tilde{a}_{55}\mu } \Big ) w_5 < 0,\\ w_2&= \beta ^* (1-p) \Big ( \frac{ \tilde{a}_{55} \tilde{a}_{66} + \tilde{a}_{66} \kappa q \eta _1+ \tilde{a}_{55} \kappa (1-q) \eta _2 }{a_{11} \tilde{a}_{55} \tilde{a}_{66} } \Big ) w_5> 0,\\ w_3&= \beta ^* \sigma (1-p) [n a_{33} + \alpha (1 - n) ] \Big ( \frac{ \tilde{a}_{55} \tilde{a}_{66} + \tilde{a}_{66} \kappa q \eta _1+ \tilde{a}_{55} \kappa (1-q) \eta _2 }{a_{11} a_{22} a_{33} \tilde{a}_{55} \tilde{a}_{66} } \Big ) w_5> 0, \\ w_4&= \beta ^* \sigma (1 - n) (1 - p) \Big ( \frac{ \tilde{a}_{55} \tilde{a}_{66} + \tilde{a}_{66} \kappa q \eta _1+ \tilde{a}_{55} \kappa (1-q) \eta _2 }{a_{11} \tilde{a}_{55} \tilde{a}_{66} } \Big ) w_5> 0,\\ w_5&= w_5> 0, \quad w_6 = \frac{ \kappa q}{ \tilde{a}_{55} }w_5> 0, \quad w_7 = \frac{ \kappa (1-q) }{ \tilde{a}_{55} }w_5> 0,\\ w_8&= \beta ^* \sigma r_0 (1-p) [n a_{33} + \alpha (1 - n) ] \Big ( \frac{ \tilde{a}_{55} \tilde{a}_{66} + \tilde{a}_{66} \kappa q \eta _1+ \tilde{a}_{55} \kappa (1-q) \eta _2 }{ a_{11} a_{22} a_{33} \tilde{a}_{55} \tilde{a}_{66} \mu } \Big ) w_5 \\&\quad + \Big ( \frac{ \tilde{a}_{66} r_1 \kappa q + \tilde{a}_{55} r_2 \kappa (1 - q) }{ \tilde{a}_{55} \tilde{a}_{66} \mu } \Big ) w_5 > 0. \end{aligned} \end{aligned}$$
(5.6)

Similarly \(J^*\) has a left eigenvector \(\mathbf {v} = (v_1, v_2, \ldots , v_8)\), satisfying \(\mathbf {v.w}=1\), with

$$\begin{aligned} \begin{aligned} v_1&= 0, \quad v_2 = v_2> 0, \quad v_3 = 0, \quad v_4 = \frac{a_{11}}{\sigma (1-n)}v_2> 0, \quad v_5 = \frac{a_{11} a_{33}}{\sigma k(1-n)}v_2> 0,\\ v_6&= \beta ^* \Big (\frac{ \sigma (1 - n)(1 - p) k + p a_{11} a_{33} }{\tilde{a}_{55} \sigma k (1-n) } \Big ) \eta _1 v_2> 0,\\ v_7&= \beta ^* \Big (\frac{ \sigma (1 - n)(1 - p) k + p a_{11} a_{33} }{\tilde{a}_{66}\sigma k (1-n) } \Big ) \eta _2 v_2 > 0,\\ v_8&= 0. \end{aligned} \end{aligned}$$
(5.7)

It follows from Theorem 4.1 in [12], by computing the the associated non-zero partial derivatives of f(x) (evaluated at the DFE (\(\mathcal {E}_0\)) with \(\beta = \beta ^*\)) that the associated bifurcation coefficients, a and b, are given by

$$\begin{aligned} a = \sum ^n_{k,i,j=1} v_k w_i w_j \frac{\partial ^2f_k}{\partial x_i \partial x_j}(0,0), \quad \text {and} \quad b = \sum ^n_{k,i=1} v_k w_i \frac{\partial ^2f_k}{\partial x_i \partial \beta ^*}(0,0), \end{aligned}$$
(5.8)

which leads to

$$\begin{aligned} a = \frac{2 \beta ^* \mu v_5}{\Lambda } \big [ b_1 Q_{22} + b_2 Q_{33} \big ] - \frac{2 \beta ^* \mu }{\Lambda } \Big [ (1 - p) v_2 Q_{11} + p v_5 Q_{11} + b_2 v_4 Q_{33} \Big ],\nonumber \\ \end{aligned}$$
(5.9)

and

$$\begin{aligned} b = (w_5 + w_6\eta _1 + w_7\eta _2) \big [ p v_5 + (1 - p) v_2 \big ] > 0. \end{aligned}$$
(5.10)

with

$$\begin{aligned} \begin{aligned} Q_{11}&= w_2w_5 + w_2w_6\eta _1 + w_2w_7\eta _2 + w_3w_5 + w_3w_6\eta _1 + w_3w_7\eta _2 + w_4w_5\\&\quad + w_4w_6\eta _1 + w_4w_7\eta _2 + w_5w_5 + w_5w_6(1 + \eta _1) \\&\quad + w_5w_7(1 + \eta _2) - w_5w_8(b_3 - 1) + w_6w_6 \eta _1 + w_6w_7(\eta _1 + \eta _2)\\&\quad - w_6w_8(b_3 - 1)\eta _1 + w_7w_7\eta _2 - w_7w_8(b_3 - 1)\eta _2 \\ Q_{22}&= w_3 w_5 + w_3 w_6 \eta _1 + w_3 w_7 \eta _2, \quad Q_{33} = w_4 w_5 + w_4 w_6 \eta _1 + w_4 w_7 \eta _2. \end{aligned} \end{aligned}$$
(5.11)

Clearly, \(b > 0\) for all biologically feasible parameters. Hence, the direction of the bifurcation at \(\beta = \beta ^*\) (i.e., \(\mathcal {R}_T = 1\)) depends only on the sign of a. Looking at (5.11), we have that \(Q_{22}\) and \(Q_{22}\) are clearly positive. It can easily be shown that \(Q_{33}\) is also positive.

Hence, we have that the existence of of backward bifurcation occurs if and only if the rates of exogenous reinfection, \(b_1\) and \(b_2\), are large enough such that the positive part of (5.9) dominates the negative part. Thus backward bifurcation occurs if and only if:

$$\begin{aligned} \frac{2 \beta ^* \mu v_5}{\Lambda } \big [ b_1 Q_{22} + b_2 Q_{33} \big ] > \frac{2 \beta ^* \mu }{\Lambda } \Big [ (1 - p) v_2 Q_{11} + p v_5 Q_{11} + b_2 v_4 Q_{33} \Big ].\nonumber \\ \end{aligned}$$
(5.12)

\(\square \)

The backward bifurcation exhibited by the model (4.2) is shown in Fig. 4.

Fig. 4
figure 4

Backward bifurcation diagram for the TB model (4.2), showing the force of infection against the effective reproduction number (\(\mathcal {R}_{T1}\))

5.1 Non-existence of backward bifurcation

Consider the special case of the model (4.2) with \(b_1 = b_2 = 0\) (i.e., negligible exogenous re-infection). It can be shown that, in this case, the backward bifurcation coefficient, a, given by (5.9) (and noting that \(\beta ^*>0\) and that all parameters in the model (2.1) are positive) reduces to:

$$\begin{aligned} \begin{aligned} a = - \frac{2 \beta ^* \mu }{\Lambda } \Big [ pv_5 Q_1+ (1-p)v_2 Q_1\Big ] < 0, \end{aligned} \end{aligned}$$
(5.13)

where

$$\begin{aligned} \begin{aligned} Q_1&= w_2w_5 + w_2w_6\eta _1 + w_2w_7\eta _2 + w_3w_5 + w_3w_6\eta _1 + w_3w_7\eta _2 + w_4w_5 + w_4w_6\eta _1\\&\quad + w_4w_7\eta _2 + w_5w_5 + w_5w_6(1 + \eta _1) \\&\quad + w_5w_7(1 + \eta _2) - w_5w_8(b_3 - 1) + w_6w_6 \eta _1 + w_6w_7(\eta _1 + \eta _2)\\&\quad - w_6w_8(b_3 - 1)\eta _1 + w_7w_7\eta _2 - w_7w_8(b_3 - 1)\eta _2 \end{aligned} \end{aligned}$$
(5.14)

and \(w_i, i=1,\ldots ,8\) are given above in Sect. 5.

Hence, it follows from Theorem 4.1 in [12] that the model (4.2), does not undergo a backward bifurcation if \(b_1 = b_2 = 0\) (i.e., in the absence of exogenous re-infection). Hence, this study has confirmed that the presence of exogenous re-infection induces backward bifurcation in the transmission dynamics of TB. This corroborate results from other TB models, that exogenous re-infection can cause the backward bifurcation phenomenon [12, 23, 41, 45].

6 Global asymptotic stability of EEP: special case

Consider the special case of the model (2.1) with \(\alpha = p = b_1 = b_2 = b_3= \delta _1 = \delta _2 = \delta _3 = 0\), given by

$$\begin{aligned} \dot{S}&= \Lambda - \lambda S - \mu S, \nonumber \\ \dot{E_1}&= \lambda S - (\sigma +\mu ) E_1, \nonumber \\ \dot{E_2}&= \sigma n E_1 - (r_0+\mu ) E_2,\nonumber \\ \dot{E_3}&= \sigma (1 - n) E_1 - ( k + \mu ) E_3,\nonumber \\ \dot{I}&= k E_3 - (\kappa +\mu ) I, \nonumber \\ \dot{J_1}&= \kappa q I - ( r_1 + \mu ) J_1,\nonumber \\ \dot{J_2}&= \kappa (1 - q) I - ( r_2 + \mu ) J_2, \nonumber \\ \dot{T}&= r_0 E_2 +r_1J_1 +r_2J_2 - \mu T, \end{aligned}$$
(6.1)

where \(\lambda = \frac{\beta (I+\eta _1J_1 + \eta _2J_2 )}{N}\) is the force of infection. It follows that the associated reproduction number corresponding to the model (6.1), denoted by \(\mathcal {R}_{T2}\), is given by

$$\begin{aligned} \mathcal {R}_{T2}= & {} \mathcal {R}_{T} |_{ ( \alpha = p = b_1 = b_2 = b_3 = \delta _1 = \delta _2 = \delta _3 = 0 ) }\nonumber \\= & {} \beta \Big ( \frac{ \sigma k (1-n) \big [ \tilde{a}_{55} \tilde{a}_{66} + \tilde{a}_{66} \kappa q \eta _1 + \tilde{a}_{55} \kappa (1-q) \eta _2 \big ]}{a_{11} a_{33} \tilde{a}_{44} \tilde{a}_{55} \tilde{a}_{66}} \Big ), \end{aligned}$$
(6.2)

Let

$$\begin{aligned} \mathcal {E}_3 = \left( S^{**}, E_1^{**}, E_2^{**}, E_3^{**}, I^{**}, J_1^{**}, J_2^{**}, T^{**}\right) , \end{aligned}$$
(6.3)

denote an EEP of the model (6.1).

Since \( \delta _1 = \delta _2 = \delta _3 = 0,\) then the total population N(t) in model (6.1) at any time t tends to a limiting value \(\frac{\Lambda }{\mu }\), i.e., \(N(t) \rightarrow \frac{\Lambda }{\mu }\) as \(t \rightarrow \infty \). Let \(\tilde{\beta } = \frac{\mu \beta }{\Lambda }\) so that

$$\begin{aligned} \begin{aligned} \tilde{\lambda } = \tilde{\beta } \left( I + \eta _1 J_1 + \eta _1 J_2\right) . \end{aligned} \end{aligned}$$
(6.4)

is the force of infection for the model (6.1). The system of equations in (6.1) are solved in terms of the force of infection (4.3) to obtain

$$\begin{aligned} \begin{aligned} S^{**}&= \frac{\Lambda }{ \tilde{\lambda }^{**}+\mu }, \quad E_1^{**} = \frac{\tilde{\lambda }^{**}\Lambda }{a_{11}\left( \tilde{\lambda }^{**}+\mu \right) }, \quad E_2^{**} = \frac{\tilde{\lambda }^{**} \Lambda \sigma n }{a_{11}a_{22}\left( \tilde{\lambda }^{**}+\mu \right) }, \\ E_3^{**}&= \frac{\tilde{\lambda }^{**} \Lambda \sigma \left( 1-n\right) }{a_{11}a_{33}\left( \tilde{\lambda }^{**}+\mu \right) },\quad I^{**} = \frac{ \tilde{\lambda }^{**} \Lambda \sigma k \left( 1-n\right) }{a_{11}a_{33}\tilde{a}_{44}\left( \tilde{\lambda }^{**} +\mu \right) },\\ J_1^{**}&= \frac{ \tilde{\lambda }^{**} \Lambda k \sigma q \kappa \left( 1-n\right) }{a_{11}a_{33}\tilde{a}_{44}\tilde{a}_{55} \left( \tilde{\lambda }^{**} +\mu \right) }, \quad J_2^{**} = \frac{ \tilde{\lambda }^{**} \Lambda k \sigma \kappa \left( 1-n\right) \left( 1-q\right) }{a_{11}a_{33}\tilde{a}_{44}\tilde{a}_{66}\left( \tilde{\lambda }^{**} +\mu \right) }, \\ T^{**}&= \frac{ \tilde{\lambda }^{**} \Lambda \sigma \big [ n r_0 a_{33} \tilde{a}_{44} \tilde{a}_{55} + k q \kappa r_1 a_{22} \tilde{a}_{66} \left( 1-n\right) + k r_2 \kappa a_{22} \tilde{a}_{55} \left( 1-n\right) \left( 1-q\right) \big ]}{a_{11}a_{22}a_{33} \tilde{a}_{44}\tilde{a}_{55} \tilde{a}_{66} \mu \left( \tilde{\lambda }^{**} + \mu \right) }, \end{aligned} \end{aligned}$$
(6.5)

where \( a_{11}= (\sigma +\mu ) \), \(a_{22}= (r_0+\mu ) \), \(a_{33}= ( k + \mu ) \), \(\tilde{a}_{44}= (\kappa +\mu ) \), \(\tilde{a}_{55}= (r_1 + \mu ) \), \(\tilde{a}_{66}= ( r_2 + \mu ) \), and \(\tilde{\lambda }^{**} = \frac{\tilde{\beta } \big (I^{**} + \eta _1J_1^{**} + \eta _2J_2^{**} \big )}{N}\) is the force of infection at the EEP. Substituting the expressions in (6.5) into the force of infection at the EEP yields a unique value for \(\tilde{\lambda }^{**}\), which is:

$$\begin{aligned} \begin{aligned} \tilde{ \lambda }^{**} = \mu ( \mathcal {R}_{T2} - 1 )> 0, \quad \text {whenever} \quad \mathcal {R}_{T2} > 1. \end{aligned} \end{aligned}$$
(6.6)

Hence, the following result has been established.

Lemma 6.1

The model (6.1), with \(\alpha = p = b_1 = b_2 = b_3= \delta _1 = \delta _2 = \delta _3 = 0\), has a unique EEP, given by \(\mathcal {E}_3\), whenever \(\mathcal {R}_{T2} > 1\).

Furthermore, let [the stable manifold of the DFE of the model (6.1)] be

$$\begin{aligned} \mathcal {D}_0 = \left\{ \left( S, E_1, E_2, E_3, I, J_1, J_2, T \right) \in \mathcal {D} : E_1 = E_2 = E_3 = I = J_1 = J_2 = 0\right\} . \end{aligned}$$
(6.7)

We claim the following result.

Theorem 6.2

The unique EEP, \(\mathcal {E}_3\), of the reduced model (6.1), with \(\alpha = p = b_1 = b_2 = b_3= \delta _1 = \delta _2 = \delta _3 = 0\), is GAS in \(\mathcal {D} {\setminus } \mathcal {D}_0\) whenever \(\mathcal {R}_{T2} > 1\).

Proof

Consider the model (6.1) with (6.4). Further, let \( \mathcal {R}_{T2} > 1\) (so that the endemic equilibrium \(\mathcal {E}_4\) exists, in line with Lemma 6.1). Consider the following non-linear Lyapunov function

$$\begin{aligned} \begin{aligned} \mathcal {Z}&= S - S^{**} - S^{**} \ln \left( \frac{S}{ S^{**}} \right) + E_1 - E_1^{**} - E_1^{**} \ln \left( \frac{E_1}{ E_1^{**}} \right) \\&\quad + E_2- E_2^{**} - E_2^{**} \ln \left( \frac{E_2}{ E_2^{**}} \right) \\&\quad + E_3 - E_3^{**} - E_3^{**} \ln \left( \frac{E_3}{ E_3^{**}}\right) + L_1 \Big [ I - I^{**} - I^{**} \ln \left( \frac{I}{ I^{**}} \right) \Big ] \\&\quad + L_2 \Big [ J_1 - J_1^{**} - J_1^{**} \ln \left( \frac{J_1}{J_1^{**}} \right) \Big ] + L_3 \Big [ J_2 - J_2^{**} - J_2^{**} \ln \left( \frac{J_2}{ J_2^{**}} \right) \Big ], \end{aligned} \end{aligned}$$
(6.8)

where

$$\begin{aligned} L_1= & {} \tilde{\beta }S^{**} \left( \frac{\tilde{a}_{55} \tilde{a}_{66} + q \kappa \tilde{a}_{66} \eta _1 + \tilde{a}_{55} \kappa (1-q)\eta _2 }{ \tilde{a}_{44} \tilde{a}_{55} \tilde{a}_{66}} \right) , \quad L_2 = \tilde{\beta } \frac{ S^{**} \eta _1}{\tilde{a}_{55}}, \quad \text {and} \nonumber \\ L_3= & {} \tilde{\beta } \frac{ S^{**} \eta _2}{\tilde{a}_{66}}, \end{aligned}$$
(6.9)

and with Lyapunov derivative,

$$\begin{aligned} \begin{aligned} \mathcal {\dot{Z}}&= \Big ( 1 - \frac{S}{ S^{**}} \Big ) \dot{S} + \Big ( 1 - \frac{E_1}{ E_1^{**}} \Big ) \dot{E_1} + \Big ( 1 - \frac{E_2}{ E_2^{**}} \Big ) \dot{E_2} + \Big ( 1 - \frac{E_3}{ E_3^{**}} \Big ) \dot{E_3} \\&\quad + L_1 \Big ( 1 - \frac{I}{ I^{**}} \Big ) \dot{I} \\&\quad + L_2 \Big ( 1 - \frac{J_1}{ J_1^{**}} \Big ) \dot{J_1} + L_3 \Big ( 1 - \frac{J_2}{ J_2^{**}} \Big ) \dot{J_2}. \end{aligned} \end{aligned}$$
(6.10)

It can be shown from the model (6.1) and (6.4), at steady state, that

$$\begin{aligned} n \sigma= & {} \frac{a_{22} E_2^{**}}{E_1^{**}}, \quad (1-n) \sigma = \frac{a_{33} E_3^{**}}{E_1^{**}}, \quad k = \frac{\tilde{a}_{44} E_3^{**}}{E_1^{**}}, \quad q \kappa = \frac{\tilde{a}_{55} J_1^{**}}{I^{**}}, \nonumber \\ (1 - q) \kappa= & {} \frac{\tilde{a}_{66} J_2^{**}}{I^{**}}. \end{aligned}$$
(6.11)

Substituting the right hand sides of the first to seventh equations of (6.1) into (6.10), and using (6.11) (after several algebraic calculations) gives

$$\begin{aligned} \mathcal {\dot{Z}}&= \mu S^{**} \Big (2 - \frac{S}{S^{**}} - \frac{S^{**}}{S} \Big ) + \tilde{\beta } S^{**} I^{**} \Big (3 - \frac{S^{**}}{S} - \frac{I^{**}}{I} - \frac{I S}{S^{**} I^{**}} \Big ) \\&\quad + \tilde{\beta } S^{**} J_1^{**} \eta _1 \Big (4 - \frac{S^{**}}{S} - \frac{I^{**}}{I} - \frac{J_1^{**} I}{J_1 I^{**}} - \frac{S J_1}{S^{**} J_1^{**}} \Big ) \\&\quad + \tilde{\beta } S^{**} J_2^{**} \eta _2 \Big (4 - \frac{S^{**}}{S} - \frac{I^{**}}{I} - \frac{J_2^{**} I}{J_2 I^{**}} - \frac{S J_2}{S^{**} J_2^{**}} \Big ). \end{aligned}$$

Finally, since the arithmetic mean exceeds the geometric mean, the following inequalities hold:

$$\begin{aligned} \begin{aligned}&2 - \frac{S}{S^{**}} - \frac{S^{**}}{S} \le 0, \qquad 3 - \frac{S^{**}}{S} - \frac{I^{**}}{I} - \frac{I S}{S^{**} I^{**}} \le 0 \\&4 - \frac{S^{**}}{S} - \frac{I^{**}}{I} - \frac{J_1^{**} I}{J_1 I^{**}} - \frac{S J_1}{S^{**} J_1^{**}} \le 0, \qquad 4 - \frac{S^{**}}{S} - \frac{I^{**}}{I} - \frac{J_2^{**} I}{J_2 I^{**}}\\&\qquad - \frac{S J_2}{S^{**} J_2^{**}} \le 0. \end{aligned} \end{aligned}$$
(6.12)

Thus, \(\mathcal {\dot{Z}} \le 0\) for \(\mathcal {R}_{T2} > 1\). Since the relevant variables in the equations for T are at the endemic steady state, it follows that these can be substituted into the equation for T in (6.1), so that

$$\begin{aligned} T(t) \rightarrow T^{**} \quad \text {as} \quad t \rightarrow \infty . \end{aligned}$$
(6.13)

Hence, \(\mathcal {Z}\) is a Lyapunov function in \(\mathcal {D} {\setminus } \mathcal {D}_0\) [29]. \(\square \)

7 Simulation

In this section, we carry out uncertainty and sensitivity analysis of all the parameters in the treatment model (2.1) using the six infected classes, the reproduction number \((\mathcal {R}_T)\), and TB incidence as response functions. Numerical simulation results of the model (2.1) are also presented.

Table 3 PRCC values for the parameters of the treatment model (2.1) using the total number of new latently-infected individuals \((E_1)\), diagnosed latently-infected individuals \((E_2)\), undiagnosed latently-infected individuals \((E_3)\), undiagnosed actively-infected individuals (I), diagnosed actively-infected individuals with prompt treatment \((J_1)\), diagnosed actively-infected individuals with delay treatment \((J_2)\), and the reproduction number \((\mathcal {R}_T)\) as response functions

7.1 Uncertainty and sensitivity analysis

There are 21 parameters in the treatment model (2.1), and uncertainties are expected to arise in estimates of the values used in the numerical simulations. In order to access the effect of these uncertainties and to determine the parameter(s) that have the greatest impact on the transmission dynamics of tuberculosis, we perform uncertainty and sensitivity analysis [8, 34, 35]. Following [8, 39], we perform Latin Hypercube Sampling (LHS) and Partial Rank Correlation Coefficients (PRCC) on the treatment model (2.1). The analysis carried out in this section is based on demographic data relevant to Nigeria [16, 64]. The range and baseline values of the parameters tabulated in Table 2 are used in the analysis.

Using the population of new latently-infected individuals \((E_1)\) as response function, it is shown in Table 3 that the top three PRCC-ranked parameters of the treatment model are: the rate of diagnosis of latent TB \((\sigma )\), fraction of fast TB progression (p), and the rate of detection of active TB \((\kappa )\). Similarly, using the population of diagnosed latently-infected and undiagnosed latently-infected individuals \((E_2\) and \(E_3)\) as response functions, the top three PRCC-ranked parameters are the fraction of fast TB progression (p), rate of detection of active TB \(( \kappa )\) and the treatment rate for diagnosed latently-infected individuals \((r_0)\), and rate of diagnosis of previously undiagnosed latently-infected persons \((\alpha )\), fraction of fast TB progression (p) and fraction of new latent TB infection who got diagnosed (n), respectively. Also, using the population of undiagnosed actively-infected individuals (I) as response function, the top three PRCC-ranked parameters are: the fraction of fast TB progression (p), the rate of detection of active TB \((\kappa )\), and the transmission rate \((\beta )\)). Furthermore, using the population of diagnosed actively-infected with prompt treatment and actively-infected with delay treatment \((J_1\) and \(J_2)\) a response functions, the top three PRCC-ranked parameters are the fraction of fast TB progression (p), the transmission rate \((\beta )\), and the fraction of detected active TB cases with prompt treatment (q). Finally, using the reproduction number \((\mathcal {R}_T)\) as response function, the top three PRCC-ranked parameters are: the fraction of fast TB progression (p), rate of detection of active TB cases \((\kappa )\), and the transmission rate \((\beta )\). In summary, this study identifies nine parameters that have the most significant influence on the transmission dynamics of the treatment model, albeit, depending on the output response of interest, namely: the transmission rate \((\beta )\), fraction of fast TB progression (p), fraction new latent TB who got diagnosed (n), rate of diagnosis of latent TB \((\sigma )\), fraction of detected active TB cases with prompt treatment (q), rate of detection of active TB cases \((\kappa )\), treatment rate for diagnosed latently-infected individuals \((r_0)\), treatment rate for diagnosed actively-infected individuals with delay treatment \((r_2)\), and the rate at which latently-infected individuals become diagnosed individually \((\alpha )\).

The incidence of a disease is the rate of occurrence of new cases of the disease at a given time (or a period of time). In order to investigate significant parameters that greatly influence the occurrence of new cases of tuberculosis (i.e., TB incidence), we use the TB incidence as a response function. It follows from Table 4 that the top three PRCC-ranked parameters are the transmission rate \(( \beta )\), fraction of fast TB progression (p), and rate of detection of active TB cases \(( \kappa )\).

Table 4 PRCC values for the parameters of the treatment model (2.1) using the total number of new cases (incidence) as response function

It is worth noting that the fraction of fast TB progression (p) is common to all output responses as one of the significant parameters that has an influence on the transmission dynamics in the population. Hence, an increase in the value of p will result in corresponding increases in the values of the output responses considered herein.

7.2 Numerical simulations

The TB model (2.1) is numerically simulated to illustrate the effect of varying some key parameters related to diagnosis of latent and active TB cases. The parameter values listed in Table 2 are used for the simulations, otherwise specific parameter values (especially the transmission rate, \(\beta \)) used for the simulations are stated in the caption of the each figure. For the simulation in this section, demographic parameters relevant to Nigeria were chosen. Since the total population of Nigeria in 2015 is estimated to be 184,635,279 [16], it follows that, at the disease free equilibrium, \(\Lambda /\mu =184{,}635{,}279\). Moreover, the average mortality rate in Nigeria is \(\mu = 0.02041\) per year [56] so that the average recruitment rate is \(\Lambda = 3{,}768{,}400\) per year, and the total TB incidence in Nigeria was estimated to be 600,000 in 2015 [64].

Fig. 5
figure 5

Cumulative number of new TB cases with \(\beta = 9\), and varied n

Considering Fig. 5, we have the cumulative number of new TB cases as we vary the fraction of new latent TB infection who got diagnosed (n) between 0 and 1. The simulation shows that the cumulative number of new TB cases significantly drops as more latent TB are diagnosed (i.e., as \(n \rightarrow 1\)). This suggest that increasing the fraction of latent TB cases who are diagnosed have a positive effect in reducing the number of new TB cases over time.

Fig. 6
figure 6

Cumulative number of new TB cases with \(\beta = 9,\) and varied q

Looking at Fig. 6, we have the cumulative number of new TB cases as we vary the fraction of detected active TB cases (q) between 0 and 1. The simulation shows that as we increase the fraction of detected active TB cases who are promptly treated, the cumulative incidence of TB reduces.

Fig. 7
figure 7

Cumulative number of new TB cases with \(\beta = 9\), and varied \(\sigma \)

The plot in Fig. 7 shows the cumulative number of new TB cases as we vary the rate of diagnosis of latent TB \((\sigma )\) between 0.2 and 3. The simulation shows that as more latent TB infections are diagnosed, the cumulative number of new TB cases also decreases.

Fig. 8
figure 8

Cumulative number of new TB cases for with \(\beta = 4\), and varied \(\kappa \)

Figure 8 depicts the cumulative number of new TB cases as we vary the rate of detection of active TB cases \((\kappa )\) between 0.2 and 4. Just like the case in Fig. 6, a reduction in the cumulative number of new TB cases can only be achieved with very high detection rate for active cases.

8 Discussion

A new deterministic model for investigating the effect of diagnosis and treatment of latent TB infections and active cases on the transmission dynamics of the disease in a population is formulated and analyzed. We summarize the major theoretical and epidemiological findings as follows:

  1. (i)

    The model has a disease-free equilibrium (DFE), which is locally asymptotically stable whenever the associated reproduction number is less than unity. In the absence of exogenous re-infection, the treatment model has a globally asymptotically stable DFE whenever the associated reproduction number is less than unity. For a special case, the treatment model is shown to have a unique endemic equilibrium which is shown to be globally-asymptotically stable.

  2. (ii)

    The model exhibits the backward bifurcation phenomenon when the reproduction number \((\mathcal {R}_{T1})\) is less than unity due to the presence of exogenous re-infection.

  3. (iii)

    Analysis of the effective reproduction number of the model (2.1) show that the fraction of detected active TB cases that are promptly treated as well as the detection rate for active TB can have a population-level impact on the transmission dynamics of TB under certain conditions. Moreover, the impact of the fraction of new latent TB infections (together with the fraction of active TB cases detected and promptly treated) on the disease burden on the population is largely dependent on the treatment rates for TB infected individuals.

Uncertainty and sensitivity analysis of the treatment model (2.1) show the following:

  1. (a)

    Using the population of new latently-infected individuals \((E_1)\) as response function, it is shown that the top three PRCC-ranked parameters are the rate of diagnosis of latent TB \((\sigma )\), fraction of fast TB progression (p), and the rate of detection of active TB \((\kappa )\). In the same vein, using the population of diagnosed latently-infected and undiagnosed latently-infected individuals \((E_2\) and \(E_3)\) as response functions, the top three PRCC-ranked parameters are the fraction of fast TB progression (p), rate of detection of active TB \(( \kappa )\) and the treatment rate for diagnosed latently-infected individuals \((r_0)\), and rate of diagnosis of previously undiagnosed latently-infected persons \((\alpha )\), fraction of fast TB progression (p) and fraction of new latent TB infection who got diagnosed (n), respectively.

  2. (b)

    Furthermore, when the population of undiagnosed actively-infected individuals (I) is used as response function, the top three PRCC-ranked parameters are the fraction of fast TB progression (p), the rate of detection of active TB \((\kappa )\), and the transmission rate \((\beta )\)). Similarly, using the population of diagnosed actively-infected with prompt treatment and actively-infected with delay treatment \((J_1\) and \(J_2)\) a response functions, the top three PRCC-ranked parameters are: the fraction of fast TB progression (p), the transmission rate \((\beta )\), and the fraction of detected active TB cases with prompt treatment (q). It is worth noting that this result also holds when the reproduction number \((\mathcal {R}_T)\) is used as a response function.

  3. (c)

    When the TB incidence is used as the response function, it is shown that the top three PRCC-ranked parameters that influences the incidence of TB in the population are the transmission rate \(( \beta )\), fraction of fast TB progression (p), and rate of detection of active TB cases \(( \kappa )\).

  4. (d)

    The common parameter that significantly influences all output responses is the fraction of fast TB progression (p).

Numerical simulations of the TB model (2.1), using relevant demographic data from Nigeria, show that the cumulative number of new cases is significantly reduced as we increase the fraction of diagnosed new latent TB infections (LTBI) (n), and the rate of diagnosis of latent TB \((\sigma )\). Furthermore, the cumulative number of new cases can be reduced when a very large fraction of detected active TB cases are promptly treated (q), and the rate of detection of active TB cases \((\kappa )\) is high.

This study has shown that, with a relatively high fraction of diagnosed (and treated) latent TB infections (even with a small fraction of detected active TB cases promptly receiving treatment), it is possible to reduce the TB burden in the population, in agreement with some of reports, such as [30] and other reference therein. However, the impact of these diagnosis will be positively felt if there are robust treatment policies that will allow for high treatment rates for most TB infections diagnosed.