Introduction

The IARC Global Cancer Observatory has estimated that in 2018 the global cancer burden have risen to 18.1 million new cases and 9.6 million deaths in 185 countries [1,2,3]. Several factors as population growth, ageing, social and economic development, lifestyles, diet more typical of industrialized countries are associated with the increasing cancer burden [4,5,6,7,8,9].

Lung cancer is caused mainly by smoking [10,11,12], and represents the leading cause of cancer deaths in the world, with about 1.8 million cases (12.9% of all cancers) and 1.59 million deaths (19.4% of all cancer-related deaths) per year [13,14,15]. Among lung cancers, non-small cell lung cancer (NSCLC) is the most common type (85–90% cases of all lung cancers) [16, 17]. Most patients are diagnosed in the advanced stage (79%) [18] with the presentation of symptoms related to the primary tumor (27%), nonspecific systemic symptoms suggestive of metastases (34%) or symptoms specific to a metastatic site (32%) [19]. Lung cancer is difficult to treat because its pathological mechanisms are still poorly understood. This disease is a significant economic burden not only to patients but also to national healthcare systems.

Recent advances in our understanding of the cell signaling pathways that control cell survival have opened up many new therapies that improve the 5-year survival rate for patients [20]. Among these new methods, targeted therapy with the typical drug, erlotinib (Tarceva), has been found to limit tumor development and progression by inhibiting the epidermal growth factor receptor (EGFR) mutations found in 10–15% of cases in North America and Western Europe and up to 51.4% of cases among Asian patient [21,22,23,24,25].

Erlotinib was approved for second-line therapy by the FDA in 2005, and then for first-line treatment of advanced EGFR mutation-positive NSCLC in 2013, because it was proven superior to chemotherapy for progression-free survival in patients whose tumors harbor sensitizing driver mutations in the EGFR gene [26,27,28]. However, for decision makers, the choice of treatment depends not only on effectiveness but also on treatment costs [29]. The aim of the study was to review the available evidence on the cost-effectiveness of erlotinib in the first-line treatment of advanced NSCLC and evaluate the quality of studies.

Materials and methods

Systematic literature review

A systematic literature search was conducted in PubMed, Cochrane, Scopus and ScienceDirect, all well-known and respected databases, in March 2017 to identify articles published from January 2000 onwards that report a pharmacoeconomic evaluation of erlotinib in first-line treatment of advanced NSCLC. The search process was repeated in December 2017 to ensure up to date results. The main key words, formed using the PICO technique (Table 1), were “advanced non-small cell lung cancer”, “erlotinib”, “first-line”, “cost-effectiveness”, “cost-utility”, “economic evaluation”, “cost-minization”, “cost-benefit” and synonyms identified using the Medical Subject Heading (MeSH) tool. Search terms were combined with boolean searching terms (“AND”, “OR”) and adapted as required for each database, for example, #1 “non-small cell lung cancer” OR “non-small cell bronchial cancer” OR “NSCLC” AND #2 “advanced” OR “stage IV” OR “stage IIIB” OR “metastatic” AND #3 “first-line” AND #4 “erlotinib” AND #5 “economic evaluation” OR “economic analysis” OR “cost minimization” OR “cost effective” OR “cost benefit” OR “cost utility”.

Table 1 PICO elements

The titles and abstracts of all the publications that met the search criteria were reviewed independently by two researchers. Full-text articles written in English were captured if they provided an original economic evaluation of erlotinib in first-line treatment of NSCLC patients. Clinical trials, review papers, posters, or studies that did not assess the full pharmacoeconomics, or in other words, that did not compare both costs and consequences of interventions or conditions, and studies without a specific effectiveness evaluation were eliminated. Any disagreement between the two reviewers was solved through discussion.

After reviewing, the following information from these studies was recorded in a table: (1) bibliography as year, authors and location of studies, (2) study design including aim, interventions and alternatives, the study perspective, costs and outcomes, type of model, time horizon and cycle, discount rate and sensitivity analysis, and (3) results of the study.

To compare the results of the studies, all currency values were converted into 2017 international dollars (I$) based on Consumer Price Index (CPI) and Purchasing Power Parity (PPP) [30] by the following formulas:

$${\text{Cos}}{{\text{t}}_{{\text{present}}}}={\text{ Cos}}{{\text{t}}_{{\text{past}}}} \times {\text{ CP}}{{\text{I}}_{{\text{present}}}}/{\text{ CP}}{{\text{I}}_{{\text{past}}}} \times {\text{ PPP,}}$$

in which, Costpresent: cost, converted into 2017 I$. Costpast: cost in the year of the study was conducted, CPIpresent: average consumer price index in 2017. CPIpast: consumer price index of the year when the study was conducted.

Quality evaluation

The quality of the studies was assessed by two reviewers who were not given author or affiliation information, using an internationally accepted quality checklist, the Quality of Health Economic Studies (QHES), which was developed by five experts in the field of health economics and three investigators [31]. This questionnaire consists of 16 questions with the dichotomous criteria scores to which the answers yes (full score) or, no, (zero score). In our study, the scale was modified with three answers: no (zero score), yes (full score) and partial (half of full score). The QHES assigns a weight to each question according to its importance (ranging from 1 to 9 points). The total number of points possible is 100 (adding up the score from 1 to 9 assigned to the ‘yes’ answer for each question), thus 0 represents lowest quality and 100 highest quality. In case of a disagreement between the reviewers on any of the questions, a third reviewer acted as a tie-breaker. Since there is no standard interpretation for the ranking of QHES scores, we decided that a score from 75 to 100 indicated good quality, 50–75 medium quality and below 50 low quality. Moreover, the relationship between the quality of the studies (QHES scores) and the independent variables was also evaluated using t test and Pearson’s correlation coefficient. The independent variables consisted of the number of authors, the locations of authors, the years from publication to 2017, funding sources, study design and location of study. A p value of < 0.05 is considered as statistical significance.

Results

A total of 130 studies met our search criteria: 30 from Pubmeb, 8 from Cochrane, 36 from Scopus and 56 from Sciencedirect. After removing duplicates, 99 studies remained, and once we screened their titles and/or abstracts, 11 studies remained that met the inclusion and exclusion criteria. Thus, 88 studies were eliminated: 4 were not about first-line treatment, 19 did not provide a pharmacoeconomic evaluation, 1 was not a full pharmacoeconomics, 21 did not appraise the cost-effectiveness of erlotinib, 3 did not assess NSCLC disease, 1 was not written in English, 17 were systematic reviews, and 22 were only abstracts or programs from scientific conferences. The 11 articles that met our criteria were read in full and assessed for eligibility, and all were found acceptable (Fig. 1). Their details and content were then summarized in Table 2.

Fig. 1
figure 1

Flow diagram of screening process

Table 2 Characteristics of the studies

Study characteristics

Studies of economic evaluation of erlotinib were carried out in the interval between 2012 and 2017 in many countries all around the world including Europe (UK, Germany, France, Spain, Italy) [32,33,34,35,36,37,38] Asia (Hongkong, China) [39,40,41] and America (USA) [42], but the largest number of studies were in Europe (64%) and in 2012 and 2015 (54%). Among 11 studies, 5 studies compared erlotinib with the best supportive care, 2 studies dealt with reverse strategy, 1 study compared erlotinib in combination with bevacizumab with bevacizumab alone, the others compared with cisplatin plus pemetrexed, carboplatin plus gemcitabine and gefitinib. The target population in most studies comprised elderly patients with advanced stage EGFR-mutated NSCLC.

The main methods used in most studies were cost-effectiveness analysis (CEA) with modeling method and sensitivity analysis for uncertainty of model, including multi-way probabilistic sensitivity analysis in all studies and one-way deterministic analysis in five studies [34,35,36, 40, 41]. Regarding the modeling method, the health-state transition model was used in the majority of studies (8 of 11 studies), the Markov model being the most chosen (75%). In the health-state transition model, the model cycle was 1 month in the majority of the studies and 3 weeks in the studies conducted in Germany [34] and China [40]. The model horizon was different in each of articles, with a lifetime horizon in 2 studies [39, 42], 10 years in 3 studies [34, 40, 41], 4 years in 1 study [32] and with no specification in 2 studies [35, 36]. In 3 remaining studies, 2 studies used partitioned survival model [37, 38] with a 1 month cycle and a lifetime horizon [38] or 5 a year horizon, while 1 study was based on the case report forms collected monthly [33]. The main clinical data sources of these studies were EURTAC (European Clinical Trials Database No. 2006-003568-73), GFPC 0505 (a prospective randomized phase II trial), SATURN (the randomized multicenter placebo controlled phase 3 patient-level data trial) and OPTIMAL trial (a multicentre open-label randomised phase 3 study).

Regarding the perspectives of these publications, 2 studies by Chouiad were based on a third-party perspective [35, 36] and 7 remaing studies (except the Ting study [42] and the Khan study [33]) were based on the perspective of the health care system [32, 34, 37,38,39,40,41], so they calculated only direct costs consisting of chemotherapy, hospitalization, medications, diagnostic imaging, consumables, treating adverse events, procedures, laboratory tests, and support care. Two studies of Ting [42] and Khan [33] did not mention the perspective, but they did not assess indirect costs, so they were not based on society perspective. Main outcomes obtained in the studies were life-years gain (LYG), quality-adjusted life years (QALY) and progression-free life-years gained (PF-LY). A discount rate of 3% in 5 studies [32, 34, 39, 40, 42] and 3.5% in 5 studies [33, 35,36,37,38] were used for both cost and effectiveness. The Fangbing study did not mention a discount rate [41].

Cost-effectiveness

After converting the currencies used in the studies to I$, using the CPI and PPP, we found that erlotinib in the first-line treatment of advanced NSCLC had incremental cost-effectiveness ratios (ICER) in terms of QALY and LYG (ICER/QALY and ICER/LYG) that ranged from dominant, effectively superior and cost saving, to I$305,510.31/QALY (Fig. 2) and from I$31,209.55/LYG to I$66,540.20/LYG (Fig. 3). In the studies conducted in Europe, the cost-effectiveness acceptability curves resulting from probabilistic sensitivity analyses showed that the probability of erlotinib being cost-effective was over 50%, if the range of willingness-to-pay (WTP) threshold was €50.000–€70.000 [34, 37], even 100% in the research of Walleser et al. [38]. It would be 100% in all studies in Europe if WTP was at least €90.000. About studies in Asia, at the threshold of approximately US$100.000, erlotinib had a 100% probability of being cost-effective in Hong Kong [39], but 50% probability in China [40]. One study showed that erlotinib exhibited about 10% chance of being cost-effective in all patients and about 80% chance in a subset of elderly patients with NSCLC who had poor performance, were deemed unfit for chemotherapy and developed first cycle (28 days) rash in the range of WTP of £20,000–£30,000 [33]. Two other studies in France comparing erlotinib with reverse strategy did not find a difference in cost-effectiveness [36, 43] and 1 study comparing a combination of erlotinib and bevacizumab with bevacizumab alone suggested that the strategy with bevacizumab alone was the best in terms of cost-effectiveness [41].

Fig. 2
figure 2

ICER/QALY of studies

Fig. 3
figure 3

ICER/LYG of studies

When comparing the results regarding comparative regimens, a significant discrepancy emerged between the results in comparision erlotinib with best supportive care. Although all these studies were carried out in Europe, the result of Vergnenegre (result in three European countries including Italy (I$42,801.60/LYG), France (I$54,170.50/LYG) and Germany (I$66,540.20/LYG)) [36] was higher than that of Walleser (result in 5 European countries including Italy (I$32,301.04/LYG), France (I$31,209.55/LYG), Germany (I$35,232.67/LYG), UK (I$33,557.67/LYG) and Spain (I$36,688.71/LYG)) [37]. Similarly, studies comparing erlotinib and chemotherapy regimens found significant difference: the results ranged from I$20,434.34/QALY to I$134,696.97/QALY. Regarding the regimen of erlotinib followed by chemotherapy after disease progression compared to the reverse strategy, in the Chouaid study on fit elderly patients, there was no difference in outcomes between the two strategies, but the erlotinib-first strategy was less costly [43]. However, similar research by Chouaid, but on frail elderly patients, found no difference in cost-effectiveness [36].

The quality of the reports

On the results of the quality evaluation through the 16 questions in the QHES quality questionnaire, 7 studies were of high quality, and 4 of medium quality. The mean score of these 11 studies was 75.77 out of 100 (SD 9.38). The more recent studies were among the higher quality ones. The highest score was assigned to the Shcremser article published in 2015 [34] which was assigned a score of 92. The studies by Khan and Ting published in 2015 [33, 42] and Vergnenegre published in 2016 [32] were ranked the second highest with a score of 81–86. Three studies published in 2013, 2014, 2017 were ranked the third highest, with slightly higher scores in 2013 and 2014 (“Appendix 1”).

Regarding our answers to the 16 questions to evaluate the quality of each study, all the studies were assigned a ‘yes’ answer to questions 1,4, 6 and 15, which regard the study objective, pre-specification of the groups at the beginning of the study, performance of incremental analysis between alternatives, and the conclusions/recommendations of the study. In contrast, lowest scores to these studies were given for question 2 (over 50% studies got partial score because of not stating reasons for perspective selection in study), question 7 (over 50% studies got partial score because of lacking of explicit methodological detail on costing/resource use and health/patient measures or above and beyond the source of costs and the name of the scale used to measure health states or other effectiveness outcomes [44]), question 10 and question 12 (about 30% articles did not get full score for these items) and question 14 (all studies did not explicitly discuss direction and magnitude of potential biases) (Fig. 4).

Fig. 4
figure 4

Percentage of answers to each of the QHES questions

When analysing the relationship between the variables and the quality of studies based on the mean QHES scores, articles that the years since publication were under 3 years had a significantly higher mean score than the studies that the years were from 3 years or more (83.0 versus 69.8; p = 0.01), that is, the more recent studies were the higher quality ones. Regarding the study design, there were not enough articles in each category to conduct meaningful statistical comparison, so we evaluated the relationship between the quality of studies and the types of model used in the studies (Markov model or not). The result showed that there was a significantly higher mean quality score in the studies that used Markov model than the studies that used other models (80.83 compared with 65.75; p = 0.002). For locations of study, locations of authors, number of authors and funding source, there was not significantly related to the mean QHES score (Table 3).

Table 3 Relationship between study characteristics and the quality of the studies (QHES scores)

Discussion

This comprehensive review examined studies on the cost-effectiveness of erlotinib in the first-line treatment of advanced NSCLC, and assessed the quality of the studies using the QHES questionnaire, a well-known and trusted tool. Unlike other systematic reviews, this review focused on the first-line treatment of advanced NSCLC instead of later lines of treatment, and on erlotinib regimen rather than other regimens, and drew upon many recent and up-to-date publications, as well as evaluated the quality of studies [35, 45,46,47,48] In doing so, we sought to give healthcare policymakers a sense of the reliability of the research available, to help inform their decisions in the first-line treatment of NSCLC with erlotinib.

An initial keyword search in 4 respected medical databases yielded 130 articles on our topic; after reading the titles and articles to exclude duplicates and publications that did not meet our criteria, 11 studies were identified for this review. The main methods used in most studies were cost-effectiveness analysis (CEA) with modeling method and sensitivity analysis for the uncertainty of model. The use of modeling method in CEA studies regarding treatments for chronic diseases is indispensable because it allows evaluation of costs and clinical outcomes based on a number of types of data from many different sources, and makes it possible to extrapolate efficacy for the general population, especially when there is insufficient data from longer-term or head-to head clinical trials [49]. However, the modeling method is also a key limitation of the erlotinib pharmacoeconomic analyses which leads to a large amount of uncertainty in the analyses. As a result, the authors of most studies built a set of assumptions and sensitivity analysis to solve these problems with multi-way probability sensitive analysis chosen in all studies. In their model, they also set up a series of treatment scenarios that were close to reality and difficult to realize in a clinical trial with sufficient power and an appropriate time horizon.

Through converted results, there was the variability of ICER/QALY (about tenfold). This discrepancy may come from the difference in the comparators and the methods of researches. While other studies used modeling method and chemotherapy or targeted drugs in comparators, Khan et al. [33] chose the real data and the best supportive care in comparison. The choice of the comparator had important implications not only for trial ethics and recruitment but also for results and interpretation and depended mainly on the specific assessment question [50]. Compared to other studies, the study of Khan et at aimed to patients who were unfit for chemotherapy and received only active supportive care due to their poor performance status or presence of comorbidities [33]. This choice was relevant not only for demonstrating the efficacy and cost of erlotinib but also for assessing its place in therapy in comparison with existing therapy [50]. Results of the studies came from the perspective of a health care system and third-party payer were limited by the direct costs, while indirect costs associated with travel, loss of productivity and other factors may accounted for high value. To have comprehensive overview of the economic burden of erlotinib in the treatment of advanced NSCLC, these indirect costs should be assessed as well. Most studies agreed that erlotinib was cost-effective compared with best supportive care and chemotherapy (except the studies by Khan [33] and Fangbing [41]) across the range of possible WTP thresholds, but concluded that it was no more cost-effective than reverse strategy. One of the factors that had significant impacts on the ICER was price of erlotinib, which was shown in the results of the deterministic sensitivity analysis in the studies of Schremser et al. [34] and Wang et al. [40]. The change in the cost of erlotinib caused the second highest change in ICER in the group of selected cost variables. Thus, price improvement was a potential strategy to improve the cost-effectiveness of erlotinib. In China, all eligible Chinese patients (EGFR mutation-positive NSCLC) would be able to use erlotinib free of charge after they have treated with erlotinib continuously for 5 months through the donations of manufacturer [40]. In UK guidance, erlotinib was provided at the discounted price agreed under the patient access scheme [51]. In other countries that lacked this special benefit, erlotinib monotherapy would fail to be cost-effective [40].

Using the QHES instrument, the quality of these 11 studies was assessed. Most of them got a high score (an average of 75.77 out of 100), with the more recent publications earning the higher scores for quality. This could be explained that the authors had reference and followed up-to-date about guidelines for reporting health economic evaluation. This reference is necessary for authors to upgrade the quality of their reports, which make their results valuable and trustworthy for decision-makers. The QHES modification in three answer instead of the dichotomous scores (mid-score was applied if the article only met the half the requirements of the item) made the evaluation closer. For examples in the item 2, most authors stated the research perspective in the studies, which was important to identify the included cost components, but they did not mention to the reasons for its selection that did not deserve full points but did not deserve zero points either. As a result, it was to note the need for improvement in regarding questions 2, 7, 10, 12, 14, on which the studies generally failed to score well.

As usual in pharmacoeconomic evaluations, almost all of the analyses were fully or partially funded by the manufacturer if their medicines were included in the research. The source of funding was made clear in most of the analyses: seven studies specified their source of funding, one study indicated that it received no funding and three studies failed to indicate anything about their support. Among 7 studies funded, 2 received financial support from pharmaceutical companies (28.6%), 2 received funding from government fund (28.6%) and 3 were supported by both pharmaceutical companies and national source (42.8%).

Our study has some limitations. The criterion limiting the articles to those written in English and published from January 2000 onwards meant that we might have excluded many previous and non-English studies. Besides, the choice of only four databases and not including unpublished researches (grey literature) generated some biases. This review evaluated studies’ reporting quality which might not reflect the quality of the study [52]. Moreover, although good inter-rater reliability was noted for scores in QHES scale, other evaluators may score the articles differently [53].

Conclusion

A searching and screening process led to the identification of 11 studies, which were then analysed and assessed for quality. Depending on the frequently proposed willing to pay threshold of each country, most concluded that erlotinib was cost-effective in the first-line treatment of advanced NSCLC patients, two studies showed that erlotinib was cost-effective on specific types of patients with certain conditions, and two studies found no difference in cost-effectiveness between erlotinib and reverse strategy. The quality of the studies was high, with a mean QHES score of 75.77 (SD 9.38) on a scale from 0 to 100.