Introduction

Many individuals who smoke tobacco want to quit but find it difficult to do so since nicotine is addictive. In the past decade or so, e-cigarettes and heated tobacco products (HTPs) have become popular among some consumers of nicotine, especially those trying to quit smoking. Because these electronic nicotine delivery systems (ENDS) mimic the cigarette smoking experience more closely than pharmaceutical cessation aids such as nicotine patches, ENDS may encourage switching from cigarettes and therefore cessation of smoking. Alternatively, ENDS may hinder cessation if they complement the smoking habit. Whether public policy should encourage smokers to use ENDS for cessation, as in the UK (Hampsher 2020), remains an unsettled issue. Many public health officials object to ENDS on the grounds that the use of ENDS and smoking are positively correlated in many surveys, among other reasonsFootnote 1; some officials incorrectly deduce from such evidence that ENDS discourage cessation from tobacco. However, in these surveys, the use of ENDS is a consumer choice, not a randomly assigned treatment, and that choice is likely to be correlated with unobserved confounding factors related to smoking behaviour. Ignoring this issue of sample selection (or “common liabilities,” as they are known in the public health literature) leads to bias in estimates of the causal effect of ENDS use on smoking. This study investigates this correlation, selection bias, and the causal treatment effect of ENDS use on cessation.

Health ministries in many countries discourage the use of ENDS, even for smokers. South Korea, the country studied here, is among them (Prieger and Choi, 2020). The smoking rate in Korea is higher than in most other countries. The cigarette smoking rate of 20% in 2018 is at the 64th percentile among reporting countries (World Health Organization (WHO) 2019b). Korea has among the highest prevalence of male smoking in the world; the cigarette smoking rate for men of 35% is at the 78th percentile among all countries and in the top decile among high-income countries. Despite large declines in smoking since 1980, when the male smoking rate was almost 80% (Prieger and Choi, 2020), further decline in the smoking rate stalled or reversed between 2005 and 2011. After ENDS began to come into common use in Korea around 2011–2013, the male smoking rate resumed its decline. These trends prompt the question of whether ENDS indeed played a role in cessation from smoking.

Many cohort studies of Korean subjects find that ENDS use and smoking are positively correlated, a finding we replicate below. To show that such findings are due to selection effects from the endogeneity of using ENDS, we show that a tax increase on ENDS and negative pronouncements regarding ENDS by the government in 2015 likely shifted the subpopulation of ENDS users toward those smokers for whom cessation is less likely. Evidence for the shift comes from changes in the composition of ENDS users and increases in the association between ENDS use and current smoking from regression and matching estimators. When the most likely users of ENDS are also those most addicted to smoking, it is unsurprising that smoking and ENDS use are positively correlated even if ENDS help achieve cessation.

We then estimate the treatment effects directly. Our econometric analysis of repeated cross-sectional data suggests that, after accounting for unobserved confounding characteristics of individuals, ENDS promote cessation. The novelty of the study lies in demonstrating that—even without randomized treatment assignment, longitudinal data that would allow individual-level fixed effects to control for selection, or policy and price variation that allows quasi-experimental methods such as difference-in-difference estimation—using appropriate econometric methods for endogenous treatment effects shows that the correlation between ENDS use and smoking is due to selection bias. We use parametric models to estimate the effect of the binary treatment, which is ever use of ENDS, on the binary outcome: recent cessation from smoking. Several versions each of many models for treatment effects with binary dependent variables are estimated: bivariate probit, copula models, and a moment-based control function model. The results from the models controlling for selection effects generally agree that such effects exist, that they are in the direction that explain why ENDS use and smoking are correlated, and that the causal effect of ENDS on cessation is positive. Based on the results preferred by formal selection criteria, we estimate the average treatment effect (ATE) of ever using ENDS on the probability of cessation for ever-smokers in Korea to be between 28.9 and 32.2 percentage points. More importantly, the average treatment effect on the treated (ATET), which represents the expected impact of ENDS use among those choosing to use them (the group of smokers who may find cessation most difficult, as we show below), is estimated from the preferred bivariate copula model to be 16.2 percentage points. The control function estimation yields a similar ATET of 17.0 percentage points. The estimates for the ATET are robust to many different estimators, subsamples of the data, varying definitions of cessation, specification of the time trend, and whether an instrument is used.

Our empirical results finding a positive, significant, and sizeable causal effect of ENDS on cessation link real-world use of ENDS to results from randomized controlled trials (RCTs) on ENDS and cessation. Systematic reviews of RCTs indicate on the whole that ENDS can be a better aid to cessation than nicotine replacement therapy (Chan et al. 2021; Hartmann-Boyce et al. 2022; Hébert-Losier et al. 2020). For ENDS to be beneficial for public health, however, smokers must choose to use them. Adoption of ENDS has been relatively low in Korea, especially in contrast to other Asian countries such as Japan. The reason is largely due to government policy. While officials allowed ENDS into the marketplace for nicotine products in 2007,Footnote 2 the health and drug safety ministries have consistently discouraged the use of ends. Highly publicized pronouncements by the government have convinced many Koreans that ENDS are as harmful to health—or even more harmful—than smoking, that e-cigarettes do not help in quitting smoking, and that they cannot be used as a cessation aid. The pronouncements on harm are contrary to much scientific evidence, as will be discussed in the concluding section. Korea was the first country to tax nicotine-containing ENDS (in late 2010), and taxes on ENDS rose 119% in 2015 (along with increased cigarette taxes) and doubled again in 2021. Governmental agencies have issued several negative statements and recommendations regarding ENDS. A prominent example occurred in January 2015, when the Ministry of Health and Welfare announced that e-cigarettes “contain the same carcinogens as in tobacco cigarettes,” with the tacit implication that vaping is just as harmful as smoking (Ministry of Health & Welfare, 2015). The tax increases and widely publicized health warning from the government led to current ENDS usage to fall from its already low level by almost one-half from 2015 to 2016.

In the next section, we review the literature on ENDS use, smoking, and cessation in Korea and elsewhere. Descriptive empirical results on smoking and ENDS use in Korea are covered next, and then, the positive correlation between smoking and ENDS use is demonstrated with descriptive regressions. The subsequent section discusses the potential for selection bias and presents indirect and direct model-based evidence for it. We conclude with a summary and discussion of the results and their policy implications.

Literature on the Association Between the Use of Ends and Smoking

Since ENDS start gaining market prominence in the 2000s, a growing body of literature has examined the association between e-cigarette usage, attempts to stop smoking, successful cessation, and other aspects of dual use of cigarettes and ENDS. Although there is relatively limited evidence on the causal impact of e-cigarette usage on smoking, observational studies that investigate e-cigarette use among smokers provide interesting and suggestive results. Several studies find that ENDS use is associated with the intention to quit smoking; the majority of smokers who use ENDS do so to reduce or quit smoking (Adkison et al., 2013; Maglia et al., 2017). These intentions translate to an increased number of quit attempts among ENDS-using smokers, compared to single-use smokers (Pasquereau et al., 2017; Zhuang et al., 2016). Increasing the number of attempts is important since most attempts to quit smoking fail, regardless of which cessation aids are used.

Are these quit attempts successful? Many studies from around the world find evidence that e-cigarette use is associated with cessation, although the evidence is not unequivocal. Systematic reviews of the literature conclude that most studies find positive relationships between using ENDS and cessation, but also that much of the evidence is correlational (Malas et al. 2016; Villanti et al., 2018). Some meta-analyses of observational studies also find positive and significant effects on cessation from ENDS use, whether in general (Rahman et al., 2015) or only for daily use (Wang et al., 2021). However, some reviews of observational studies find no positive association between vaping and cessation overall (Wang et al., 2021). The mixed findings regarding cessation rates among dual users may be due to differences in the frequency of use; daily e-cigarette users are more likely to quit smoking than non-users, but less-than-daily users are less likely (Wang et al., 2021).

Apart from abstinence, ENDS use can also encourage reducing the intensity of smoking (Pasquereau et al., 2017). On the other hand, other studies find that dual users consume more total nicotine and have higher nicotine dependence than single-use smokers (Kim et al. 2020; Snell et al., 2020). Of course, such findings may say more about the type of smoker who chooses to also use ENDS than about any causal effect of ENDS use on consumption or dependence. This possibility that individuals choosing to be dual users are those less likely to be successful in quitting informs our discussion of selection bias below.

In contrast to the mainly observational cohort studies discussed above, there are far fewer results from research designed to elicit causal effects. Several published studies evaluate the use of ENDS as a cessation aid in randomized controlled trials (RCTs) (Chan et al. 2021; Hébert-Losier et al. 2020). In particular, a recent Cochrane review (a series considered the gold standard of meta-analyses in public health and medicine) covered 22 RCTs and nine uncontrolled intervention studies where every participant received ENDS for cessation. It concluded that ENDS is more effective for smoking cessation than nicotine replacement therapy (with high certainty) and e-cigarettes without nicotine (with moderate certainty) (Hartmann-Boyce et al. 2022).

Other studies use quasi-experimental designs to identify causal effects. Saffer et al. (2020) employ a synthetic control difference-in-difference approach to examine a sharp increase in e-cigarette taxes in Minnesota to find that the adult smoking rate rose and the cessation rate fell in response. Relatedly, Dave et al., (2019a, 2019b) show that television advertising of ENDS has a causal effect on cessation from smoking. These studies are part of a recent set of quasi-experimental studies exploiting variation in taxes and other policies to provide robust evidence that ENDS and cigarettes are economic substitutes instead of complements (e.g., Abouk et al., 2023; Dave et al., 2019a, 2019b; Friedman & Pesko, 2022; Pesko & Warman, 2022), which indirectly supports the cessation hypothesis. Difference-in-difference modeling is unavailable in the present context since taxes and regulation are uniform within Korea.

The studies reviewed above (apart from Kim et al. 2020) were performed outside Korea. Research performed using data from Korean subjects tends to present a negative view of ENDS, mainly because typically no or little attempt is made to address the issue of selection bias. Several studies find that use of e-cigarettes is positively associated with smoking prevalence and intensity, especially among younger people.Footnote 3 Specific findings include that the dual use of ENDS and cigarettes is more prevalent than single use of e-cigarettes, although some studies do not distinguish concurrent dual use from having ever used ENDS and cigarettes concurrently. Those who smoke more frequently are found to be more likely to be current ENDS users. However, such observational studies have little to say about whether e-cigarette use leads to smoking, vice-versa, or whether both behaviors are caused jointly by other factors. The same critique applies to the association found between e-cigarette use and smoking intensity. Studies on actual or attempted cessation in Korea have only recently begun to examine the role of e-cigarettes. Use of e-cigarettes is associated with more cessation attempts, in contrast to use of HTP (Kang & Cho, 2020).

In contrast to the observational studies in Korea, the only RCT in the country involving ENDS came to positive conclusions (Lee et al., 2019). E-cigarettes were found to be about as effective as nicotine gum (a proven cessation aid), and a higher proportion of the e-cigarette users reduced their smoking (i.e., switched for daily to non-daily smoking) at 24 weeks.

Data and Descriptive Results on Smoking and the Use of ENDS in Korea

For our empirical investigation of smoking and vaping, we use data from the 1998 to 2018 Korea National Health and Nutrition Examination Survey (KNHANES), a cross-sectional population-based survey (using multistage stratified cluster sampling). Since questions regarding e-cigarette use are available since 2013, the sample is limited to 2013–2018 when examining ENDS. For 2018, the ENDS usage variable we construct refers to e-cigarette or HTP use; before then, there was no survey question about HTP in particular, although some HTP users may have considered their devices to be e-cigarettes when answering the survey question. The survey was conducted in 1998, 2001, 2005, and annually 2007–2018. The unit of observation is an individual, and all statistics and regressions estimated from the survey data are weighted to reflect the population of non-institutionalized adults of legal smoking age (19 years and older). All inference is based on standard errors computed with the Taylor series linearization method accounting for the complex survey design, including stratification, clustering on the primary sampling units, and the sampling weights.

There were large decreases in the smoking rate in Korea since the 1980s until 2007, when it reached 25.0% (95% CI [0.230, 0.273]) (Fig. 1; Prieger and Choi, 2020). After 2007, the prevalence of smoking declined only modestly to its rate in 2018 of 21.1% [0.195, 0.227] overall and 35.8% [0.335, 0.382] for men. All of the recent decline is accounted for by men. The use of ENDS is fairly low in Korea but was increasing through 2015, with a reversal in response to unfavorable policy after 2015 (Fig. 2). Adults in their 20s and 30s are more likely than older people to use ENDS, although the share of current ENDS use among adults aged 40–49 years increased threefold in 2018, the first full period that HTPs such as PMI’s IQOS were available (Prieger and Choi, 2020), compared to the year before. Out of all adults, 18.9% of males (95% CI [0.171, 0.209]) and 3.4% [0.027, 0.042] of females report ever using e-cigarettes in 2018. The share of e-cigarette users increased sharply in 2015 (likely because of a large cigarette tax increase and indoor smoking ban that year), but fell the next year, possibly in response to ENDS tax increases and the negative government pronouncements about the health effects of ENDS in 2015. About 9.9% [0.085, 0.115] of males and 1.3% [0.009, 0.017] of females reported that they currently use e-cigarettes in 2018 (Fig. 2). Adult users of ENDS in Korea are nearly exclusively current and former smokers (Fig. 3). Use among never smokers is trivial—less than one-half of 1% of never smokers say that they have ever used ENDS during this period (see Table 2). Little has changed in the composition of ENDS users since 2016, despite the introduction of HTP in 2017.

Fig. 1
figure 1

Smoking prevalence among adults. Note: Data are from the KNHANES (analysis by the authors). Only individuals of age 19 + (the legal smoking age) are included

Fig. 2
figure 2

Percentage of adults who ever used and currently use ENDS

Fig. 3
figure 3

Current adult ENDS use by smoking status

Why are smokers and others trying e-cigarettes? In 2015, when cigarette taxes rose, there were more adults using ENDS for reasons of curiosity, harm reduction, and as a cessation aid than in 2016 (Fig. 4). In 2015, 44.8% (95% CI [0.357, 0.542]) of current e-cigarette users stated that cessation was their main reason for use, while in 2016, that figure was 48.6% [0.374, 0.599]. More smokers intend to quit than attempt to do so. Table 1 shows how many adult smokers have plans to quit smoking, and when, broken out by usage of ENDS. Over 2016 to 2018, about 61% of smokers stated that they had plans to quit, although more than half of those had no plans within the next six months. The intentions and timing of the plans do not vary greatly depending on whether e-cigarettes have ever been used, except that current and past users are about three percentage points more likely to have cessation plans. Having plans to quit is most likely among former HTP users (perhaps because they tried using HTP to quit in the past) and least likely among current HTP users (perhaps due to the selection effects discussed below). It is interesting to note, however, that more current HTP users have plans to quit more than six months in the future than any other group.

Fig. 4
figure 4

Reasons for using ENDS among adult current users

Table 1 ENDS use and plans for cessation among current adult smokers

Results: The Positive Association Between ENDS Use and Smoking

Current and past use of ENDS is highly positively correlated with smoking in Korean data, even after controlling for demographic factors. Table 2 shows the cross-tabulation of smoking status and ENDS use in the KNHANES data averaged over 2013–2018. As already seen in Fig. 3, current ENDS users are likely to be current smokers: Out of the 2.9% of adults during the period 2013 to 2018 who were current ENDS users, 84.9% of them reported being current smokers. On the other hand, those who never used ENDS also tended to report that they had never smoked cigarettes. Out of the 90.9% of adults who reported never using ENDS, 65.1% of them never smoked, 18.6% are former smokers, and only 16.4% of them are current smokers.

Table 2 Adult smoking status and ENDS use

The positive association between ENDS use and smoking persists after controlling for observable characteristics of the individuals. Table 3 shows the results from probit regressions of a dependent binary variable for current smoking on current and past use of ENDS using KNHANES data from 2013 to 2018 (summary statistics and complete regression results are in the appendix). The probit model is used to be in accord with the models for endogenous treatment effects explored later. Both marginal effects (more familiar to economists) and adjusted odds ratios (more familiar to health researchers) are shown. The marginal effects and odds ratios are adjusted for the survey year, region of the country, gender, age, urban/suburban/rural location, educational attainment, occupation, household income, home ownership, and insurance status. For each sample, the first regression includes only the fixed effects for year and region, while the second regression includes all the additional control variables. Examples of the unobserved confounders controlled for by the year fixed effects include national tobacco control laws, tobacco and ENDS taxes (which are national in Korea), and the offerings and prices in the national tobacco and ENDS marketplace. The region fixed effects control for systematic differences in products or prices among regional markets, among other unobserved region-specific factors.

Table 3 Probit regression results for current smoking and ENDS use

As in the Korean cohort studies discussed above, current and ever use of ENDS is highly and statistically significantly associated with current smoking. In the first two estimations in Table 3, the sample includes all adults of legal smoking age. The apparent marginal effect of current use of ENDS on smoking is huge: 60.3 percentage points in the estimation only including fixed effects. After controlling for additional demographic and SES controls, the marginal effect is reduced to a still-large 40.2 percentage points. The adjusted odds ratio for current use of ENDS is 17.1 when the additional controls are excluded in the regression and 10.5 when they are included. The marginal effects and adjusted odds ratios for past use of ENDS are even higher. These very high adjusted marginal effects and odds ratios are due mainly to the fact that very few people use ENDS in Korea unless they were smokers at some point. Thus, ENDS use is a very strong predictor of current smoking.

In the next two estimations in Table 3, the sample is restricted to adults who smoked at some point (in the past or currently). The marginal effects and odds ratios for ENDS use are much smaller than in the all-adult sample, but are still very large and statistically significant. Current ENDS use is associated with a higher probability of current smoking: either a 33 or 24 percentage point increase (and odds ratios of 4.5 or 3.1), depending on whether controls are included. Past ENDS use, as before, is a stronger predictor of current smoking. The final estimation in the table restricts the sample to adult males who smoke or formerly smoked, since most smokers and ENDS users are men and the prevalence of smoking among Korean women may be severely underreported (Kang et al., 2013). The results are similar to the similar regression specification with both sexes.

Results: Selection Bias in the Use of ENDS

As discussed above, the weight of the evidence from RCT studies around the world leans toward a positive treatment effect of ENDS use on cessation. No causal conclusions of any kind can be drawn from regression analyses of data from observational cohort studies that do not correct for bias in the estimates due to the endogeneity of ENDS use. Individuals choose whether to use ENDS, and that choice is highly likely to be correlated with unobserved factors related to smoking behavior.

For purposes of exposition only, assume that there are two types of people. A Type I individual is characterized by unobserved factors that make him more likely to initiate smoking, more likely to become addicted (or more heavily addicted) if he does smoke, and more likely to have weak motivation and low self-efficacy to quit smoking. The same factors may also lead to higher use of ENDSFootnote 4 and thus a positive correlation between use of ENDS and cigarettes. This postulate is in accord with findings that dual users have higher tobacco dependence than single-use smokers (Kim et al. 2020; Snell et al., 2020) and higher urinary cotinine levels (a biomarker for nicotine consumption; Kim et al. 2020). If there is reverse causation between ENDS and cigarettes, because smokers adopt ENDS for the convenience of servicing their nicotine habit when smoking is not possible, then the induced positive correlation between ENDS and smoking would be even stronger. In analysis of cohort data populated with Type I individuals, one may find a large positive coefficient on ENDS use in a regression of smoking status on ENDS use and observable control variables, even if the true causal effect is zero or negative.

Consider now a Type II individual, who is at the other end of the spectrum of unobserved individual factors. He is less likely to initiate smoking, less likely to become addicted to nicotine, and more likely to be able to quit and has little proclivity toward ENDS use. For the same reasons as above but in attenuated form, there will still be positive correlation between observed use of ENDS and cigarettes, but it will be smaller. The same outcome regarding positive bias in the ENDS regression coefficient would be observed as for data populated with Type I individuals, except that (due to the smaller positive correlation between ENDS and cigarette use) the upward bias would be smaller. For both types, ignoring sample selection leads to bias in estimates of the causal effect of ENDS use on smoking.

To assess whether there is selection bias in the regression estimates from the previous section, two approaches are explored. In the first approach, plausible changes in the proportion of Type I and Type II individuals using ENDS are exploited to look for evidence of changes in the amount of bias in the regression coefficient. In the other approach, econometric models designed to account for selection bias directly are employed. Both approaches indicate that there is selection bias in the direction of obscuring the negative effect of ENDS use on smoking (or, equivalent, the positive effect of ENDS on cessation) and the latter approach suggests that the use of ENDS causally increases the probability of cessation.

Indirect Evidence of Selection Bias

Recent work in applied health econometrics notes that marked changes in regression coefficients after officials make a health recommendation regarding an activity can constitute evidence for bias from selection effects (Oster, 2020). For example, if health officials recommend that taking a particular vitamin is conducive to good health, and if generally healthier people are more likely to respond to the recommendation, then cohort studies will reveal that the apparent link between the vitamin and good health becomes spuriously stronger after the announcement.

The relevant application to the present context is the announcement by the Korean government in 2015 recommending caution in use of e-cigarettes due to their stated harms to health. The difficulty in direct examination of the impact of the announcement is that in 2015, taxes on cigarettes also rose, which spurred much new use of ENDS as an alternative. Here, therefore, data from 2014 will be compared with data from 2016, after much of the new ENDS use had apparently fallen away (as shown in Fig. 2); the number of adult ENDS users is about the same in 2014 as 2016. Due to the health recommendation and the increased use of ENDS by current smokers in 2015 (reverse causality), it is expected that the composition of ever-ENDS users changed between 2014 and 2016. Because of the health recommendations, many Type I individuals may not have adopted ENDS in 2015 and 2016 who otherwise would have. Because of the tax increase, which affected the purchasing decisions of lower income smokers more than others, there also likely was an influx of new lower-income ENDS users who find cessation more difficult. The negative association between income and smoking cessation is well documented in the literature.Footnote 5 For both these reasons, between 2014 and 2016, the mix of ENDS users likely shifted toward Type I individuals and others for whom cessation is less likely.

Before looking for change in the regression coefficients, we show that some key characteristics of e-cigarette users changed over the years 2014 to 2016. It is important to remember that the selection bias postulated to afflict the regression coefficients is caused by unobserved confounding factors. However, large changes in observed characteristics may signal that unobserved factors changed as well (Oster, 2020). Table 4 shows how the composition of the subpopulation of current ENDS users in the KNHANES changed with respect to gender, income, age, education, binge drinking, self-reported health, and smoking. The 2015 cigarette tax increase and health warning regarding ENDS changed the profile of e-cigarette users, resulting in users with lower incomeFootnote 6 and less education, both of which are associated with greater difficulty in cessation.Footnote 7 ENDS use also shifted toward older individuals, who have been found to have higher nicotine dependence, fewer smoking quit attempts, and lower rates of successful abstinence from cigarettes (see Arancini et al. 2021), although the relationship between age and cessation is less clear for Korean smokers (Kim et al., 2021). The incidence of binge drinking rose among ENDS users in 2016, which is highly associated with smoking and greater difficulty achieving cessation (Kaye et al., 2020). The distribution of self-reported health also changed but in non-monotonic ways; in 2016, there was a lower prevalence of “very good” health and there was more “bad” or “very bad” health combined, but there was also much more “good” health. Although the evidence is not unequivocal in every case, the bulk of these shifts seems to suggest a shift from Type II to Type I ENDS users. Although the timeframe does not exactly match, supplementary evidence for this shift can be found in survey evidence showing that in 2015, 56% of e-cigarette users in Korea did so to stop smoking, while that proportion dropped to 31.7% in 2018 (Mbrain 2018).

Table 4 Characteristics of current ENDS users, 2014 and 2016

Based on the discussion of the selection-into-treatment model above, a shift toward Type I individuals would result in a larger positive coefficient on ENDS use in a regression of current smoking. That is indeed what happens when comparing the results from single-year regressions using data from 2014 to 2016. The first column of results in Table 5 is for probit regressions as in Table 3 performed separately using data from 2014 to 2016. The one change is that instead of including variables for both current and past ENDS use, the single variable for ever use of ENDS is used (to focus attention upon a single measure). The regression results are shown in the form of the predicted effect of ever-ENDS use on the probability of current smoking. The results show that while ever using ENDS is associated with a 24.5 percentage point increase in the probability of current smoking in 2014 (after adjusting for all the control variables), that association rises to 31.4 percentage points in 2016. As expected, the apparent association has risen, as would happen if the selection bias were worse in 2016. The change in the effect between the two years of 7 percentage points is not significant (p = 0.11), however, despite being an increase in the predicted probability of 28%.

Table 5 Evidence for selection effects from changes in estimated effect of ENDS on smoking, 2014 and 2016

Another way to control for confounding observed factors, instead of regression adjustment, is to match on the observed covariates. A propensity-score matching estimator for the treatment effect of ever-ENDS use on smoking may be better than regression to control for bias from mismatch in covariates between ENDS users and other individuals. (However, a matching estimator does not remove bias from unobserved factors such as those at issue here, any more than regression does.) The results from the matching estimation, in the final column of Table 5, are in accord with those from the logit regressions (see the appendix for details of the estimation procedure). Furthermore, the change in the estimated effect of ENDS, 12.9 percentage points, is statistically significant. These results are consistent with the presence of selection bias that worsened in 2016.

In summary, the evidence in this section shows that the composition of ENDS users changed between 2014 and 2016, suggesting that shifts may also have occurred along unobserved dimensions of confounding factors. The regression results show that, as predicted by the selection-into-treatment theory, the positive association between ever using ENDS and current smoking increased after the tax increase and health recommendation against ENDS. All of this evidence is consistent with there being substantial selection bias in regressions that treat ENDS use as exogenous. Of course, evidence consistent with a hypothesis does not rule out alternative hypotheses, such as that ENDS use actually does have a positive causal effect on smoking. To strengthen the evidentiary case for selection bias, therefore, we turn now to direct econometric evidence.

Direct Evidence of Selection Bias

In the absence of an RCT or longitudinal data on individuals, there remain some weapons in the econometrician’s arsenal to detect and correct for bias from endogenous treatment effects on a binary response. There are many approaches to estimating treatment effects on a binary outcome when a binary treatment is endogenously chosen by the individual.Footnote 8 The estimations below include the bivariate probit model, several copula models, and Wooldridge’s (2014) control function estimator. Semiparametric models for treatment effects are unavailable for use in the present application.Footnote 9 Models with discrete endogenous regressors generally lack nonparametric point identification (Shaikh & Vytlacil, 2011), and so, estimation typically involves parametric assumptions of some sort, which is the approach taken here. The first is the bivariate probit model for endogenous treatment effects (Heckman, 1978), which has been widely used in health econometrics (e.g., Bhattacharya et al., 2006; Humphreys et al., 2014, among many others). Other parametric approaches to this estimation problem are based on copulas (Han & Vytlacil, 2017). Examples of copula-based approaches to endogenous treatment or sample selection in health economics, albeit not with binary outcomes, include Dancer et al. (2008), Hasebe and Vijverberg (2012), McGovern et al. (2015), Murteira and Lourenço (2011), Prieger (2002), and Winkelmann (2012). Copulas allow separation of the choice for the marginal distributions and the specification of their means from the specification of the dependence of the error structure for the treatment and outcome equations (Nelsen 2006). The Frank and Clayton copulas explored here are Archimedean copulas (Genest & McKay, 1986), which are popular for applied work. Archimedean copulas have closed-form parametric expressions, leading to straightforward maximum likelihood estimation (MLE), and allow radial asymmetry and tail dependence, resulting in a large variety of correlational structures (Nelsen 2006).

While instrumental variable and traditional control function estimators for treatment effects in linear models are unavailable for nonlinear models for binary outcomes (e.g., probit or logit) with binary endogenous variables, Wooldridge (2014) developed a moment-based control function estimator for the problem at hand. His GMM estimator, while resting on the parametric assumption of the normality of the errors, depends only on certain moments implied by that assumption and thus can be viewed as “less parametric” as opposed to fully parametric, MLE-based approaches.

In the endogenous treatment effect models estimated here, the choice of whether to have ever used ENDS is modeled jointly with the smoking cessation decision, with the choices allowed to be correlated due to unobserved confounding factors. If the assumptions of the models are satisfied (to be discussed below), the causal effect of ENDS use on cessation can be recovered even using cross-sectional (or repeated cross-sectional) cohort data. The details of the parametric and moment-based control function models are in the appendix, but the idea behind each of the models is as follows.

The dependent variable for individual i for the selection equation, D, which is also the treatment variable in the outcome equation, is ever use of ENDS (including HTP in the 2018 data). Since individuals are observed only once, the time subscript for the KNHANES wave is suppressed in the notation. Ever use instead of current use is chosen to admit the possibility that a smoker switched to ENDS and then also ceased using ENDS. The decision ever to have used ENDS is modeled as in a regression model for binary dependent variables:

$${D}_{i}=\left\{\begin{array}{c} \begin{array}{cc}1& \text{if }{\text{z}}_{i}^{\prime}\gamma +{u}_{i}\ge 0\\ 0& \text{else}\end{array}\\ \end{array}\right.$$
(1)

where \({D}_{i}=1\) means that individual i selects into treatment. Vector zi in the selection equation is a set of variables for observed factors related to the treatment (e.g., demographic and socioeconomic variables of the individual and variables related to the price of ENDS, as those used in the descriptive regressions above), and γ is a vector of coefficients to be estimated. Unobserved factors affecting the individual’s choice of treatment are collected into the scalar error term \({u}_{i}\). The outcome, cessation from smoking, is modeled similarly:

$${Y}_{i}=\left\{\begin{array}{c} \begin{array}{cc}1& \text{if }{\alpha }_{0}+\tau {D}_{i}+{x}_{i}^{\prime}\beta +{\varepsilon }_{i}\ge 0\\ 0& \text{else}\end{array}\\ \end{array}\right.$$
(2)

where \({Y}_{i}=1\) means subject i quits smoking. The inclusion of \({D}_{i}\) in Eq. 2 makes this a triangular (or recursive) bivariate system. The linear index in Eq. 2 will be termed the simple specification, to distinguish it from the potential outcome specification. In the latter, a full set of interactions between \({D}_{i}\) and the other regressors is included to allow all β coefficients to take different values depending on treatment status.

The following is known about identification in the triangular bivariate system with binary dependent variables defined by Eqs. 1 and 2 and known joint and marginal distributions of the error terms. Without an excluded instrument (i.e., all elements of z are also in x) but with sufficient variation in the exogenous variables (i.e., full support of at least one regressor in the outcome equation), point identification of all model parameters is obtained (Freedman & Sekhon, 2010). Without enough variation in the exogenous regressors, identification may fail or at best is likely to be weak (Mourifié & Méango, 2014). Thus, contrary to common belief, an excluded instrument is not always necessary, although having one may aid identification and reduce bias (Li et al., 2019). Second, an excluded instrument, even with limited variation (e.g., binary), along with some other conditions, is sufficient for global identification of all parameters (Han & Vytlacil, 2017). Finally, if the exogenous variation is small, the bias in the estimates may be large (as in the usual weak instrument literature for linear models) (Freedman & Sekhon, 2010; Han & Vytlacil, 2017). Nevertheless, the investigation of Denzer (2019) of various IV, parametric, and semiparametric estimation methods for the triangular bivariate probit model in the presence of weak instruments led to a “strong recommendation to primarily rely on the ML estimator.” Furthermore, Li et al. (2019) found that bivariate probit can yield reasonably accurate estimates of treatment effects even when the distributional assumptions are badly violated. In particular, in addition to finding that the bivariate probit model is a “reasonably resilient empirical tool for estimating the effect of an endogenous binary regressor on a binary outcome variable,” Li et al. (2019) also show that when the parametric assumptions are incorrect, the model yields quasi-ML estimates that constitute a “conservative but rational choice for the practitioner.” For this reason, all except the moment-based estimations performed in the present study are MLE and we use instruments.

For the parametric and control function models, the error terms ui and \({\varepsilon }_{i}\) are assumed to be normal. The possibility that the error terms are not independent leads to the endogeneity of the treatment in the outcome equation; in such cases, probit estimation of Eq. 2 alone leads to bias in the estimate of τ and therefore the treatment effect. The various models estimated here differ in the assumed form of the correlation between the error terms and whether estimation is likelihood or moment based (see the appendix for the econometric details). In the parametric models, pseudo-maximum likelihood estimationFootnote 10 returns estimates of the population regression coefficients and the joint distribution of the two error terms. Evidence for nonzero correlation between the error terms constitutes evidence of selection effects, implying that simple regression estimation of the outcome equation (e.g., probit) is biased.

The dependent variable for the outcome, Eq. 2, is recent cessation. This binary variable is defined as a formerly smoking male who quit smoking within the past 8 years and maintained cessation to the time of the survey. The outcome variable thus measures past cessation and non-relapse of the most recent cessation attempt. The sample is limited to current and former male smokers who quit within the past 8 years. The time since quitting smoking is by self-report in the KNHANES. Limiting the sample to exclude those who quit smoking long ago was done because the focus of the study is how ENDS may be linked to cessation, and someone who quit smoking in, say, 1990 neither quit with the help of ENDS nor was likely to begin smoking again during the era of ENDS availability. A threshold of 8 years was chosen to roughly match the ENDS period in Korea. The first year of data on ENDS, and thus the first year of data in this estimation, is 2013; cessation for this wave of respondents has to be after 2005, or about 2 years before ENDS became available.Footnote 11 The final year of data is 2018; cessation for that wave has to be after 2010, the year before ENDS began to be commonly used.

Recent cessation is modeled in the outcome Eq. 2 as a linear function of ENDS use (Di) and the demographic and socioeconomic variables explored above with the region fixed effects (xi). In addition to all the elements of xi, the regressor vector zi also includes the tax on ENDS products (given the discussion on identification above, this variable may be viewed as an instrument but is not needed as an identifying instrument). The tax variable is a composite of e-liquid and HTP heatstick taxesFootnote 12; the two products’ taxes are combined as a weighted average based on relative market shares (see Table A-1b in the appendix). The regressors also include the interaction of the tax with Medical Aid status (a public insurance program roughly similar to Medicaid in the U.S.) and categorical household income. The interactions are included as further instruments under the notion that ENDS prices (which are affected by the taxes) may matter most for lower income consumers. In each estimation reported in Table 6 below, the p-value from the Wald test of the coefficients for the excluded instruments was below 0.015, and in the copula models, it was below 0.005, indicating the relevance of the instruments. Since the ENDS tax does not vary in the cross-section, it is not possible to use year fixed effects in these regressions, and so instead, a linear time trend is included in xi and zi. The analysis sample for the regressions includes male respondents only due to the large differences in ENDS and cigarette use by gender and low self-reported rates of ENDS usage and smoking among females. The econometric results are summarized in Table 6. The average treatment effect on the treated (ATET, defined as \(E\left({Y}_{1}-{Y}_{0}|D=1\right)\)) is the result of primary interest. The average treatment effect (ATE, or \(E\left({Y}_{1}-{Y}_{0}\right)\)) is also shown. While the ATE yields the expected improvement in cessation from using ENDS across the whole subpopulation of smokers and former smokers, the ATET is more germane because it is the average effect on cessation for those smokers who chose to use ENDS. The latter group is the one which some public health officials and researchers claim have cessation hindered by ENDS use.

Table 6 Estimated treatment effects and evidence for the endogeneity of ENDS usage from the bivariate regression models

To summarize, identification of the treatment effects in these models rests not solely on policy-driven variation in the ENDS taxes but also on variation in the other regressors and the parametric assumptions. Without the additional assumptions, little headway can be made toward estimating or bounding the treatment effect. The Manski nonparametric bounds on the ATE under the minimal assumption of no reporting error are estimated (McCarthy et al., 2015) to be [− 0.47, 0.53], so wide as to be virtually uninformative. Under the stronger assumption of monotone negative treatment selection (i.e., that individuals choosing to use ENDS are more likely to not quit smoking, conditional on treatment assignment, which is the direction of the selection bias indicated by the indirect evidence presented above), the nonparametric bounds on the ATE can be narrowed to [− 0.16, 0.53], which still do not pin down the sign, hence the parametric assumptions to point-identify the treatment effects.

Table 6 includes results from three families of models: bivariate probit, bivariate copula models for binary dependent variables, and the moment-based control function model for endogenous treatment effects with binary outcomes (details and complete regression results are in the appendix). Two main specifications are explored: a simple triangular specification in which the treatment indicator appears only by itself in the outcome regression equation, and a potential outcome specification in which all the coefficients in the outcome equation are allowed to differ depending on treatment status (estimated by interacting Di with each element of xi). The latter is more flexible but introduces a large number of extra parameters to be estimated.

The simple specifications in Table 6 include one from the probit model assuming the exogeneity of ENDS use (for comparison), one from the bivariate probit model, and four from the Frank and Clayton Archimedean family of copula models. The probit model is essentially the same estimation as in the final column of Table 3, except for the change in the ENDS variable to be ever use. The coefficients for the outcome equation, the ATET, and the ATE are the same as would be for those from a bivariate probit model with correlation parameter ρ restricted to zero. Given the assumed exogeneity of ENDS use and correlations between the latter and smoking observed above, it is unsurprising that the ATE and ATET are negative, large, and significant.

However, all the models that correct for endogeneity have positive estimates of the ATET and ATE (and most are statistically significant). For each estimation, evidence for the endogeneity of ENDS use can be gleaned from the significance of θ, the correlation-related parameter in the copula models, the Pearson correlation ρ, and Kendall’s rank correlation τ. Kendall’s τ is a measure of similarity of the orderings of two random variables that is commonly used with copulas.

Given that none of the parametric models may reflect the true data generating process, the estimates can be viewed as yielding quasi-ML estimates that minimize the Kullback–Leibler divergence from the true model (Li et al., 2019). To link this notion to model selection criteria in a manner appropriate for survey data, we compute Lumley and Scott’s (2015) design-based Akaike Information Criterion (dAIC), which is a version of the familiar AIC that is bias-corrected for complex survey samples used to estimate population parameters. Lower values of dAIC, shown in the final column of Table 6, indicate less Kullback–Leibler divergence from the unknown true model.

The first model admitting correlation and endogeneity is the bivariate probit model in the second row of Table 6. The results of the bivariate probit model show insignificant treatment effects and only weak evidence for endogeneity. However, the copula models show strong evidence of both. The Frank copula model with the simple specification estimates the ATET to be 12.1 percentage points. For the Clayton copula, the results shown are for the version of the copula with the highest pseudo-likelihood among the standard and two rotated versions. The standard version admits only positive correlation while the 90° and 270° rotated versions of the Clayton copula assume that correlation is negative.Footnote 13 The 270° rotated Clayton model (hereafter, Clayton270) is preferred since it has the highest pseudo-likelihood and the lowest dAIC among the Clayton models. It shows strong negative correlation and a significant ATET of 16.4 percentage points (the other Clayton models are not shown in Table 6 but all results are in the appendix). The tail dependence in the Clayton270 results implies that (above and beyond correlation and observables) a smoker who is highly unlikely to use e-cigs is highly likely to quit smoking. Both copula models are preferred to the bivariate probit models by dAIC.

A more parsimonious version of the simple specification, S2, was also estimated, in which some of the regressors that were generally insignificant in the regressions were dropped. The resulting estimated ATETs were significant: 10.7 from the Frank model and 16.2 percentage points from the Clayton270 model. Per the dAIC, specification S2 outperforms the corresponding Frank and Clayton270 estimations with all the regressors (S1). Of the models for the simple specifications, the Clayton270 model with specification S2 is the preferred estimation per the dAIC. Thus, the best estimate of the ATET is 16.2 percentage points, coupled with a statistically significant estimate of the endogeneity of ENDS use.

To match some of the literature on endogenous treatment effects, the potential outcome specification was also estimated for the various models. The results are in the lower half of Table 6. Specification S3 is the same as S1 but with each coefficient allowed to differ with treatment status. A reduced specification, omitting some insignificant regressors, yields specification S4. The estimated treatment effects are largely similar to those from the simpler specifications and the Clayton270 model is again the best among the fully parametric models. None of these, however, are better (per dAIC) than their counterparts with specifications S1 of S2. Adopting the potential outcome specification allows estimation of Wooldridge’s moment-based control function estimator for endogenous treatment effects with binary treatment and outcomes. This estimation (in the final row of the table), with a specification nearly identical to S3,Footnote 14 yields the highest estimate of ATET: 17.0 percentage points. Since this estimator is based on GMM instead of MLE, there is no way to compare it to the other models using the dAIC. It may thus be added to a set of preferred (i.e., non-dominated) estimates.

Note that with each model correcting for sample selection except the control function model, the ATE is much larger than the ATET. This comparison is further evidence in favor of selection bias in the use of ENDS, for it implies that if treatment were randomly assigned across the entire population of smokers, ENDS would have a larger causal effect on cessation than it does for those choosing to use ENDS under the current set of policy, attitudes, and preferences.

To summarize, the reason why the impact of using ENDS switches from being pro-smoking in the simple regressions in Table 3 to being pro-cessation here is due to the strong selection bias in the former. The correlation in the unobserved factors related to cessation and ENDS use is negative, whether measured by the parametric correlation-related coefficient, the Pearson correlation coefficient, Kendall’s τ, or the tail dependence in the Clayton copula. That is, there are latent factors of individuals that make them more likely to use ENDS and less likely to quit smoking/maintain cessation. This is exactly as predicted by the conceptual discussion of selection bias from the beginning of this section.

Discussion

The preferred estimates for the treatment effect on cessation of ever use of ENDS and HTP (ATET = 0.162 from the Clayton270 model and 17.0 from the control-function model) are sizeable but not unreasonably large. For comparison, one meta-analysis of studies on varenicline (the most effective pharmaceutical for cessation) found a treatment effect of 26 percentage points at 12 weeks after quitting for men. It is to be expected that use of ENDS has a lower estimated treatment effect, since cessation was maintained for a longer period (38 months on average) and in any event, some ENDS users have no intention to quit smoking (refer back to Table 1). The estimated treatment effects are also more modest than the 22.7% cessation rate at 24 weeks found for the ENDS-using group in the Korean RCT of Lee et al. (2019), discussed above, likely for the same two reasons.

There are some limitations to the present study. The outcome studied is cessation maintained to the time of the survey, but some of those individuals may subsequently relapse, which is not captured in the data. On the other hand, relapse to smoking before the survey is counted as “no cessation.” This issue is the same for all studies of cessation, however; see also the robustness test below that varies the length of prolongation of cessation. Perhaps more importantly, the methods employed here depend to an unknown degree on the various modeling assumptions. Due to selection issues, no results from observational, pooled cross-sectional studies can be as convincing as longitudinal studies, RCTs, or quasi-experimental methods using panel data. Given the lack of longitudinal data or sharp subnational policy differentiation and the absence of nonparametric identification in doubly binary treatment effect models, we used parametric assumptions to analyze repeated cross-sectional data to demonstrate the existence of a strong selection effect and a positive causal effect of ENDS use on cessation. To lessen the dependence of the results on any one set of assumptions, a variety of models and regression specifications were presented. Another limitation is the limited variation in the ENDS tax instrument. The relevant identifying variation in the data when the instrument is employed is the change in national ENDS taxes over time (net of the included time trend), “personalized” to each individual only through interactions of this variable with their income. One could certainly wish for a stronger instrument with greater variation in both the cross-section and time dimensions. However, as discussed above, an instrument is not strictly necessary for identification in the parametric models, and as to be discussed shortly, the results change little if the instrument is dropped altogether and a specification test fails to show evidence that the instrument is invalid.

Given their importance, the parametric assumptions were tested in three ways. Details for all of these are in the appendix. First, the specification test of the bivariate probit model of Acerenza et al. (2023) was adapted to our preferred Clayton model. The null hypothesis of correct model specification for this test can be rejected if any of the identifying assumption discussions above for the parametric models fail (correct choice of marginal distributions for the latent errors, correct choice of copula to bind the two equations into the joint density, and instrument validity [when an instrument is used]). The test rests on the conditional probabilities implied by the model, converting them to moment inequalities that can be tested with Chernozhukov et al.’s (2015) intersection bounds procedure. The test result indicated acceptance of the null hypothesis of correct model specification and instrument validity. Second, we checked whether swapping the probit marginals for logistic errors in the treatment and outcome equations affected the results in the copula model. In all combinations (probit-logit, logit-probit, and logit-logit), the ATET remained about the same as in the main estimation.Footnote 15 Third, we explored moving away from fully parametric assumptions with the seminonparametric estimator of Gallant and Nychka (1987) as adapted to the present endogenous treatment effects problem (De Luca, 2008). This estimator uses expansions in Hermite polynomials to flexibly model the joint distribution of the error terms in a manner that yields consistent estimation for a broad class of distributions. The seminonparametric estimator allows for arbitrary skewness and kurtosis to reduce the potentially high sensitivity of the estimates of the treatment effects to the marginal distributional assumptions. The estimate of the ATET produced by this model is 0.103, somewhat lower than our main estimates, yet still significant at the 1% level.Footnote 16

Several additional robustness tests were explored; results are in the appendix. First, the linear time trend in the models reported in Table 6 was replaced with a cubic time trend. The results varied little, the same model was preferred by the dAIC, and the estimated ATET was identical to the main result (0.162). Second, given that instruments may not be needed for identification in these models, the preferred model was re-estimated without instruments (and with the cubic time trend). Again, the results were highly similar, the same model was preferred, and the ATET was estimated to be nearly identical at 0.163. Third, we varied the definition of cessation to require prolongation of abstinence for various lengths (three to 24 months). The ATET for the preferred model remained highly statistically significant, although its magnitude naturally fell as the length of abstinence required for cessation increased (from 0.125 at three months to 0.044 at 24 months). Fourth, we varied the window for recent cessation. Whether cessation was restricted to onset in the past three years, the baseline case of eight years, 10 years, or other years in between, the ATET from the preferred model was still significant at the 1% level, with estimates varying between 0.130 and 0.172. Fifth, we limited the sample to various subsets of years due to the fact that the impact of heated tobacco products and e-cigarettes may not be the same on cessation. Whether we excluded 2018 (the first year that questions regarding use of heated tobacco were added to the survey) or 2017–2018 (the years in which heated tobacco products were available in Korea), the estimated ATET from the preferred Clayton model and Wooldridge’s model changed little. The same was true when using only the data from 2017 to 2018.

Despite the potential limitations of the study, the concordance of the results across models and specifications leads us to draw three general conclusions. First, there is evidence that the same men in Korea who are least likely to quit smoking are those taking up ENDS. This is demonstrated by the consistent finding of negative correlation between the treatment and outcome equations. Second, correcting for the selection bias in using ENDS reveals that they may aid cessation, as shown by the positive and general significant estimates of the ATET and the ATE. Finally, there is therefore potentially severe selection bias in any observational study of the association between ENDS use and smoking.

If ENDS can indeed be an effective aid to cessation, then the Korean government’s discouragement of their use by smokers creates a massive lost opportunity to reduce smoking and improve public health at a time when smoking’s decline has slowed. The implications apply to many other countries as well. Public health officials in most countries take their lead from the World Health Organization (WHO) through the Framework Convention on Tobacco Control (FCTC), which stops short of banning ENDS but encourages parties to “consider prohibiting or otherwise regulating them” (World Health Organization (WHO) 2019b). However, while ENDS are not risk-free, switching completely from cigarettes to e-cigarettes substantially reduces levels of measured carcinogens and toxins in the body (National Academies of Science, Engineering and Medicine 2018; Shahab et al. 2017). Public Health England (2015) encouraged smokers to switch to e-cigarettes with a statement that, based on the evidence, the latter are about 95% safer than cigarettes. Partially due to that reason, there is no excise tax on e-cigarettes in the UK (Hampsher 2020).

Negative governmental attitudes toward ENDS also run the danger of creating a self-fulfilling prophecy. Research shows that the reasons for the use of ENDS greatly affect whether they aid cessation. Smokers who use e-cigarettes for smoking cessation have higher rates of quitting cigarettes in the future, but ENDS users with other reasons can have lower cessation rates than those who do not use ENDS at all (Vickerman et al., 2017). Thus, if public misperception about ENDS causes vapers to use e-cigarettes mainly for reasons other than cessation, then ENDS will in fact be less helpful for cessation. The Korean response to ENDS can be contrasted with Japan’s experience, where ENDS in general (and HTP tobacco in particular) appear to be reducing consumption of cigarettes and perhaps furthering cessation in recent years (Hahn et al. 2020; Stoklosa et al. 2019).