1 Introduction

In many countries, health insurance systems have been designed primarily to protect paid employees, with the implicit assumption that the self-employed are in a position to take care of themselves. Health insurance systems that distinguish between the two modes of work create incentives or disincentives to forego paid employment and become self-employed. Health-care institutions may thus induce an inefficient misallocation of workers away from their most productive use. An emerging body of literature finds that employer-provided health insurance in the USA results in an entrepreneurship lock effect, meaning that workers insured through their employers are reluctant to enter into self-employment because they do not want to lose coverage (e.g., Holtz-Eakin et al. 1996; Fairlie et al. 2011). However, almost nothing is known about the effects of public health insurance systems on entrepreneurial activity. Closing this gap in the literature is an important task. Health-care systems are frequently overhauled in many countries, not least because of aging populations. The Patient Protection and Affordable Care Act (PPACA) is a well-known example of a far-reaching health-care reform that introduced elements of a public health insurance system in the USA.

In this paper, we address the research question if and how much health insurance costs in self-employment in comparison to paid employment affect the decision to switch from paid employment to self-employment within a public health insurance system. In modern knowledge-based economies, entrepreneurial activity, especially opportunity entrepreneurship, is related to innovation, growth, and employment creation (Carree and Thurik 2003; Acs and Audretsch 2005; Van Praag and Versloot 2007). We focus on transitions from paid employment to self-employment because these founders are opportunity entrepreneurs in the sense that they generally have an alternative employment option but see better business opportunities in self-employment (Fossen and Büttner 2013; Fairlie and Fossen 2016).

The German health-care system is particularly well suited to study such effects, because it creates significant variation in incentives for self-employment across workers that changed frequently over time and can be exploited for identification. In the German hybrid system, public and private health insurance co-exist in a highly regulated legal framework. Similar health-care systems can be found across continental Western Europe, Asia, Latin America, and in many of the formerly communist countries in Central and Eastern Europe (Gerlinger and Schmucker 2009).Footnote 1

In Germany, public health insurance is obligatory for most paid employees, while the self-employed can generally choose between private health insurance and voluntary membership in the public health insurance system after leaving paid employment. While public health insurance contributions are a percentage of gross labor income, premiums for private health insurance depend on age and a health assessment at entry, and until 2013 also on gender. Contribution rates into public health insurance varied over providers until 2008. Importantly, membership in public health insurance allows a spouse with income below a certain threshold and children to be covered for free, while private health insurance must be purchased individually. Therefore, when an individual considers switching from paid employment to self-employment, the differential in health insurance costs between the two alternatives depends on the family situation and the age, gender, and health status of all family members. Legal changes in the tax deductibility of health insurance contributions in 2005 and 2010 provide additional variation.

We use the Socio-Economic Panel (SOEP), a representative household panel survey for Germany, which provides information on employment transitions and the determinants of self-employment known from the literature as well as health insurance coverage and the health situation of the household members. This allows us to simulate the individual health insurance cost differential between self-employment and paid employment. Carefully controlling other potentially relevant factors such as income, age, gender, and health of the family members, we estimate the effect of the cost differential on the probability of entry into self-employment for paid employees. A discrete time hazard rate model avoids survivorship bias in the sample of paid employees.

The results indicate that an increase in the health insurance cost differential between self-employment and paid employment by €10 per month decreases the probability of entry into self-employment by 0.015 percentage points, i.e., about 1.7% of the average annual entry rate. This implies an elasticity of −0.47 and shows that entrepreneurial activity is sensitive to changes in the health insurance system that alter the cost differential. Further investigation shows that men respond more strongly to these incentives than women. The results are robust with respect to specification choices and assumptions regarding the health insurance cost differential. Our conclusions are also supported when we estimate reduced form models with the determinants of the health insurance cost differential as explanatory variables instead of the cost differential itself.

Furthermore, for the first time in the literature, our results show that individual poor health has a direct negative effect on the probability of entry into self-employment even after controlling for the health insurance cost differential, which is larger for less healthy persons. In contrast, poor health of a spouse exclusively decreases the entry probability through higher health insurance costs for the family. Parker and Rougier (2007) and Rietveld et al. (2015) report that persons with poor health are less likely to become self-employed but cannot distinguish between direct health effects and the effect of higher health insurance costs in self-employment for less healthy persons.

In the following section, we provide a literature overview and point out our contributions. Section 3 describes Germany’s hybrid system of public and private health insurance. Sect. 4 describes the empirical methodology and Sect. 5 the household panel data. The empirical results are provided in Sect. 6, and Sect. 7 concludes the analysis.

2 Review of the literature

The emerging literature on entrepreneurship lock has its roots in the job lock literature.Footnote 2 This literature describes the barrier that employer-provided health insurance potentially poses on job mobility in the USA, since employees may fear to lose coverage for pre-existing conditions if they switch their employer (e.g., Madrian 1994; Gilleskie and Lutz 2002). Verheul et al. (2002) note that health insurance systems increase the costs of entrepreneurship if the transition to self-employment implies that the current health insurance coverage must be given up. Holtz-Eakin et al. (1996) pioneer the empirical analysis of entrepreneurship lock in the USA using the Survey of Program Participation and the Panel Study of Income Dynamics. They investigate if the negative influence of employer-provided health insurance on the probability of transition from paid employment to self-employment decreases if the switching person can be covered under the spouse’s health policy. The authors cannot draw clear conclusions because of large standard errors, probably due to limited variation in their explanatory variables.

Using a similar strategy and Current Population Survey (CPS) data, Wellington (2001) finds a significant positive effect of having health insurance through one’s partner on the probability of being self-employed. Also based on the cross-sectional CPS, Lombard (2001) similarly reports that married women are more likely to be self-employed if their husbands have health insurance. Estimating probability models of being self-employed implies potential issues of survival bias and reverse causality. We improve on this by estimating hazard rate models of the probability of entry into self-employment based on panel data, where the explanatory variables are observed before a transition is made.

Zissimopoulos and Karoly (2007) focus on self-employment at older ages using the Health and Retirement Study, include indicators of own and spousal health insurance coverage in their empirical transition model, and conclude that their results are only partially reconcilable with job lock. They note that employers who provide health insurance in the USA may also tend to offer quality jobs, which may confound the analysis, as also emphasized by Gilleskie and Lutz (2002). In our German setting, all paid employees with labor income below a fixed threshold are obliged to be covered under statutory public health insurance, so the issue does not arise.

Fairlie et al. (2011) link consecutive CPS surveys to create a short 2-year panel, which allows them to study business creation. Their approach exploits a discontinuity at age 65, when coverage under Medicare begins in the USA. This should make the loss of employer-provided health insurance less relevant and therefore facilitate entry into self-employment. The results support the view that employer-provided health insurance in the USA creates entrepreneurship lock.

None of these articles calculate health insurance costs in the alternatives of paid employment or self-employment. Therefore, the quantitative results of each paper only apply in their immediate institutional contexts. Since we estimate a marginal effect of the individual health insurance cost differential on the probability of entry into self-employment, our estimated effect size can be interpreted more generally.

Four articles analyze the effect of the 1986 Tax Reform Act (TRA86) and its amendments in the USA on self-employment and transition probabilities. This policy reform allowed entrepreneurs to deduct increasing portions of their health insurance premiums from their taxable income. Heim and Lurie (2010) and Gurley-Calvez (2011) use a panel of tax return data, while Velamuri (2012) and Gumus and Regan (2015) employ CPS data. The results from the four articles suggest that a lower after-tax price of health insurance in self-employment increases self-employment and entry rates and decreases the exit rate.Footnote 3

The existing literature is limited to the health insurance system in the USA, which is dominated by employer-provided health insurance. The literature lacks microeconometric investigations of the effects of public health insurance systems on the probability of entry into self-employment. Most countries in the world implement some form of public health insurance, and this is also relevant for the USA since the introduction of the PPACA. Jackson (2010) and Heim and Lurie (2014) analyze effects of Massachusetts’ health insurance mandate policy implemented in 2007. Jackson (2010) compares the likelihood of new firm formation in Massachusetts versus neighboring New Hampshire in the border region, and Heim and Lurie (2014) conduct a difference-in-difference analysis contrasting self-employment in Massachusetts after the reform to other states and before the reform. The authors find negative effects on firm formation and self-employment, respectively, and argue that this could be due to the higher financial burdens for entrepreneurs brought by the policy. In contrast to our analysis, these papers do not quantify the cost increase, which makes it difficult to relate the estimated effect sizes to other contexts.Footnote 4

Like the literature reviewed in this section, this paper focuses on health insurance. Taxes and other components of the social insurance system such as public pension insurance may also affect self-employment. In our empirical estimations, we account for any such effects by including control variables, especially flexible functions of income.Footnote 5

3 Germany’s hybrid health insurance system

Germany’s health-care system is characterized by the co-existence of public, nonprofit statutory health insurance (SHI), and for-profit private health insurance (PHI). Almost 90% of the population is covered under SHI (Federal Ministry of Health 2013), which is regulated by federal law, primarily the Code of Social Law V. Healthcare is universal in the sense that since 2009, health insurance (public or private) is obligatory for all inhabitants of Germany, and even before, only a very small fraction of the population was not covered under any health insurance.Footnote 6 The health insurance system involves various discontinuities and changes over time. These create variation in the health insurance costs, which we simulate in detail, and help to identify the effects on entry into self-employment. We describe the most important institutions for Germany relevant during the period of analysis (2000–2011).

Paid employees are compulsorily insured under SHI if their gross pay does not exceed the obligatory insurance limit (Versicherungspflichtgrenze). This limit is adjusted annually and increased from €3298 per month in 2000 to €4125 in 2011. Employees with higher salary can opt out of the SHI system and choose PHI instead. SHI contributions are not risk-related. Instead, they are a certain percentage of the insured person’s gross salary, but only up to the contribution assessment ceiling (Beitragsbemessungsgrenze). This ceiling is adjusted annually and was increased from €3298 per month in 2000 to €3713 in 2011.Footnote 7 Within the SHI system, there are several nonprofit SHI funds to choose from. Until 2008, contribution rates differed between the SHI funds. Since 2009, there is a unique contribution rate, but SHI funds can charge an extra premium if the contributions are insufficient.Footnote 8 , Footnote 9

The spouse of a married, paying SHI member is covered under so-called family insurance in the SHI for free as long as his or her monthly income is below the threshold of marginal employment. This threshold was €325 until 2002 and €400 since 2003. Children of a SHI member are covered under contribution-free family insurance if they are (i) below 19 years of age, (ii) below 24 years of age and with labor income not exceeding the marginal employment threshold, or (iii) below 26 years of age and in education.

Specific regulations apply for marginal employment. Employees earning less than the threshold of marginal employment are covered under SHI but do not need to pay contributions. The employer paid a contribution rate of 10% until 2002, 11% until 2007, and 13% since 2007; this is only relevant for our analysis in a specification where we assume that employers are able to shift the burden onto their employees. Since 2003, employee’s contributions are gradually faded in within the zone of monthly income between €400 and €800.

In contrast to the SHI, premiums for PHI policies are based on risk. PHI premiums are determined by age and an initial health assessment. In the period of our analysis, women had to pay higher premiums than men; health insurance policies sold since December 21, 2013 may no longer discriminate by gender. In contrast to family insurance in the SHI system, PHI policies have to be bought for each family member.Footnote 10 Grunow and Nuscheler (2014) report that healthier individuals in Germany tend to self-select into PHI due to the risk-rated premiums. This is consistent with self-selection of healthier persons into self-employment, which allows choosing PHI independently of income (see below).

Contributions to SHI are split between employers and employees. Until July 2005, each side paid 50% of the contributions; since then, employees pay 0.9 percentage points of the SHI contribution rate alone.Footnote 11 Correspondingly, for high-income employees with PHI, the employer pays half the PHI premium insofar as this amount does not exceed the employer’s maximum contribution to SHI, which is determined by the contribution assessment ceiling. The statutory incidence of the contributions is not necessarily equal to the economic incidence. Employers may be able to shift some of their statutory burden to employees by paying lower wages. However, the statutory split is very salient. Employees see their contribution to SHI as a payroll deduction on their paycheck, or they directly pay their share of the PHI premium, but they do not see their employer’s contributions. Chetty et al. (2009) document the behavioral importance of salience of taxes. Therefore, in the main analysis, we assume equality of statutory pay burden and economic burden, i.e., the burden is roughly split by half. In a robustness check, we explore the assumption of a full shift of the burden onto the employees.

Self-employed persons are not obliged to be insured in the SHI system.Footnote 12 They can buy PHI, or they can choose to be voluntary members in the SHI system if they were SHI members for at least 12 months immediately prior to becoming self-employed, or if they were SHI members for at least 24 months within the last 5 years. Thus, those who switch from paid employment to self-employment are usually eligible for voluntary SHI membership, except for high-income employees with PHI. The self-employed pay the full SHI contributions or PHI premiums, i.e., both the employer’s and the employee’s shares. If they are covered under voluntary SHI, they have to report their income to their SHI fund; otherwise, the contribution assessment ceiling is applied. To prevent income underreporting, there is a lower limit of income applicable to compute the SHI contributions of the self-employed.Footnote 13

Sickness benefits, which partially replace income from the seventh week of sick leave on, are covered under SHI and usually also under PHI. This also applies to the self-employed who choose PHI coverage or voluntary SHI with the full contribution rate. A difference between paid employees and self-employed persons is that employers are obliged to continue paying wages during the first 6 weeks of a worker’s sick leave; for the self-employed, private insurance is available instead. We explore this issue in a robustness check (Sect. 6.2).

Tax deductibility creates additional variation in the costs of health insurance in paid employment and self-employment across individuals and time. Health insurance expenses—SHI contributions as well as PHI premiums—can partly be deducted from the personal income tax (PIT) base. The rules changed various times during our observation period, and these changes affected individual taxpayers in different ways due to the nonlinearity of the tax schedule and the various thresholds involved. We describe the specifics in Appendix B.

4 Empirical methodology and data

4.1 Hazard rate model of entry into self-employment

We are interested in transitions from paid employment to self-employment. In a random utility model, a person i in paid employment in period t chooses to switch to self-employment in the following period if utility in self-employment (se) exceeds utility in paid employment (pe). Utility U itj in alternative {se, pe} is assumed to be a function of the health insurance costs for the household in this alternative, HIC itj , observed characteristics x it , and an error term ε itj that includes any relevant unobservables:

$$ {U}_{it j}\left({HIC}_{it j},{x}_{it}\right)\kern0.5em =\kern0.5em \alpha\ {HIC}_{it j}+{\beta}_j^{\prime }\ {x}_{it}+{\varepsilon}_{it j} $$
(1)

The parameter α of the health insurance costs is expected to be negative and assumed to be the same in paid employment and self-employment, because €1 disposable for consumption should yield the same utility in both employment states.Footnote 14 The parameters β j of the characteristics x it may vary across alternatives, because these characteristics may shift tastes for one or the other alternative.

The probability of entry into self-employment conditional on the HIC itj and x it equals the probability that utility from self-employment U it,se is higher than utility from paid employment U it,pe . Let entry it denote a binary variable that equals one if a person switches from paid employment to self-employment between observation times t and t + 1, and zero otherwise. With the standard assumption of type I extreme value disturbed error terms ε itj (McFadden 1974),Footnote 15 we obtain

$$ \begin{array}{l}\mathrm{Prob}\left({\mathrm{entry}}_{it}=1\Big|{HIC}_{it, se},{HIC}_{it, pe},{x}_{it}\operatorname{}\right)=\\ {}\mathrm{Prob}\left({U}_{it, se}\left({HIC}_{it, se},{x}_{it}\right)>{U}_{it, pe}\right({HIC}_{it, pe},{x}_{it}\left)\right)=\\ {}\Lambda \left(\alpha \left({HIC}_{it, se}-{HIC}_{it, pe}\right)+\right({\beta}_{se}-{\beta}_{pe}\left){}^{\prime }{x}_{it}\right)=\Lambda \left(\alpha\ {\mathrm{HICD}}_{it}+{\beta}^{\prime }{x}_{it}\right)\end{array} $$
(2)

where the health insurance cost differential HICD it  = HIC it,se  − HIC it,pe , β = β se  − β pe , and Λ is the cumulative logistic distribution function. All the explanatory variables in x it as well as the HICD it are measured before potential entries into self-employment, which alleviates potential endogeneity concerns.

The probability of switching to self-employment may change with tenure in paid employment. It may decrease due to habituation, or it may increase due to experience and networks. To account for such potential effects, we include 12 dummy variables indicating the duration of the current spell in paid employment in x it . By conditioning on this very flexible specification of the baseline hazard, the logit model of the transition probability in Eq. (2), estimated on the data in person-period format, can equivalently be written as a general survival model (cf. Jenkins 1995; Sueyoshi 1995; Caliendo et al. 2010; García-Gómez et al. 2010). We use annual data because the covariates are not available at a higher frequency. By applying the discrete time hazard rate model, we take into account state dependence and avoid survivorship bias. The model consistently accounts for both, right-censored spells and left-censored spells, because retrospective employment history information in our data allow us to recover the correct spell duration even in case employment already started before the first survey interview of a person. In a robustness check, we also control for nonrandom selection into the sample of paid employees (see Sect. 6.2).

4.2 Simulation of the health insurance cost differential

The explanatory variable of central interest in this analysis is the differential in expected costs for health insurance between the counter-factual alternative of self-employment and the current situation in paid employment. The relevant cost differential takes into account the deductibility of health insurance costs from the personal income tax base.

We define the health insurance cost differential (HICD) as the minimum total increase in SHI contributions and/or PHI premiums (including long-term care insurance) after taxes for the total household per month if the paid employee under consideration switches to self-employment. It may be positive or negative, depending on the individual and household situation.Footnote 16

The health insurance costs in the current situation are calculated from the information about the health insurance situation of the household members contained in our data, including the membership in one of the SHI funds (listed in Table A 1 in Appendix A) with the corresponding contribution rates. For the counter-factual situation of self-employment of one of the household members, we simulate the total household health insurance costs for the person’s two options—if available—voluntary SHI membershipFootnote 17 or PHI. To predict PHI premiums, we first run a regression of observed private health insurance premiums of actually self-employed persons on characteristics determining PHI premiums. These include gender, age, and health when signing the PHI contract, co-payments, duration of the contract, and eligibility for government aid for civil servants and their families. Then, we use the estimated coefficients to predict individual PHI premiums. This way, individual health shocks enter the predicted HICD and provide exogenous variation. Appendix C describes the estimation of PHI premiums in detail, and Sect. 6.2 covers a robustness check with respect to the underlying estimation sample.

The income tax deduction is simulated using an adaption of the tax-benefit microsimulation model “STSM” for Germany (Steiner et al. 2012). The simulation accounts for the details of the progressive personal income tax, such as joint taxation of married couples and tax benefits for families with children, and incorporates changes in the tax code over time. These include two reforms of the tax deduction of health insurance expenses and changes in the tax schedule (see Appendix B). This is the second source of exogenous variation.

To calculate the HICD, we assume that the person considering to switch to self-employment expects to earn the same amount in self-employment as he or she currently earns in paid employment. This allows us to focus on the effect of health insurance costs without mixing it with the effect of income expectations that may differ between paid employment and self-employment for other reasons. The rich set of independent variables we include, particularly education and industry, and the flexible functions of current income and age control for differences in expectations.

For the calculation of the HICD, we assume that the person entering self-employment chooses the alternative of voluntary SHI or PHI that is cheaper for the household. High-income paid employees who are already insured under PHI cannot choose to switch back to voluntary SHI, so for them, we calculate the HICD acknowledging that they have to stay in PHI.

PHI policies usually offer somewhat more and better services covered than SHI funds, and physicians tend to treat PHI patients preferentially, because they are often able to charge higher prices to PHI providers. Per se, this may put into question that consumers compare the two options on a cost basis only. We adopt three strategies to account for this.

First, we control for the possession of supplementary PHI policies that any SHI members can buy to supplement their services covered. Most prominently, these additional services typically include eligibility for a hospital room with no more than two beds and treatment by the head physician in a hospital, which full PHI policies usually also cover. This control variable should capture the individual preference for premium-quality healthcare.Footnote 18 Second, in a robustness check, we recalculate the HICD by adding the average cost of additional private health insurance policies to the contributions of all SHI members in order to compare costs between SHI and PHI plans with more similar services covered. Third, in a radical robustness check, we assume that when contemplating entry into self-employment, everybody only considers PHI because of the better services covered and nobody chooses voluntary SHI, and recalculate the HICD accordingly. We report the results of these robustness checks in Sect. 6.2.Footnote 19

4.3 Control variables

The current wage and salary income as an employee may have an effect on the probability of transition to self-employment and is also correlated with the HICD because SHI contributions are dependent on income. Therefore, it is important to control for income effects through a flexible functional form; we consider higher order polynomials and spline functions. Similarly, health, age and its square, gender, and the number of children are important control variables because they may have direct effects on the entry probability and are correlated with the HICD through estimated PHI premiums.Footnote 20 With similar reasoning, we also include the marital status and the age, health, and income of a spouse, if present. As additional control variables, we include determinants of entrepreneurship known from the literature (e.g., Blanchflower 2000; Parker 2009): education, prior work experience and prior unemployment experience, non-German nationality, intergenerational background (a dummy variable indicating whether the father was self-employed when the respondent was 15 years old), and risk tolerance (the self-reported willingness to take risks on a scale from 0 to 10).Footnote 21 We also include 11 industry and 4 regional dummies, as well as a full set of year dummies to control for the business cycle.Footnote 22

4.4 Identification of the effect of the health insurance cost differential

In our econometric estimation, we identify the causal effect of the HICD on the probability of entry into self-employment by exploiting variation over individuals that is exogenous after controlling for possibly nonlinear effects of income, own and spousal health and age, gender, marital status, the number of children in the household, and the other control variables. The remaining partial variation in the HICD stems from (i) the discontinuities created by the various discrete thresholds relevant for the health insurance system as well as their changes over time as described in Sect. 3, along with the variation in contribution rates over SHI funds and time; (ii) the discontinuities in the personal income tax system that affect the amount saved through the tax deduction of the health costs, and the changes in these tax rules over time, as described in Appendix B; and (iii) the effects of specifics in each family’s situation, e.g., own and spousal health and the coverage of children under free family insurance, on counter-factual health insurance costs in self-employment through the impact of these specifics on PHI premiums. Thus, for the identification of the causal effect of the HICD, we use the variation created by the details of the health insurance legislation and tax code and the changes of these institutions over time, which are exogenous to the individual, as well as the health insurance cost consequences of own and spousal health shocks, which can reasonably be assumed to be exogenous as well after controlling for income and education.

5 Data

5.1 Representative household panel data with health information

The data requirements for this analysis are fulfilled by the German Socio-Economic Panel (SOEP), a representative annual household survey collecting detailed information about the socio-economic situation of private households in Germany (cf. Wagner et al. 2007). We use the waves 2000, when the SOEP was significantly enlarged, through 2012.Footnote 23 During this time, the SOEP covered about 20,000 persons in 11,000 households per year. The data allow us to observe employment transitions and provide all details necessary to simulate the HICD, including the tax deduction, as well as a rich set of control variables.

We require health information of the household members for an adequate prediction of counter-factual PHI premiums in case of self-employment and as control variables in the transition equation. In our main specifications, we use a self-assessed health measure provided annually in the SOEP. Respondents are asked how they would describe their current state of health on a five-point scale (1 = very good, 2 = good, 3 = satisfactory, 4 = poor, 5 = very poor). Such subjective measures are commonly used in the literature (e.g., Benitez-Silva and Ni 2008; Haan and Myck 2009) and are generally viewed as good overall reflections of individual physical as well as mental health, at least if the analysis is confined to a single country. Kalwij and Vermeulen (2008) report that objective health measures add little on top of self-reported health information in an analysis of employment patterns in Germany. Nevertheless, in a robustness check, we alternatively use a more objective measure, the official degree of disability. This is a percentage asserted by a public medical officer and printed in an official document that grants access to certain benefits. The advantage of the subjective measure is that smaller and temporary differences in a generally good medical condition are captured, which would not lead to the issuance of a disability certificate.

The classification of individuals as self-employed is based on a survey question about the occupational status of each respondent.Footnote 24 If respondents are employed or self-employed in more than one position, they are asked to report their status in their primary activity. The sample is restricted to working-age individuals between 19 and 59 years of age to avoid distortions through early retirement decisions.Footnote 25 It excludes civil servants, farmers, pensioners, and those currently in education, vocational training, or military service; occupational choices in these groups are likely to follow different patterns.Footnote 26 We also exclude family members working for a self-employed relative from the sample, because these individuals are not entrepreneurs in the sense of running their own business.

5.2 Descriptive statistics by employment state

Table 1 shows the means of relevant variables from the survey by employment state. We divide the paid employees further into those who do and those who do not enter into self-employment between the year of observation and the subsequent year and report the difference between these two means and its significance level. The paid employees form the estimation sample at risk of entry into self-employment in the hazard rate model; the others are only included the robustness check controlling for selection into being a paid employee. Table A 2 in Appendix A provides the definitions of the variables.

Table 1 Descriptive statistics by employment status

Almost 90% of the paid employees who do not enter into self-employment within the following year are paying members in SHI, 4.3% are in marginal employment and covered under free family insurance through the spouse’s SHI, and 6.5% have PHI. Among those making the transition, 22.6% are already covered under PHI as a paid employee, reflecting the higher mean before-tax labor income of this group (real €3054 versus €2471 per month). The income difference seems to be partly explained by the higher education of those about to venture out on their own: 41% of them have a university degree, but only 23% of the other paid employees. The soon-to-be entrepreneurs as well as their spouses are healthier when looking at the self-assessed measure, but there are no significant differences with regard to the official disability degree. The other variables confirm the known picture. Those who enter into self-employment are more often male, have had a self-employed father, and are more willing to take risks than the other paid employees. They are more often active in the business services and construction industries and less often in manufacturing and “public and personal services” (the latter category includes employed teachers, for example).

Table 2 shows means of simulated variables needed to calculate the HICD for the estimation sample at risk of making the transition from paid to self-employment. We discuss these variables going from top to bottom. The mean current health insurance costs are €188 per month for employees who stay employees and €9 more for those who switch to self-employment in the following year, again reflecting the higher average income of the latter group. In the counter-factual case of self-employment, the persons in the sample would pay an average premium of €363–365 per month if they chose PHI. This variable is predicted using the estimated PHI premium equation and includes premiums for a spouse and children, if these are present and not otherwise covered, as explained in Appendix C. If the person hypothetically entering into self-employment chose voluntary SHI membership, he or she would pay about €16–22 more on average. For 53% of the paid employees who do not enter into self-employment, a PHI policy would be the cheaper choice for the household in case of self-employment; this share is 59% for those who actually enter.

Table 2 Mean simulated health insurance costs

The health insurance costs reported so far are before tax. Since these expenses can partly be deducted from the personal income tax base, the higher costs in case of self-employment lead to tax savings of €21–26 on average per month, both with the PHI and with the SHI. Taking into account the tax savings, each person chooses the lower cost option for the household, PHI or SHI, in case of self-employment. This leads to the hypothetical minimum health insurance costs in case of self-employment after taxes of €277–280. The mean is lower than the mean of PHI or SHI costs because of the individual choice of the cheaper option. Finally, we subtract the actual current health insurance costs for the household from the hypothetical costs in case of self-employment, both after taxes, and arrive at the average nominal HICD. It is €92 per month for paid employees who do not enter into self-employment and €80 per month for those who enter. The difference is significant and may indicate that a higher HICD deters persons from entry into self-employment. The next section explores this much more rigorously using the econometric model which controls for important covariates and state dependence. For the estimation, we deflate the HICD amount using the consumer price index and divide by 100.Footnote 27

Figure 1 depicts the distribution of the real HICD in a histogram for all paid employees in the estimation sample. Clearly, the HICD is positive for the most part of the distribution, i.e., health insurance costs for the household increase when a member switches to self-employment because of the loss of the employer’s contributions. Insofar, the German health insurance system may constitute a barrier against entry into self-employment. However, there is also an important negative part of the HICD distribution. For example, for a young, healthy and unmarried male, the full premium to PHI in case of self-employment can be cheaper than the employee’s share to SHI contributions. Thus, the health insurance system may provide incentives to enter self-employment for persons whose income is not high enough to opt out of the SHI as a paid employee. For them, self-employment lifts the barrier to PHI.

Fig. 1
figure 1

Histogram of the simulated health insurance cost differential. The histogram shows the distribution of the simulated differential in the monthly health insurance costs between the alternatives of self-employment and paid employment in real euro of 2005. Here, the incidence assumption is that the burden of the health insurance costs is shared between employers and employees according to the statutory division, i.e., roughly by half. Source: Authors’ calculations based on SOEPv29, 2000–2012

The histogram further shows that there is a bunching at zero, which indicates that for almost 15% of the individuals in the sample, health insurance costs for the household would not change at all if they switched to self-employment. For example, this is the case for persons with income below the marginal employment threshold who are insured under free family insurance through their spouse’s SHI membership, because they can keep their family insurance in self-employment. Another bunching occurs at the right side of the distribution with a high HICD. Again, marginal employment plays a role here, because a marginally employed person who cannot be covered under family insurance through a spouse or parent is covered under SHI with zero employee’s contributions, but not as a low-income self-employed person. In the latter situation, the person has to pay the full income-independent PHI premium, or as a voluntary SHI member, the lower limit of income for the assessment of SHI contributions for the self-employed applies. In both cases, costs increase drastically in comparison to free SHI coverage, and the health insurance system constitutes a significant barrier against self-employment.

6 Econometric results

6.1 Main results: the effect of the health insurance cost differential

Table 3 provides the main results from estimating the hazard rate model of transition from paid employment into self-employment in Eq. (2). The table shows logit coefficients and the average marginal effect of the key explanatory variable, the health insurance cost differential (HICD) between counter-factual self-employment and paid employment. The standard errors are robust to heteroscedasticity and clustering at the individual level across yearsFootnote 28; those of the marginal effects are obtained using the delta method.

Table 3 Self-employment entry probability conditional on health insurance cost differential

Column (1) provides the main specification for the full sample. The HICD has a negative effect on the probability of entry into self-employment, which is significant at the 1% level. As the HICD is coded in units of €100, increasing the additional costs in case of self-employment by €10 per month decreases the entry probability by 0.015 percentage points on average. This is 1.7% of the annual transition rate from paid employment to self-employment of 0.9%, as indicated at the bottom of the table. The result shows that the decision of paid employees to switch to self-employment is significantly influenced by the additional costs (or cost savings) for health insurance for the household.

Column (2) shows that the effect of health insurance costs is even stronger for men, where an increase in the HICD by €10 per month increases the transition probability by 1.9% of the entry rate. For women, the effect has the same sign but is smaller and insignificant (column 3). Nonpecuniary aspects of self-employment like flexibility of time and location of work may be more important for women (Georgellis and Wall 2005).

To compare the effect size of the health insurance costs with other determinants of entrepreneurship that are well-known from the literature, we calculate average marginal effects of some of our control variables as well (see Table A 3 in Appendix A). Using the full sample, we find that an increase of health insurance costs in entrepreneurship by €100 per month decreases the probability of switching to self-employment by about as much as a decrease in the willingness to take risks by one point on the 11-point scale. The standard deviation of the HICD is €120 and that of the willingness to take risks about 2 points. Thus, a one standard deviation change in health insurance costs has more than half the effect of a one standard deviation change in risk tolerance, which is known to be an important determinant of entrepreneurship (Caliendo et al. 2009; Ahn 2010; Skriabikova et al. 2014). Moreover, the effect of an increase in the HICD by €330 per month has the same effect as not having a self-employed father, so an increase in the HICD by one standard deviation has more than a third of the effect size of an intergenerational link, which is known to be highly relevant (Dunn and Holtz-Eakin 2000).

The elasticity of the probability of entry into self-employment with respect to the health insurance costs in self-employment implied by our estimate is about −0.47 for the whole sample and −0.65 for men (keeping the health insurance costs in paid employment constant). We compare this to the elasticities implied by the literature that analyses the effects of the TRA86 tax reform in the USA. Gumus and Regan (2015) only provide estimates for men, and their point estimate for all men implies an elasticity of −0.56, which is similar to ours. However, the coefficient estimated by Gumus and Regan (2015) is not significantly different from zero due to a large standard error. The results of Heim and Lurie (2010) imply a larger point estimate of the elasticity of the probability of starting to earn some income from self-employment of −2.93.Footnote 29 They acknowledge that their estimated effects are “somewhat large” (p. 1006) and explain this by the strong increase in the self-employment rate in the USA during the time period under their analysis.

Persons with poorer health are less likely to enter into self-employment, even after controlling for the HICD, which accounts for the higher costs of PHI implied by the poorer health. In the main estimation based on the full sample, a deterioration of health by one point on the five-point scale of self-assessed general health decreases the entry probability by 0.11 percentage points (average marginal effect in Table A 3 in Appendix A), i.e., 12% of the annual transition rate; the standard deviation of the health measure among all employees is 0.82.Footnote 30 Similarly, Parker and Rougier (2007) as well as Rietveld et al. (2015) report that older workers with poor health are less likely to move into self-employment in Britain and the USA, respectively. Self-employment is associated with longer work hours than paid employment and considerable strains (Lewin-Epstein and Yuchtman-Yaar 1991), which may deter less healthy persons from this choice, in addition to the implications for health insurance costs discussed. Rietveld et al. (2015) argue that higher costs of health insurance may explain why less healthy individuals less often become self-employed, but empirically, they cannot distinguish this channel from direct health effects. Thus, our separation between the direct health effect and the effect through health insurance costs is a novel contribution to the literature.

The partial effect of poor spousal health is not significant when controlling for the HICD in our estimations. In Sect. 6.3, we show that omission of the HICD leads to a significant negative coefficient of poor spousal health. Thus, poor spousal health affects the entry decision, but only indirectly through the higher PHI costs implied.

6.2 Robustness checks

In this section, we assess the robustness of the estimated effect of the HICD with respect to specification choices and assumptions taken.Footnote 31

Income splines

As mentioned before, it is important to control for possibly nonlinear effects of wage income of paid employees. In column (4) of Table 3, we replace the fourth-order polynomial of income of the main specification by a spline function with five splines of equal width within the interval from zero to €20,000 per month, where the highest spline also includes observations with higher income. We observe that the coefficient and marginal effect of the HICD remains almost unchanged, indicating that the estimates are not sensitive to the functional form with regard to income.

Health measure

In column (5), we explore the official degree of disability as an alternative, more objective health measure (see Sect. 5.1) and again find a very similar effect of the HICD as well as a negative partial direct effect of poorer health. A higher disability degree by 10 percentage points decreases the entry probability by 0.076 percentage points (Table A 3 in Appendix A) or 8% of the entry rate.Footnote 32

Unobserved heterogeneity

In further specification checks, we allow for unobserved heterogeneity between individuals. In column (6) of Table 3, we consider an individual-specific random effect with a normal distribution. The estimated variance of the individual random effect is significantly different from zero. The logit coefficient of the HICD and the average marginal effect (which contains a draw from the estimated random effect distribution for each individual) become larger in absolute terms, but they are not significantly different from the point estimates in column (1). We also estimate complementary log-log models with an arbitrary discrete distribution of the individual random effect (not reported in the table). We achieve convergence with two mass points based on a quasi-Hessian convergence criterion. The estimated probability of being latent type 1 is 83%, and the average marginal effect of the HICD for this type is −0.0011 (std. err. 0.0004), similar to our baseline estimate. The marginal effect for type 2 (17% probability) is −0.0003 (std. err. 0.0001), but the distinction between the two mass points and the probability of being type 2 are statistically insignificant. We conclude that the main results are robust to the modeling of unobserved heterogeneity.

Selection correction

The sample at risk of transition from paid employment to self-employment is the subsample of person-year observations currently working in paid employment. The results from our main estimations can be interpreted as conditional on being a paid employee. To test whether the results change if we account for potential selection effects, we employ a selection correction that allows us to make unconditional statements as well. We use the selection model for binary dependent variables suggested by Van de Ven and Van Praag (1981) with probit equations both for the main equation of entry into self-employment and a selection equation for being in paid employment. The error terms are modeled to follow the bivariate normal distribution with correlation. In both equations, among other variables, we control for the number of children under the age of 17 in the household, gender, and the interaction of these variables. For better identification, in the selection equation, we additionally include the number of children below three and below 6 years of age and their interaction terms with the gender dummy. The presence of young children in the household is likely to influence labor force participation, especially for women, but not the choice between paid employment and self-employment after controlling for the general family situation.

The estimated probit coefficients from this model are reported in Table 4. The coefficients from the selection equation show that most of the variables with exclusion restriction are highly significant. The correlation between the error terms of the selection and the transition equations ρ turns out to be significantly different from zero only among men. The estimated marginal effects of the HICD are very similar to those from the baseline models in Table 3 (note that the probit coefficients cannot be directly compared to the logit coefficients from the main model). This shows that the estimated effects are not driven by selection and that they are also insensitive to the choice of a logit or a probit specification.

Table 4 Robustness check: probit specification with selection correction

Assumptions about the health insurance cost differential

Next, we explore the sensitivity of our results with respect to the assumptions taken in order to calculate the HICD. In the main estimations, we assume that the incidence of the burden of the health insurance costs, as perceived by the paid employees when they consider switching to self-employment, equals the statutory incidence, i.e., employers and employees bear roughly half of the contributions. However, it is possible that employers are able to shift the burden onto employees by paying lower wages. Using data from Eurostat on labor costs, Ooghe et al. (2003) estimate that more than half of the burden of social security contributions is borne by the employees in Europe. Gruber (2000) concludes from a literature review that the costs of health insurance are even fully shifted onto the employees.

In column (1) of Table 5, we therefore assume that the total SHI contributions and PHI premiums, i.e., both the statutory employee’s and employer’s shares, are effectively borne by the employees. This reduces the average real HICD between the alternatives of self-employment and paid employment from €91.6 to −€73.0 for the paid employees who do not enter into self-employment and from €78.4 to −€78.9 for those who make the transition (see Table 2). In the baseline assumption, the step into self-employment on average implies higher health insurance costs, because the nascent entrepreneur loses the employer’s contributions. If the employer’s contributions are fully shifted onto the employee anyway, the situation becomes relatively cheaper in self-employment on average, because self-employment opens up the PHI option, total costs of which are often lower. In this estimation, the point estimate of the average marginal effect of the HICD becomes larger in absolute terms in comparison to the main estimation (column (1) of Table 3), but the confidence intervals of the two estimates overlap, so the result is robust.

Table 5 Self-employment entry probability: further robustness checks

In column (2) of Table 5, we return to the baseline assumption regarding incidence but re-calculate the HICD by adding the cost of supplementary PHI to the health insurance costs associated with SHI. We use the mean of the costs of those employees in the sample who actually have supplementary PHI report paying for it. As discussed in Sect. 4.2 in the paper, this makes the services covered by SHI and PHI more comparable. Again, this decreases the average HICD and makes self-employment look more attractive (Table 2). However, the estimated effect of the HICD is very similar to the main estimation.

Finally, we assess the robustness with respect to the assumption that the choice between voluntary SHI and PHI in case of self-employment is determined by the lowest total cost for the household (if both options are available). Instead, in column (3) of Table 5, we assume that everybody considers the HICD associated with PHI in case of self-employment. The estimated effect of the HICD becomes somewhat larger in absolute terms than in the main estimation, but not significantly different. In summary, we conclude that the results are robust to the assumptions taken for the calculation of the HICD.

Civil servants

We return to our original simulation of the HICD for the remaining robustness checks. Thus far, we considered transitions from dependent employment to self-employment only. In column (4) of Table 5, we additionally include civil servants in the sample and also add a corresponding dummy variable. We calculate their HICD taking into account that civil servants have lower PHI costs because they and their families (if certain conditions are met) are eligible for financial support from the government in case of sickness. The coefficient of the civil servant dummy variable is negative and significant, indicating that civil servants are less likely to switch to self-employment than employees, presumably because of the high job security and comparably generous pension entitlements of civil servants in Germany. The coefficient and marginal effect of the HICD for the combined sample of employees and civil servants is very similar to the baseline estimate in column (1) of Table 1, which confirms robustness.

Estimation of PHI premiums

In our main specification, we predict individual PHI premiums in case of self-employment based on a regression of PHI premiums using the sample of self-employed persons with PHI policies (see Appendix C). The idea behind using self-employed persons only in this auxiliary regression is that the determination of PHI premiums might differ somewhat between self-employed and dependently employed customers, but the disadvantage is that the estimation sample may be selective. In order to use a larger and less selective sample, in this robustness check, we include all self-employed workers as well as paid employees with PHI policies in the regression of PHI premiums (1442 observations).Footnote 33 We obtain similar coefficients for the PHI premium equation as in Table C 1 in Appendix C. After recalculating the HICD, column (5) of Table 5 shows that the results from estimating the probability of entry into self-employment remain very robust.Footnote 34

Foreseeable health predisposition

For our identification, we treat changes in individual health as exogenous shocks (after controlling for income and education). However, since various diseases, such as certain forms of cancer, are partly hereditary and determined by genes and their interaction with the environment, future health issues may be partly foreseeable by individuals, and this may influence their occupational choices even before health issues arise. While it is impossible to completely control individual expectations about future health developments using our data, we test the sensitivity of our results when we use both the ages of death of the respondent’s father and mother as proxies for individual genetic predisposition. Because these variables are not observed for a parent who is still alive, we interact them with dummy variables indicating whether the father or mother is deceased. The coefficients of the additional control variables turn out to be insignificant in the regression of the probability of entry into self-employment, and the estimates presented in column (6) are similar to the baseline results.

Sick leave

As mentioned in Sect. 3, a difference between paid employment and self-employment is that employees receive wage continuation from their employer during the first 6 weeks of sick leave from work. To explore whether this influences the decision to become self-employed, in a robustness check, we include the number of sick leave days a worker reports for the previous year and an interaction with wage income in our model of entry into self-employment. These variables capture the value of wage continuation to an individual. However, these additional variables are insignificant, and their inclusion does not change the size or significance of the estimated effects of the HICD or the health status variable. Thus, differences in sick pay do not drive our results.

Lagged HICD determinants

Even though we control for the variables that determine the HICD, especially income and age, one may still be concerned that the estimated coefficient of the HICD may partly reflect nonlinear effects of these individual characteristics on self-employment (spurious correlation). Therefore, in another check of robustness, we use variables lagged by 4 years, but within the contemporaneous legislative environment, to simulate the HICD. This should largely purge the HICD coefficient of any uncontrolled effects of individual characteristics but leave identification of the coefficient through changes in the health insurance system and tax system intact. Due to the lagged variables, our sample size shrinks to 25,819 observations in this test. The marginal effect of the HICD becomes −0.0016 (std. err. 0.0004), very similar to the baseline estimate in Table 3. This increases confidence that our results are not driven by spurious correlation.

6.3 The effects of health and the family situation

We have discussed which characteristics of household members influence the HICD, and we have estimated that the HICD has a sizable and significant effect on the probability of transition from paid employment to self-employment. In this section, we take a step back and re-estimate the same hazard rate model but exclude the HICD and directly include factors that determine the HICD amount instead. This approach is comparable to the extant literature for the USA, which does not calculate a HICD (Holtz-Eakin et al. 1996; Wellington 2001; Zissimopoulos and Karoly 2007; Fairlie et al. 2011). Table 6 shows the logit coefficients for the full sample and separately for men and women; average marginal effects of the most interesting variables appear in the rightmost three columns of Table A 3 in Appendix A.

Table 6 Self-empl. entry probability conditional on determinants of health insurance costs

Poorer health significantly decreases the probability of entry into self-employment in the pooled sample and for men. Poorer health of the spouse also significantly decreases the entry probability in the pooled sample and for men in these specifications omitting the HICD. The finding that spousal health becomes insignificant when including the HICD in Table 3 indicates that the deterring effect of poor spousal health works through increasing health insurance costs in case of self-employment.

A paid employee currently covered under and paying for SHI, i.e., not in marginal employment, is less likely to enter into self-employment than somebody who already has PHI. While the latter can keep the current PHI contract without a new health assessment when stepping into self-employment, the former has to give up SHI in order to switch to PHI, which requires a new health assessment. This constitutes a barrier comparable to entrepreneurship lock in the USA, where employees are reluctant to lose their employer-provided health insurance. The alternative of remaining in voluntary SHI may be too expensive especially for individuals with higher income, because the employer’s contributions are lost. Paid employees with marginal employment who are covered by free family insurance through their spouse or parent are even more hesitant to venture out on their own. Although they may keep their family insurance if their income remains below the marginal employment threshold, they may fear to lose family insurance should their income exceed it. We control for gender, the number of children, and an interaction term of these variables, so any effects of being occupied with childcare should be captured by these variables.

A higher number of children covered under free family insurance decreases the entry probability for both genders, as expected, although this is statistically insignificant, presumably due to the high correlation with the number of children in the household. In contrast, the number of children in the household, independent of family insurance coverage, has a positive partial effect for men. Thus, we distinguish between two effects: on one hand, an effect of the family situation that is unrelated to health insurance costs and may be related to the wish to have flexibility in the timing and location of work, and on the other hand, the deterring effect of the additional health insurance costs.

We expect that a spouse covered under free family insurance locks in the partner who is an employee and a paying SHI member, because a switch to self-employment and purchase of PHI would imply that the spouse loses family insurance. We anticipate this barrier to be stronger if the spouse covered under family insurance is in poor health or older, because this would make PHI more expensive. Therefore, we include interaction terms. These turn out to be significant for women only; the interaction term with poor health has the expected sign, but not the one with age. The average marginal effects in Table A 3 in Appendix A, which take into account the interaction terms, indicate that a spouse covered under family insurance on average decreases the likelihood of entry for women, significant at the 1% level. For men, there is no significant effect.

In sum, the results from the reduced form estimations are consistent with our main finding that health insurance costs affect the decision to become self-employed. In comparison to the approach pursued here, our main estimations have the advantage that, by directly including the HICD, we are able to separate the effects of health insurance costs from effects of the family situation that are unrelated to health insurance costs.

7 Conclusion

We show that a differential treatment of paid employees and self-employed persons in the health insurance system has significant effects on the entry rate into self-employment, based on representative household panel data for Germany. If the additional costs of health insurance for a self-employed person in comparison to a paid employee increase by €10 per month, the probability of entry decreases by about 1.7% of the annual entry rate. This is a considerable effect in comparison to well-known determinants of entrepreneurship: An increase in the health insurance costs in self-employment by one standard deviation has more than half the effect of a one standard deviation decrease in the willingness to take risks and more than a third of the effect of an intergenerational link. We show that these effects occur in a system with public and universal healthcare. The effect size is at the lower end of the range of point estimates reported by the extant literature considering the system of employer-provided health insurance in the USA.

Furthermore, we provide evidence that individual poor health has a direct negative effect on the probability of entry into self-employment even after controlling for the health insurance cost differential, which is larger for less healthy persons. This indicates that the greater strains associated with self-employment may be a barrier for less healthy persons, in addition to the pecuniary disincentives. In contrast, a spouse’s poor health decreases the entry probability through the higher average health insurance costs for the family when switching to self-employment but has no significant direct effect. In the context of taking care of a spouse with poor health, the disadvantages of self-employment may be offset by advantages such as flexibility in location and timing of work.

The results concerning the health insurance costs show that monetary incentives within the health insurance system influence entrepreneurial activity. For example, more generous tax deductions of health insurance costs for the self-employed would increase the entry rate into self-employment. Of course, it is not the primary intention of the health insurance system to stimulate entrepreneurship. However, given the significant effects, future health-care reforms, which may be crafted with intentions unrelated to entrepreneurship, should be analyzed with respect to the incentives and disincentives they create for self-employment in order to avoid unexpected effects on firm formation.

Our approach of estimating a coefficient of the health insurance cost differential has the advantage that the effect size is comparable across countries with different health insurance systems. Future research should exploit the variety of health-care systems in the world, including tax-financed public health services as found in the UK, for example, to explore if monetary incentives with regard to health insurance have similar effects on self-employment choice in different systems. This avenue will lead to a better understanding of the effects of health-care institutions on entrepreneurial activity. In the process of health-care reform that many countries are undergoing, this will allow to take into account the important effects on entrepreneurship, which is becoming increasingly important in innovation-based economies.