Introduction

The World Health Organization (WHO) estimates that alcohol use disorders affect 76.3 million people worldwide (WHO, 2004). Results from the National Comorbidity Study indicate that over 14% of adults in the United States have a lifetime history of alcohol dependence, making it one of the most prevalent adult psychiatric disorders (Kessler et al., 1994). A number of studies have demonstrated that alcohol dependence is under a significant degree of genetic influence, with heritability estimates in the range of 50–70% (Cloninger et al., 1981; Heath et al., 1997; Kaprio, et al., 1987; McGue, 1993, 1999; Prescott et al., 1994b). The frequency and amount of alcohol used by adults is also influenced by genetic factors, accounting for 40–56% of the variance, and these genetic influences remain fairly consistent across adulthood (Carmelli et al., 1990; Kaprio et al., 1987, 1992; Prescott et al., 1994a). Carmelli et al. (1993) estimated that 82–86% of the variance in longitudinal stability of alcohol consumption among a group of WWII veterans was accounted for by genetic factors.

The majority of individuals begin experimenting with alcohol in adolescence. Researchers from the Monitoring the Future study estimated that 80% of American adolescents have used alcohol by the end of 12th grade (Johnston et al., 2001). In data from 1412 Virginia twin pairs, 60% of 16 year-old boys and 48% of girls reported having used alcohol (Maes et al., 1999). Accordingly, the transition from adolescence to young adulthood is a particularly critical period in the development of patterns of substance use, as it is during this period that most individuals initiate alcohol use and transition from initial experimentation to more established, potentially problematic, patterns of use (Schulenberg et al., 1997). Several studies have found that adolescents who begin to use alcohol very early, compared to their peers, are at a greater risk for the later development of alcohol abuse or dependence (Chou and Pickering, 1992; Clapper et al., 1995; Grant and Dawson, 1997; Kandel et al., 1992; Pitkanen et al., 2005; Riala et al.,2004). Results from the National Longitudinal Alcohol Epidemiological Survey (NLAES; Hingson et al., 2000) demonstrated that those who report drinking before the age of 14 vs. those who remained abstinent until age 21 were three times more likely to also report frequent high-density drinking and seven times more likely to drink to intoxication on a weekly basis. These early initiators were also significantly more likely to have placed themselves in a dangerous or injury-prone situation after drinking, suggesting this may be an area to target to decrease adolescent risk behavior.

Twin studies have found that the initiation of alcohol use is largely influenced by environmental risk factors (Heath and Martin, 1988; Heath et al., 1991; Kaprio et al., 1987; Koopmans and Boomsma, 1996; Rose et al., 2001b; Stallings et al., 1999). For example, Rose and colleagues (2001b) found that the shared environment accounted for 76% of the variance in whether or not 14-year-old twin boys and girls had initiated alcohol use, while additive genetic effects were nonexistent for boys and accounted for only 18% of the variance in initiation for girls. In a review of 18 studies examining initiation of substance use, Hopfer et al., (2003) found that alcohol initiation is under a significant degree of shared environmental influence, accounting for 55–80% of the variance in initiation. Once initiation has occurred, genetic factors explain a large amount of the variation in frequency of alcohol use (34–72%) especially as adolescents get older (Heath et al., 1991; Hopfer et al., 2003; Koopsmans and Boomsma, 1996; Maes et al., 1999; Rose et al., 2001b; Viken et al., 1999). Additionally, the amount of time taken to transition from first alcohol use to regular use is largely influenced by genetic factors (Stallings et al., 1999).

Thus, while the substance abuse literature has addressed multiple aspects of use and misuse, most of this literature has accumulated from separate analyses conducted across multiple datasets. In this paper, we apply multiple-stage genetic models to progressive stages of alcohol use and misuse in two longitudinal samples of twins. These multiple-stage models allow one to more accurately assess the importance of genetic and environmental risk factors on patterns of use and misuse by making allowance for partial overlap with risk factors for initiation. In addition, we have further defined initiation, by characterizing individuals as to whether they initiated alcohol use in early or late adolescence. Previous research has typically classified adolescents into two groups, those who have not initiated use and those who have. Heath et al. (2002) demonstrated that, when using a binary initiation variable (e.g., never initiated, initiation), “serious bias may arise for estimates of correlations between genetic (or environmental) effects on Initiation versus Outcome” (p. 121). Accordingly, newer multiple-stage models suggest that initiation should be defined using a multiple-category trait as opposed to a binary construct (Heath et al., 2002). By exploring multiple stages of alcohol use within a multivariate model, we can examine the extent to which risk factors overlap between various stages of alcohol use and misuse. It is possible that some risk factors influence multiple aspects of alcohol use and abuse, while other genetic and environmental factors influence frequency of use and/or the development of problems but have no impact on whether or not an adolescent chooses to try alcohol in the first place (Heath et al., 2002). In order to study such common vs. specific influences on patterns of use, one must account for risk factors impacting the decision to initiate use.

We applied these multiple stage genetic models to data from two population-based, longitudinal twin-family studies, Finntwin16 (FT16) and Finntwin12 (FT12). The use of parallel assessment procedures and the overlap in ages in these two independent studies of Finnish twins permitted us to examine the replicability of results across samples. We first fit two-stage models to the initiation of alcohol use and frequency of use at age 17 or younger in the FT16 and FT12 datasets. Subsequently, we expanded this model in the FT16 dataset, to incorporate data on drinking frequency and drinking problems in young adulthood. Accordingly, our study allowed us to examine (1) the degree to which genetic and environmental risk factors impact initiation of alcohol use, taking into account both whether and when the adolescent initiated, (2) the degree to which genetic and environmental influences impact drinking frequency in late adolescence/early adulthood, a time by which most adolescents have moved to more experienced and established patterns of use (Rose et al., 2001a, b), (3) the degree to which genetic and environmental influences impact drinking problems, as measured by the Rutgers Alcohol Problem Index (RAPI; White and Labouvie, 1989) at age 25, and (4) the degree to which genetic and environmental factors overlap between these various stages in the progression of substance use.

Methods

Samples

FinnTwin16 (FT16) and FinnTwin12 (FT12) are two independent, population-based studies, each consisting of five consecutive birth cohorts of Finnish twins. All twins were identified through Finland’s Central Population Registry, permitting exhaustive and unbiased ascertainment. Zygosity was determined in both twin studies using a well-validated questionnaire completed by both co-twins at the baseline, as described elsewhere (Kaprio et al., 1995). Because these twins were younger in FT12, classification was supplemented by parental response to items developed for zygosity classification of twin children (Goldsmith, 1991). FT16 consists of twins born 1975–1979 (Rose et al., 1999). In 1991, the first cohort of twins was mailed baseline questionnaires within 2 months of their 16th birthday. Most responded within a few weeks. This procedure was repeated with the four subsequent cohorts, with the last cohort receiving the baseline questionnaire during 1995. Baseline questionnaires contained items on health habits, including items assessing the initiation and use of alcohol. The staggered administration, rapid completion, and quick return of the baseline questionnaire yielded age-standardized baseline data. The five birth cohorts contained 3065 families of twins in which both twins were living and residing in Finland at the age of 16. With mail and telephone prompts, 5561 of the 6130 twins in these families (90.7%) returned the baseline questionnaire.

All respondent twins were sent follow-up questionnaires when they turned 17 and 18.5 years of age. Retention rates were approximately 90% across the first three waves of data collection for both male and female twins. Starting in the autumn of 2000, a fourth wave of data collection was conducted over a 30-month period when the twins were between the ages of 22 and 27. The last of these questionnaires were returned in 2003. Of those who participated at baseline, we could reach in 2000–2002 a total of 5594 participants (2689 men, 2905 women), and questionnaires were returned by 4929 (2239 men, 2690 women), yielding a response rate of 88.1% (83% for men, 93% for women). The average age for the respondent twins was 24.4 years, but, for ease of presentation, the age of the fourth assessment wave is referred to as 25 throughout this paper. Follow-up questionnaires included the same items assessing the initiation, use, and abuse of alcohol, as well as items assessing problematic behavior related to alcohol use or abuse in the age 18 and 25 follow-ups. For twin analyses, data were available for 1712 same-sex twin pairs of confirmed zygosity: 789 pairs of twin brothers, classified into 342 monozygotic (MZ) and 447 dizygotic (DZ) pairs, and 923 pairs of twin sisters, 496 MZ and 427 DZ female twin pairs.

FT12 consists of 2724 Finnish families with twins born in 1983 through 1987 (Kaprio et al., 2002; Rose, et al., 2001a). The assessment procedure for FT12 was similar to that of FT16 described above, except that baseline assessment was initiated earlier, when the twins were ages 11–12. The first cohort of twins, born in 1983, was sent baseline questionnaires near the end of 1994. Most of these twins returned the questionnaire very early in 1995, the year in which these twins reached the age of 12. This procedure was repeated for subsequent birth cohorts. Mean age at return of the baseline questionnaire was 11.45 years, but, for ease of presentation, baseline age is referred to as age 12. Of the 2548 families with twins that were sent the baseline questionnaire, 2216 (87%) of these families returned the baseline questionnaire and gave consent to participate in this longitudinal study. Follow-up questionnaires were sent out to all twins whose families agreed to participate at baseline within 2 months of the twins’ 14th birthday. Retention rates exceeded 90% at this follow-up. A second follow-up questionnaire, at age 171/2 years, was initiated in autumn of 2000 and is to be completed in the spring of 2005. Results presented here with FT12 data represent 90% of the expected final sample. So far, 3890 questionnaires have been returned out of 4212 mailed, a response rate of 92.4% for those already participating in earlier questionnaires. For twin analyses, data were available for 1297 same-sex twin pairs of confirmed zygosity: 633 pairs of twin brothers, 284 MZ and 349 DZ pairs, and 664 pairs of twin sisters, 347 MZ and 317 DZ female twin pairs.

Alcohol Initiation and Use Measures

In FT16, age of initiation and frequency of use were measured with two items. The first item, assessed at the age of 16 and at all follow-ups, asked how often the twins “drink alcohol at all” and was followed by nine response options: (1) daily; (2) couple of times a week; (3) once a week; (4) a couple times a month; (5) about once a month; (6) about once every 2 months; (7) 3–4 times a year; (8) once a year or less; or (9) I don’t drink at all. The second item, asked only at age 16, asked the age at which the individual first drank a glass of beer, wine, long drink (a drink of beer alcohol content with a soft-drink flavor), and/or hard liquor. In FT12, twins were first asked about substance use at the age 14 follow-up assessment where they were asked how often they “drink alcohol at all” followed by four response options: (1) once a week or more; (2) about 1–2 times a month; (3) less often than once a month; and (4) never; I don’t drink at all. This drinking item was collapsed to four levels from the nine-item measure used in FT16 because of the expected lower frequency of alcohol use at age 14. At age 17 frequency of use was assessed using the same nine level question as in FT16.

Using these items, we created a three-level age of initiation variable for use in genetic models that was parallel across the two datasets. As Heath and his colleagues (2002) suggested, we began by assuming a bivariate probit model for factors that affect alcohol initiation where initiation was defined as having three categories: never initiated, late adolescent initiation, and early adolescent initiation. Those twins who self-reported never drinking alcohol at age 17 or younger were given a zero on the initiation variable, those who reported initiating alcohol use at ages 15, 16, or 17, were coded as 1 and considered “late adolescent initiators”, and those who initiated at age 14 or younger were coded as 2 and considered “early adolescent initiators.” The outcome measure for these bivariate models was the nine-level frequency of use question asked when the twins were 17 in both FT16 and FT12. Those twins who reported never initiating alcohol use cannot be characterized on the frequency of use dimension and were given a missing value for this outcome measure. Only a very small number of twins reported daily drinking (0.17% in FT16 and 0.13% in FT12), so this level was collapsed with “drinking a couple of times per week”, such that the highest category represents drinking more than once per week. Thus, the resulting frequency of use variable ranged from zero (once a year or less) to six (a few times per week or daily), which preserved much of the variance in use and maximized statistical power.

In addition to the alcohol use items described above, the age 25 questionnaire in FT16 contained 22 items from the Rutgers Alcohol Problem Index (RAPI; White and Labouvie, 1989), a scale with good reliability that was designed to assess problematic drinking. The RAPI contains items assessing dependence, withdrawal, blackouts, neglect of responsibilities in several domains, shame and/or embarrassment to self or others, and inappropriate behaviors such as fighting. Individuals indicated how often each consequence of alcohol use had happened in the past twelve months using the following four response options: (1) never, (2) rarely, (3) sometimes, or (4) quite often. For those twins who answered at least 18 of the 22 items, we calculated a RAPI severity score by taking the average response (1–4) across the number of items answered. This method, as opposed to summing across all 22 items, permitted us to retain participants who completed the majority of the items but who may have neglected to answer a few of them had a minimal amount of missing data. Because the initiation and frequency of use dimensions were on ordinal scales, RAPI scores were collapsed for model fitting analyses into five levels using the SAS System’s univariate quintiles procedure, where the first level contains those individuals lowest on problem drinking and the fifth level contains those highest on problem drinking (SAS, 2002–2003). RAPI scores were highly skewed, reflecting the large proportion of individuals reporting no alcohol problems, so the first drinking problem level contained the largest proportion of individuals. For analyses incorporating the age 25 problem drinking outcome variable, initiation was categorized using four levels, extended from the three level initiation variable to include information obtained at age 25. The four levels were: never initiated by age 25, very late initiation (between the ages of 18 and 25), late adolescent initiation (at the ages of 15, 16, or 17), and early initiation (by the age of 14). The frequency of use at age 25 variable was created in the same manner as described above for use at age 17.

Model Fitting Analyses

We applied multiple-stage Cholesky models to the data, as described in Heath et al., 2002. These models are similar to classic multivariate models, however, the outcome, frequency of alcohol use in this case, has structural missing data for those twins with liability values of zero (never used alcohol) on the initiation dimension. Like a standard multivariate model, variance is partitioned into additive genetic (A), common environmental (C), and unique environmental (E) components. Additionally, multiple-stage models permit us to estimate the degree to which shared A, C, and E factors contribute to the covariance between different stages of use. The multiple-stage Cholesky models are also similar to the Causal–Common–Contingent (CCC) model presented in Kendler et al. (1999). Unlike the multiple-stage Cholesky models, however, the CCC model utilizes a single regression coefficient (pathway) to estimate the covariance between different stages and does not allow for the partitioning of the cross-stage covariation into A, C, and E components. We first fit two-stage Cholesky models to the initiation of alcohol use (as defined above) and frequency of use at age 17, separately to data from FT16 and FT12 (see Fig. 1). Subsequently, we expanded the bivariate model to a trivariate model in the FT16 dataset, studying an expanded initiation dimension (as detailed above), frequency of use at age 25, and incorporating a third dimension of drinking problems at age 25 (see Fig. 2).

Fig. 1.
figure 1

Full Bivariate Cholesky Model of Alcohol Initiation and Frequency of Alcohol Use measured at age 14 and 17.

Fig. 2.
figure 2

Full Trivariate Cholesky Model of Alcohol initiation, Frequency of Alcohol Use, and Drinking Problems Measured at Ages 14, 17, and 25.

All modeling was conducted using the raw ordinal data option in Mx (Neale et al., 1999). The significance of each of the parameters in the model can be tested by dropping a parameter and evaluating the change in −2 log likelihood between the initial model and the nested submodel. This difference is evaluated using a chi square distribution. A significant change in fit between the models (\(p< 0.05\)) for the difference in degrees of freedom indicates that dropping the parameter caused a significant decrease in fit of the model, indicating that parameter significantly contributes to the outcome trait and should be retained in the model.

Results

Prevalence of Alcohol Initiation, Use, and Problematic Drinking

The frequency distributions of male and female MZ and DZ twins on alcohol initiation and frequency of use are reported in Table I for FT16 and FT12. Of the 3424 same-sex twins in FT16, 14% reported never drinking at age 17 or younger, 30% reported first drinking at age 15, 16, or 17, and 56% reported early drinking at the age of 14 or younger. Of the 2594 same-sex twins in FT12, 11% reported never drinking at age 17 or younger, 51% reported first drinking at age 15, 16, or 17, and 38% reported early drinking at the age of 14 or younger. Overall, 86% of twins in FT16 and 89% of twins in FT12 self-reported the initiation of alcohol use at the age of 17 or younger, but as the proportions of early drinkers above show, more twins initiated drinking early in FT16 than FT12.

Table I. Percentage of Twins Self-Reporting the Initiation of Alcohol Use by Age Groups and the Frequency of Use among those Twins who have Initiated Split by Sex and Zygosity

Similar proportions of boys and girls initiated alcohol use at 17 or younger or at 14 or younger in both FT16 and FT12. Although the number of twins who reported initiating alcohol use did not vary substantially by gender, boys reported more frequent alcohol use at age 17 than girls in FT16 (t=2.86, p=0.004, d=0.22) and FT12 (t=2.15, p=0.03, d=0.11). In both samples this difference is most evident at the extreme with more boys, 24% in FT16 and 30% in FT12, than girls, 15% and 21%, reporting regular drinking, once per week or more. To assess whether those who initiated alcohol use early were more likely to report more frequent alcohol use at age 17, we submitted sex (as a covariate) and the three-level initiation variable into a multiple regression procedure predicting the six-level frequency of use variable. After accounting for sex, early initiators in both samples reported significantly higher frequency of alcohol use at age 17 than those who reported initiation after the age of 14 (t=13.69, \(p< 0.0001\), pr 2=0.12 for FT16; and t=11.63, \(p< 0.0001\), pr 2=0.07 for FT12).

The frequency distributions of the 3344 FT16 male and female twins who reported on alcohol initiation, frequency of use, and problematic drinking at the age of 25 are presented in Table II. Approximately 2/3 of the twins who had not initiated alcohol use at the age of 17 or younger reported the initiation of drinking by 25 years of age. The proportion of twins drinking a couple of times a week or daily (17% of females, 32% of males) also increased dramatically, particularly for males, while those drinking once a year or less dropped dramatically for both males (1%) and females (3%). The largest proportion of female (43%) and male (46%) twins were placed in the first level of their respective problem drinking distributions based on their responses to RAPI items, which suggests that most twins reported very low levels of alcohol problems. Fourteen percent of females and 15% of males reported problematic drinking in the highest, fifth level. The average RAPI severity score for women in this highest level was 1.69 (possible range of 1–4) and the average score for men was 1.93. On average, females in the highest quintile were reporting the presence 10.39 (SD = 3.32) drinking problems out of a possible 22, and males, on average, reported 13.18 (SD = 3.43) drinking problems. Thus, both females and males in the highest quintile reported experiencing a substantial number of problems associated with alcohol use at age 25.

Table II. Percentage of Twins in FT16 Self-Reporting the Frequency of Use and Quintile Level of Problematic Drinking at Age 25 Among those Twins who Split by Sex and Zygosity

Bivariate Model Fitting

An example of a two-stage (bivariate Cholesky) model is shown in Fig. 1. Table III presents goodness-of-fit statistics for the two-stage modeling procedure applied to FT16 and FT12 data, with the best-fitting model in bold (model 9g). For both samples, constraining the thresholds to be equal across males and females caused a significant decrease in fit for FT16 (model 2) and FT12 (model 5), reflecting the small differences in frequency of use distributions mentioned above. Constraining standardized estimates of genetic (A), common environmental (C) and unique environmental (E) influences on both traits to be equal across sex did not cause a significant reduction in fit of the model in either FT16 (model 3) or FT12 (model 6) data. We next fit a model jointly analyzing FT16 and FT12 data to test whether the estimates could be constrained equal across samples. As model 9 in Table III shows, constraining A, C, and E estimates on initiation and frequency of use to be equal across samples did not cause a significant decrease in fit of the model. However, constraining thresholds to be equal across samples did cause a significant decrease in fit of the model (model 8). Thus, for all subsequent models, threshold estimates were allowed to vary across sex and sample, while A, C, and E standardized estimates were constrained equal across sex and sample. Dropping genetic effects on the initiation of alcohol use resulted in a significant decrease in fit (model 9a). Similarly, dropping the genetic influence on the frequency of use variable in the combined samples model (model 9b) also resulted in a significantly poorer model fit. Setting the influences of shared environment to zero for the initiation dimension (model 9c) resulted in a significant decrease in model fit. Finally, a substantially poorer fit resulted from dropping the C effects on the frequency of alcohol use (model 9d).

Table III. Bivariate model fitting results for alcohol initiation and frequency of use at age 17 in FT16 and FT12

Next, we tested for the presence of common A, C, and E factors influencing both the age of alcohol initiation and frequency of use. Dropping the common A estimate (model 9e) and, in a separate model, the common C estimate (model 9f) both resulted in significant reductions in fit, Dropping the common E estimate (model 9g), however, resulted in a very small, nonsignificant change in the chi-square fit statistic. To test whether the genetic variance in initiation completely overlapped with the genetic variance in frequency of use, a model with the specific genetic influences on frequency of use set to zero (which results in a correlation of 1.0 among genetic factors) was fit to the data (model 9h). Setting the genetic correlation to 1.0 resulted in a significant decrease in fit, indicating that while there were significant shared genetic influences on initiation and frequency of use, they were not completely overlapping. Fitting a model with the shared environmental correlation set to 1.0 (model 9i) also resulted in significantly poorer model fit, suggesting that the common environmental influences did not completely overlap. Consequently, the bivariate genetic model that best fit the combined FT16 and FT12 alcohol initiation and frequency of use data was one in which A, C, and E estimates, but not thresholds, were constrained equal across gender with no shared unique environmental influences on the two stages of alcohol use (model 9g).

Standardized estimates of A, C, and E (and 95% Confidence Intervals; CI) for the initiation and frequency of use variables from the full model with A, C, and E constrained across gender are presented in Table IV for the combined model as well as for the models fit separately to FT16 and FT12 data. We presented estimates from the full models in order to show the broader CIs yielded by the full model. Based on results from the full combined model, shared environmental influences accounted for the largest proportion of variance in the age of alcohol initiation, explaining 59% of the variance. Additive genetic factors accounted for 29% of the variance in age of initiation, while unique environmental factors accounted for 12%. For frequency of use at age 17, shared environmental factors accounted for 34%, while the proportion of variance explained by additive genetic factors was 39%. Unique environmental influences on the frequency of alcohol use accounted for 27% of the variance in the combined model.

Table IV. Standardized Estimates (and 95% CIs) for A, C, and E Influences on Drinking Initiation and Frequency of Alcohol Use at Age 17 for the Full Models Constrained Across Gender in the Combined Model and for FinnTwin16 and FinnTwin12 Separately

The correlations (and 95% CIs) between A, C, and E influences on the age of alcohol use initiation and frequency of alcohol use for the full combined model constrained across gender and FT16 and FT12 data are presented in Table V. Based on the combined model fitting results, the estimated genetic correlation between initiation and frequency of use dimensions was 0.51 (0.21–0.81), indicating that genetic factors impacting the decision to initiate drinking accounted for 26% of the variance in frequency of use once initiation has occurred. Results also suggested a large overlap of shared environmental influences impacting both initiation and frequency of alcohol use; a correlation of 0.81 (0.62–1.00) was found across both stages. This correlation suggested that the shared environmental influences on initiation accounted for approximately 66% of the variance in the frequency of alcohol use. The correlation between unique environmental influences across stages was nonsignificant and estimated at 0.02 (0.00–0.19), which suggested that there was almost no overlap of unique environmental factors impacting the age of initiation and the frequency of alcohol use.

Table V. Correlations (and 95% CIs) among the Standardized A, C, and E Parameter Estimates for the Alcohol Initiation and Frequency of Use Variables for the Full Model Constrained Across Gender and FinnTwin16 and FinnTwin12 Samples

Trivariate Model Fitting

Table VI presents goodness-of-fit statistics for the three-stage (trivariate) Cholesky model (see Fig. 2) applied to FT16 data through the age of 25, with the best-fitting model in bold (model 17 for females and 31 for males). Constraining the thresholds to be equal across males and females caused a significant decrease in fit (model 2), reflecting the sizable differences in frequency of regular use at age 25. Constraining standardized A, C, and E estimates on the three stages to be equal across sex also caused a significant reduction in fit of the model (model 3). For all subsequent models, thresholds and A, C, and E estimates at all three stages were allowed to vary across sex.

Table VI. Trivariate model fitting results for alcohol initiation (Init), frequency of use at age 25 (Freq), and alcohol problems at age 25 (Prob) in FT16

For both sexes, dropping the genetic pathway on initiation did result in a significant reduction in model fit (models 4 and 18). Dropping the common genetic pathways between initiation and frequency of use and problem drinking did not result in a significant decrease in fit (models 5 and 19), indicating that the significant genetic influences on initiation did not overlap with genetic influences on frequency of use at age 25 or with alcohol problems at age 25, for either sex. However, dropping the common genetic pathway between frequency of use and problematic drinking did significantly reduce model fit (models 6 and 20), indicating that there were common genes influencing frequency of use at age 25 and alcohol problems at 25. Specifying a model that only allowed for one set of shared genetic factors influencing alcohol initiation, use, and problems caused a significant decrease in fit (models 7 and 21), as would be expected based on the results reported above. Finally, a model in which the genetic influences at each stage were independent of other stages (all common genetic pathways set to zero) resulted in a poorer model fit (models 8 and 22). In summary, these results suggest that, for both females and males, genetic factors influencing initiation of alcohol use do not significantly overlap with genetic influences on frequency of use and problem drinking at the age of 25; however, there is significant overlap between the genetic factors influencing frequency of use and alcohol problems at age 25.

The same series of submodels described above for genetic pathways were also applied to common and unique environmental pathways. For both sexes, dropping common C pathways between initiation and frequency of use and problem drinking resulted in a significant decrease in fit (model 9 for females and model 23 for males). Dropping the common C pathway between frequency of use and problem drinking significantly decreased the fit of the model to data for females (model 10) but not for males (model 24). Similarly, stipulating a model that only allowed for one set of shared environmental factors influencing alcohol initiation, use, and problems caused a significant decrease in fit for females (model 11) but not for males (model 25). Lastly, a model fit to the data in which the C influences at each stage were independent of other stages resulted in a moderately significant poorer model fit for females (model 12) and a significantly poorer model fit for males (model 26). These results suggest that, for females, there is a substantial amount of common and specific shared environmental influences across all stages of alcohol use, while, for males, the same shared environmental factors appear to be influencing initiation, frequency of use, and problem drinking at the age of 25. However, we note that for males, the magnitude of influence of shared environmental factors on frequency of use and problematic drinking is rather small. For females, there is considerably greater influence of common environment on frequency of use at age 25, although problematic drinking shows little evidence of common environment, similar to in males.

For unique environmental influences, only the common E pathways from initiation to frequency of use and problem drinking could be dropped without a significant decrease in model fit for both sexes (models 13 and 27). Dropping the common E pathway from use to problem drinking (models 14 and 28), allowing for only one set of E factors influencing all three stages (models 15 and 29), and dropping all common E pathways (models 16 and 30) all caused significant decreases in model fit. In summary, for both females and males, unique environments influencing initiation of alcohol use do not significantly overlap with unique environmental influences on frequency of use and problem drinking at the age of 25; however, there is significant overlap between the unique environmental factors influencing frequency of use and alcohol problems.

Consequently, the trivariate genetic model that best fit the alcohol initiation, frequency of use, and alcohol problem data in FT16 was one in which thresholds and A, C, and E estimates varied across gender. For females, a model in which common A and E pathways between initiation and both frequency of use and alcohol problems were dropped fit the data best (model 17). The best-fitting model for males was one in which common A and E pathways between initiation and both frequency of use and alcohol problems were dropped as well as all C pathways from the frequency of use and problem drinking stages (model 31).

Standardized estimates of A, C, and E (and 95% Confidence Intervals; CI) from the full model for FT16 age 25 initiation, frequency of use, and problem drinking variables are presented in Table VII. As with the bivariate models, C influences accounted for the largest proportion of variance in the age of alcohol initiation, explaining 47% of the variance among females and 61% of the variance among males. Additive genetic factors accounted for 44% and 22% of the variance in age of initiation for females and males, respectively, while E factors accounted for 9% and 17%, respectively. E factors had the largest influence on the frequency of alcohol use at age 25 among women, accounting for 50% of the variance. Additive genetic influences accounted for 19% of the variance in frequency of use, which is substantially less than the proportion of variance accounted for by genetic factors on initiation. The influence of C factors also decreased at this stage, accounting for 31% of the variance in use at age 25 among women. Contrary to the small genetic influences on use for women, A factors accounted for 48% of the variance in frequency of use among men, while C accounted for only 8%. The unique environment accounted for 44% of the variance in use for men. As expected, genetic factors accounted for the largest proportion of variance in problem drinking for both women and men, explaining 47% and 55%, respectively. Unique environmental factors explained another 38% and 36% of the variance in problem drinking, while C accounted for only 15% of the variance for women and 8% of the variance for men. The CIs for the C estimates of problem drinking overlapped zero in both females and males, suggesting that they are nonsignificant.

Table VII. Standardized Estimates (and 95% CIs) for A, C, and E Influences on Drinking Initiation, Frequency of Alcohol Use, and Problem Drinking for the Full Model in FinnTwin16-Age 25

Table VIII presents the correlations (and 95% CIs) between A, C, and E influences on the three stages of alcohol use for the full model. The estimated genetic correlation between initiation and frequency of use dimensions was 0.23 (0.00–0.89 for females; 0.00–0.74 for males) for both sexes. The wide confidence interval includes zero, and these correlations were not significant. The genetic correlations between initiation and problem drinking were also nonsignificant: 0.15 (−0.17–0.47) for females and 0.29 (−0.18–0.84) for males. The genetic correlations between frequency of use and problem drinking were much higher, with estimates of 0.78 (0.29–1.00) for females and 0.63 (0.39–0.96) for males, suggesting that 61% and 40%, respectively, of the genetic variance in problem drinking behavior could be accounted for by genetic factors important in frequency of use.

Table VIII. Correlations (and 95% CIs) among the Standardized A, C, and E Parameter Estimates for the Alcohol Initiation (Init), Frequency of Use (Freq), and Alcohol Problem (Prob) Variables for the Full Models in FinnTwin16-age 25

Overall, the correlations between shared environmental factors across stages were larger than those found for genetic factors. The estimated C correlations between initiation and frequency of use were 0.44 (0.08–0.86) and 1.00 (0.43–1.00) for females and males, respectively (approximately 19% and 100% of the C variance in use could be accounted for by C factors at initiation). For initiation and problem drinking, the correlations were 0.55 (0.23–1.00) for females and 0.61 (−1.00–1.00) for males, suggesting a moderate overlap of 30% of the C variance in these two stages for women. The frequency of use and problem drinking dimensions correlated 0.23 (−0.78–0.92) for females and 0.59 (−0.62–1.00) for males, respectively, which are both nonsignificant. The very large confidence intervals suggesting nonsignificant findings around the correlations between C influences on these stages reflect the very small relative contributions that shared environments made at these stages. The estimated E correlations between initiation and frequency of use, 0.06 (0.00–0.23) for females and 0.16 (0.00–0.37) for males, and the correlation between E factors on initiation and problem drinking, 0.08 (−0.02–0.25) and 0.01 (−0.18–0.22), were small and nonsignificant. The correlations between E influences on frequency of use and problem drinking behavior were estimated at 0.46 (0.35–0.55) for females and 0.30 (0.15–0.44) for males, indicating that E factors important for frequency of use accounted for 21% and 9%, respectively, of the variance in problem drinking.

Discussion

In the present study, we applied multiple-stage genetic models to progressive stages of alcohol use and misuse in two longitudinal samples of twins. Utilizing multiple-stage modeling procedures permitted more accurate assessments of the importance of genetic and environmental risk factors on patterns of use and misuse by making allowance for partial overlap with risk factors for initiation, and by taking into account censoring on the frequency/problem dimensions for individuals who had not yet initiated. Furthermore, we expanded on the previous literature addressing substance use initiation by characterizing individuals as to whether they initiated alcohol use in early or late adolescence. Consistent with previous epidemiological findings (Johnston et al., 2001; Maes, et al., 1999), reported initiation of alcohol use at the age of 17 or younger was high among Finnish twins (86–89%). These numbers are somewhat higher than the estimated 80% of American adolescents who report drinking alcohol (Johnston et al., 2001), which may reflect small cultural differences in attitudes about alcohol use, as well as the availability of alcohol in these two countries. Substantially more twins in FT16 indicated drinking at the age of 14 or younger compared to twin reports of drinking in FT12. The difference in the proportion of twins reporting early initiation is likely due to differences in how early initiation was computed for each study. In FT16, twins were asked at the age of 16 to retrospectively report when they began drinking alcohol, and anyone who reported age 14 or younger was coded as early. In FT12, reports of current drinking behavior were made by the twins at age 14. A second possible explanation for the difference is that drinking patterns, cultural expectations, and the availability of alcohol may have been different across these two time-periods.

In both samples, the age at which twins initiated alcohol use significantly predicted how often the twins reported drinking at age 17 as well as the level of drinking problems the FT16 twins reported, even after accounting for the sex of the twins. For example, 8% and 15% of early initiators in FT16 and FT12, respectively, reported daily or weekly drinking, while only 2% and 5% of late initiators reported daily or weekly drinking. Of those 25 year olds who initiated at age 14 or before, 20% reported drinking problems on the RAPI that placed them in the highest quintile of drinking problems, while 7% of those who initiated after the age of 17 reported drinking problems on the RAPI that placed them in the highest quintile. These results replicated previous findings that earlier alcohol use initiation is associated with heavier and more problematic drinking in later adolescence and young adulthood (Chou and Pickering, 1992; Clapper et al., 1995; Grant and Dawson, 1997; Hingson et al., 2000; Kandel et al., 1992; Pitkanen et al., 2005; Riala et al., 2004).

Bivariate model findings reported here were consistent with reports of large shared environmental influences and small additive genetic factors influencing the decision to initiate use (Heath and Martin, 1988; Heath et al., 1991; Kaprio et al., 1987; Koopmans and Boomsma, 1996; Rose et al., 2001b; Stallings et al., 1999). Across the two Finnish twin samples, the shared environmental estimate on initiation was on the low end of the range of those reported by Hopfer and colleagues (2003), who concluded in their review that shared environmental influences accounted for 55–80% of the variance in alcohol initiation. The somewhat lower estimate of shared environmental influences, and the corresponding higher estimate of genetic influences, in our samples may reflect the incorporation of information about the timing of initiation. Age at first drink has been shown in previous research to be under a degree of genetic influence (Prescott and Kendler, 1999). When initiation was expanded to incorporate those who began drinking after the age of 17 in the FT16 trivariate models, additive genetic and the shared environmental estimates for females were nearly equal, while estimates for males were largely unchanged from the bivariate model. The parameter estimates for genetic influences on the age of alcohol initiation in the FT16 bivariate model that permitted estimates to vary across gender were also higher for females (A = 0.37) than males (A = 0.28). These results suggest that genetic factors may play a more important role in the age of initiation for women.

Results presented here also confirmed the consistent finding that genetic factors increased in importance for drinking behavior once initiation had occurred, while the influence of shared environmental factors dropped appreciably across stages. Unique environmental factors were also more important in impacting frequency of use than the initiation of alcohol use. For males, drinking frequency at age 25 was under similar genetic influence as drinking frequency at age 17. However, genetic factors accounted for a relatively small amount of the variation in drinking frequency at age 25 for females, while unique environmental influences played the largest role in drinking frequency. This finding suggests that there may be unique processes influencing women’s drinking patterns in their early to mid-20s. On the other hand, estimates of unique environmental influences do include error, so it is possible that this difference for women was a result of measurement error. We are not aware of other studies that have reported this effect, and the finding warrants further exploration in future studies. With the exception of small genetic influence on drinking frequency for women in the trivariate modeling results, estimates of genetic and environmental influences on the frequency of alcohol use were all within the range of previous studies (Heath and Martin, 1988; Heath et al., 1991; Hopfer, et al., 2003; Kaprio, et al., 1987; Maes et al., 1999; Rose, et al., 2001b; Stallings et al., 1999).

Our finding of sizeable genetic influences on drinking problems were consistent with previous research that has found large genetic influences on alcohol dependence in adulthood in the range of 50–70% (Cloninger et al, 1981; Heath et al., 1997; Kaprio, et al., 1987; McGue, 1993; Prescott et al., 1994b). It is important to note that the analysis of problem drinking is more complex than initiation or frequency of use in our sample and should be interpreted cautiously, as many drinkers who may eventually develop problems may not have yet manifested them in their early to mid-20s.

In addition to examining the genetic and environmental influences on initiation and frequency of use, the multiple stage genetic modeling procedures used here permitted an assessment of the extent to which common (vs. specific) genetic and environmental factors account for the various stages of alcohol use. Results for the model fitting to the combined FT16 and FT12 data suggested that some of the genetic and much of the common environmental influences that are important for the initiation of alcohol use also impact the frequency of alcohol use once initiated. Thus, although common environmental influences played a more important role in impacting initiation, it was largely the same shared environmental influences that impacted frequency of use once initiated. Estimates of common unique environmental influences across stages of use were nonsignificant and near zero.

Finally, we extended these multiple stage models to examine common and/or specific factors important in the transition from regular use to problem drinking in young adulthood using FT16 data. Genetic factors influencing initiation did not overlap substantially with genetic influences on frequency of use and problem drinking in the twins’ 20s. However, the genetic factors influencing frequency of use did overlap substantially with those impacting alcohol problems. This overlap is consistent with results found by Prescott and Kendler (1999), who found an association between genetic factors influencing early drinking and alcohol dependence. The shared environmental factors important at initiation were largely the same factors influencing frequency of use; however, the importance of common environment was considerably reduced for drinking frequency at age 25, and nonsignificant for alcohol problems. Unique environmental factors, which included measurement error, influencing frequency of use did account for some of the variance in alcohol problems.

Several methodological limitations to the multiple stage genetic modeling procedures used in the current study should be noted. Multiple-stage models assume that the initiation and outcome variables have a bivariate normal liability distribution. When this assumption is met and the initiation variable has at least three levels, two of which are characterized in individuals who have data on the outcome dimension, the bivariate genetic model is fully identified with twin data (Heath et al., 2002). Our multiple-level initiation dimension used in these analyses did not meet the assumption of normality. This suggests that our initiation dimension did not represent a single liability, which was likely caused by the incorporation of information about the timing of initiation into the initiation variable. This probably led to the lower estimates of common environmental influences/higher estimates of genetic influence on initiation, as compared to previous studies, and may have also contributed to an inflated estimate of shared genetic influences between initiation and frequency of use at age 17. Another concern is that multiple-stage genetic models are fairly complex and require large sample sizes to accurately estimate parameters and correlations among factors, especially those that are small. The difficulty in estimating some of the small common pathways in the present study may have been due to limited power.

One interesting extension of these models will be to incorporate information about mediators and moderators of genetic and environmental influences on the stages of alcohol use and abuse within the context of multiple stage genetic models (Heath et al., 2002). Several other factors, including age (Koopmans et al., 1997), socioregional factors (Rose et al., 2001a), parental alcohol use (Koopmans and Boomsma, 1996), religiousness (Koopmans et al., 1999), sibling drinking behaviors (McGue et al., 1996), and parental monitoring (Rose et al., 2001b) have all been reported to moderate influences on substance use behavior of adolescents. In future work, we plan to assess the impact of these specific environmental variables on each stage of alcohol use, as well as any impact they may have across stages. Another interesting direction for future research will be to apply these multiple stage genetic modeling procedures to adolescent cigarette smoking and/or drug initiation and use, as several studies have found that adolescents who experiment with one substance are more likely to try others, and, as the use of alcohol increases, so does the use of tobacco products (Maes et al., 1999; Collins, 1990–1991). Finally, the large genetic influences on drinking problems as well as the substantial overlap between genetic factors influencing frequency of alcohol use and alcohol problems suggests that these may be important areas to target for gene identification studies in addition to phenotypes based on substance use disorder diagnoses or dependence.

In conclusion, multiple-stage genetic modeling results from two independent Finnish twin samples suggested the presence strong shared environmental influences, and correspondingly small additive genetic influences, on the decision/timing of alcohol initiation. For the most part, the same shared environmental factors important for initiation also impacted the frequency of alcohol use in later adolescence. However, shared environmental influences were less important for frequency of use, while the influence of additive genetic factors and unique environmental factors were more influential contributors to the frequency of alcohol use. Genetic factors important at initiation overlapped to a small degree with the genetic factors influencing the frequency of use, but we found no overlap of unique environmental factors across stages of use. Genetic factors played the largest role in problematic drinking at age 25 in both men and women, whereas common environmental influences were nonsignificant in both sexes. For both sexes, genetic factors influencing alcohol problems overlapped substantially with those influencing frequency of use at age 25, and shared environmental influences on initiation overlapped moderately with the relatively small shared environmental influences on frequency of use at 25. Unique environmental factors were not very important at initiation and did not overlap much with the moderately important unique environmental factors influencing frequency of use and problem drinking at 25, which did show some overlap with each other. The application of multiple stage genetic models to longitudinal data provided a novel approach to estimating, within the same individuals across time, the impact of genetic and environmental factors at each stage of alcohol use while taking into account those factors important at earlier stages.