Introduction

Social origin has been linked to adolescent delinquency and criminal behavior (Paternoster and Brame 1997; Sampson et al. 2005; Sampson and Laub 2005). One general principle guiding empirical research is that an individual is more likely to engage in delinquency or crime when social control is weakened (Hirschi 1969; Gottfredson and Hirschi 1990; Sampson and Laub 1993). Parenting styles, parent–child emotional bond, school attachment, peer influence in adolescence, marital stability, military service, and employment are examples of social control. In spite of the wide recognition that individuals may differ in the propensity to commit serious delinquent and criminal acts (e.g., Moffitt 1993; Gottfredson and Hirschi 1990; Caspi et al. 1994; Paternoster and Brame 1997), few studies have investigated the potential connections between genetic variants and serious and violent delinquency in humans. A notable exception is recent work by Caspi et al. (2002) who showed that maltreated children in New Zealand with a genotype associated with low levels of MAOA activity were more likely to engage in violent behavior than maltreated children with a normal level of MAOA activity.

The objective of this study is to investigate the association between the self-reported serious and violent delinquency and the 40-bp variable number of tandem repeats (VNTR) in the DAT1 gene and the TaqI polymorphism in the DRD2 gene. The study was based on a cohort of more than 2,500 adolescents and young adults in the United States in the National Longitudinal Study of Adolescent Health (Add Health).

A genetic contribution to delinquency has been reported by a number of twin and sibling studies (Rodgers et al. 2001; Christiansen 1977; Gottesman et al. 1983; Malone et al. 2004). A number of biometrical studies also identified shared environmental influences as important contributors to delinquency (Lyons et al. 1995; Cloninger and Gottesman 1987; Rowe 1992). Many of the studies on genetic variants and aggression have focused on the role of dopamine and its receptors and transport sites (de Almeida et al. 2005; Miczek et al. 2002). The most common pharmacotherapeutic interventions of human aggression use dopaminergic antagonists (de Almeida et al. 2005). For instance, the dopamine D2 receptor (DRD2) antagonist haloperidol has long been used to treat aggressive behavior in psychotic patients.

Civelli and colleagues were the first to clone and sequence a dopamine D2 receptor (Bunzow et al. 1988) and the first to describe the 3′ TaqI polymorphism in this gene (Grandy et al. 1989). The TaqI polymorphism is a T to C transition in the 3′ non-coding region of the gene, 10.5 kb downstream of the stop codon and 9.4 kb downstream of the polyA signal. There have been mixed findings regarding the functional significance of this polymorphism. Noble and colleagues reported an association between the DRD2*A1 allele and a lower B max for dopamine receptors in the brain (Nobel et al. 1991) and the activity differences between DRD2*A1 respondents and those homozygous for the DRD2*A2 genotype (Noble et al. 1997). The DRD2*A1 allele was shown to be associated with decreased receptor density in one study (Pohjalainen et al. 1998), but not in others (e.g., Laurelle et al. 1998).

The soluble carrier family 6 dopamine transporter member three gene (DAT, locus symbol: SLC6A3) codes for a dopamine transporter protein (DAT), which limits the level and duration of dopamine receptor activation (Bannon et al. 1995). A study using mice with the dopamine-transporter gene knockouted established not only the central importance of dopamine transporter in controlling synaptic dopamine levels, but also its role as an obligatory target for the behavioral and biochemical action of amphetamine and cocaine (Giros et al. 1996). Vandenbergh et al. (1992) identified several polymorphisms in DAT1, including a polymorphic 40 bp repeat in the transcribed portion of the gene which is most commonly repeat 9 (DAT1*9R) to 10 times (DAT1*10R).

One study found that human subjects homozygous for the 10R allele exhibited significantly lower dopamine transporter binding than carriers of the 9R allele (Jacobson et al. 2000) although the findings from another study are inconsistent (Heiz et al. 2000). A number of studies have demonstrated an association between the 10R allele in the DAT1 gene and attention deficit hyperactivity disorder (ADHD) (Cook et al. 1995; Daly et al. 1999; Gill et al. 1997; Waldman et al. 1998; Cornish et al. 2005). The DAT1*9R allele was reported to be associated with both a lower score in novelty seeking and a greater success in smoking cessation (Sabol et al. 1999). A study of a community sample of 790 children reported an association between the VNTR polymorphism in the DAT1 gene and parent-reported externalizing behavior assessed at ages 4 and 7, but not at age 9 years (Young et al. 2002).

Results

Contingency table analysis

Figure 1 plots the empirical trajectory or mean score of serious delinquency (Panel 1) and violent delinquency (Panel 2) by age and gender. Table 1 compares the mean score of serious delinquency and violent delinquency across genotypes within each gender and each of the three Add Health Waves. The declining trend of both serious delinquency and violent delinquency over the Waves is a reflection of the age pattern described in Fig. 1. For the DAT1 gene, the most interesting result is the sharply reduced serious delinquency scores and violent delinquency scores at each of the three Waves among the males with the DAT1*9R/9R genotype as compared with the DAT1*10R/10R genotype and the DAT1*10R/9R genotype. For example, the males with the DAT1*9R/9R genotype on average scored 1.5 on the delinquency scale at Add Health Wave I much lower than 2.38 and 2.34 scored by those with the DAT1*10R/10R genotype or the DAT1*10R/9R genotype, respectively. The females with the 9R/9R genotype, however, do not seem to score differently from the 10R/10R or 10R/9R genotype.

Fig. 1
figure 1

Serious and Violent delinquency scale score by age and gender

Table 1 Mean score of serious delinquency and violent delinquency by genotype, gender, and Add Health Waves

The results with regards to the DRD2 gene are more complicated, but equally interesting. The male respondents heterozygous for the A1 and A2 alleles seem to consistently score higher on both the serious delinquency and the violent delinquency scale across all three Add Health Waves than the DRD2*A1/A1 or the DRD2*A2/A2 genotype. This pattern of much higher or lower value for the heterozygotes relative to both types of homozygotes is sometimes described as heterosis and reviewed recently by Comings and MacMurray (2000). These results apply again only to males. The initial findings of the association concerning both DAT1 and DRD2 do not seem to vary by Add Health Waves or age, suggesting that the genotype effects may be relatively constant or the trajectories of delinquency across genotypes are likely to be parallel over adolescence and young adulthood. Because of the sibling clustering in the data, standard significance tests are not valid for these comparisons. The next section presents significance tests for the genotype effects obtained from the mixed regression-models that take the correlations into consideration.

Regression analysis

The findings from the regression analysis (Table 2) are highly consistent with those from the contingency table analysis (Table 1). Table 2 presents the estimated associations of the genetic variants in DAT1 and DRD2 with serious delinquency and violent delinquency among male adolescents and young adults. We have estimated models based only on the female sample; the models run parallel to those in Table 2. Neither DAT1 nor DRD2 variants are associated with number of partners in these models (data not shown). Table 2 reports six models with three for serious delinquency and three for violent delinquency. After considering one polymorphism at a time in the first two of the three models, the third model considers the two polymorphisms simultaneously. Each of the six models also reports the number of respondents as well as the number of observations used in the model. A respondent does not have to contribute an observation at all three Waves to be included in the analysis. The random parameters in the mixed model described by Eq. 1 are not shown in Table 2.

Table 2 Estimated association of the genetic variants with serious and violent delinquency among male adolescents and young adults

In order to facilitate likelihood ratio tests across different models to test the overall genotype effects, we used exactly the same number of observations (3,121) in the models on serious delinquency and violent delinquency. Using samples with the maximum number of observations (about 3,243) yielded almost identical results.

The models of DAT1 included only respondents with the DAT1*10R/10R, DAT1*10R/9R, and DAT1*9R/9R genotypes. The three genotypes amount to about 98% of the sample. The DAT1 model shows that the male respondents with the DAT1*10R/10R and DAT1*10R/9R genotypes scored 1.03 and 0.84 points higher on the delinquency scale, respectively, than those with the DAT1*9R/9R genotype; the corresponding P values are 0.005 and 0.026, respectively, each with 1 df. A likelihood ratio test for the two categories of DAT1*10R/10R and DAT1*10R/9R against DAT1*9R/9R yielded a P value of 0.018 with 2 df. Consistent with the findings in the contingency-table analysis (Table 1), the A1/A1 or the A2/A2 genotype in the DRD2 model of serious delinquency scored lower than the heterozygotes (A1/A2) (−0.96 and −0.34 lower with p values of 0.002 and 0.05, respectively). A likelihood ratio test of the two categories of the homozygotes against the heterozyotes yielded a P value of 0.005 with 2 df.

Figures 2 and 3 graph the effects of genotype (based on results in Table 2) for serious delinquency for the White males. The relative differences across genotypes within each of the other ethnicities are the same though the absolute levels of delinquency need to be adjusted for the effects of ethnicity. In Fig. 2, the trajectory of serious delinquency for the DAT1*10R/10R genotype or the DAT1*10R/9R genotype is about twice as high as the trajectory for the DAT1*9R/9R genotype at most ages in adolescence and young adulthood. In Fig. 3, the trajectory of serious delinquency for the heterozygotes (DRD2*A1/A2) is the highest, with the trajectory for the DRD2*A1/A1 genotype located about half as high as the heterozygotes and the DRD2*A2/A2 genotype about 20% lower than the heterozygotes. The graphs for violent delinquency are similar and not shown.

Fig. 2
figure 2

Genotype effects of DAT1 on trajectory of serious delinquency

Fig. 3
figure 3

Genotype effects of DRD2 on trajectory of serious delinquency

The findings in the model of serious delinquency that considers DAT1 and DRD2 jointly are very similar to those from the two models that consider one gene at a time. The parameter estimates in the joint model are slightly lower and the P values are slightly larger than those in the single-gene models, suggesting the absence of major correlations among the two polymorphisms. The likelihood ratio test of the joint model against the model (not shown here) without the two genetic polymorphisms produced a P value of 0.0016 with 4 df.

The three models of violent delinquency closely resemble the three models of serious delinquency. DAT1*10R/9R and DAT1*10R/10R on average scored 0.70 (P = 0.006) and 0.54 (P = 0.032) points higher than DAT1*9R/9R, respectively; the likelihood ratio test between the model with the two genotype parameters against the model without the parameters produced a P value of 0.021 with 2 df. The two homozygotes of DRD2*A1/A1 and DRD2*A2/A2 scored lower (−0.68 and −0.32 with P values of 0.002 and 0.008, respectively) than DRD2*A1/A2, the likelihood ratio test with 2 df yielding a P value of 0.0016. The findings in the two-gene model for violent delinquency are essentially the same as those in the single-gene models. The likelihood ratio test of the model with four parameters for the two polymorphisms against the model without the genetic variants gives a P value of 0.0006.

To address potential bias from population stratification, we controlled for self-reported race/ethnicity in all regression models estimated in Table 2. In addition, we performed Allison’s procedure (1999, Eq. 2) for all four single-gene models. The model includes one random effect (β j ) at the sibling-cluster level, a second random effect at the individual level (e ijk ), and the key interaction term [(αβ) ij ] between genotype and the random effect at the sibling-cluster level. In all four models, the random interaction term is highly significant with a P value smaller than 0.0001. More importantly, the genotype effects in these Allison models have remained statistically significant at a similar level as those in the models in Table 2. For example, in the model of serious delinquency, the P value for the effect of DAT1*Any10R in the Allison model is estimated to be 0.023 as compared with 0.005 and 0.020, respectively for the effects of DAT1*10R/9R and DAT1*10R/10R in Table 2. Similarly, in the model of violent delinquency, the P value for DAT1*Any10R is 0.030 in the Allison model as compared with 0.006 and 0.032, respectively for DAT1*10R/9R and DAT1*10R/10R in Table 2. The results from these additional testing suggests that our findings may not be sensitive to population stratification.

Discussion

Evidence that links dopamine to aggressive behavior primarily comes from two sources: (1) neurobiological studies of animals, mostly rodents and cats, and (2) mechanistic studies of pharmacotheapeutic interventions in the aggressive acts of patients (Miczek and Fish 2005). Animal models have clearly implicated the dopaminergic system in aggressive behavior even though it is less clear how dopamine and its receptor families in neural circuits specifically mediate aggressive behavior (Miczek and Fish 2005). Neuroleptic drugs that target the D2 receptor family have proved effective reducing violent behavior in both humans and laboratory rodents pointing to the D2 receptor as a critical cite of action (Brizer 1988; Krsiak et al. 1981).

Related human studies are mostly on ADHD, externalizing behavior, and hyperactivity among young children. The 10R allele in the DAT1 gene has been repeatedly shown to be associated with attention deficit hyperactivity disorder (ADHD) (Cook et al. 1995; Daly et al. 1999; Gill et al. 1997; Waldman et al. 1998; Cornish et al. 2005). The VNTR polymorphism in the DAT1 gene has been linked to externalizing behavior at ages 4 and 7 (Young et al. 2002). However, a similar study examining the links between the same VNTR in the DAT1 gene and hyperactivity, other externalizing behavior problems, and related temperament traits in a general population sample did not find the associations (Jorm et al. 2001).

Although measures such as ADHD, externalizing behavior, and hyperactivity are related to our measures of serious delinquency and violent delinquency, at least two important differences exist between the two. First, externalizing behavior is typically used to measure behavior among young children. Second, externalizing behavior represents a less severe and more inclusive disruptive and destructive behavior in childhood (Shaw and Winslow 1997, p. 148, 149). Externalizing disorders consist of disruptive, hyperactive, and aggressive behaviors (Hinshaw 1987), but some hyperactive children are not antisocial. The construct of externalizing behavior also includes the DSM-IV oppositional defiant disorder (American Psychiatric Association; APA 1994), which involves oppositional behavior (negative, hostile, and defiant behavior) especially shown by young children to their parents and teachers. These early behavior problems are generally less severe than and may be viewed as the forerunner of the serious and violent delinquency among adolescents and young adults measured in our study.

Genetic studies of serious delinquent behavior and criminal behavior in non-patients or general human populations seem rare. Caspi et al. (2002) reported an interaction between maltreatment in childhood and level of MAOA activity for violent behavioral problems; they failed to find a main effect of the MAOA gene. Chen et al. (2005) hypothesized the association of aggressive behavior in adolescents with both the dopamine D2 receptor gene and the dopamine transporter gene. They provided suggestive evidence from a small study of 11 adolescents diagnosed to have impulsive-aggressive violent behavior.

The findings regarding the DRD2 gene are somewhat unusual in the sense that the heterozygotes scored higher on the serious and violent delinquency scales than both the homozygotes for DRD2*A1 and the aggressive violent behavior. As far as we know, ours is the first national study that reported significant main effects homozygotes for DRD2*A2. This type of results are sometimes described as heterosis defined more generally for cases in which individuals heterozygous for a genetic variant exhibit a greater or lesser effect for a trait than homozygotes (Comings and MacMurray 1997a, b, 2000). Heterosis is systematically studied and taken advantage of in agriculture. For example, hybrid corn consistently demonstrates higher yields than either parental strain (Strickberger 1968; Gardner and Snustad 1981). In human genetics, the sickle cell anemia represents one of the most well-known examples of heterozygote advantage. Individuals with the heterozygous state possess a resistance to malarial infection—a resistance absent in the homozygous normal or the wild type, while the homozygous carriers will typically die from the genetic disorder. A particularly relevant study of α-CaMKII knockout mice demonstrated a strong effect of heterosis for aggressive behavior using three types of mice: mice in which the gene was heterozygously disrupted, mice in which the gene was homozygously disrupted, and the wild type (Chen et al. 1994). Those with a heterozygous deletion of the α-CaMKII gene scored 80 on the defensive aggression scale while the homozygous wild type and those with a homozygous deletion scored 22 and 0, respectively.

From the evolutionary point of view, aggressive behavior is likely to be adaptive. Aggression could aid securing food and safe resting places, protecting the young, and obtaining advantage in a reproductive context. However, not all levels of aggression are adaptive or equally adaptive. As Edward O. Wilson (1975, p. 254) pointed out, “in order to be adaptive it is enough that aggressive patterns be evoked only under certain conditions of stress such as those that might arise during food shortages and periodic high population densities.” A level of aggression adaptive to a certain set of conditions may not be adaptive to another. Aggressive behavior adaptive to circumstances one million years ago, 200 years ago in an agrarian society, and contemporarily in a digital society is likely to differ considerably. It is thus possible that some genetic variants associated with serious and violent delinquency today could have once had a selective advantage.

Investigating one polymorphism in each of the two well-studied genes has limitations. Although it is plausible that these variants are causally related to serious and violent delinquency, an alternative explanation is that other functional variants within the DAT1 gene and the DRD2 gene or in an adjacent region are in linkage disequilibrium with the variants. These other variants could be the real delinquency-predisposing polymorphisms. Additional work is needed to replicate our findings that uses additional genetic variants within and near the DRD4 and DRD2 genes.

We examined both serious and violent delinquency in adolescence and young adulthood measured at ages from 12 to 23. The age range is available because of considerable age range within the Add Health sample (12–18 at Wave I) and also because of repeated measures obtained at Waves I–III. The wide age range has advantages and disadvantages. It allows for an examination of the age patterns of delinquency (Fig. 1); but, because the same set of questions were asked of the Add Health respondents at all Waves, the serious and violent delinquency scales may not be equally valid at all ages. Petty crimes such as stealing something worth less than $50 may be committed more commonly by adolescents than young adults.

We are planning on a major replication of the reported results, which are based on about 2,500 respondents out of the entire Add Health cohort of about 20,000. Add Health Wave IV is funded and scheduled to collect DNA samples from the entire cohort. When the DNA samples become available, a replication study will be performed using the Add Health respondents whose DNA is not available for the current study.

Materials and methods

Subject

The data source for our analysis is the sibling sub-sample of more than 2,500 participants in the National Longitudinal Study of Adolescent Health (Add Health), which started as a nationally representative sample of more than 20,000 adolescents in grades 7–12 in 1994–1995 (Wave I) in the USA (Harris et al. 2003). Add Health is longitudinal; the respondents have been followed by two additional in-home interviews in 1995–1996 (Wave II) and 2001–2002 (Wave III). Add Health is school-based and the adolescents were from 134 schools. The school sample was stratified by region, ethnic mix, size, urbanicity (urban/suburban/rural), and school type (public/private/parochial). Our analysis uses the sibling sample of Add Health because DNA measures collected at Wave III in 2002 are available only for this subset of the Add Health respondents. The subset consists of about 2,500 MZ twins (16.5%), DZ twins (23.6%), and full biological siblings (59.8%). We estimated the heritability of serious delinquency (0.51) and violent delinquency (0.60) using only the paired male MZ twins, DZ twins, and full biological siblings (333 pairs).

Measures

We constructed a serious delinquency scale and a violent delinquency scale using the 12 questions asked of all the Add Health respondents at Waves I–III. The questions and scaling weights used to create the scales are given in Appendix. These two scales are variations of a widely-used type of scales in contemporary research on delinquency and criminal behavior (Thornberry and Krohn 2000). Our scales are closely related to the scales used by, for example, Hagan and Foster (2003) and Haynie (2001, 2003) in analysis of Add Health data and by Hannon (2003) in analysis of data from the National Longitudinal Study of Youth.

Following the delinquency literature (Hagan and Foster 2003; Haynie 2001, 2003), we divide the 12 questions/items into the non-violent and violent types. The non-violent delinquency includes stealing amounts larger or smaller than $50 with or without breaking and entering, and drug selling. Violent delinquency includes serious physical fighting that resulted in injuries needing medical treatment, use of weapons to get something from someone, involvement of physical fighting between groups, shooting or stabbing someone, deliberately damaging property, and pulling a knife or gun on someone. The serious delinquency scale (non-violent and violent) is based on the entire 12 items and the violent scale is based on a subset (eight) of the 12 items. The empirical distributions of the two delinquency scales are skewed to the right ranging 0–36 and 0–24, respectively.

The Cronbach’s alpha for the serious delinquency scale for Waves I, II, and III, respectively, are 0.81, 0.79, and 0.73. Our serious delinquency scale overlaps with Hagan and Foster’s (2003) delinquency scale to a substantial extent. Our serious delinquency scale is designed to capture a wide range of serious delinquent behavior that could result in state sanction of arrest, conviction, and incarceration. Hagan and Foster (2003) utilized a 15-item scale that included most of the 12 items used for our scale as well as a number of items on acts more typically viewed as common adolescent deviance such as lying to parents/guardians about where had been, minor vandalism, being loud in a public place, and driving a car without its owner’s permission. As the name suggests, our violent delinquency scale focuses on an array of violent delinquent behavior that could potentially be classified as violent offenses by the criminal justice system. For Waves I, II, and III, the Cronbach’s alpha values for the violent delinquency scale are 0.75, 0.74, and 0.66, respectively.

Measuring delinquency and crime is challenging. Official measures based on the police reports and the prison and court system have been long known to substantially underestimate delinquency and crime (Robison 1936; Murphy et al. 1946; Thornberry and Krohn 2000; Hood and Sparks 1970) because official measures reflect not only the behavior of offenders, but also the decisions made by the justice system. For these reasons, many criminologists have turned to self reports in recent decades (Hindelang et al. 1979, 1981). Self reports are now a fundamental method of measuring criminality and seem capable of yielding reliable and valid data (Thornberry and Krohn 2000).

As with any survey of sensitive private information, reporting accuracy is a concern. To protect confidentiality, the entire Add Health questionnaire was completed anonymously without requiring names and other identifying information. Anonymous surveys on sensitive behavior have been shown to be more likely to yield accurate responses than non-anonymous surveys (Alexander and Fisher 2003). To further protect confidentiality, reduce non-responses, and increase reporting accuracy, this section of the interview in Add Health was self-administered by audio-computer assisted self interview (CASI). The sensitive question was read to respondents by means of audio headphones. Respondents were given instructions by the computer on how to complete their answers. Self-reported rates of illegal and embarrassing behavior are higher when computer-assisted techniques, particularly self-administered techniques, are used (Tourangeau and Smith 1996; Wright et al. 1998).

The percent of the US adult population that has ever been incarcerated in a state or federal prison increases sharply in the age group of 25–34 over 18–24 (Bonczar 2003), pointing to a likely heavier sample attrition among more chronic offenders because of incarceration at Wave III than at Waves I and II. Add Health Wave III recorded the specific causes of why some of the Waves I and II respondents were not interviewed at Wave III; approximately one dozen individuals from the sibling sample were not interviewed due to incarceration. Chantala et al. (2004) estimated the extent of underreporting at Wave III relative to Wave I, using the respondents and the reports at Wave I and taking advantage of the observation that some of the respondents at Wave I were non-responders at Wave III. These estimates indicate that most of the delinquent and violently delinquent activities could be underrepresented by 1–2.5% in the Wave III data relative to the Wave I population and that selling drugs, carrying a weapon, and shooting or stabbing someone could be underrepresented by about 5%. To reduce the potential impact of disproportional sample attrition at Wave III, we have removed observations of serious and violent delinquency measured at ages 24 or older. These observations account for fewer than 6% of the total number of observations obtained at Wave III. The genotypic distribution and the mean serious and violent delinquency scores by genotype among these observations (data not shown) are very similar to those in the main sample described by Table 1. We chose age 24 because, in our data, the proportion of having ever being arrested shows a marked decline among those who are 24 or older indicating a more severe disproportional attrition in this age group.

DNA preparation and genotyping

At Wave III, in collaboration with the Institute for Behavioral Genetics in Boulder, Colorado, Add Health collected, extracted, and quantified DNA samples from the sibling sub-sample. Genomic DNA was isolated from buccal cells using a modification of published methods (Lench et al. 1988; Meulenbelt et al. 1995; Spitz et al. 1996; Freeman et al. 1997). All of the methods employed Applied Biosystems instruments and reagents. Microsatellite and VNTR polymorphisms were done using fluorescent primers that were analyzed on an ABI capillary electrophoresis instrument. Single nucleotide polymorphisms were analyzed using an ABI Sequence Detection System and 5′-nuclease (Taqman®) methodology. To reduce errors, two individuals independently scored all genotyping. The additional details on DNA collection and genotyping can be found at Add Health website (Smolen and Hewitt, http://www.cpc.unc.edu/projects/addhealth/).

A 40 bp Variable Number Tandem Repeat (VNTR) polymorphism in the 3′ untranslated region of the DAT1 gene has been genotyped with a modified method of Vandenbergh et al. (1992). The primer sequences were: forward, 5′-TGTGGTGTAGGGAACGGCCTGAG-3′ (fluorescently labeled), and reverse: 5′-CTTCCTGGAGGTCACGGCTCAAGG-3′. This VNTR ranges from 3 to 11 copies with the 9-repeat (9R or 440 bp) and 10-repeat (10R or 480 bp) polymorphisms being the two most common alleles in Caucasian, Hispanic and African American populations (Doucette-Stamm et al. 1995). In our analysis sample, the 9R and 10R account for about 21 and 76% of all alleles, respectively; 35, 59, and 0.048% of the respondents possess one 10R, two 10Rs, and two 9Rs, respectively. The variation across ethnic groups appears to be moderate with the 10R allele accounts for 80, 86, 80, and 90% of all alleles in Whites, Blacks, Hispanics, Asians, respectively. The apparent outlying case of Asians could be due to its small sample size (n = 186 or about 7% of the sample).

The DRD2 gene has a polymorphic TaqIA restriction endonuclease site about 2,500 bp downstream (3′ untranslated region) from the coding region of the gene. The A1 allele of this polymorphism has a point mutation C → T (TCGA to TTGA). The DRD2 TaqIA genotyping was performed using the fluorogenic 5′nuclease (Taqman®, Applied Biosystems, Foster City, CA) method with reagents (VIC™ and 48 FAM™ labeled probes and TaqMan® Universal PCR Master Mix without AMPerase® UNG) obtained from Applied Biosystems (ABI) (Haberstick and Smolen 2004). In our analysis sample, the proportion of DRD2*A1/A2 accounts for 25.9, 36.7, 39.1, and 20.0% of all genotype in Whites, African, Hispanic, and Asian Americans, respectively. A series of χ 2 tests for each polymorphism and for each self-reported ethnic group (European, African American, Hispanic, and Asian) reveals no deviation from the Hardy–Weinberg equilibrium.

Analytical strategies

To test the associations of the polymorphisms with serious and violent delinquency, we adopted a three-step analytical strategy. The first step is a contingency-table analysis in which the mean scores of serious delinquency and violent delinquency across genotypes were compared within each Add Health Wave and gender. The second step is a mixed regression model (Searle 1971; Searle et al. 1992) that describes the trajectory of serious and violent delinquency over age. The regression also adjusts for race/ethnicity and for correlation among siblings and repeated measures in the data. The third step addresses potential bias from population stratification.

In the rest of this section, we elaborate on the second and third steps. Our sample consists of twins and siblings as well as the repeated observations of the same individual over different Add Health Waves; these observations are not independent. The mixed model has long been established in the statistical literature for analysis of data that are not independent (Searle 1971; Searle et al. 1992). The following equation describes the basic structure of the mixed models used in our analysis

$$ {\text{Delinquency}}_{{jit(s)}} = \beta _{0} + \beta _{1} {\text{genotype}}_{{ji}} + \beta _{2} {\text{age}}_{{jit}} + \beta _{3} {\text{age}}^{2}_{{jit}} + \beta _{4} {\text{gender}}_{{ji}} + \beta _{5} {\text{ethnicity}}_{{ji}} + u_{{j0(s)}} + v_{{ji}} + e_{{jit(s)}} , $$
(1)

where j, i, and t index sibling pair or cluster, individual, and Add Health Waves, respectively; s = m, d, or f indicates whether the sibling cluster or pair are MZ twins, DZ twins, or full biological siblings. The basic trajectory of serious and violent delinquency is described by age and age2 and their parameters. The model allows the random effect at the sibling cluster level and the level of observations to vary by type of sibling cluster because the strength of the correlation in these types of sibling clusters varies considerably. Conditional on the three random intercepts at the level of sibling clusters and one random intercept at the individual level, the siblings and repeated measures are assumed to be independent.

Equation 1 does not account for the potential source of clustering due to the Add Health sampling design at the school and/or neighborhood level. Incorporating school or neighborhood random effects would make Eq. 1, which is a three-level model, a four or five-level model. More importantly, school and/or neighborhood effects tend to be quite small in most cases (Guo and Zhao 2000).

We used two strategies to address the potential impact of population structure. First, we adjusted for self-reported race/ethnicity in all regression analysis so that the comparisons across genotypes are made after adjusting for the effects of race/ethnicity. Tang et al. (2005) showed a near-perfect correspondence between the four self-reported ethnic categories (European Americans, African Americans, East Asians, and Hispanics) and the categories determined by 326 microsatellite markers.

As a second strategy, we applied Allison et al.’s (1999) procedure to test for possible population stratification. Following the idea used in the development of sibship tests of linkage and association (Curtis 1997; Boehnke and Langefeld 1998; Spielman and Ewens 1998), Allison et al. (1999) reasoned that the probabilities of genotypes of siblings depended entirely on parental genotypes and that controlling for the effects of sibship would be equivalent to controlling for parental genotypes. Indexing sibships by j, individuals by k, and genotypes by i, they proposed a procedure that can be written as a mixed model

$$ Y_{{ijk}} = \mu + \alpha _{i} + \beta _{j} + (\alpha \beta )_{{ij}} + e_{{ijk,}} $$
(2)

where α i , or the effect of genotype i, is assumed to be fixed; β i , or the effect of sibship j, is assumed to be random; and (αβ) ij is an interaction term specifying the dependence of the random effect of sibship on genotype. The conditioning on sibship in the model eliminates the possible confounding of population stratification. This model is a special case of the mixed model (Searle 1971; Searle et al. 1992).