Gender nonconforming behavior (GNB) refers to behaviors that are perceived by a culture to be incongruent with expected gender norms (e.g., boys who play with dolls and girls who play with cars). GNB tends to emerge around age 2 or 3 years when gender identity emerges and may continue to become more pronounced across childhood (Lewis 1987; Li et al. 2017; Rieger et al. 2008). Although estimates of GNB are variable, 4% of boys and 10% of girls in the United States were described by their parents as behaving “like the opposite sex” (Achenbach 1991). Of particular concern, children and adolescents with GNB experience poorer psychological adjustment than their peers without GNB (Baams et al. 2013; Rieger and Savin-Williams 2012; Yunger et al. 2004).

GNB and Depressive Symptoms

Research has rarely examined GNB as a predictor of depressive symptoms. Depression is relatively uncommon before puberty (1–2%), but increases to approximately 5% of U.S. youth during mid-adolescence, with higher rates found among female adolescents (Maughan et al. 2013). Depression is an important mental health outcome to consider during adolescence due to both its increasing prevalence during this developmental period and its ability to predict a host of negative outcomes in adulthood, including poor health, lack of social support, and recurrent depression (Avenevoli et al. 2015; Naicker et al. 2013; Thapar et al. 2012). Although few studies have focused on GNB and depressive symptoms in childhood, one exception found retrospective report of children’s GNB at age 11 to predict depressive symptoms in adolescence and early adulthood, particularly among men and people identifying as heterosexual (Roberts et al. 2013). Cross-sectional research of school-aged children 6 to 12 years, as well as with college students, also finds gender nonconformity to be associated with relatively high rates of depressive or internalizing symptoms (Borgogna et al. 2018; Kuvalanka et al. 2017).

The minority stress model highlights the negative impact that members of a minority group, such as children who exhibit GNB, may experience due to stigma, prejudice, and discrimination (Meyer 2003). Children who exhibit GNB are at increased risk of being bullied and ostracized by both peers and family members, factors that can lead to shame and to the development of depressive symptoms (Aspenlieder et al. 2009; Bennett et al. 2010; Roberts et al. 2013; Toomey et al. 2010; Wallace and Russell 2013). Similarly, stigmatization of adolescents and young adults who exhibit GNB has been shown to increase depression risk (Gordon et al. 2017). Developmentally, GNB has been associated with victimization or exclusion by peers as early as age 7 years-old (Zosuls et al. 2016). The age at which the relationship between GNB and later depressive symptoms begins to emerge, however, remains unclear.

Currently, it is unknown whether a child as young as 4–5 years-old who displays GNB is at risk for future depressive symptoms. By age 18–24 months, most children accurately label gender and use gender in their speech (Martin and Ruble 2010); by age 2–4 years most children recognize gender differences and use gendered pronouns, and by age 5 most children express their gender identity (Ruble et al. 2006). During these early years, gender-typed behavior tends to increase. Moreover, gender identity becomes stable by the end of the preschool years, while children’s rigidity in gender stereotypes and their own gender-typed behavior peak around age 4 or 5 years (Halim et al. 2013; Signorella et al. 1993). Accordingly, the late preschool-age period is an important one in which to examine the possible relationship between GNB and later depressive symptoms.

Several gaps exist in our current understanding of GNB as a risk factor for future depressive symptoms. First, as we noted, the age at which GNB emerges as a risk factor for future depressive symptoms is unknown because prior research has typically examined GNB among adolescents and adults rather than among young children. Second, studies examining GNB often select adolescent and adult participants based on sexual minority status, whereas recent research suggests that GNB, and not sexual minority status per se, is a better predictor of depressive symptoms in early adulthood (Martin-Storey and August 2016; Toomey et al. 2010). Third, past research has generally relied on cross-sectional or retrospective data, limiting our knowledge of the extent to which GNB predicts future depressive symptoms.

Purpose of the Current Study

The current study seeks to examine whether children who exhibit GNB in early childhood report greater depressive symptoms during late adolescence. Source bias is minimized by using mothers as the rater of GNB at age 4 and 5 and adolescents as the reporter of their own depressive symptoms at age 16 and 17. We hypothesize that GNB during early childhood will predict depressive symptoms during late adolescence.

Method

Participants

Participants were 125 adolescents (65, 52%, male) and their mothers from a longitudinal study examining developmental risk factors of child adjustment, including prenatal cocaine exposure. Pregnant women attending prenatal clinics in Philadelphia, Pennsylvania and Trenton, New Jersey were approached to participate between February 1993 and December 1995. Children who were born before 32 weeks of gestation, required special care or oxygen therapy for more than 24 h, exhibited congenital anomalies, or were exposed to opiates or PCP in utero were excluded. Mothers were predominantly African American (112, 90%, with the other 23, 10%, European American) and ranged in age from 13.7–38.6 years-old (M = 26.0, SD = 5.9) at the time of their child’s birth. In addition, the sample was predominantly low SES because mothers had a mean education level of 11.6 years (SD = 1.4, range = 8–15), with 61% (n = 76) of families receiving Aid to Families with Dependent Children. Mean adolescent age was 16.9 (SD = .4) at the 16-year visit and 17.8 (SD = .4) at the 17-year visit.

Procedure and Measures

Of the 258 children who participated in the first laboratory visit at 4 months, mothers of 229 children provided assessments of child behavior on the Child Behavior Checklist (Achenbach 1991) at age 4 years (n = 218) and/or age 5 years (n = 187). From these families, 125 adolescents completed the Beck Depression Inventory (BDI-II) (Beck et al. 1996) at 16 years (n = 104) and/or 17 years (n = 88). No significant differences were observed in perinatal variables (cocaine, alcohol, cigarette, or marijuana exposure; neonatal medical problems), child gender, maternal age, or ethnicity between participants in the current sample and those from the original sample who did not participate. Incentives were provided to families for their participation in the form of gift cards ($25 at age 4 & 5 visits; $60 at age 16 & 17 visits) for use at local stores at each visit. Informed consent was obtained from mothers and assent from adolescents for all study procedures, which were approved by the Institutional Review Boards of Drexel University and the University of Medicine and Dentistry of New Jersey (now Rutgers University).

The data we report here fall into three categories: (a) Perinatal factors included child gender (based on parent report at the time of the participating child’s birth), neonatal medical problems at birth, and perinatal substance exposure. (b) Psychosocial risk factors involved environmental risk across 4 months to 13 years-old and gender nonconforming behavior assessed at 4–5 years-old. (c) Our outcome measure was depressive symptoms assessed at age 16–17.

Neonatal Medical Problems (Birth)

Neonatal complications were abstracted by nurses from hospital records at birth using the Hobel Scale, a neonatal medical risk scale based on 35 possible complications (Hobel et al. 1973). Complications included general factors (e.g., low birth weight, fetal anomalies, and feeding problems), respiratory problems (e.g., congenital pneumonia, apnea, and meconium aspiration syndrome), metabolic disorders (e.g., failure to gain weight and hypoglycemia), cardiac problems (e.g., murmur and cardiac anomalies), and CNS problems (e.g., CNS depression and seizures). Items were coded as absent (0) or present (1) and summed such that higher scores indicated greater neonatal medical problems; the total score was log transformed to correct for positive skew.

Prenatal Substance Exposure (Birth)

Prenatal substance exposure was assessed using a semi-structured interview administered to the mother within 2 weeks of her child’s birth. The interview included questions assessing the frequency and amount of the mother’s use of cocaine (n = 48, 38%, used), alcohol (n = 48, 38%), cigarettes (n = 56, 45%), and marijuana (n = 15, 12%) throughout pregnancy. Substance use was categorized as follows: alcohol (0 = 0 drinks/day, 1 = from .01 to 1.00/day, 2 = from 1.01 to 2.00/day, 3 = from 2.01 to 3.00/day, 4 = more than 3.00/day); cigarettes (0 = 0 cigarettes/day, 1 = from .01 to 1.00/day, 2 = from 1.01 to 5.00/day, 3 = from 5.01 to 10.00/day, 4 = more than 10.00/day); and marijuana (0 = 0 joints/day, 1 = from .01 to .50/day, 2 = from .51 to 1.00/day, 3 = more than 1.00/day; however, given the presence of positive skew this variable was recoded such that 0 = no use and 1 = some use). Prenatal cocaine exposure (PCE) was confirmed by analysis of the newborn’s meconium for the presence of benzoylecgonine (cocaine metabolite) using radioimmunoassay followed by confirmatory gas chromatography/ mass spectrometry. PCE was dichotomized (0 = no use and 1 = some use) in all analyses as prior reports from this sample have found the dichotomous measure to best predict outcomes.

Environmental Risk (4 Months to 13 Years)

A composite environmental risk score was created by compositing the following risks, assessed from maternal interview at nine time points from birth to age 13 (at the 4 months, and 1.5, 2.5, 4.5, 7.0, 8.5, 10.0, 11.5, and 13.0 year lab visits). The score included family life stress during the past 6 months based on the Social Environment Inventory (Orr et al. 1992), maternal social support network size based on the Norbeck Social Support Questionnaire (the number of important or significant people in the mother’s network, reverse scored; Norbeck et al. 1981), the number of regular caregivers (greater number being associated with higher risk), the irregularity of the child’s schedule (assessing variability as to when the child woke up, ate, went to sleep, etc., during the past week), and the instability of the child’s surroundings (assessing the number of changes in the room in which the child slept, who lived in the house, etc., during the past 6 months) from the Family Chaos Scale (R. Seifer, personal communication, 1993), single parenthood (single parent associated with higher risk), maternal education (number of years of education, reverse scored), maternal race (African American associated with higher risk given that minority status may serve as a proxy for potential exposure to racism and structural disadvantage and has been found to be a risk factor for depressive symptoms among youth, consistent with the minority stress model; Stirling et al. 2015), and public assistance status (whether the family received Aid to Families with Dependent Children [AFDC] or, later, Temporary Assistance to Needy Families [TANF] funding).

Each variable was standardized and summed. Standardized environmental risk scores were created by averaging standardized z-scores for each risk measure across each of the nine time points. Such aggregate scores are more stable than individual measures, and there is increased power to detect effects of the environment because errors of measurement decrease as scores are summed and degrees of freedom are preserved. This and similar cumulative environmental risk measures have been found to explain more variance in child and adolescent outcomes than single factor scores (Atzaba-Poria et al. 2004; Bendersky and Lewis 1994; Deater-Deckard et al. 1998; Sameroff et al. 1993).

Gender Nonconforming Behavior (4 and 5 Years)

GNB was assessed by mother report on the Child Behavior Checklist (Achenbach 1991) using the item “acts like opposite sex” (0 = not true, 1 = somewhat or sometimes, 2 = very true or often true). We dichotomized the score such that a 1 or 2 point response was recoded as 1, and subsequently we computed the mean of this score across the 4 and 5 year visits. This item has been used previously as an indicator of GNB (Steensma et al. 2014; van Beijsterveldt et al. 2006).

Depressive Symptoms (16 and 17 Years)

Depressive symptoms were assessed using the 21-item Beck Depression Inventory-II (BDI-II; (Beck et al. 1996). The BDI-II and its predecessor, the BDI, are well-validated self-reports of depressive symptoms for adolescents (Bennett et al. 1997; Krefetz et al. 2002). BDI-II items assess symptoms during the past 2 weeks using a 4-point scale (0 to 3, with higher scores indicating more frequent depressive symptoms; e.g., “Sadness: 0 = I do not feel sad; 1 = I feel sad much of the time; 2 = I am sad all the time; 3 = I am so sad or unhappy that I can’t stand it.”). The BDI-II was completed by 104 adolescents at 16.5 years and 88 at 17.5 years, with mean scores computed across the two visits. Scores of 0 to 13 suggest the presence of minimal depression, scores of 14 to 19 suggest mild depression, scores of 20 to 28 suggest moderate depression, and scores of 29 to 63 suggest severe depression (Beck et al. 1996). Internal consistency for the BDI-II was satisfactory in the current sample (Cronbach’s alpha = .88 at 16 years and .89 at 17 years).

Results

Data Analysis

Missing data (1% of all data) were managed using multiple imputation of 100 datasets containing all study variables in SPSS version 24 (IBM 2016). Descriptive statistics and correlations among all study variables were first examined. Statistical power to detect an effect (p < .05, two-tailed) in correlational analyses was 94% for a medium effect size and 62% for a small effect size. Hierarchical regressions were also conducted to examine whether GNB predicted depressive symptoms after controlling for child gender, prenatal marijuana exposure, neonatal medical problems, and environmental risk.

Bivariate Relationships

Ten children (8.0% of the sample; 3 boys, 7 girls) were rated by their mother as exhibiting GNB in early childhood (i.e., a response of “sometimes” or “often” on the GNB item). As shown in Table 1, children who exhibited GNB in early childhood reported more depressive symptoms in late adolescence (r = .288, p = .001). Female gender (Spearman r = −.285, p = .001) and greater environmental risk (r = .213, p = .017) were also associated with higher rates of depressive symptoms.

Table 1 Descriptive statistics and correlations among predictor variables

GNB and Depressive Symptoms

Does GNB predict depressive symptoms after controlling for covariates? Child gender, environmental risk, prenatal substance exposure (to each of the following substances: alcohol, cocaine, marijuana, and tobacco), and neonatal medical problems were considered as covariates. Of these, only female gender, environmental risk, prenatal marijuana exposure, and neonatal medical problems were correlated with depressive symptoms at p < .20 and were thus included in Step 1 of a hierarchical regression predicting depressive symptoms. GNB was entered as a predictor in Step 2.

As shown in Table 2, GNB predicted greater depressive symptoms after controlling for covariates (p = .004). Gender also was a significant predictor because female adolescents reported more depressive symptoms than did male adolescents (p = .005). Environmental risk, prenatal marijuana exposure, and neonatal medical problems did not predict depressive symptoms.

Table 2 Hierarchical regression predicting depressive symptoms as a function of gender nonconforming behavior

Discussion

The current findings suggest that GNB in early childhood may predict an increased risk of depressive symptoms during adolescence in the United States. A small prior literature also has found GNB to be associated with depressive symptoms (Alanko et al. 2009; Martin-Storey and August 2016; Roberts et al. 2013; Toomey et al. 2010). Thus, a consistent relationship between GNB and depressive symptoms is beginning to emerge across different reporters of GNB (self vs. parent), samples (LGBTQ identified vs. no gender or sexual identity information collected), types of data collection (i.e., retrospective vs. longitudinal), and developmental periods.

The mechanisms by which GNB may place children at increased risk for depressive symptoms in adolescence remain unclear, although family and peer rejection or abuse may partially explain this association. Physical and verbal abuse, for example, have been found to mediate the relationship between GNB and depressive symptoms (Martin-Storey and August 2016; Toomey et al. 2010). Similarly, Toomey and colleagues (Toomey et al. 2010) found that harassment or abuse mediates the relationship between GNB and depressive symptoms among participants older than those in the current study. Parental coldness and rejection also have been associated with childhood GNB and as such may create increased risk for depressive symptoms (Alanko et al. 2009; Landolt et al. 2004).

More broadly, societal pressure to conform among youth exhibiting GNB in childhood could lead to depressive symptoms in adolescence. Egan and Perry (2001), who found children’s perceived gender typicality or contentment with their biological sex to be positively associated with adjustment, also found both feelings of pressure to conform to gender-typical behavior (from sources including parents, peers, and one’s self) and intergroup bias (i.e., feelings of superiority of one’s own gender) to predict poorer adjustment. These findings were supported by research finding that gender atypicality, gender discontent, and greater perceived pressure to conform predicted poorer adjustment among preadolescents (Yunger et al. 2004). Importantly, there is overlap between GNB and sexual minority status, including identifying as a gay, lesbian, and transgender individual (Li et al. 2017; Rieger and Savin-Williams 2012). However, whereas gender nonconformity is negatively associated with well-being among adolescents, sexual orientation appears to be unrelated to well-being (Rieger and Savin-Williams 2012). Likewise, harassment related to GNB, rather than one’s sexual minority status per se, increases risk for depressive symptoms (Martin-Storey and August 2016). Collectively, these findings suggest that GNB may be more relevant for adjustment than sexual minority status, perhaps due to the stigmatization and societal rejection that can elicit depressive symptoms among youth with GNB (Baams et al. 2013; Turban et al. 2017). Nonetheless, future research should consider the potential combined effects of gender and sexual minority status given that college students who are both gender and sexual minorities are at particularly elevated risk for poor outcomes (Borgogna et al. 2018).

Such stigmatization can elicit shame, which in turn is associated with increased risk for depressive symptoms (Kim et al. 2011). Shame is a painful emotion in which one perceives the self to be defective (Lewis 1995). Factors such as whether one perceives a violation of societal standards for their behavior and, importantly, whether one blames him or herself for such behavior increases the likelihood of experiencing shame (Lewis 1995). Children and adolescents who exhibit GNB may be at increased risk of experiencing shame (Wallace and Russell 2013). Future studies should examine shame as a potential mediator of the relationship between early GNB and future depressive symptoms given that shame has been found to mediate the relationship between stressors and negative outcomes such as PTSD symptoms and suicide risk among adults experiencing minority stress (Mereish et al. 2018; Straub et al. 2018).

Limitations

Several limitations of the current study deserve mention. First, our modest sample of children exhibiting GNB highlights the need for the findings to be replicated in a larger sample. In addition, our modest sample prevented the testing of potential moderators such as biological sex or gender and the examination of potential mediators such as parental rejection. Second, our sample consisted of predominantly urban, low SES, and African American adolescents and as such our findings may not generalize to other samples. Third, a single item, maternal report was used to assess GNB, raising the possibility that a more comprehensive assessment of GNB in early childhood might yield stronger associations with outcomes during adolescence. Finally, the current study did not assess the extent to which GNB is associated with sexual orientation or gender identity later in adolescence or adulthood. Recent research has found that childhood GNB is a significant predictor of later sexual orientation and minority gender identity (i.e., LGBT; Li et al. 2017; Olson et al. 2015), which are associated with increased risk for depressive symptoms and suicidality (Haas et al. 2011; Lucassen et al. 2017). Similarly, we did not assess GNB during adolescence and thus do not know the extent to which youth with GNB during early childhood in our sample continued to exhibit GNB during adolescence.

Future Research Directions

Longitudinal research with larger community samples is needed to replicate the current findings and, as we suggested, to identify potential moderators and mediators of the relationship between GNB and depressive symptoms. Although our modest sample of children displaying GNB precluded examining gender as a moderator, prior research suggests that a child’s gender can affect parents’ acceptance of GNB. For example, parents of preschool children may perceive GNB to be normal for girls, but not so for boys depending on factors such as race, class, and a parent’s adherence to traditional gender roles (Kane 2006). These differences may impact the extent to which boys versus girls with GNB experience low levels of parental acceptance, increased risk for shame, and ultimately increased depressive symptoms. Finally, intervention research identifying effective ways for professionals and parents to support children with GNB in early childhood to prevent the onset of future depressive symptoms is needed.

Practice Implications

The growing literature on GNB and depressive symptoms highlights the need for pediatricians, mental health practitioners, and other child healthcare professionals to be aware of the potential relationship between early GNB and depressive symptoms. More broadly, healthcare professionals need to have an understanding of gender nonconformity and to have a knowledge base from which to provide competent assessment and treatment (Hendricks and Testa 2012). It may be helpful for clinicians to ask parents and children about the extent to which their child exhibits GNB (Forcier and Johnson 2013), as well as to assess the attitudes and feelings of those close to the child about such behaviors. Interventions at the school level aimed at making the school environment more welcoming (Meyer 2009) and at the family level (Pyne 2014; Wallace and Russell 2013) to increase acceptance and decrease shame could help to prevent the increased risk of depressive symptoms among children exhibiting GNB. In addition, the encouragement of friendships with the other gender can prove helpful because young children who perceived themselves as similar to the other gender and who also had other-gender friends were less likely to be victimized or excluded by their peers (Zosuls et al. 2016).

Conclusions

Our study is the first to our knowledge to document the prospective relationship between GNB assessed in the preschool years and depressive symptoms in adolescence. Although our findings are preliminary, they suggest that adolescents who exhibit GNB may be at elevated risk for depressive symptoms during adolescence. Future research should seek to use longitudinal designs to examine the relationship among GNB, potential mediators and moderators, and outcomes such as depressive symptoms. A better understanding of the processes by which GNB can lead to elevated depressive symptoms will guide practitioners in developing more effective interventions and to improve outcomes among gender nonconforming youth.