Introduction and literature review

Mild traumatic brain injuries (mTBI) are common, as over 75% of the 1.7 million traumatic brain injuries in the United States each year are classified as mild using a Glasgow Coma Scale score between 13 and 15 (CDC 2010; Faul et al. 2010). The symptoms of an mTBI are clinically heterogeneous, typically manifesting as a variety of cognitive (e.g. reduced attention and memory), somatic (e.g. headaches, dizziness, nausea), affective (e.g. sad, emotionally labile), and sleep-related difficulties (CDC 2010; McCrory et al. 2009; Arciniegas et al. 2000; Hall et al. 2005). However, accurate diagnosis of an mTBI can be difficult due to the lack of radiological evidence of a brain injury, as the brain is frequently deemed to be normal on traditional brain imaging modalities such as computerized tomography (CT) and magnetic resonance imaging (MRI) (e.g., Bazarian et al. 2007; Hughes et al. 2004; Iverson et al. 2000). Thus, the diagnosis of an mTBI is typically based on self-reported clinical and cognitive symptoms following head trauma. This can be problematic as the symptoms of an mTBI are nonspecific and can occur in a variety of other medical and psychological conditions (Stein and McAllister 2009).

There may be ways to improve the sensitivity of neuroimaging methods to detect brain abnormalities sustained related to an mTBI. There is evidence that there are, in fact, pathophysiological changes associated with an mTBI, as autopsy studies have suggested that some who have experienced an mTBI with negative radiological findings have microscopic axonal injuries (Bigler 2004; Blumbergs et al. 1994). These changes are likely the result of the mechanical impact of the head injury. Rapid acceleration and deceleration forces related to the concussive event are thought to stretch, twist, and shear axons or cause alterations in cell function that manifest as the acute symptoms of an mTBI (Blennow et al. 2012). Magnetic resonance imaging (MRI) scans show promise in detecting these abnormalities, as they are more sensitive in detecting white matter shearing, small subacute hemorrhages, small contusions, and small foci of axonal injury as compared to CT (see Niogi and Mukherjee 2010). MRI can also better differentiate brain and CSF as well as edema as compared to CT scans (Johnston et al. 2001), which are important considerations when evaluating mTBI using imaging modalities.

Several studies have suggested chronic (i.e. greater than six months post injury) differences in brain volume following a concussion, such as decreased overall brain volume (Gale et al. 1995; MacKenzie et al. 2002). Further, these studies suggested that loss of consciousness and worse neurocognitive functioning was associated with smaller brain volumes in the patient groups. Symptomatic patients with an mTBI at an average of two years post-injury had a higher rate of atrophy in forebrain parenchyma, cerebral white matter, and cerebellar volume compared to controls at two different time points (Ross et al. 2012). Cortical thinning in the left superior temporal and left superior frontal gyri was shown in veterans with blast-related mTBI within 18-months of injury (Tate et al. 2014), which was associated with abnormal language and audition scores on behavioral measures.

Longitudinal studies of neuroanatomical changes in mTBI have produced mixed findings. A group of people with a documented mTBI in an emergency department setting exhibited global brain atrophy compared to a control group on longitudinal scans taken acutely after the mTBI and again at 1-year post-injury (Zhou et al. 2013). In collegiate hockey players, global volume reductions were noted at 2-weeks and 2-months post- concussion (Jarrett et al. 2016). Region of interest analyses indicated reduced gray matter volume in the precuneus and white matter volume in the anterior cingulate, which correlated with memory and attention performance (Zhou et al. 2013). Hofman et al. (2001) showed that when scanned again at six months post-injury, patients with abnormal MRIs had a greater rate of brain atrophy than those without MRI abnormalities. In another study, the patient group had abnormally rapid atrophy in whole brain parenchyma and cerebral white matter between the time of the injury and an average of 1.7 years later, while the cerebellum and brainstem enlarged rapidly over the first year and a half and then diminished over the following year, suggesting that different brain regions are affected by a TBI in different ways (Ross et al. 2014). However, in a study examining ventricular volume, gray matter volume, and cortical thickness during the acute (i.e. within three weeks) and semi-acute phase of recovery (i.e. four months later), there were no significant differences between the mTBI group and a matched control group (Ling et al. 2013). Examining specific bran regions has also showed differences associated with mTBI. Monti et al. (2013) found that those who experienced a medically diagnosed mTBI before the age of 25 years old had smaller bilateral hippocampi compared to well-matched control participants at an average of 30 years post-injury. They also found that this sample did worse on a face-scene relational memory task and had less functional MRI activity in memory-related regions in the parietal and prefrontal cortices. However, these differences were not evident in younger samples after an average of four years post-injury (Monti et al. 2013), suggesting that these subcortical changes may take place over a longer amount of time.

Many of the studies presented thus far have been conducted in medical populations, where a concussion may occur at one discrete time point. However, athletes that play contact sports may have different brain changes in response to a concussion due to the accumulation of chronic impact to the head. In a sample of current college football players, participants that experienced a clinician-diagnosed mTBI had smaller bilateral hippocampi compared to a sample of non-concussed teammates as well as a sample of non-concussed non-athletes at a post-injury period ranging from 1 day to 4.5 years (Singh et al. 2014). Additionally, the non-concussed football players had smaller bilateral hippocampi compared to the control group, suggesting that the sub-concussive impact associated with football may have an impact on brain structure. The time players reported engaging in football was negatively correlated with hippocampal volume suggestive of a dose-response relationship (Singh et al. 2014).

More chronic changes have also been documented in sports-related mTBI. Cortical thinning of the right temporal lobe and left insula was seen in a sample of young adult athletes with multiple concussions who had not had a concussion in the past 6-months as compared to non-concussed controls (List et al. 2015). Former college athletes ages 50–75 who experienced between one and five concussions (average = 2.08) before their mid-20s showed cortical thinning in various areas, lateral ventricular enlargement, and worse memory performance when compared to their non-concussed teammates (Tremblay et al. 2013). Similar cohorts of middle-aged former athletes with remote concussions have also been showed to have differences in event-related potentials (Broglio et al. 2012; De Beaumont et al. 2009) and white matter integrity (Hart et al. 2013; Tremblay et al. 2014; see Henry et al. 2016 for review). These findings resemble patterns of abnormal aging reflected in neurodegenerative conditions often seen later in life.

We extended the current literature with our examination of former high school football players who had at least two mild TBIs related to their high school football experiences. However, these former athletes have not received a concussion since that time, a period of at least fifteen years. We compared the concussed group to a matched group of control participants who also played high school football but do not have a history of concussive injuries. We also examined how these injuries may be related to worse cognitive performance. We used high-resolution FSPGR MRI scans from a 3-Tesla imaging system to elucidate volumetric differences between the past-concussed group and the control group with a priori hypotheses that the past-concussion group would have less total brain volume (i.e. intracranial volume, ICV), total white matter volume, total gray matter volume, hippocampal volume, anterior cingulate volume, and lateral ventricular volume compared to the control group. Further, we explored two brain-behavior correlations: the relationship between hippocampal volume with memory functioning based on previous studies implicating medial temporal lobe structures are related to delayed recall (Tremblay et al. 2013), as well as between gray matter volume and overall cognitive functioning based on previous aging literature showing this broad relationship (Kramer et al. 2007).

Method

Participants

Data collection for this study began in 2012 as part of a larger dataset. Preliminary functional MRI data were previously published on a subset of these participants (Terry et al. 2015). Recruitment took place through newspaper and online advertisements, news articles about the study, or public records and football alumni listservs. Participants were included if they were right-handed, male, and aged 40–65 years. This age range was selected to maximize the number of participants who may have sustained a concussion in their remote history, but limit the frequency of people who by virtue of their age may be experiencing symptoms of a Minor or Major Neurocognitive disorder, as this could be a confounding factor. Participants were excluded if they reported: being incompatible with the MRI environment, being illiterate, learning English as a second language, a history of alcohol or drug abuse/dependency within the past five years, a history of a neurological disorder (e.g., seizures, epilepsy), a history of a developmental learning disorder (e.g. learning disability, ADD, ADHD), current use of any psychotropic medications, or a diagnosis of bipolar disorder or schizophrenia. One hundred and forty-five potential participants were screened, from which 45 were successfully recruited and consented by signing an Institutional Review Board approved consent form. Of these participants, imaging data is only available for 40 participants (i.e. n = 20 for each group) due to claustrophobia and late-identified MRI incompatibility. Participants were given a small stipend ($50) for their participation.

Participants were divided into two groups: one with a history of two or greater concussions in the context of their high school football experiences but no other lifetime concussions (total n = 25; imaging n = 20); and one without any concussive history (total n = 20; imaging n = 20). Concussions were identified through two ways. First, participants completed an in-house self-report questionnaire regarding concussive history aimed at validating the presence of at least two concussions in the context of high school football based on criteria set by the American Congress of Rehabilitation Medicine (Medicine 1993), where a mTBI was suspected when at least one of the following criteria was met after an injury involving the head: (1) any period of loss of consciousness; (2) any loss of memory for events immediately before or after the accident; (3) any alteration in mental state at the time of the accident (e.g. feeling dazed, disoriented or confused) and (4) focal neurological deficit(s) that may or may not be transient (Medicine 1993; Cassidy et al., 2004). Participants were then administered the Acute Concussion Evaluation (ACE; Gioia and Collins 2006), a systematic evidence-based clinical protocol designed to assess 1) the specific characteristic of the injury including the details of the blow to the head, 2) a full array of 22 symptoms and 5 signs associated with mTBI, and 3) risk factors that might predict a prolonged recovery. The ACE was shown to have moderate to high internal consistency (α = 0.82) and adequate content, predictive, and convergent validity when compared to other concussion assessments (Gioia et al. 2008). Previous studies using retrospective reporting showed adequate recall of remote historical events with acceptable one-month test-retest reliability (Kendler et al. 2008; Moffitt et al. 2007). Participants were excluded if they reported exactly one concussion or enduring a concussion outside of high school football. The non-concussed group denied a lifetime history of concussions.

All participants completed a concussion symptom checklist to index current post-concussive symptoms, if any, as this could have been a potential confound in the neuroimaging analysis. The Symptom Assessment Scale (SAS) is a 22-item symptom list where the participant first answers “yes” or “no” as to whether they have experienced any of the symptoms over the past 24 h. For those symptoms positively endorsed, they rank both the duration and the severity of each system over the past 24 h using a Likert scale (0 to 6); the maximum total score is 132 on each of the two sub-scales (Broglio et al. 2007). The severity sub-scale is anchored with not severe at all and as severe as possible, and the duration scale is anchored with briefly and always.

The Repeatable Battery for the Assessment of Neuropsychological Status (RBANS) was used to assess cognitive functioning (Randolph et al. 1998). A total of 12 subtests are used to generate 5 index scores (i.e., Visuospatial/Constructional, Attention, Language, Immediate Memory, and Delayed Memory) and one Overall Scaled Score for the whole test. Raw scores are converted to standard scores (i.e., mean = 100, standard deviation = 15) based on age (Fig. 1).

Fig. 1
figure 1

Bivariate Pearson correlations between overall gray matter volume and standard score on the Repeatable Battery for the Assessment of Neuropsychological Status (RBANS)

Neuroimaging

All images were acquired using a General Electric (GE; Waukesha, WI) Signa HDx 3 T MRI. This scanner is equipped with 16 RF receiver channels with TQ Engine gradients (amplitude, 45 mT/m [z-axis], 40 mT/m [x, y-axes]; slew rate, 200 T/m/s) and an 8-channel head coil. 3D structural scans were acquired using a fast spoiled gradient recalled echo (FSPGR) protocol; TE ≤ = 2 ms, TR = 7.5 ms, flip angle =20°, 154 axial slices, slice thickness = 1 mm, and FOV = 256 × 256 mm. These images cover from the top of the head to the brainstem. Acquisition time for this sequence was 6 min 20 s. Total scanning time was approximately 55 min.

Volumetric MRI analysis

Automated segmentation of cortical and subcortical regions was conducted using Freesurfer (version 5.1). Details of these segmentation processes are available in Fischl et al. (2002, 2004a). This automated method has been shown to be statistically indistinguishable from the manual tracing of cortical and subcortical structures (Fischl et al. 2002) and has been shown to be sensitive to volumetric differences between groups (Morey et al. 2009). Additionally, it is relatively insensitive to changes in acquisition parameters (Fischl et al. 2004b), making this procedure robust and generalizable across different research settings. While studies show differences in Freesurfer analyses across scanner platforms (Wilde et al. 2016), this is likely not problematic in the present study given we only used one scanner.

An automated measure of total intracranial volume (ICV) was also used in conjunction with the Freesurfer package (Buckner et al. 2004). Extracting ICV using this method has also been shown to be comparable to manual tracing. This volume served as both a dependent measure as well as a way to correct ROIs for overall head size. Total gray matter volume and white matter volume were extracted directly from the Freesurfer output. The anterior cingulate cortex comprised the volumes labeled the caudal anterior cingulate and rostral anterior cingulate. Our hypotheses were not hemisphere specific, so the left and right volumes for the hippocampus, lateral ventricles, and anterior cingulate (i.e., caudal + rostral) were summed to represent their respective bilateral structures. To control for ROIs, the slope (b) of the ROI volume was regressed onto the ICV. That variable was entered into the following equation to normalize each ROI for head size:

$$ \left[\mathrm{normalized}\ \mathrm{ROI}\ \mathrm{volume}=\mathrm{raw}\ \mathrm{ROI}\ \mathrm{volume}\hbox{--} b\ \left(\mathrm{ICV}\hbox{--} \mathrm{mean}\ \mathrm{ICV}\right)\right] $$

This correction procedure has been used in multiple studies that have extracted subcortical volumes and is less affected by systematic and random error compared to other normalization methods (e.g. Erickson et al. 2009; Head et al. 2008; Sanfilipo et al. 2004)

Behavioral data analysis

Data were analyzed using the Statistical Package for Social Sciences (SPSS version 21.0). Between-groups independent samples t-test were conducted at a significance threshold of p < .01, which is corrected to control for multiple comparisons. The following t-tests were run between the concussed group and control groups: 1) whole brain volume, 2) normalized white matter volume, 3) normalized gray matter volume, 4) normalized bilateral hippocampal volume, 5) normalized bilateral anterior cingulate volume, 6) normalized bilateral lateral ventricle volume. We also calculated Pearson correlation coefficients for two hypothesized relationships: between normalized bilateral hippocampal volume and RBANS Delayed Memory, and between normalized gray matter volume and RBANS Total Score, for each group.

Results

When comparing the previously concussed group with the non-concussed group, there were no statistically significant differences with respect to age [t(38) = 1.31, p = 0.20], years of formal education [t(38) = 0.59, p = 0.56], and pre-morbid IQ level as measured by the Wechsler Test of Adult Reading [t(38) = 0.49, p = 0.63] based on independent samples t-test analyses (Table 1). On the SAS, both participant groups reported experiencing a modest number of symptoms over the previous 24 h (previously concussed group M = 4.6; non-concussed group M = 3.6) similar to rates of baseline symptom reporting in other studies (Piland et al. 2010). However, there were no group differences in the total number of symptoms reported [t(38) = 0.85, p = 0.40], the duration of those symptoms over the past 24 h [t(38) = 1.17, p = 0.25], or the symptom severity [t(38) = 1.49, p = 0.14]. There were no between-group differences on cognitive variables from the RBANS (see Table 1). Time since the most recent injury was M = 36.7 years (SD = 7.2, range = 25–44).

Table 1 Group demographics

Independent t-test analysis comparing whole brain volume between the concussed and the control groups was not significant [t(38) = 0.029, p = .98] (Table 2). The average ICV for the sample was 1,405,323 mm3, which was used to calculate the normalized volumes for each of the ROIs. There were no statistically significant differences in volumes between the two groups when comparing overall gray matter volume [t(38) = 0.451, p = .65], overall white matter volume [t(38) = 0.432, p = .66], bilateral lateral ventricles [t(38) = −0.538, p = .59], bilateral anterior cingulate volume [t(38) = 0.509, p = .61], or bilateral hippocampal volume [t(38) = 0.033, p = .74]. Given the potential influence of other variables, we employed an analysis of covariance (ANCOVA) to ensure that age did not affect the relationship between group status and volumes. Covarying for age, ANCOVAs did not reveal any statistically significant group differences (all p-values > .05).

Table 2 Between-group independent t-test comparisons of brain volumes

In exploratory post-hoc analyses, we sought to examine if there was a differential relationship between cognition and brain volumes. Using bivariate Pearson correlations, bilateral hippocampal volumes were not correlated with the Delayed Memory index score on the RBANS for the concussed group [r = −.03, p = .90] or the control group [r = .12, p = .62]. However, better overall cognitive functioning was related to more adjusted gray matter volume in both groups [concussed: r = .53, p = .01; control: r = .44, p = .06]. There was not a statistically significant difference when comparing this relationship between groups [z = 0.032, p = .37].

Discussion

Results of our study suggest a lack of volume differences in the brains of middle-aged men who experienced multiple mTBIs compared to those without mTBIs in the context of their high school football careers. This lack of volumetric differences was seen in both overall intracranial volume as well as in specific regions that have previously been associated with smaller volumes such as overall gray matter, white matter, lateral ventricles, the anterior cingulate, and the hippocampus.

It important to understand the sample recruited for this study in the context of its null results. The individuals in this study were community dwelling and actively recruited to take place in this research study based on local advertisements and historical records. This sample may be different than previous studies that recruited participants who sought professional medical attention or experienced loss of consciousness (e.g., Monti et al. 2013). Further, the mTBI group participants cumulatively reported only receiving medical attention after 31% of their concussions. Thus, our sample may represent more mild cases on the spectrum of what could be classified as an mTBI. Also unlike some previous studies, the mTBI group was not experiencing a statistically different number of concussion-like symptoms as compared to the non-mTBI control group, nor were they labeled as having abnormal MRI findings from a clinical brain scan at the time of the injury (e.g., Hofman et al. 2001). Further, this sample assessed participants more remotely than many of the previous studies that reported positive findings (e.g., Singh et al. 2014). Thus, it is possible that frank neuronal loss may only occur in the context of chronically symptomatic individuals or in the presence of positive imaging findings.

Statistical power to detect differences is important to consider in a study reporting a lack of statistically significant findings. An a priori power analysis calculated that a sample size of 38 (i.e. 19 per group) would achieve a power of .80 based on the effect sizes of previous empirical studies that examined gross anatomic volumes (e.g. whole brain parenchyma, cortical gray matter, cortical white matter; Ross et al. 2012) and hippocampal volume (Monti et al. 2013; Singh et al. 2014). This value is consistent with the typical sample sizes of the literature in this field, as similar studies with group differences had individual group sizes of n = 10 (Monti et al. 2013) and n = 15 (Tremblay et al. 2013) examining group differences. There also do not appear to be any meaningful effect sizes when examining the means and standard deviations of our volumetric data. This suggests that the current results are likely not an artifact of low power to detect differences.

Lack of volumetric differences as suggested by these data does not mean this sample is free of neural-related differences. In a subsample (n = 36) of these participants, we previously showed that the mTBI group exhibited less blood-oxygen-level dependent (BOLD) signal in left hemisphere brain regions associated with memory encoding and recognition during a verbal-paired associates task despite comparable task performance and memory scores on traditional neuropsychological tasks (Terry et al. 2015). Additionally, preliminary analyses of diffusion tensor imaging data on this sample indicate reduced fractional anisotropy in the mTBI group in the right genu/body of the corpus callosum, bilateral internal capsule, bilateral superior longitudinal fasciculus, and bilateral infero-frontal longitudinal fasciculus (Terry and Miller 2016). Other groups examining the long-term effects of sports-related concussions have also showed differences in white matter (Hart et al. 2013; Tremblay et al. 2014) as well as electrophysiology, potentially in the fronto-parietal networks (Broglio et al. 2012; De Beaumont et al. 2009; for review, see Henry et al. 2016). Taken together, there may be some neuroanatomical abnormalities in these participants, but they have not yet reached the level of gross brain atrophy or functional impairments in everyday life.

Exploratory post-hoc analyses did not show a relationship between hippocampal volume and memory functioning as would have been expected in this community-based sample. However, they did show a positive relationship between overall cognitive functioning and gray matter volume as other studies have also shown between brain metrics and cognition (Tate et al. 2014; Tremblay et al. 2013). This relationship did not differ between the concussed and control groups.

There are several limitations to this study. The heterogeneity of types of mTBI (e.g. severity, location of impact, loss of consciousness, acute symptom presentation, duration of symptoms, etc.) limits the specificity of our results. Differences in injury characteristics likely influence the specific pathophysiological processes associated with the injury and may also influence long-term anatomic changes. Further, the retrospective nature of reporting about these concussive events and the lack of medical records associated with the injury limits our ability to ensure a truly accurate diagnosis of a concussion, its severity, and statistically model other factors contributing to injury heterogeneity. However, given the under-diagnosis of mTBIs due to historical views about mTBI, desire to continue to play, and other factors, we found that including subjects without professional medical attention would increase the external validity of the sample. The sample size common in imaging studies of this nature limits the ability to model the known injury characteristics (e.g., loss of consciousness). We did not compare anatomical volumes by hemispheres because we lacked specific lateralized hypotheses and wanted to limit the number of statistical comparisons. Another limitation is the cross-sectional nature of this study, which would limit our ability to conclude that group differences are the product of the mTBI events. In the wake of null findings, there is always the potential that other variables that were not accounted for modified or masked the volumetric differences between groups. Lastly, the current null findings may be an artifact of the time since the injury. It is possible that functional MRI and white matter differences may predict later volumetric differences and cognitive decline in the future as is seen in neurodegenerative disorders (Canu et al. 2010; Wierenga and Bondi 2007).

It is also worth commenting on the effect size analysis on the demographic data between the two groups. These analyses showed small-to-medium effect sizes, such that the concussed group was older and reported more concussive-like symptoms compared to the control group. We do not believe that these potential group differences influenced the analysis or interpretation of our data, given that both increased age and concussion-related pathology would likely lead to smaller volumes (Du et al. 2006; Ross et al. 2014; Terribilli et al. 2011). Including age as a covariate did not influence our findings. We did not deem it necessary to examine other covariates.

This study adds to the overall body of literature related to potential neuroanatomic changes post-mTBI. This is one of few studies to examine potential brain volume differences in former high school football players with multiple concussions. Data suggests there are no statistically significant differences in intracranial volume, white matter volume, gray matter volume, and lateral ventricle volume. Further, there was a lack of group differences in more specific regions previously associated with mTBI (i.e. hippocampus, ACC). Future research is needed to assess whether volumetric differences manifest in these individuals later in life given the function imaging and diffusion tensor imaging findings.