Introduction

The purpose of this study was to conduct a comprehensive evaluation of 2010 helmets in linear impacts to the Hybrid III head. The test conditions were derived from studies of NFL game impacts, some causing concussion. For example, video from 182 NFL game collisions was analyzed and 31 impacts were reconstructed in laboratory tests with helmeted Hybrid III dummies.11,12 That data provided helmet impact conditions for 25 on-field collisions involving concussion. The analysis also provided an average and standard deviation in the impact velocity associated with concussion of NFL players. Other analyses of the NFL video determined impacts to the facemask.2 Eight impact sites were selected on the facemask and helmet shell.2,13,21 The linear impact tests reported here are complementary to the drop-weight testing used to certify football helmets to NOCSAE standards.68 The tests also compliment a recent evaluation of football helmets.15

During the 2008–2009 NFL season, 81% of players wore Riddell helmets with more than half using the VSR4 helmet first introduced in the 1990s (Riddell unpublished data). Only 25% of players were using the newer Riddell Revolution, Schutt DNA and other “modern” helmets. Even though the VSR4 received an upgrade in the padding with vinyl nitrile foam in 2000, there is a lack of objective, independent data on its change in performance compared to the 1990s design. The data reported here is intended to provide information on the performance of “modern” helmets in attenuating the forces of impacts to the helmet. Twenty-three separate tests were conducted, involving a linear impact at eight sites on the facemask and shell with four different impact velocities and two helmet temperatures. The testing is comprehensive and provides data on the translational and rotational accelerations of the head. The data establishes a benchmark for performance of helmets available in 2010 as well as what was used in the 1990s.

Materials and Methods

Linear impact tests were conducted by two independent laboratories with experience in sport safety research and in evaluating football helmets. The testing involved 17 models of “modern” football helmets from five manufacturers, who voluntarily supplied helmets for the evaluation. The scope of the testing was intentionally broad to cover a range of conditions in which concussion may occur in the NFL and to include lower severity impacts to ensure evaluation of impacts in normal practice and play. An additional series of tests was conducted on three models of 1990s helmets to establish a “baseline” performance of helmets from that era. Four of the twelve 1990s helmets were new and unused from that era; they were VSR4 helmets from Riddell archives. The others came from a local high school and were in good condition. The helmet performance data provided by these tests is intended to be an unbiased and fair evaluation of the performance of helmets used in football.

Test Conditions and Velocities

Figure 1 shows the eight impact locations on the front, side, and rear of the helmet. The sites were chosen based on video analysis of 182 NFL game impacts of helmet-to-helmet, helmet-to-ground, helmet-to-shoulder pad type collisions and the laboratory reconstruction of 31 impacts.11,12 The linear impact tests included four impacts on the helmet shell at sites F, C, D, and R. There were two impacts on the facemask at A and A′ and two on the facemask interface with the shell at B and UT.2,12,21 Using a reconstruction of NFL game concussions, these sites and the test conditions were chosen to cover the type of impact conditions associated with severe impacts and concussion in NFL players2,1113,21 and lower speed impacts associated with normal practice and play.3,14,15

Figure 1
figure 1

Locations for linear impacts to the shell (F, C, D, and R) and facemask (A, A′, B, and UT) of football helmets (based on game video analysis of concussion impacts in the NFL6,9,11,12,16,18,20)

Each helmet was placed on the Hybrid III head using the standard NOCSAE position in accordance with manufacturer’s supplied fitting instructions.68 The Hybrid III dummy head and neck were used because of their specification in US Government standards for automotive crash testing, their known repeatability and humanlike impact responses, and their use in previous helmet impact research on helmet impacts in professional football players.1113,1921 For final adjustment, the brow of the helmet was aligned with the middle positioning indicator on the NOCSAE nose gauge guide and in accordance with supplied fitting instructions. Manufacturers were asked to supply helmets with chinstraps pre-adjusted to fit the Hybrid III, but not all complied. Where necessary, chinstraps were adjusted for a snug fit with the chinstraps buckled.

Table 1 lists the impact sites, velocities and temperatures for the helmet testing. Twenty-three (23) separate impacts were conducted with four impact velocities of 5.5, 7.4, 9.3, and 11.2 m/s (12.3, 16.6, 20.8, and 25.1 mph). Testing at the average impact velocity of 9.3 m/s (20.8 mph) was emphasized and the design and layout of the impactor, sliding table, and installation of the Hybrid III head, neck, and helmet followed methods previously described.1113,1921 Studies from automotive safety have shown that testing at the average condition for an injury balances the design for the more and less severe impacts and also balances the design for those with the strongest and weakest tolerances.4 Ten impacts were conducted using the average condition for concussion of 9.3 m/s (20.8 mph), including all shell and facemask impacts except the A′ condition. Only two of the facemask impact conditions found by Craig2 were used in this testing. They were A and A′. The A″ condition is similar to the centered NOCSAE facemask impact. The A″ condition was not tested. The 9.3 m/s severity of impact was also used after the helmet was warmed to 37.8 °C (100°F). Two shell impacts were conducted at the highest impact severity of 11.2 m/s (25.1 mph). This condition was associated with NFL players out 7+ days with concussion and represents the condition for the most seriously injured players.10

Table 1 Test matrix with 23 different impact sites, speeds and temperatures

Figure 2 shows the four impact velocities in this study and the nominal delta Vs overlaid on the 25 reconstructed impacts causing concussion in NFL players. Eight impacts were conducted at the threshold condition for concussion at 7.4 m/s (16.6 mph). This condition represents the transition severity for concussion in the NFL.1113 Three impacts were conducted at 5.5 m/s (12.1 mph). This severity is below recorded concussions in the NFL11 and is associated with normal practice and play. The 5.5 m/s (12.1 mph) impacts are below the severity of any known concussion in NFL players. HITS analysis of college and high school play has recorded head impact accelerations of 20–40 g during normal practice and play, which falls within the range of the lowest severity impacts in this test series.3,14 These tests were added to ensure an assessment of helmet performance in conditions of normal play through the most severe impacts in the NFL.

Figure 2
figure 2

Impact velocity and delta V for 23 reconstructed game impacts of NFL player concussions and 5 severe impacts without injury. The linear impact conditions that were used for the evaluation of the force attenuation of helmets are superimposed on the reconstructed game impacts6,9

For the 37.8 °C (100°F) test condition, helmets were conditioned for 4–24 h in a convection oven. The helmet was impacted not less than 2 min and not more than 3 min after removal from the oven. The helmet was then returned to the conditioning oven if further elevated temperature tests were to be conducted on it. The rationale for selecting the 37.8 °C (100°F) test condition was based on field and laboratory studies that demonstrated maximum on field helmet temperatures to be not more than 40.6 °C (105°F) based on Southern Impact Research Center (SIRC) unpublished data. Most data points indicate that helmets attain temperatures of 37.8 °C (100°F) in warm climates. Data were available from actual on field game conditions involving collegiate players in Arizona and New Hampshire, as well as laboratory studies tracking temperature change both in heated environments and with varying workloads on athletes.

Linear Impactor

The linear impactor was designed to simulate the striking head of a football player and photos of the equipment have been previously published.11,13,21 An air accumulator was charged to the set pressure and was released by a solenoid-activated dump valve onto an air piston. The air piston rapidly accelerated a sliding ram to the desired speed, at which point the ram disengaged from the air piston and was constrained by linear bearings into impact with the helmeted Hybrid III head. The thrust piston was stopped before the sliding ram made head contact, so only the sliding ram was involved in the impact. The velocity of impact was determined by the set pressure in the accumulator.

At the end of the sliding ram was a 152 mm diameter flat aluminum backing plate. Attached to the plate was a 35 mm layer of vinyl nitrile foam covered by a hard plastic cap simulating the shell and liner of a helmeted, striking player. The cap was machined of Nylon and had a domed spherical surface with a 127 ± 5 mm radius. It weighed 0.238 kg. A sample of 14 foam pads weighed 0.0622 ± 0.0019 kg. The cap, foam pad, and backing plate were connected using hook and loop fastening strips so that the foam could be easily interchanged.

The mechanical properties of the impactor face were based on Der-Tex VN600 foam. The published specifications were greater in range for critical impact measures than the test laboratories were comfortable with. Der-Tex was approached and asked to supply sheets that were as identical as possible for use. A lot of VN600 foam was produced and screened for density, durometer, thickness, and 25% compression defection (CD) by the manufacturer. Mechanical testing determined the pressure causing 25% CD of a 6.5 cm2 (1 sq″) cylindrical sample of foam using a loading rate of 2.54 cm/min (1″/min). The thickness and density were also determined. Twenty, two-pad sets were selected for a total of 40 pads that were mechanically similar. SIRC determined that the average density of the pads was 97.7 kg/m3 (6.09 lb/ft3), the average pressure for 25% CD was 94 kPa (13.6 psi), and the average thickness was 35 mm (1.38″).

The mass of the impactor was 14 kg (31 lb).11 The impactor mass simulated that of the striking player in a game impact. The struck player is typically hit from the front or side of the helmet with little support provided by the torso.11,12 There was typically more mass of the striking player involved in the impact because they lined up their head, neck and torso in the collision.19,20 During the impact, more momentum was transferred to the struck player because of the greater effective mass of the striking player. The biomechanical data from game reconstructions showed that the head acceleration and change in velocity of the striking player were lower than that of the struck player.19,20 The linear impact provides a momentum exchange to the struck helmet. The change in velocity of the struck helmets is about 60% of the impact velocity so the energy transfer to the struck head is much lower than based on the impact velocity. The linear impacts are different from the NOCSAE drop tests, where all of the impact energy is transferred to the struck helmet. The alignment of the head–neck and orientation of the impact was setup to provide maximum energy transfer to the head.

Helmets Evaluated and Procedures Followed

A meeting was held with the football helmet manufacturers on August 17, 2009 announcing the test program. The manufacturers were asked to voluntarily supply helmets for testing that they intended to make available to the NFL for the 2010 season. Eleven helmets were supplied in the initial round of testing from September to November of 2009. Six additional helmets were tested during the second round from January to February of 2010. Four helmets of each model were supplied with replacement chinstraps and faceguards. Half of the helmets and parts were sent to Biokinetics and Associates Ltd. in Ottawa and the other half to SIRC in Tennessee. The testing was done at the two independent laboratories following the identical protocol.

Each helmet was assigned an impactor face cap. Each laboratory received 20 pads and randomly assigned a pad to a particular helmet. All testing on that helmet model was performed with that pad. The test matrix was established so there would be a consistent “cool-down” time between impacts to the same pad so its properties were restored. Bare head testing with the end caps used in the testing showed the padding lots provided repeatable head responses (<3% coefficient of variation in 5 m/s impacts to the head with all of the pads used in the study). After each test, issues related to helmet retention (i.e., chinstrap unlatching, the cup sliding off the chin, loss of the helmet from the head) were noted and the helmet and components were inspected for damage to the chinstrap, facemask, shell, and padding. Twenty different issue types were identified in all of the testing and each issue was coded by number after a test. For this study, the total number of issues for each helmet was considered as important as each of the head responses.

Instrumentation, Filtering, and Data Analysis

The helmeted Hybrid III head was instrumented to measure translational and rotational acceleration, and calculate Severity Index (SI) and Head Injury Criterion (HIC).11,16,19 Head translational and rotational acceleration were measured using the established sign convention in standard J1733.17 The positive x-axis is forward, the positive y-axis is through the right ear, and the positive z-axis is down the neck.

The Hybrid III head was machined to provide mounting locations for nine accelerometers in a 3-2-2-2 configuration (from www.radenton.com, reference drawing B-3623). The accelerometers were piezo-resistive and capable of withstanding 2000 g (Endevco models 7264A, B-2000). All data were collected at 10 kHz following the requirements of standard J211-11.8 The data channels were filtered with CFC 1000 anti-aliasing hardware. For the calculation of rotational acceleration, the data from the nine accelerometers were digitally post-processed to CFC 1805 for analysis by the method of Padgoankar et al. 9 The reported linear and rotational accelerations involve CFC 180 filtering.

Both laboratories agreed on a testing protocol prior to the start of helmet evaluations. The protocol described a common order in which the helmets were tested. Testing was conducted on all of the shell impact sites first, followed by all of the facemask impacts. The 5.5 m/s tests were first, followed by the 7.4, 9.3, and 11.2 m/s impacts. All tests for all sites at a particular impact speed were completed before continuing with the next impact speed. The shell impact site test order was: F, C, D, and R. The facemask site order was UT, B, A, and A′. For each site and speed combination, the helmet model sequence remained the same.

The protocol for facemask and chinstrap replacement was agreed upon between the laboratories. For the shell impacts, the facemask was replaced after all tests were completed at a particular impact speed. For the facemask impacts, the facemask was replaced after every A and A′ impact. An exception was that the facemask from the 5.5 m/s impacts was reused for the 7.4 m/s impacts for both the shell and facemask impacts because there was no damage. The precise sequence of testing and changing of components was followed. The chinstrap was replaced whenever the facemask was replaced, or when significant damage was observed.

The Hybrid III vinyl skin has friction against the lining of some football helmets. This makes donning the helmets difficult and causes unrealistic head–helmet interactions. Nylon stockings were tightly fit on the head to reduce friction and provide a reasonable simulation of the skin-to-helmet interface. Two layers of nylon stockings were stretched over the dummy head prior to installation of the helmet. This allowed relative motion between the helmet and head. The stockings remained tightly fit to the head skin. The internal neck cable was tightened to the standard torque of 1.13 Nm (10 in-lbs) prior to testing. A six-axis upper neck load cell (model 1716A, R.A. Denton Inc., www.radenton.com) was installed for the testing only at Biokinetics. That data are not reported in this article.

A twin beam velocity trap was used to trigger the data acquisition system and to provide a measurement of the impact velocity. The velocity trap was positioned so the impactor broke the second beam just before contact with the helmet. High speed video was recorded at 1000 fps for one test of each helmet model, site, and speed combination at each laboratory.

1990s Baseline Helmets

An important aspect of the work was to establish baseline responses for helmets of the 1990s. This was done to determine the level of helmet performance in the 1990s and to establish a benchmark for comparison with the modern helmets of 2010. Twelve helmets were obtained of three different types of 1990s helmets. Riddell provided four old style VSR4 helmets with the original padding as used in the 1990s. These were new and unused helmets. SIRC obtained four old style VSR4 helmets and Pro Air II’s made by Schutt from a local high school that was in good condition. These helmets were subjected to the identical protocol as the 17 modern helmets. The VSR4 helmet has a dual density liner and was used in the NFL game reconstructions.1113,1921

Weights and Dimensions

The helmet shell plus padding, facemask, and chinstrap were weighed and summed to give the overall weight of each helmet. In addition, a series of standardized measurements were taken of the height, width, and length of the helmet shell and complete helmet with faceguard. These dimensions were taken with the helmet on the Hybrid III head and included distances from the head center of gravity (cg). The Hybrid III headform was positioned so the Frankfort plane was horizontal. The helmet was positioned on the headform using the same techniques as for the impact testing. A plumb gauge was used to transfer the measurement points down to the work surface. This provided measurements of the helmets in a plane parallel to the Frankfort plane.

Statistical Analyses

The average, standard deviation (sd), and 95% confidence interval for responses were determined for each test condition and helmet. The significance of differences from the 1990s baseline was determined using the Student t test assuming unequal variance and a two-sided distribution using a p < 0.05 as significant. A t test was also performed using the normalized average response using the baseline condition for normalization. The normalized response was compared to one for the determination of statistical differences for the 23 test conditions for each helmet.1 This approach was favored over others because it assumed the average response was the best available estimate of the helmet’s performance for that condition. It also treated each helmet alone in comparison to the baseline.

The significance of differences in the number of issues during testing was addressed in two ways.1 Issues included chinstrap unlatching, component tearing or breakage, and loss of helmet retention. Each individual issue was recorded. The total number of issues in the tests was compared to that of the baseline helmets, where zero was no issue in a test, 1 was one issue in a test, 2 was two issues in a test, and so on. The average number of issues in all tests of a helmet was compared to the average for the baseline helmets using the Student t test assuming unequal variance, two-sided distribution and p < 0.05 as significant. Second, the 95% confidence interval was determined for the proportion of tests with an issue, where: p = n/N, n was number tests with an issue and N was the total number of tests on the helmet. The 95% confidence interval is \( p = p \pm 1. 9 6 { }\sqrt {\left( {p\left( { 1 - p} \right)/N} \right)} \). Non-overlapping confidence intervals indicated a significant difference in the proportion of tests with an issue compared to the baseline.

Results

Weights and Dimensions

Table 2 shows the weight of the helmet components and size of the helmet shell and shell plus facemask for the 17 helmets and three 1990s baseline models. The average weight for the 1990s baseline was 1.87 ± 0.04 kg (4.11 ± 0.08 lb), where the shell and padding weighed 1.26 ± 0.01 kg (2.77 ± 0.02 lb) and the facemask 0.52 ± 0.01 kg (1.15 ± 0.02 lb). The average weight for the 17 helmets was 1.95 ± 0.20 kg (4.30 ± 0.44 lb), which was 4.7% greater than the 1990s baseline. The helmets ranged in weight from 1.50 to 2.27 kg (3.29–5.00 lb). The greatest increase in weight was for the chinstraps at 41.4%, but this involved only an increase from 0.05 to 0.08 kg (0.12–0.17 lb). There was an 8.3% increase in the facemask weight and only a 1.7% increase in shell weight.

Table 2 Weights of the shell plus padding, facemask and chinstrap, and dimensions of the shell and shell plus facemask for each helmet tested compared to the 1990s baseline helmets

Measurements were made of the helmet size while positioned on the Hybrid III head following NOCSAE and the manufacturer’s recommended procedures. The 1990s baseline helmets were 32.4 ± 0.5 cm (12.76″ ± 0.20″) in length, 24.2 ± 0.2 cm (9.54″ ± 0.06″) in width, and 28.1 ± 0.3 cm (11.05″ ± 0.13″) in height. The average of the 17 helmets was only slightly larger than the 1990s baseline. They were 34.2 ± 1.3 cm (13.46″ ± 0.51″ in length, 25.8 ± 0.9 cm (10.14″ ± 0.37″) in width, and 28.9 ± 1.5 cm (11.37″ ± 0.61″) in height. On average, the 2010 helmets were 5.5% longer, 6.3% wider, and 2.9% taller. The largest helmets were the DNA 2022 and the Adams 3000. The smallest were the Adams A4 and A2000.

1990s Baseline Responses

Figure 3 shows the average and ±1sd in HIC, peak translational acceleration and peak rotational acceleration for the 14 shell impacts at four sites (F, C, D, and R) on the 1990s baseline helmets. The test site and speed are listed at the top and the “H” refers to the high temperature testing. The majority of the responses were in the range of 300–800 HIC, 60–150 g, and 4,000–9,000 r/s2. For HIC, there was a step increase in value with increasing impact velocity for the four sites on the helmet; however, the front (F) site had the lowest HICs of the four sites. For peak translational and rotational acceleration, the frontal response was again lower than the side and rear responses. Figure 4 shows the average and ±1sd in HIC, peak translational acceleration, and peak rotational acceleration for the 9 impacts on the facemask (A, A′) and attachment to the shell (B, UT) for the 1990s baseline helmets. Loading on the attachment of the facemask to the shell had higher responses for the 7.4 and 9.3 m/s test conditions than directly on the facemask. Tests were not conducted at 11.3 m/s.

Figure 3
figure 3

Average and ±1sd in 14 shell impacts for HIC, peak head acceleration (g), and peak rotational acceleration (α) for the 1990s baseline helmets at impact velocities of 5.5–11.2 m/s

Figure 4
figure 4

Average and ±1sd in 9 facemask impacts for HIC, peak head acceleration (g), and peak rotational acceleration (α) for the 1990s baseline helmets at impact velocities of 5.5–9.3 m/s

Helmet Response Compared to the 1990s Baseline

Figure 5 shows the percent difference in HIC for each of the helmets tested in the 23 different impacts compared to the 1990s baseline. A positive difference represents a lower HIC than the 1990s baseline. Cells of data shown in green represent statistically significant improvements (lower responses) on average from the 1990s baseline at the p < 5.0% level, and those in red are significantly poorer responses. The data are organized by impact site with increasing impact velocity. The helmets were organized by manufacturer and the helmet number (#) is listed along with the specific helmet model. The average HIC average, ±1sd and 95% confidence interval for the 1990s baseline helmets are shown at the bottom for each test condition.

Figure 5
figure 5

Percent difference in HIC from baseline for the 17 helmets tested

Also shown in Fig. 5 are four overall measures of the differences for each helmet from the 1990s baseline. They are the overall average difference for the 23 means, the median difference for the 23 conditions, the number of conditions where the average response was more than ±10% different from the 1990s baseline, and the number of differences that were statistically significant at the p < 0.05 level. The number of ±10% and significantly different responses was the sum of the number of positive and negative conditions meeting the criterion.

Figure 5 summarizes the number of issues related to chinstrap unlatching, component tearing or breakage and loss of helmet retention for the 17 helmets. The heading shows the average number of issues for the 1990s baseline helmets. There were 22 per four 1990s helmets tested in the 23 conditions (22 issues in 92 tests). Since some of the tests involved multiple issues, there were only 16 tests for the 1990s baseline helmets with an issue (16 tests with issues in 92 impacts). In some cases, the helmet dislodged from the head representing a loss of retention due to chinstrap unlatching or the chinstrap sliding off the chin. On average, there was only one test with the 1990s baseline where the helmet dislodged and came off the head (1 test out of 92). The proportion of issues for each helmet was statistically compared to that of the 1990s baseline helmets. Issue numbers with red background are significantly greater than the 1990s baseline at p < 0.05.

The proportion of issues for each helmet was statistically compared to that of the 1990s baseline helmets. Issue numbers with red background are significantly greater than the 1990s baseline at p < 0.05. The proportion of issues and 95% confidence interval for the baseline helmets was 19.9 ± 4.7% (55 in 276 tests). For example, the proportion of issues for helmet #1 was 34.8 ± 9.7% (32 in 92 tests). Since the confidence intervals do not overlap, helmet #1 has a significantly greater number of issues than the 1990s baseline. The total number of issues in the baseline tests was 68 (range up to 3) in 276 tests. Again for example, helmet #1 had 44 issues (range up to 5) in 92 tests. The average number of issues in tests with helmet #1 was significantly greater than for the baseline (0.478 vs. 0.246, t = 2.53, p < 0.05).

For the 17 helmets, the average difference from the 1990s baseline varied from improvements in HIC with up to 22% (lower) values to poorer performance with up to −21% (higher) HICs. The median difference varied from a 22% improvement to a −18% (poorer) performance. Five helmets had more than 10 impact conditions where the average HIC was more than ±10% improved over the 1990s baseline. These five helmets also had 10 or more conditions where the difference was statistically significant from the 1990s baseline at p < 0.05. One helmet had 11 conditions where the average response was significantly poorer than the baseline at p < 0.05.

Six of the helmets had a significantly greater number of issues and one had a significantly higher number of tests with the helmet coming off the head. The chinstrap on helmet #8 consistently unlatched in the tests. The design had the straps fixed to the shell and a central latching at the chin cup. Once the chin cup unlatched, the helmet came off the head.

Figure 6 shows the difference in peak head acceleration for the 17 helmets from the 1990s baseline following the same presentation scheme as in Fig. 5. Again, a number of the helmets showed significant improvements in head accelerations (lower responses) compared to the 1990s baseline. The median improvements were up to 13% with two helmets having 12 conditions out of the 23 with statistically significant improvements in average response. Figure 7 shows the difference in peak rotational acceleration for the 17 helmets from the 1990s baseline. Interestingly, a number of helmets showed significant improvements compared to the baseline with median improvements up to 21% and as many as 14 test conditions with statistically significant improvements in rotational acceleration.

Figure 6
figure 6

Percent difference in peak acceleration from baseline for the 17 helmets tested

Figure 7
figure 7

Percent difference in peak rotational acceleration from baseline for the 17 helmets tested

Grouping of Helmet Performance

Table 3 summarizes the statistical analysis of the normalized average responses compared to one for the 1990s baseline. The 23 normalized responses were evaluated using the t test where the t value and p value are given. The level of p < 0.05 was used to determine significance, such that no significant difference (NS) means that the 23 different tests were not statistically different from the 1990s baseline helmets. The t statistic is summarized for HIC, peak translational acceleration, and peak rotational acceleration. The last column lists the average number of issues and identifies those helmets with a significantly (S) greater number of issues than the 1990s baseline at the p < 0.05 level. The table also lists the average difference for the 23 impact tests compared to the 1990s baseline.

Table 3 Significance of differences in helmet performance compared to the 1990s baseline and grouping of helmets in terms of head responses and issues in the helmet impacts

The t statistics for the three responses and issues compared to the 1990s baseline allowed the helmets to be organized into four separate groups. The first group (best performing) had all three responses significantly improved compared to the baseline for the 23 test conditions and a statistically similar number of issues as the 1990s baseline helmets. The best performing helmets were the Schutt DNA PRO and the Riddell Revolution, IQ, and Revolution Speed. These helmets were significantly better in attenuating head responses in the impacts. This group was also significantly better than the other 13 helmets tested.

The second group (good performing) involved helmets with at least one response significantly better than the 1990s baseline and a statistically similar number of issues. There were four helmets in this group, two from Schutt (ION 4D and ION 4D 2030) and two from Riddell (VSR4 and Speed Classic). Overall, these helmets were better than the baseline in at least one response but less than the first group.

The third group (poor performing) had either no biomechanical response significantly different for the 23 conditions from the 1990s baseline or had a significantly greater number of issues. In some cases these helmets had significant improvements in many conditions but the improvements were associated with unlatching or component damage. Six of the helmets in the third group showed no significant difference from the 1990s baseline for the 23 test conditions. Four of these had a significantly higher number of issues. There was one helmet in a fourth group (bad performing). All three biomechanical responses were significantly worse than the 1990s baseline and had a negative average difference for the three responses. It was significantly worse than the 1990s baseline and the other three groups.

Discussion

The announcement of the helmet testing with the linear impactor did not offer enough time for the helmet manufactures to make changes in products that were tested. The specifics of the 23 different test conditions were not known by the helmet manufacturers, except for the general understanding of the impact sites and severities for concussion in NFL players.2,1113,1921 The testing reported here did not allow time for refinement or optimization of the football helmets tested. The testing provides a broad array of information and insight on the control of head accelerations in football helmet impacts. However, the full implications of the testing will not be realized for several years as further improvements in helmet safety are made. Four 2010 helmets performed best when compared to the 1990s baseline helmets.

The four helmets had significantly improved performance compared to the 1990s baseline helmets. The Schutt DNA PRO (#5) and the Riddell Revolution (#4), IQ (#9), and Revolution Speed (#7) had statistically lower biomechanical responses and better force attenuation than the 1990s baseline helmets. Their overall performance was the best for the 23 impacts on the Hybrid III head associated with concussion in NFL players. These helmets included energy absorbing padding of the shell and construction that offered an incremental improvement in safety to protect the head of football players.

The best performing helmets weighed 2.00 kg (4.4 lb) on average. This was only 6.8% greater than the 1.86 kg (4.1 lb) average for the 1990s baseline helmets. These helmets were 7.5% longer than the baseline (34.8 vs. 32.5 cm), 5.4% wider (25.7 vs. 24.1 cm), and only 2.7% higher (28.7 vs. 27.9 cm). These rather modest changes in weight and size indicate that significant improvements in protection can be made without drastically modifying the overall size and weight of the helmet. The sophistication is in the padding system and other components.

Four other helmets provided improved performance on some biomechanical measures compared to the 1990s baseline, but not consistently across all responses. The Riddell VSR4 (#3) and Revolution Speed Classic (#14) and the Schutt ION 4D (#6) and ION 4D 2030 (#17) were better performers than the 1990s baseline helmets. The better performing helmets were 7.4% heavier than the 1990s baseline. These helmets were 5.7% longer than the baseline (34.3 vs. 32.5 cm), 6.1% wider (25.7 vs. 24.1 cm), and 3.7% higher (29.2 vs. 27.9 cm). Again, these are modest differences from the 1990s baseline helmets.

The laboratory testing shows that some 2010 football helmets do not provide consistently improved performance over what was available in the 1990s. Six helmets had a significantly greater number of issues with chinstrap unlatching, component damage or loss of helmet retention. The helmet and its latching system are an integral part of the safety system. The loss of a helmet or unlatching in a play increases the risk of injury to a player.

Limitations

This study evaluated the protective capacity of helmets planned for use by NFL players. The impact conditions approximate head impacts from studies on NFL player concussions.2,1013,1921 The impact sites and speeds are based on the observations and analysis of NFL game video and laboratory reconstruction of 31 impacts. There may be other concussion inducing events that occur on the field that are not investigated in this testing program. However, it is presumed that helmet performance levels that are improved would not adversely affect other modalities of injury given the modest changes in mass and size. Further, there has been no effort to determine how typical the NFL collision severities are for other levels of football play, although lower and higher impact velocities were used than the average collision associated with concussion in the NFL.1113 Many of the tests were at the average velocity for concussion in the NFL. Fewer tests were conducted at lower speeds, which may be more typical of impacts during lower levels of practice and play, such as amateur play. The helmet rankings described in this work may not apply to amateur play, although one cannot assume that they do not. A follow-on study more thoroughly looks at lower velocities of helmet impact.

Many configurations of helmet and faceguard combinations are possible. No effort was made to understand how the various configurations may affect outcomes in this testing. It is anticipated that basic helmet energy attenuation levels would remain constant to a large degree for a given helmet model. Finally, care should be taken to note that a reported change in performance from baseline is not to be construed as a known change in injury risk. For example, a helmet model with responses 20% better than the 1990s baseline does not mean the risk of injury is 20% less. It may be less or more; however, improved force attenuation means lower head accelerations, which are associated with lower risks for injury. This study involved laboratory tests using a linear impactor and instrumented Hybrid III dummy. There are issues of some player impacts and injuries that it may not address.