Introduction

For more than a century, scientists tried many ingenious ways to measure the distribution of pressure in the foot. The very first attempts with plaster-filled sacks3 or spaded soil36 at the turn of nineteenth century or the more sophisticated approaches using video recording18 of the 1930s, had been as smart as difficult to apply and process. Since then, technology has made great strides leading, since the late 1970s, to a wide range of devices for the measurement of plantar pressure and ground reaction forces.12,13

The popularity of distance running has greatly increased over the last three decades.11 An average runner usually strikes the ground around three times per second.9 The description of how the foot touches the ground during running, called foot strike pattern (FSP), depends on the location of the first contact area with the ground23 and rearfoot (RS), midfoot (MS) and forefoot (FS) strike are the common classifications. If the FS is not followed by heel contact (as it is, instead, in the toe-heel-toe pattern), it is called toe strike (TS).30 FSPs during running have already been linked to injury risk15,22,35,39,46 and performance.17,21,23,27,30,31,34,37,45 Even if the conclusions are often contradictory and retrospective, it is clear that the study of FSPs is becoming increasingly connected not only to élite, but also to recreational sports.

The most used method to examine FSPs is the video analysis of the recorded landing.5,16,23,25,27 This approach, however, is time-consuming and observer-dependent, since there is a need of trained people to manually scan the video recordings. Moreover, as it has been recently reported,5 the reliability of the observations decreases when the analyzed foot is not the one in the foreground. Another possible approach would be to make a kinematic analysis using a video system.2 This solution can be a good asset when force plates are not available, but it certainly involves some preparation time to place the markers and calibrate the system. In addition, the compliance of the foot itself could produce some non-systematic errors.2 Furthermore, the setup should include at least two cameras in order to include both feet in the analysis, thus adding some complexity to the experimental setup. These video-based methods, though, always rely on some level of manual preparation or elaboration and cannot be easily automated. Recently, also the use of inertial sensors has been proposed.20 Given the inexpensive and lightweight nature of these devices, the approach is certainly appealing for in situ applications. A validation with a kinematic method, though, showed a low reliability for TS cases.20 Moreover, the strict requirements for sensors’ supports stiffness and the quite complex post-processing (i.e., filtering conditions), would require some experience and tuning before the proper application.

Other kinetic approaches, like the location of the center of pressure at impact,13 are widely used by pressure plate companies in their built-in software. This analysis alternative seems to be a good candidate for automating the evaluation process and the reason is twofold. First, it allows not only for a categorical classification of the FSP, but also for a quantification of it through the foot strike index (distance from the heel to the center of pressure at impact relative to total foot length). Second, it appears to be an appropriate metric when searching for correlation with injury risks.7,35 Nevertheless, the analysis’ boundary conditions must be clearly unfolded to the user, in order to: (a) correctly interpret and report the outcomes and (b) avoid inconsistencies like the presence of incomplete footprints when dealing with TS cases. The second point could be avoided by using pressure-sensitive insoles.33 These devices, though, lack in reliability in presence of lightweight participants and need a refined algorithm in order to detect the toe-off.33

The existing methods are either time-consuming, observer-dependent or lacking in accuracy when trying to give a real-time feedback to the participant during treadmill running. Therefore, it is clear that a substantial standardization of the methods employed to evaluate the FSPs is currently missing.

The development of an automatic numerical algorithm able to classify accurately FSPs during running would provide several benefits: first, it would avoid any observer dependence by automating the evaluation process. This characteristic can considerably enrich the analysis, potentially including in a single evaluation step a big number of step cycles at once. Another important advantage is the ease of use: the researcher does not need to be trained to identify the FSP. Finally yet importantly, the automatic nature of this evaluation model can speed up the throughput of the outcomes, thus giving the chance to give online-feedback information about the foot strike.

The objective of the current investigation was to validate an original FSP assessment technique based on the numerical analysis of foot pressure distribution. Further, we aimed to make the method completely reproducible and therefore the boundary conditions and the calculation process are explained step-by-step.

We validated our custom algorithm against one of the most used and accepted methods, the video analysis,5,15,23,27,37,38 using a large sample size. The investigation was conducted across a wide spectrum of submaximal speeds and two different conditions (barefoot and shod) in order to prevent the analysis of only very specific conditions. Finally, the treadmill approach ensures a consistent number of gait cycles to analyze and average. This significantly improves the impact of the method compared to others considering only one or two steps by using fixed cameras while running over ground or force plates.2,5,16,2325,27,28,30,31,38,41

Materials and Methods

Experimental Design

We recruited people of both genders and various ages (n = 145; 85 male, 60 female, see Table 1 for details). The criteria for assessing their running experience were defined as follows. Inexperienced (n = 57; 29 male, 28 female): people that were inactive, doing other sports29 or that just took up running5 (for a period ≤1 year). Recreational (n = 67; 41 male, 26 female): people running more than 20 km/week30,32 and averaging three or more sessions/week in the past 5 years.10 Competitive (n = 21; 15 male, 6 female): athletes running more than 40 km/week,29,34 registered in athletics or triathlon clubs and competing at the regional, national or international level in any event, except throwing and walking.4

Table 1 Participants’ anthropometric characteristics and running velocities.

All of them gave informed consent for the experimental procedure, according to the rules of the local scientific board. None of the participants showed or reported any history of neuromuscular or musculoskeletal impairments at the time of the measurements. Moreover, in the 6 months prior to the measurements day, none of them has suffered any injury to the lower limbs and they all reported to be habitually shod during daily life and, if applicable, when running. No participant was an experienced barefoot runner.

The FSP assessment method was double: a video analysis using a high-speed camera and an automated evaluation using an algorithm to interpret pressure distribution data. After checking its reliability by comparing the outcomes of eight different observers, the video analysis was considered as a reference technique for validating the custom software.

Material

The foot strike videos were recorded using a high-speed video camera (Flare 4M180-CCL, IO Industries Inc., London, Canada) operating at 550 Hz, with dedicated recording software (Simi Grab 2.1.1, Simi Reality Motion Systems GmbH, Unterschleissheim, Germany). The resolution was set to 984 × 400 pixels. Pressure distribution patterns were recorded at 120 Hz through a pressure plate (FDM-THM-S, zebris Medical GmbH, Isny im Allgäu, Germany) integrated in a treadmill (mercury, H-p-cosmos Sports & Medical GmbH, Nussdorf, Germany). The pressure plate data were acquired using the proprietary software (WinFDM-T v2.5.1, zebris Medical GmbH, Isny im Allgäu, Germany) and then extracted in a raw format for autonomous post-processing (R version 3.1.2, R Foundation for Statistical Computing, R Core Team, Vienna, Austria). Both the camera and the plate were synchronized using an analogue signal. The camera was set up 350 cm laterally to the left side of the treadmill, mounted on a tripod at a height of 29.5 cm and angled perpendicular to the sagittal plane of the subjects.

Protocol

On the treadmill, the participants conducted a self-selected warm-up,19,31 in order to choose a comfortable running pace. The procedure to find the comfortable speed was implemented using the method of limits.43 The speed was randomly increased with steps of 0.02 to 0.05 m/s at varying time intervals (around 5 to 10 s) until the participant was comfortable with a specific pace. Then the operation was repeated starting from a faster speed (around 0.5 to 1 m/s higher) and randomly decreasing it as previously done. If the comfortable value was not differing more than 10% from the previous, the average of the two values was taken as the preferred. Otherwise, the whole procedure was reiterated. The warm-up protocol typically lasted between 5 and 10 min, with an average of 6.8 ± 1.2 min.

After being instructed about the protocol, the participants ran for 90 s in four different conditions: preferred speed (shod, 2.8 ± 0.4 m/s), faster speed (shod, 3.5 ± 0.6 m/s) slower speed (shod, 2.3 ± 0.3 m/s), preferred speed (barefoot). For competitive and recreational runners, the faster speed was then determined calculating the average speed maintained during the personal best time effort over a 10 km race (mean 125 ± 7% of preferred speed). If no information was available (e.g. for inactive participants), the faster speed was set as the 125% of the preferred. The slower speed was set as the 75 to 85% of the preferred speed (mean 80 ± 4%), accordingly to each participant’s preference. As a guideline to identify the slower speed, the participants were asked to report a maximum value of 2 (i.e., weak, light effort) on the modified Borg Rating of Perceived Exertion scale.6 Each condition was repeated, thus giving eight datasets for each subject. The order of the eight trials was completely randomized. Before every trial, each participant performed 60 s of familiarization running on the treadmill, in order to allow for accommodation.44 Following, a dataset of 30 s was recorded. There were no stops between the trials, except the one or two necessary brakes to take the shoes off before running in the barefoot condition.

Analysis

Two main parts formed the FSP analysis: the video analysis as a well-accepted reference technique and the pressure plate data elaboration through a new numerical computation algorithm.

For the video analysis, eight observers were trained to identify the three FSPs (i.e., RS, MS and FS) by showing them ten trivial (clearly identifiable) and ten non-trivial videos taken outside the present study. A trivial video is namely a representation of an unambiguous FSP (e.g. a strong RS, where the rearfoot unequivocally touches the ground before any other part of the foot). On the contrary, a non-trivial video is a recording where the FSP is not clearly identifiable (e.g. a MS or a light FS or RS, where the gap between the outsole and the ground is not evenly assessed among different observers because of lighting conditions, superposition of similar colors, etc.). After becoming able to classify all the subjected data, the observers were asked to look at the study’s videos and to recognize the RS, MS and FS cases. As reported from other studies,5 there is indeed a decline in reliability of around 10% when making a video analysis of the foot in the background. To avoid this potential source of measurement uncertainty, only the data related to the left foot (foreground image) are presented.

A typical analysis scenario is shown in Fig. 1. We report here only the data related to observations of the foreground foot (left), in order to avoid any additional measurement uncertainty5 due to the difficult interpretation of a background image.

Figure 1
figure 1

A typical video analysis scenario and the correspondent pressure distributions at impact. Pictures (a) and (b) represent the rearfoot strike, while (c) and (d) show the forefoot strike. Pictures (e) and (f) show the difficulty of assessing the FSP using the video analysis.

The pressure plate-data elaboration revolves around the concept of strike index (SI). The SI, as originally defined by Cavanagh and Lafortune13 and then adopted from several other authors,1,2,8,14,20,21,26,28,30,42 is the distance from the heel to the center of pressure at impact relative to total foot length. However, the most important assumption made from this method is to know the foot (or shoe) length. The plate itself cannot measure this quantity accurately, since in cases like the TS the entire foot does not touch the treadmill. Therefore, for a clear identification of the FSPs from the pressure distribution data, the first step is to determine for each participant the footprint lengths. The measured lengths with and without shoes (for shod and barefoot trials, respectively) have been used as a reference to carry on the analyses. A custom-made caliper has been used for the measurements. The bare foot length has been considered as the distance from the pternion point to the most anterior point of the longest toe, measured parallel to the foot axis. In a similar way, the shoe length was measured as the distance between the perpendicular projection to the ground of the most posterior and the most anterior points of the outsole. These values constitute the foot and shoe measured lengths. The information regarding the footprint length is necessary especially when dealing with incomplete footprints during running (like in the case of TS, where the heel never touches the plate), since the pressure plate does not give any information about how long the original footprint should be.

To identify the forefoot, the midfoot and the rearfoot, the footprint is divided into three geometrically equal parts, each representing one third of the total length. Using the pressure values of the individual foot recorded from the plate, the code evaluates the footprint length (calculated length) along the treadmill’s anterior–posterior axis. If the calculated value differs more than 5% from the measured one (e.g. for TS cases), the footprint is corrected with the “real” value, like shown in Fig. 2. The most important assumption underlying this step is that, during the toe-off phase, the tip of the shoe or of the foot always touches the ground. The width of the foot is considered as the widest footprint recorded. The footprint is thus localized within its real boundaries: this is done by expanding each pressure matrix in length and width with the appropriate number of zero elements in order to reach the correct length and width.

Figure 2
figure 2

Graphical representation of the footprint’s length correction. In this strong toe strike case, the participant never touched the ground with the rearfoot, as shown in the pressure distribution (a) and in the ground reaction forces (b) graphs. The identification of fore-, mid- and rearfoot is possible only after the footprint correction via software (a).

The calculation of the SI, then, automatically provides one of the three FSPs (RS, MS or FS, being the TS case included in the FS). To temporally locate the impact, the first recorded data after the swing phase has been taken as a reference, thus defining “impact” as “initial contact”.13 In the algorithm this is considered as the first non-zero pressure matrix after the last toe-off. In Fig. 3, a flowchart shows the logic of the FSP determination algorithm, emphasizing the individual steps and their interconnections.

Figure 3
figure 3

Flow chart showing the logic of the FSP determination algorithm. Every fundamental step and interconnection are reported for reproducibility purposes.

Statistics

We calculated a two-way Intraclass Correlation Coefficient (ICC) for single measurements to assess the agreement between eight different observers conducting the video analysis.

The 95% confidence interval of our sample’s margin of error was estimated through a bootstrapping procedure, in order to have an indication about the uncertainty of our FSP estimate. The original data set was resampled 10,000 times with replacement, considering the FSP as the main parameter. Starting from a sample size of 41 in order to consider the sampling distribution to be normal (central limit theorem), the procedure was repeated until the total sample size was reached, with increasing steps of 10. The minimum sample size was chosen since the central limit theorem states that the sampling distribution of the statistic is normal for sample sizes greater than 40. After elaborating the data, the agreement between the video analysis and the numerical approach was calculated, thus comparing our algorithm with a reference technique.

Results

As shown in Table 2, the agreement between the eight observers is usually higher for RS cases in all conditions (ICC values from 0.83 to 0.96, confidence level 0.99). The FS pattern assessment suffers a decrease in ICC value for the shod condition at faster and slower speeds (ICC values 0.65 and 0.73, respectively), while it produces higher agreement in the other conditions (ICC values 0.86 for the shod condition, preferred speed and 0.92 for the barefoot case). In evaluating MS cases, the observers never reach high agreement (ICC values 0.51 to 0.64). When joining the MS and FS strikes into a single pattern (MFS), thus identifying only two types of FSP, the ICC inevitably increases for all conditions (0.83 to 0.96). For this reason, the video analysis can be a proper reference method only when considering the RS and the joint MFS patterns.

Table 2 Intraclass correlation coefficient (ICC) calculated between the outcomes of eight observers’ video analyses.

As reported in Table 3, the video analysis, averaged among eight observers, found 76.7% RS cases. The joint MFS pattern, accordingly, constitutes the 23.3% of the total 1160 observations (145 participants, five conditions, two trials for each condition). Being the measurement uncertainty of the numerical analysis independent on the foot considered, the results are presented for both the left and the right feet.

Table 3 Comparison of the video analysis’ outcomes with the numerical results.

Combining all the results (preferred, faster and slower speed shod and preferred speed barefoot), the numerical analysis found 78.3% left and 77.8% right RS cases (see Table 3). Joining again the MS and FS patterns into a single one (MFS), makes the new pattern to contribute for 21.6% (left) and 22.2% (right) of the total observations. The computation times are 0.45 s for every second of recorded data on Intel® Core™ i5-5250U @ 2.70 GHz with 8 GB RAM on Windows 7 64-bit and 0.15 s on Intel® Xeon® X5650 @ 2.66 GHz with 48 GB RAM on Windows 7 64-bit. To assess the validity of the numerical approach, the outcomes of the RS and MFS patterns were compared with the video analysis. The investigation of RS and MFS has been chosen because in these two FSP cases the reliability of the video analysis was very high. The ICCs, related to the left foot results, are calculated on the number of observations for each FSP. The RS and MFS cases return high values (0.93). All agreements are significant (p < 0.01).

The 95% confidence interval estimation of our dataset’s standard errors gave the values reported in Fig. 4. The margin of error was calculated in order to estimate the likelihood of obtaining results close to the whole population’s. The following lines specify the values used to create the histogram in Fig. 4. The four conditions’ video analysis presented the following FSPs: shod, preferred speed, 88.3% RS and 11.7% MFS; shod, faster speed, 87.6% RS and 12.4% MFS; shod, slower speed, 84.8% RS and 15.2% MFS; barefoot, preferred speed, 46.2% RS and 53.8% MFS. The numerical analysis of the left footprints produced: shod, preferred speed, 89.0% RS and 11.0% MFS; shod, faster speed, 89.0% RS and 11.0% MFS; shod, slower speed, 86.9% RS and 13.1% MFS; barefoot, preferred speed, 49.0% RS and 51.0% MFS. The numerical analysis of the right footprints produced: shod, preferred speed, 88.3% RS and 11.7% MFS; shod, faster speed, 87.6% RS and 12.4% MFS; shod, slower speed, 86.2% RS and 13.8% MFS; barefoot, preferred speed, 49.0% RS and 51.0% MFS. The related margins of errors were 5.2, 5.4, 5.8 and 8.1% for the video analysis; 5.0, 5.0, 5.5 and 8.1% for the numerical analysis of the left footprints; 5.0, 5.1, 5.4 and 8.1% for the numerical analysis of the right footprints. The relative standard errors values are 2.6, 2.7, 3.0 and 4.1% (video analysis); 2.6, 2.6, 2.8 and 4.2% (numerical left); 2.6, 2.6, 2.8 and 4.2% (numerical right).

Figure 4
figure 4

Percentage of rearfoot strikes (RS) detected with the video analysis (averaged among eight observers) and by the numerical method (for both left and right foot). The error bars represent ± se (standard error) values, calculated through a bootstrapping procedure.

Table 4 shows the FSP distribution during shod and barefoot running at a comfortable speed (here called “preferred speed”).

Table 4 Foot strike patterns occurrences during shod and barefoot running at preferred speed.

Discussion

This study aimed to create and validate an automatic method to evaluate FSPs during running. Using the plantar pressure distribution recorded by a pressure plate integrated in a treadmill, we created an algorithm able to detect the FSP during running. This provides the basis for the development of an online-feedback system. To validate the method, its agreement with a solid and often used reference technique—namely the analysis of the sagittal plane video of the striking foot—was checked. Among eight independent observers, the reference technique provided a very reliable RS and MFS patterns recognition. Therefore, it was considered an adequate gold standard for the evaluation of the new algorithm. The algorithm is able to quickly and accurately recognize the FSP and can therefore be easily used as a fast feedback tool. On a standard machine, the computation time is 45% of the recorded time (see results for details).

FSP assessment through video analysis showed a very high reliability in identifying RS cases. The rearfoot is most of the times easily located by the observers and the common strong dorsiflexion associated with a RS pattern30 helps in clearly defining the case. For similar reasons, also the FS case is often clearly isolated, especially when the plantar flexion right before the strike is substantial. There is, however, a low conformity among observers in determining MS cases. This is because the midfoot is often difficult to locate and often leads to the misinterpretation of the FSP. In addition, it is not always trivial to identify a gap or a contact between the outsole (or the bare foot) and the treadmill. These factors add an amount of uncertainty that translates in a lowered agreement between observers (low ICC). Joining, as we propose, the MS and FS cases into one single pattern called MFS solves most disagreeing cases. The decrease in ICC values for the shod condition at faster and slower speeds also suggests a dependence of the inter-rater reliability on the running speed. Since the foreground foot was the left, the image of the right foot was often difficult to interpret. A previously-reported decline in reliability when analyzing the foot in the background,5 convinced us to only present the data related to the image in the foreground, namely the left foot. Additionally, the video analysis is not adequate for giving online-feedback information about the FSP and proved to be extremely time-consuming, especially when the number of trials is not small. The video analysis, one of the most popular methods for FSP assessment,5,15,23,27,37,38 is anyhow solid enough to be considered as a reference technique for validating our algorithm. The uncommonly big sample size of 145 participants was chosen in order to reduce the chance of biases in our measurements. In these kinds of studies, it is very common to use small sample sizes.2,20,28,31,37,38,41,42 This is partly due to the complex structure of some experimental setups. The chosen sample size can be evaluated by estimating the margin of error. This quantity contains the information about the uncertainty with which one predicts to describe the whole population. This means that our sample estimate will not differ from the true population’s by more than the margin of error values 95% of the time (the chosen confidence level). This is obviously a gross estimation that does not take into account all the underlying biases that might be present, but it is a starting point for evaluating the sample size.

In addition to what was previously done in other studies,5,23,25,27,41 we decided to widen the set of conditions in order to test our automatic foot strike detection across various circumstances. Therefore, shod running data were recorded at three endurance running speeds. Further, barefoot running data (at preferred speed) were acquired. Moreover, using a treadmill allowed us to have a big number of gait cycles to analyze. This aspect is crucial for a task like running during which a certain amount of adaptation, albeit small, is needed before reaching the cyclic-repetitive state. For this reason, 30 s after at least 60 s accommodation44 for each condition were recorded, excluding since the beginning the possibility of acquiring only single steps, thus minimizing the effects of artefacts in the assessment of the FSP for each participant and condition.

Since the video analysis (reference technique) is associated with a lack of reliability when dealing with MS cases, only the RS and the joint MFS patterns were used to validate our numerical approach against the reference. The agreement investigation between the two methods produced significantly high values. This will allow us, in the future, to conduct any treadmill-based study by using only our numerical approach for all FSPs (RS, MS, FS and TS). The numerical analysis, supported by a sample size of 145 participants, gave results that are consistent with previous findings.23,24 There is a clear dominance of RS patterns in the shod condition and MFS patterns in the barefoot condition. These numbers should be interpreted as referred to a sample of habitually shod runners that had no experience of barefoot running at the moment of the study. The fully automated process avoids any observer influence, thus producing objective results. The high-throughput nature of the numerical analysis helps to dramatically reduce the computation time and increase the efficiency of the FSP assessment.

With our approach, we introduced a foot length correction, which is key for producing accurate results. Peculiar cases can cause difficulties in analyzing data. The TS, for instance, produces a much shorter footprint than the original shoe (or foot). This would lead, without any additional information about the real length of the shoe or foot, to a wrong output. The algorithm would consider the footprint as complete even if only the forefoot and a portion of the midfoot (typical TS case) formed it. To avoid these singularities, the automatic algorithm needs the shoe (or foot) length as an initial input. A quick and easy measurement of the shoe length (for the shod condition analysis) and of the foot (for the barefoot cases) allows our algorithm to correctly locate the pressure information inside the real footprint. Therefore, every possible special case can be automatically analyzed. Furthermore, this method allows for within-person analysis, taking into account any possible asymmetries. This feature would permit a higher level of online-feedback, increasing the amount of available information for both the researcher and the participant.

A potential limitation of this validation might be in its specificity to treadmill running. Even if there is evidence of similarity between overground and treadmill running,40 most of the participants in this study run predominantly outdoors rather than on a treadmill. Also, the participants chose their own footwear and speed, thus not allowing for generalized conclusions on these parameters. Another possible limitation is undoubtedly given by the measurement system. The big size of the sensors (8.47 × 8.47 mm) and the relatively low sampling rate (120 Hz) of the pressure plate, may lead to accuracy issues when the requirements on the measurement uncertainty are particularly stringent.