1 Introduction

Motion capture is an important tool in various research areas and frequently used to collect kinematic input data for musculoskeletal models to estimate the muscle, joint and ligament forces [13]. One of the most commonly used motion capture methods is a combination of infrared cameras and skin markers [4]. Unfortunately, this method has limitations: 1) it is time consuming [5], 2) markers can become occluded [6], 3) marker-based systems (MBS) are complex and spacious [7] and 4) markers can move relative to the underlying bone, a phenomenon known as soft tissue artefact [8, 9].

In recent years, the Microsoft Kinect Sensor (MKS) (Microsoft Corp., Redmond, WA, USA) has attracted the interest of researchers due to its potential application for motion analysis [5, 7, 1015]. Originally developed to control gaming devices through gestures and voice commands, the MKS is a portable, easy-to-use, commercially available and significantly cheaper 3D motion capture system, compared to MBSs. For gesture recognition, the MKS combines an infrared laser projector and video camera to project a speckle pattern onto objects in its field-of-view, and creates a 3D map based on the recorded deformations in this pattern [7, 16]. Previous investigations of the MKS have shown encouraging results for the tracking of 3D marker coordinates [7], anatomical landmark positions and angular displacements [11], and shoulder abduction range-of-motion (ROM) [5]. However, the MKS only detects body segments directly in its field-of-view, which limits the sensor’s ability to track full-body movements, where body segments can obstruct each other. Recently developed software called iPi Motion Capture (iPi Soft, LLC, Moscow, Russia) is able to support two MKSs, which enables simultaneous tracking of all body segments. Additionally, a previous investigation has demonstrated that the iPi software provides higher accuracy when tracking upper-body movements compared to freely available software [13]. If this system can provide accuracy comparable to a MBS, it would result in a compact and cheap motion capture system. However, two issues need to be addressed if the MKS is to be used in musculoskeletal modeling: firstly, the methodology for applying motion capture data obtained using the MKS, or a similar device, to drive musculoskeletal models does not currently exist and secondly, it is not possible to combine MKS and force plate data.

Therefore, the purpose of this investigation was to 1) develop a musculoskeletal model driven by marker-less motion capture data, obtained using two MKSs and iPi Motion Capture software, without the use of force plates and 2) to evaluate the model’s kinematic and kinetic outputs against those obtained when simultaneously recorded skin-marker trajectories with measured (MGRF) and predicted ground reaction forces (PGRF) were used as input to the model. This approach aimed at providing a framework for applying marker-less and force plate-less motion capture data as input to musculoskeletal models, in general, while validating the model outputs associated with the Kinect-based marker-less system (MLS) data.

2 Materials and methods

2.1 Experimental data

Ten healthy males (age \(23.50 \pm 1.27\) years, height \(181.60 \pm 4.40\mbox{ cm}\), weight \(76.12 \pm 5.26\mbox{ kg}\)) volunteered to participate in the investigation and provided written informed consent.

During data collection, participants only wore tight fitting underwear or shorts, which enabled the placement of markers on the body as well as allowing the MLS to distinguish body segments. The following movements were executed: 1) gait at a self-selected pace, 2) unloaded shoulder abduction (SA) and 3) loaded shoulder abduction (LSA). These movements were chosen in order to determine the MLS’s ability to track both full-body and isolated upper-body movements. Furthermore, the inclusion of both loaded and unloaded shoulder abduction enabled evaluation of the model’s kinetic output in response to different loading conditions. Participants were instructed to walk at their self-selected pace and completed five gait trials. Likewise, five trials were completed for each of the two shoulder abduction tasks, each consisting of three consecutive repetitions. For the SA and LSA trials, participants were instructed to raise their dominant arm to an approximately horizontal position with and without a 3 kg dumbbell held in their hand, respectively. One repetition was completed when the participants returned their arm to its starting position along the torso. Four successful trials of each movement were selected for further analysis, as data from single trials for a number of participants were incomplete due to either marker occlusion more than 10% or incomplete movement reconstruction in the iPi software. For test subjects where all trials were successful, we excluded the first collected trial.

Data were simultaneously collected using the MLS and MBS. The MLS consisted of two MKSs, sampling at 30 Hz, and the recordings were processed using iPi Recorder v. 2.2.2.27 (iPi Soft LLC, Moscow, Russia). The MKSs were positioned 4.7 m from each other, elevated 1 m from the ground with an angle of 86.5 degrees between the sensors’ field-of-views, which resulted in a distance of approximately 3.4 m from the sensors to the center of the measurement volume. Thirty-three passive reflective markers were placed on the participants using a full-body protocol and their trajectories tracked using a MBS consisting of eight infrared high-speed cameras (Oqus 300 series), sampling at 100 Hz, combined with Qualisys Track Manager v. 2.7 (Qualisys, Sweden). Ground reaction forces (GRF) were obtained at 2000 Hz using three force plates (AMTI, MA, USA).

2.2 Computational methods

2.2.1 Full-body model

The musculoskeletal models were developed using the Anybody Modeling System (AMS) v. 6.0.2 (AnyBody Technology A/S, Aalborg, Denmark) based on the GaitFullBody template from the AnyBody Managed Model Repository v. 1.5, in which the lower extremity model is based on the cadaver data set collected by Horsman et al. [17], the lumbar spine model based on the work of de Zee et al. [18] and the shoulder and arm model based on the work of the Delft Shoulder Group [1921]. For each trial, two musculoskeletal models were created: one driven by the marker-less motion capture data and one by the marker-based data. The steps included in the two modeling procedures are illustrated in Fig. 1.

Fig. 1
figure 1

Top: marker-less data analysis workflow, where the data is 1) collected from two Microsoft Kinect Sensors using iPi Recorder, 2) tracked in iPi Mocap Studio, 3) the resulting stick figure is imported in AMS and tracked by a scaled musculoskeletal model, and 4) inverse dynamics, including prediction of ground reaction forces, is performed. Bottom: marker-based data analysis workflow, where the data is 1) collected with infrared cameras and force plates using Qualysis Track Manager, 2) imported into AMS for inverse kinematic analysis based on the marker trajectories and 3) kinetic analysis is performed with predicted and measured ground reaction forces

2.2.2 Geometric and inertial parameter scaling

In order to scale the cadaver-based model to the different sizes of the subjects, a length-mass-scaling law [22] was applied, which utilize the segment lengths as predictor. These segment lengths were estimated differently between the two models as will be explained later.

The total body mass was distributed to the individual segments using the regression equations of Winter et al. [23]. Geometric scaling of each segment was accomplished by introducing a linear diagonal scaling matrix that was applied to each point on the segment, including the geometric center-of-masses. The entry of the scaling matrix for the longitudinal direction was computed as the ratio between the unscaled and scaled segment lengths. In the two other orthogonal directions, the scaling was computed as the square root of the mass ratios divided by the length ratios between the scaled and unscaled models [22].

For estimation of the mass moments of inertia, the segments were assumed cylindrical with a uniform density, with the length and mass equal to the segment length and mass.

2.2.3 Muscle recruitment problem

The muscle recruitment problem was solved by formulating a polynomial optimization problem that minimizes a scalar objective function, G, subject to the dynamic equilibrium equations and non-negativity constraints, ensuring that the muscles can only pull and that the muscle forces do not exceed the strength of the muscles:

$$\begin{aligned} &\min_{\mathbf{f}} G \bigl( \mathbf{f}^{\mathrm{M}} \bigr) = \sum_{i = 1}^{n^{(\mathrm{M})}} A_{i} \biggl( \frac{f_{i}^{( \mathrm{M})}}{N_{i}}\biggr) ^{3} \\ & \mathbf{Cf} = \mathbf{d} \\ & 0 \leq f_{i}^{( \mathrm{M})} \leq N_{i}, \quad i =1, \ldots , n^{(\mathrm{M})}. \end{aligned}$$
(1)

\(\mathrm{M}\) indicates the muscle forces, \(f_{i}^{({\mathrm{M}})}\) is the \(i\)th muscle force, \(n^{({\mathrm{M}})}\) is the number of muscles and \(N_{i}\) is the strength of the muscle. \(\mathbf{C}\) is the coefficient matrix for the dynamic equilibrium equations, \(\mathbf{f}\) is a vector of unknown muscle and joint reaction forces, and \(\mathbf{d}\) contains all external loads and inertia forces. Finally, \(A_{i}\) is the physiological cross-sectional area (PCSA) of each muscle unit and, for split muscles, each unit was assigned the corresponding fraction of the total muscle PCSA, resulting from a uniform subdivision by the number of units. Further details as regards muscle recruitment can be found in Damsgaard et al. [24] and Marra et al. [25].

The strengths of the muscles were assumed to be constant, independent of the muscle length and contraction velocity, and set to the values reported in the data sets for the different body parts. Furthermore, the muscle strengths were adjusted using a strength scaling factor based on fat percentage [22]. The fat percentage was estimated from each subject’s Body-Mass-Index, which was determined using the regression equation for men reported by Frankenfield et al. [26].

In both models, muscles were added to the lower extremities. For the shoulder abduction trials, additional muscles were added to the torso, shoulder and arm. The dumbbell weight, associated with the LSA, was modeled as a downward vertical vector applying a force of 29.4 N at the palm of the hand.

2.2.4 Musculoskeletal model driven by the marker-less data

Data obtained using the MLS were processed in iPi Mocap Studio v. 2.5.1.159 (iPi Soft, LLC, Moscow, Russia), which fitted a 19-segment stick figure to each frame in the depth data generated by the MKSs. The stick figure was exported from iPi Mocap Studio and imported into AMS together with the GaitFullBody model.

The GaitFullBody model was set up to automatically scale the segment lengths according to the joint-to-joint distances of the stick figure. The segment lengths in the musculoskeletal model that could be directly computed from the stick figure were scaled based on joint-to-joint distances, as for instance the pelvis width (hip-to-hip joint center distance). For the trunk, hands and feet, however, the segment lengths were not directly obtainable from the stick figure. Therefore, an additional step was introduced in order to scale these segments in the model. First, additional nodes were added to the musculoskeletal model at locations corresponding to points identifiable on the stick figure. Second, the unscaled model was placed in a neutral position and the distance between the added nodes were computed and saved together with the unscaled segment lengths. Subsequently, the ratio between the unscaled segment lengths and the nodes was multiplied onto the segment length measurements on the stick figure before being used to scale the respective segments in the musculoskeletal model.

To obtain tracking of the stick figure by the musculoskeletal model, virtual markers were introduced on both the stick figure and the musculoskeletal model. On the stick figure, virtual markers were located around anatomical landmarks that were possible to define on both the stick figure and the musculoskeletal model (further details are provided as supplementary material). Based on these virtual markers, a nonlinear least-square optimization problem was defined that minimized the least-square difference between the virtual markers on the stick figure and those on the musculoskeletal model. This optimization problem was solved using the method of Andersen et al. [27]. This tracking ensured that the musculoskeletal model optimally tracked the stick figure even though the two models differed in segment and joint definitions. Due to the lack of measurements of subtalar eversion and neck flexion, these were fixed in neutral positions.

Since GRFs were not measured by the MLS, these were predicted by the model based on measured full-body kinematic data only. This was enabled by introducing artificial muscle-like actuators at 12 contact nodes defined under each foot. To overcome the underdeterminacy issue during double support, the computation of the GRFs was made part of the muscle recruitment algorithm. Five unidirectional actuators were added to each contact node, which combined were able to generate a normal force perpendicular to the laboratory floor and static friction forces (with a friction coefficient of 0.5) in the medio-lateral and anterior-posterior directions. Ground contact was defined as established when the node was within 50 mm of the ground plane and the velocity of the node relative to the ground was below 1.3 m/s. The transition from no contact to full contact condition was smoothed similarly to the procedures of Skals et al. [28] using the velocity of the nodes. Further details and validation of the method for an array of activities of daily living and sports-related movements were provided in Fluit et al. [29] and Skals et al. [28], respectively.

2.2.5 Musculoskeletal model driven by the marker-based data

For the models driven by the marker-based data, the model scaling and kinematic analysis were performed using the optimization methods of Andersen et al. [8, 27]. Firstly, for each subject, the model segment lengths and model marker positions were estimated by minimizing the least-square difference between model and experimental markers using the method of Andersen et al. [8] for a selected gait trial. These segment lengths and marker positions were subsequently saved and used for the analysis of all other trials. Secondly, the optimized segment lengths and marker positions were loaded and the least-square difference between model and experimental markers minimized to obtain the model kinematics [27]. Finally, two different versions of kinetic analysis were performed: one where the MGRFs were applied under the feet, and the muscle and joint reaction forces (JRF) computed using muscle recruitment [24], and one where the ground reaction forces were predicted similarly to the Kinect-based model (PGRF). Further details regarding the marker protocol and marker optimization procedure is provided as supplementary material.

2.3 Data analysis

A complete gait cycle, i.e. from heel strike to heel strike, was analyzed for the gait trials. Shoulder abduction trials were analyzed from when the arm began its migration away from the torso until it returned to the initial position. For the gait trials, the following data were selected for analysis: vertical GRF, joint angles and moments for ankle plantar/dorsi flexion, knee flexion/extension, hip flexion/extension, abduction/adduction and internal/external rotation, resultant JRFs for the ankle, knee and hip, and muscle forces for the gastrocnemius, vasti and glutei. To account for the fact that the muscles were split into multiple branches in the models, the average force across all muscle branches were used in the analysis. In addition, peak resultant JRFs, peak vertical GRF, peak muscle forces and joint ROMs were computed. For the shoulder abduction trials, the following data were selected for analysis: shoulder flexion/extension, abduction/adduction and internal/external rotation angles and moments, resultant glenohumeral JRF, muscle force for the deltoideus, peak resultant glenohumeral JRF, deltoideus peak force and joint ROMs.

To compare the variables, Pearson’s correlation coefficient (\(r\)) and root-mean-square deviation (RMSD) were computed for each trial separately and presented as the mean \({\pm} 1\) SD. The absolute values of \(r\) were categorized as weak, moderate, strong and excellent for \(r \leq 0.35\), \(0.35 < r \leq 0.67\), \(0.67 < r \leq 0.90\) and \(0.90 < r\), respectively [30]. The Friedman test was applied to statistically compare the peak vertical GRFs, peak resultant JRFs, peak muscle forces and ROMs between the three methods, and Wilcoxon paired-sample tests were used for post-hoc analysis if significant differences were found between the three groups. To account for multiple testing, a Bonferroni correction was implemented for the Friedman test and significant differences are only reported for \({p} < 0.05/11 = 0.0045\). Joint moments and all forces are expressed as percentage of bodyweight times height (% BW × BH) and percentage of bodyweight (% BW), respectively.

3 Results

The time-histories of the selected variables for the gait and shoulder abduction trials are depicted in Figs. 2, 3, 4 and 5, 6, 7, respectively. Correlation coefficients and RMSDs are listed in Tables 1a, 1b, 1c for gait and Table 2 for the shoulder abduction trials. The results of the Wilcoxon signed-rank tests are summarized in Tables 3a, 3b, 3c. The Friedman test showed significant differences between the three methods for all peak forces (\(p \leq 0.0045\)) with the exception of the SA glenohumeral peak resultant JRF (\(p =0.6077\)), and the Wilcoxon signed-rank test was, therefore, applied as post-hoc analysis for the remaining variables.

Fig. 2
figure 2

Results for gait, illustrating the ankle plantar/dorsi flexion angle (top left), knee (top center) and hip flexion/extension angle (top right), ankle plantar/dorsi flexion moment (middle left), knee (middle center) and hip flexion/extension moment (middle right), ankle (bottom left), knee (bottom center) and hip resultant JRF (bottom right). The results for the marker-based system with MGRFs (red) and PGRFs (black), and marker-less system (blue) are illustrated as the mean \({\pm} 1\) SD (shaded area) over the subjects and trials (Color figure online)

Fig. 3
figure 3

Results for gait, illustrating the hip abduction/adduction and internal/external rotation angles (top) and moments (bottom). The results for the marker-based system with MGRFs (red) and PGRFs (black), and marker-less system (blue) are illustrated as the mean \({\pm} 1\) SD (shaded area) over the subjects and trials (Color figure online)

Fig. 4
figure 4

Results for gait, illustrating the vertical GRF (top left), gastrocnemius (top right), vasti (bottom left) and glutei force (bottom right). The results for the marker-based system with MGRFs (red) and PGRFs (black), and marker-less system (blue) are illustrated as the mean \({\pm} 1\) SD (shaded area) over the subjects and trials (Color figure online)

Fig. 5
figure 5

Results for SA, illustrating the shoulder flexion/extension, abduction/adduction and internal/external rotation angles (top) and moments (bottom). The results for the marker-based (red) and marker-less system (blue) are illustrated as the mean \({\pm} 1\) SD (shaded area) over the subjects and trials (Color figure online)

Fig. 6
figure 6

Results for LSA, illustrating the shoulder flexion/extension, abduction/adduction and internal/external rotation angles (top) and moments (bottom). The results for the marker-based (red) and marker-less system (blue) are illustrated as the mean \({\pm} 1\) SD (shaded area) over the subjects and trials (Color figure online)

Fig. 7
figure 7

Results for SA (left) and LSA (right), illustrating the glenohumeral resultant JRF (top) and deltoideus force (bottom). The results for the marker-based (red) and marker-less system (blue) are illustrated as the mean \({\pm} 1\) SD (shaded area) over the subjects and trials (Color figure online)

Table 1a Correlation and RMSD between the marker-based system with MGRFs and MLS for the selected variables during gait. The results are presented as the mean and standard deviation over the subjects
Table 1b Correlation and RMSD between the marker-based system with MGRFs and PGRFs for the selected variables during gait. The results are presented as the mean and standard deviation over the subjects
Table 1c Correlation and RMSD between the marker-based system with PGRFs and MLS for the selected variables during gait. The results are presented as the mean and standard deviation over the subjects
Table 2 Correlation and RMSD between the marker-based system with MGRFs and MLS for the selected variables during SA and LSA. The results are presented as the mean and standard deviation over the subjects
Table 3a Results of the Wilcoxon signed-rank test between the marked-based system with MGRFs and MLS, listing the \(p\)-value, mean difference and standard deviation for joint ROMs and peak forces. Significant differences are indicated with a for \({p} < 0.0045\)
Table 3b Results of the Wilcoxon signed-rank test between the marked-based system with MGRFs and PGRFs, listing the \(p\)-value, mean difference and standard deviation for joint ROMs and peak forces. Significant differences are indicated with a for \(p < 0.0045\)
Table 3c Results of the Wilcoxon signed-rank test between the marked-based system with PGRFs and MLS, listing the \(p\)-value, mean difference and standard deviation for joint ROMs and peak forces. Significant differences are indicated with a for \(p < 0.0045\)

Comparable results were found between the MBS with MGRFs and PGRFs for all analyzed variables (see Table 1b) with correlations ranging from 0.77 (hip internal/external rotation moment) to 0.94 (knee and hip resultant JRF), and RMSDs ranging from \(0.50 \pm 0.11\) (hip internal/external rotation moment) to \(56.60 \pm 12.57\) (ankle resultant JRF). Furthermore, the shape and magnitude differences between the two methods and the MLS were almost identical, so in the following, only the comparisons between the MBS with MGRF and the MLS are summarized. It should be noted, however, that the Wilcoxon signed-rank test showed significant differences between the two methods for all peak forces (see Table 3b) with the exception of the peak vertical GRF (\(\mbox{mean diff.} = -0.72 \pm 3.72\), \({p} = 0.2477\)).

When comparing the MBS and MLS, similar results were found for the shoulder abduction/adduction angles, glenohumeral resultant JRFs and deltoideus forces during both shoulder abduction tasks, the shoulder flexion/extension and abduction/adduction moments during LSA, and the vertical GRF during gait. However, the MLS’s tracking of the lower body during gait showed discrepancies compared to the MBS and was, in general, inconsistent with correlations ranging from −0.63 (hip internal/external rotation angle) to 0.82 (hip flexion/extension angle) (see Table 1a). The Wilcoxon paired-sample tests (see Table 3a) showed significant differences for the ankle plantar/dorsi flexion ROM (\(p <0.0001\)), knee (\(p <0.0001\)) and hip flexion/extension ROM (\(p <0.0001\)), peak vertical GRF (\(p <0.0001\)), knee (\(p <0.0001\)) and hip peak resultant JRF (\(p <0.0001\)), and glutei peak force (\(p <0.0001\)) during gait. For the shoulder abduction tasks, significant differences were found for the SA shoulder internal/external rotation ROM (\(p =0.0044\)), LSA glenohumeral peak resultant JRF (\(p <0.0001\)) and deltoideus peak force (\(p <0.0001\)).

3.1 Gait

Strong correlations were found between the systems for the vertical GRF (0.85), knee flexion/extension angle (0.81), hip flexion/extension angle (0.82) and abduction/adduction angle (0.81), ankle plantar/dorsi flexion (0.71) and hip abduction/adduction moment (0.77), ankle (0.80), knee (0.78) and hip (0.71) resultant JRFs, and glutei force (0.77). However, the other variables showed correlations ranging from weak to moderate with the hip internal/external rotation angle (−0.63) and moment (0.57) showing the outermost values. No significant differences were found for the hip abduction/adduction ROM (\(p =0.1704\)) and internal/external rotation ROM (\(p =0.0983\)), ankle peak resultant JRF (\(p =0.4356\)), gastrocnemius (\(p =0.1222\)) and vasti peak force (\(p =0.5633\)).

3.2 Shoulder abduction

For the SA trials, strong correlations were established for the shoulder abduction/adduction angle (0.89) and internal/external rotation angle (0.68), and no significant differences were found for the shoulder flexion/extension ROM (\(p =0.4046\)), abduction/adduction ROM (\(p =0.7572\)), glenohumeral peak resultant JRF and deltoideus peak force (\(p =0.0099\)). LSA showed excellent correlations for the shoulder abduction/adduction angle (0.96) and moment (0.90), while strong correlations were found for the shoulder internal/external rotation angle (0.72), glenohumeral resultant JRF (0.87) and deltoideus force (0.88). No significant differences were found for the shoulder flexion/extension (\(p =0.1322\)), abduction/adduction (\(p =0.5364\)) and internal/external rotation ROM (\(p =0.4436\)).

4 Discussion

We developed a musculoskeletal model driven by motion capture data obtained using a MLS, consisting of dual MKSs and iPi Motion Capture software, and evaluated the model outputs against those obtained from musculoskeletal models driven by simultaneously recorded skin marker trajectories. Furthermore, to enable kinetic analysis with the MLS, the GRF&Ms were predicted by incorporating the method of Fluit et al. [29] and Skals et al. [28]. The developed methodology enabled, for the first time, estimation of kinetic variables based on a MLS. In general, the motion variables compared between the systems revealed different results for the studied movements. The vertical GRF data showed a strong correlation during gait, but the peak values were significantly different and the time of occurrence in the gait cycles deviated slightly. Noticeable discrepancies were observed for the remaining variables during gait, which, however, were inconsistent. Shoulder abduction/adduction angles showed strong to excellent correlations and the ROMs were not significantly different during both SA and LSA. For the LSA, strong to excellent correlations were also found for the shoulder abduction/adduction moment, glenohumeral resultant JRF and deltoideus force, but the peak forces were, however, significantly different. Although the results for the MLS were generally associated with larger standard deviations than the MBS, the shoulder abduction measurements showed considerably lower standard deviations compared to the results for the lower body.

Joint angles and ROM showed poor agreement during gait, most noticeably in the tracking of the ankle plantar/dorsi flexion and hip internal/external rotation angles. The large discrepancies associated with the ankle angles could also have contributed to the discrepancies for the knee and hip angles between the systems, due to the joint constraints enforced by the model. The MLS’s tracking of ankle plantar/dorsi flexion was associated with large errors and a significant difference between the systems was observed for the ankle plantar/dorsi flexion ROM. We assessed that these tracking errors were mainly associated with three issues. Firstly, the tracking of the ankle could have been compromised as a result of light reflected on the ground during data collection. Light sensitivity has been proposed as a limitation of the MKS [7], which could substantially reduce the applicability of the MLS outside a controlled environment. Secondly, Dutta et al. [7] reported that tracking errors of the MKS were considerably larger near the edges of the sensor’s field-of-view, which could have caused the poor tracking of the participants’ feet in the current investigation. This is very likely since the MKS was developed to control gaming devices mainly through hand gestures, which makes the depth information near the edges of the sensor’s field-of-view less important for its primary application. Thirdly, due to the close proximity of the feet and the ground during the stance phase of gait, a clear distinction between the feet and ground does not exist in the depth map recorded by the MKSs, which may cause inaccuracies when fitting the stick figure to the depth data.

The differences between the results for the upper-body and lower-body variables could be partially explained by the argumentation above. In addition, the higher tracking accuracy for the shoulder abduction tasks could be the result of a combination of two factors: 1) the movement occurs in the center of the sensor’s field-of-view, where the tracking error is presumably lowest, and 2) the clear visibility of the arm movement during this task compared to e.g. the legs during gait. The importance of the visibility of the movements is clear when comparing the shoulder abduction/adduction angles to the flexion/extension and internal/external rotation angles, which also showed poor agreement. Bonnechére et al. [5] reported comparable accuracy between a MLS, consisting of a single MKS and the proprietary software, and a MBS for tracking shoulder abduction, but they found poor to no agreement between the systems for hip abduction and knee flexion. The application of two MKS and a more complex stick figure model, associated with the iPi software, was not able to address this limitation and could not provide tracking accuracy of the lower body comparable to the MBS, hereby supporting these results.

In general, the kinetic variables for the shoulder abduction tasks showed encouraging results, particularly for the glenohumeral resultant JRFs and deltoideus forces, whereas the results for the lower extremities were less accurate. When the external load was applied to the shoulder abduction task, however, the variation in the kinetic variables increased noticeably, which indicates that the Kinect-based model does not respond well to increases in loading conditions. Strong correlations were observed for the ankle, knee and hip resultant JRFs as well as the glutei force during gait, while the gastrocnemius and vasti forces differed considerably. The ankle and knee resultant JRFs showed similarities in magnitude between the systems, but the timing of the movement differed slightly. Conversely, the hip resultant JRF differed considerably in magnitude, and the peak forces were overestimated compared to the results of the marker-based models. For the kinetic variables, the most encouraging results were found for the vertical GRF, which showed a strong correlation between the systems. Although the peak forces differed slightly in magnitude and timing, the overall similarity between the predicted and measured GRFs supports the results of previous validation studies [28, 29], showing that predicted GRFs are comparable to those measured using force plates.

With regards to the practical usability of the MLS, limitations were identified. Firstly, the volume in which movement could be tracked was restricted, which required careful positioning of the two MKSs to enable tracking of a single gait cycle. Secondly, any alteration in the background required recalibration of the system, which additionally complicates data collection. Thirdly, since the automatic tracking by the iPi software utilizes the solution of the previous frame as the initial guess for fitting the stick figure to the current frame, a large movement from one frame to the next could cause the process to fail. To overcome this, manual improvements of the initial guess for these frames were required. Although this did not affect the estimated stick figure movements, it was a time-consuming task that must be overcome before the system can be used on a larger scale. Finally, as regards the MLS’s use in musculoskeletal modeling, the inability of the MLS for determining the relative position of the force plates to the MKSs required the implementation of GRF prediction. Although the PGRFs were overall similar to the MGRFs, this limitation could have implications due to the MLS’s poor tracking of the feet, since it can potentially be challenging to accurately determine when foot-ground contact is established.

During data collection, additional functional trials were collected, but these were later excluded due to largely inaccurate movement reconstruction in iPi; the excluded movements were counter-movement jump and forward lunge. For the counter-movement jump trials, large errors were observed during movement reconstruction, particularly for the lower body, which caused the procedure to fail completely. It was simply not possible to obtain accurate kinematics for this high-velocity movement due to the relatively low sampling frequency of the MLS (30 Hz). The lunge trials were successfully reconstructed, but the results were associated with large errors, mainly caused by a very poor tracking of the ankle flexion/extension angle. Because of the joint constraints imposed by the musculoskeletal model, this error additionally caused poor reconstruction of the knee flexion/extension angle. Hence, large discrepancies were observed between the two systems for all variables associated with the ankle and knee. As similar tendencies were observed during gait, we assessed that the forward lunge trials did not provide additional meaningful information to the investigation.

This study contains a number of limitations. First, we used a MBS for validation. MBSs are associated with limitations regarding their accuracy, especially due to soft tissue artefacts, and do not possess the accuracy of a golden standard such as bone pins [31] or 3D fluoroscopy [32]. However, the differences observed for instance in knee flexion/extension angles in the present investigation exceed the associated differences between MBSs and bone pins reported by Benoit et al. [31]. In addition, the application of bone pins or 3D fluoroscopy would have been unsuitable, as the investigation aimed at assessing the MLS’s ability to track full-body motion. Although the MBS is associated with limitations, it would be a valuable result to establish comparable accuracy between the MLS and MBS, especially due to the portability and the low price of the MLS.

Second, retro-reflective skin markers were attached to the subjects during the data acquisition with the MKSs. Since these markers reflect infrared light, they affect the light measured by the MKSs and, consequently, the estimated depth map. Although the influence of the markers was not specifically investigated, we did not observe any noticeable effect in the depth maps and we anticipate that the effect is either minor or negligible. It would, however, be worth investigating this effect in a future study.

Third, since the stick figure and musculoskeletal models have slightly different joint constraint definitions, the applied tracking approach results in an imperfect tracking of the stick figure and hence, the resulting movement of the musculoskeletal model may deviate from the depth measurements. To overcome this, future research should explore direct tracking of the depth map by the musculoskeletal model applying for instance a similar approach as proposed by Sandau et al. [33].

In summary, the results for the vertical GRF, shoulder abduction/adduction angles and ROMs, glenohumeral resultant JRFs and deltoideus forces were encouraging, but the MLS revealed limitations, particularly for tracking the lower body. Considering these results, the MLS could be applied to track simple upper-body movements, and further development of this technology towards its application in motion analysis seems warranted, as the MLS provides a portable and significantly cheaper (<1000 USD) solution compared to MBSs. Therefore, future work should focus on assessing the ability of the system to track other simple upper-body movements relevant for diagnostics and ergonomics.