Introduction

Speed-accuracy trade-offs (SATs), such as that between movement speed and spatial accuracy, are one of the most common phenomena in human movement. SATs have been actively studied since Woodworth1, who measured the accuracy of voluntary movement. Fitts’ law, the most famous SAT model, has been widely accepted in the field of Ergonomics and Human-Computer Interaction. Fitts2 applied Shannon’s information theory to the human motor system and proposed an empirical model that relates movement duration to movement distance and target width, called Fitts’ law, based on the following equation:

$$D=a+b\,{\log }_{2}(\frac{2A}{W})=a+b\,{I}_{d},$$
(1)

where D and A are the movement duration and distance, respectively, W is the target width, Id is a logarithmic term in W and A called the index of difficulty, and a and b are intercept and slope in the linear equation, respectively, which are obtained as regression coefficients using D as an objective variable and Id as an explanatory variable. Equation (1) can be rewritten as follows:

$$D=a^{\prime} +b\,{\log }_{2}(\frac{1}{W}),\,a^{\prime} =a+b({\log }_{2}A+1),$$
(2)

to emphasise that increasing the spatial accuracy required (i.e., decreasing W) at a given distance increases movement duration, and vice versa. Thus, Fitts’ law expresses a linear SAT function represented by features a and b, i.e., intercept and slope. Previous studies have already shown that Fitts’ law can be applied to behavioural data in a variety of contexts (reviewed in3,4,5). There is also a considerable body of research supporting the applicability of Fitts’ law to arm movements such as reciprocal tapping2, reaching movements6, and one-dimensional linear and two-dimensional planar arm movements7,8,9,10.

As mentioned above, Fitts’ law is a model that has been supported by many researchers for more than half a century, but the question as to how Fitts’ law can explain human movement under any condition, despite being an empirical model, remains unanswered. From a physical perspective, human movements are modelled by the fundamental law of motion, that is, using dynamics (including kinematics). Therefore, we hypothesised that Fitts’ law in reaching movement could be determined in terms of human arm dynamics. It is important to clarify the relationship between speed-accuracy and arm dynamics because it enables us to understand human motor performance at a more profound level.

There have been interesting findings regarding the slope of Fitts’ law, including demonstrations that the slope increases as the size of effectors11, the slope in elderly subjects is greater than that in young adults12,13,14, and the slope can be reduced with practice15. From these experimental facts, it is quite natural to consider that the coefficients of Fitts’ law are related to the physical dynamics factors. Recent studies have further shown that it is essential to investigate dynamics factors. For example, Hoffmann and Hui16 investigated the duration taken to move a given distance using different arm components such as fingers, wrists, forearms and the full arm and concluded that the moments of inertia and muscle torque strength of these components affect movement duration. Bertucco et al.17 showed that movements over a long distance were associated with Coriolis forces and had a larger intercept of Fitts’ law than those at short distances. Although these studies showed that dynamics factors such as arm segmentation and torque are involved in the SAT profiles of Fitts’ law, no sufficient theoretical explanation was provided. According to dynamical optimisation models such as the minimum (commanded) torque change criterion18,19, increasing movement duration changes the proportions of the respective torque components (inertial, Coriolis, centrifugal and viscous forces) along with the movement trajectory. To better understand the SAT mechanism in human arm movement, it is therefore important to construct a model that considers the contribution of these arm dynamics factors to movement.

Here, we derive the relationship between movement duration D and spatial endpoint error W—the factors of Fitts’ law—based on multi-joint forward- and inverse-dynamics models. As some researchers have pointed out that modelling the SAT requires consideration of the signal-dependent biological noise in the nervous system20,21,22, we consider modelling in the presence of signal-dependent noise. The D − W relationship that we derive nearly covers Fitts’ law. Furthermore, we theoretically demonstrate that arm dynamics and noise factors can affect both the slope and intercept of Fitts’ law.

Explanation of Fitts’ Law Based on Human Arm Dynamics

To formulate the relationship between movement duration D and hand endpoint error W based on human arm dynamics, we assumed the feedforward control model shown in Fig. 1, and modelled the relationship between D and W using the following three steps: first, we derived the relationship between movement duration D and joint torque τ based on an inverse dynamics model; next, we derived the relationship between τ and torque noise τnoise by defining a signal-dependent noise; finally, we derived the relationship between τnoise and hand endpoint error W using a forward dynamics model. We then used the resulting D − W relation to represent Fitts’ law.

Figure 1
figure 1

Schematic of modelling, involving feedforward control of rapid arm-reaching movement with movement planning and execution performed sequentially. The movement duration D is determined in the task selection stage. The desired trajectory is planned in the planning stage, in which the desired torque τ(t) at time t, which corresponds to the command signal, is generated by the feedforward controller implementing an inverse dynamics model. Finally, the torque noise τnoise(t) is added at the movement execution stage. The hand trajectory is generated by the controlled object (i.e., the arm), which implements a forward dynamics model using the noise-added torque τ(t) + τnoise(t). The result of converting the torque noise into the task space is the spatial error of the movement, and its endpoint t = D is the hand endpoint error.

Step 1: Determining the relationship between movement duration and joint torque

The general form of multi-joint (N-link) human arm dynamics on the horizontal plane can be modelled as a second-order nonlinear differential equation18:

$${\boldsymbol{\tau }}(t)={\boldsymbol{M}}({\boldsymbol{\theta }}(t))\mathop{{\boldsymbol{\theta }}}\limits^{{\boldsymbol{..}}}(t)+{{\boldsymbol{h}}}_{1}({\boldsymbol{\theta }}(t))[\dot{{\boldsymbol{\theta }}}(t)\dot{{\boldsymbol{\theta }}}(t)]+{{\boldsymbol{h}}}_{2}({\boldsymbol{\theta }}(t))[\dot{{\boldsymbol{\theta }}}{(t)}^{2}]+{\boldsymbol{B}}\dot{{\boldsymbol{\theta }}}(t),$$
(3)

where t represents time, \({\boldsymbol{\tau }}(t)={[{\tau }_{1}(t),{\tau }_{2}(t),\cdots ,{\tau }_{i}(t),\cdots ,{\tau }_{N}(t)]}^{{\rm{T}}}\) is the joint torque, \({\boldsymbol{\theta }}(t)=[{\theta }_{1}(t),{\theta }_{2}(t),\cdots ,\)\({{\theta }_{i}(t),\cdots ,{\theta }_{N}(t)]}^{{\rm{T}}}\) is the joint angular position, \(\dot{{\boldsymbol{\theta }}}(t)\) and \(\dot{{\boldsymbol{\theta }}}(t)\) are the joint angular velocity and acceleration, respectively, the superscript T denotes the transpose of a vector or matrix, and the subscript i denotes the joint number. The first term is the inertial force given in terms of the inertia matrix \({\boldsymbol{M}}({\boldsymbol{\theta }}(t))\in {{\mathbb{R}}}^{N\times N}\). The second term is the Coriolis force, which is given in terms of a Coriolis force coefficient matrix \({{\boldsymbol{h}}}_{1}({\boldsymbol{\theta }}(t))\in {{\mathbb{R}}}^{N\times N(N-1)/2}\). The third and fourth terms are the centrifugal and viscous forces in terms of the coefficient matrices of centrifugal and viscous forces, \({{\boldsymbol{h}}}_{2}({\boldsymbol{\theta }}(t))\in {{\mathbb{R}}}^{N\times N}\) and \({\boldsymbol{B}}\in {{\mathbb{R}}}^{N\times N}\), respectively. Further, \([\dot{{\boldsymbol{\theta }}}(t)\dot{{\boldsymbol{\theta }}}(t)]\) and \([\dot{{\boldsymbol{\theta }}}{(t)}^{2}]\) are the vectors in Coriolis and centrifugal forces, \({[{\dot{\theta }}_{1}(t){\dot{\theta }}_{2}(t),{\dot{\theta }}_{1}(t){\dot{\theta }}_{3}(t),\cdots ,{\dot{\theta }}_{N-1}(t){\dot{\theta }}_{N}(t)]}^{{\rm{T}}}\) and \({[{\dot{\theta }}_{1}{(t)}^{2},{\dot{\theta }}_{2}{(t)}^{2},\cdots ,{\dot{\theta }}_{N}{(t)}^{2}]}^{{\rm{T}}}\), respectively.

As the hand path observed in human arm movement is nearly invariant within a specific range of movement duration23, the following time-normalisation can be assumed24 (for a validation of the time-normalisation, see Discussion and Supplementary Fig. S1):

$$t=Ds,\,0\le t\le D\,(0\le s\le 1),\,{\theta }_{i}(t)={\theta }_{i}(Ds)={\tilde{\theta }}_{i}(s),$$
(4)

where s is the normalised time. Equation (3) can be further expressed in terms of the movement duration D as

$$\begin{array}{rcl}{\boldsymbol{\tau }}(t) & = & {\boldsymbol{\tau }}(Ds)=\frac{1}{{D}^{2}}\{{\boldsymbol{M}}(\tilde{{\boldsymbol{\theta }}}(s))\mathop{\tilde{{\boldsymbol{\theta }}}}\limits^{{\boldsymbol{..}}}(s)+{{\boldsymbol{h}}}_{1}(\tilde{{\boldsymbol{\theta }}}(s))[\dot{\tilde{{\boldsymbol{\theta }}}}(s)\dot{\tilde{{\boldsymbol{\theta }}}}(s)]\\ & & +\,{{\boldsymbol{h}}}_{2}(\tilde{{\boldsymbol{\theta }}}(s))[\dot{\tilde{{\boldsymbol{\theta }}}}{(s)}^{2}]\}+\frac{1}{D}{\boldsymbol{B}}\dot{\tilde{{\boldsymbol{\theta }}}}(s).\end{array}$$
(5)

We can further assume that the approximation \(1/D\simeq \lambda /{D}^{2}\) is possible over the general movement duration range, where λ is the coefficient of approximation (see Supplementary Fig. S2), giving:

$$\begin{array}{c}{\boldsymbol{\tau }}(Ds)\simeq \frac{1}{{D}^{2}}\{{\boldsymbol{M}}(\tilde{{\boldsymbol{\theta }}}(s))\mathop{\tilde{{\boldsymbol{\theta }}}}\limits^{{\boldsymbol{..}}}(s)+{{\boldsymbol{h}}}_{1}(\tilde{{\boldsymbol{\theta }}}(s))[\dot{\tilde{{\boldsymbol{\theta }}}}(s)\dot{\tilde{{\boldsymbol{\theta }}}}(s)]+{{\boldsymbol{h}}}_{2}(\tilde{{\boldsymbol{\theta }}}(s))[\dot{\tilde{{\boldsymbol{\theta }}}}{(s)}^{2}]+\lambda {\boldsymbol{B}}\dot{\tilde{{\boldsymbol{\theta }}}}(s)\},\\ \,\,\simeq \frac{1}{{D}^{2}}\tilde{{\boldsymbol{\tau }}}(s),\end{array}$$
(6)

where \(\tilde{{\boldsymbol{\tau }}}(s)\) is a time-normalised torque not related to the movement duration. Expressions have been developed using the general notation (N-joint); the two-joint case for shoulder and elbow is described in detail in Supplementary Equation 1 and Supplementary Fig. S3.

Step 2: Determining the relationship between desired torque and torque noise

The actual neural process of human movement is affected by biological noise. Therefore, in the movement execution stage, we assume the presence of a signal-dependent noise represented by Gaussian noise with a mean of zero and a standard deviation proportional to the control signal that additively acts on the control signal20. Here, we refer to the signal-dependent noise at the torque level as the torque noise, \({\tau }_{i}^{{\rm{noise}}}\), which is represented as follows:

$${\tau }_{i}^{{\rm{noise}}}(t)={k}_{i}|{\tau }_{i}(t)|{z}_{i}(t),$$
(7)

where the noise parameter ki indicates the degree to which the noise depends on the desired torque and zi(t) is a pseudo-random variable describing the standard normal distribution. Time-normalisation of Eq. (7) gives

$${\tau }_{i}^{{\rm{noise}}}(Ds)=\frac{1}{{D}^{2}}{k}_{i}|{\tilde{\tau }}_{i}(s)|{z}_{i}(s).$$
(8)

Step 3: Determining the relationship between torque noise and hand endpoint error

The forward model is used to express a motor process in which endpoint error occurs as a result of the execution of a movement with noise-added torque. For instance, the forward model for a case of x-coordinate movement in a horizontal plane, \({{\boldsymbol{f}}}^{{\rm{FM}}}\in {{\mathbb{R}}}^{2\times 1}\), can be used to obtain the state variable of the hand, including position and velocity, \({\boldsymbol{X}}(t)={[x(t),\dot{x}(t)]}^{{\rm{T}}}\in {{\mathbb{R}}}^{2\times 1}\) from the joint torque \({\boldsymbol{\tau }}(t)={[{\tau }_{1}(t),{\tau }_{2}(t),\cdots ,{\tau }_{N}]}^{{\rm{T}}}\in {{\mathbb{R}}}^{N\times 1}\) as follows:

$$\frac{{\rm{d}}}{{\rm{d}}t}{\boldsymbol{X}}(t)={{\boldsymbol{f}}}^{{\rm{FM}}}({\boldsymbol{X}}(t),{\boldsymbol{\tau }}(t)).$$
(9)

Applying time integration from the beginning to the end of the movement (0 ≤ t ≤ D) to Eq. (9) and taking the difference between the equations with and without noise yields (see Supplementary Equation 2 for details):

$$\Delta {\boldsymbol{X}}(D)\simeq {\int }_{0}^{D}\frac{\partial {{\boldsymbol{f}}}^{{\rm{FM}}}({\boldsymbol{X}}(t),{\boldsymbol{\tau }}(t))}{\partial {\boldsymbol{\tau }}{(t)}^{{\rm{T}}}}\cdot {{\boldsymbol{\tau }}}^{{\rm{noise}}}(t){\rm{d}}t.$$
(10)

The partial derivative of the forward model fFM with respect to the torque τ is then approximated as follows (see Supplementary Equation 3 for details):

$$\frac{\partial {{\boldsymbol{f}}}^{{\rm{FM}}}({\boldsymbol{X}}(t),{\boldsymbol{\tau }}(t))}{\partial {\boldsymbol{\tau }}{(t)}^{{\rm{T}}}}\simeq [\begin{array}{c}{{\boldsymbol{J}}}^{x}({\boldsymbol{\theta }}(t)){{\boldsymbol{B}}}^{-1}\\ {{\boldsymbol{J}}}^{x}({\boldsymbol{\theta }}(t)){\boldsymbol{M}}{({\boldsymbol{\theta }}(t))}^{-1}\end{array}],$$
(11)

where \({{\boldsymbol{J}}}^{x}({\boldsymbol{\theta }}(t))\) is a Jacobian matrix from the joint space to the x-coordinate space. From Eq. (10), the positional endpoint error of hand state variable can be expressed in terms of the torque noise as follows:

$$\Delta x(D)\simeq \mathop{\sum }\limits_{i=1}^{N}\,{\int }_{0}^{D}{({{\boldsymbol{J}}}^{x}({\boldsymbol{\theta }}(t)){{\boldsymbol{B}}}^{-1})}_{i}{\tau }_{i}^{{\rm{noise}}}(t){\rm{d}}t,$$
(12)

where \({(\cdot )}_{i}\) is the vector in the i-th column (i-th joint) in the matrix shown in parentheses. By time-normalising Eq. (12) and substituting Eq. (8), the following relation is obtained:

$$\Delta x(D)\simeq \frac{1}{D}\mathop{\sum }\limits_{i=1}^{N}\,{k}_{i}{\alpha }_{i}^{x},\,{\alpha }_{i}^{x}={\int }_{0}^{1}{({{\boldsymbol{J}}}^{x}(\tilde{{\boldsymbol{\theta }}}(s)){{\boldsymbol{B}}}^{-1})}_{i}|{\tilde{\tau }}_{i}(s)|{z}_{i}(s){\rm{d}}s.$$
(13)

Similar derivations to those in Eqs. (913) can be applied to the y-coordinate:

$$\Delta y(D)\simeq \frac{1}{D}\mathop{\sum }\limits_{i=1}^{N}\,{k}_{i}{\alpha }_{i}^{y},\,{\alpha }_{i}^{y}={\int }_{0}^{1}{({{\boldsymbol{J}}}^{y}(\tilde{{\boldsymbol{\theta }}}(s)){{\boldsymbol{B}}}^{-1})}_{i}|{\tilde{\tau }}_{i}(s)|{z}_{i}(s){\rm{d}}s.$$
(14)

where \({\alpha }_{i}^{x}\) and \({\alpha }_{i}^{y}\) are constants in terms of D and are the values calculated by the dynamics parameters of the arm.

The model endpoint error Wmodel in the horizontal plane is then expressed as the Euclidean distance between Δx(D) and Δy(D):

$$\begin{array}{rcl}{W}^{{\rm{model}}} & \equiv & \sqrt{\Delta x{(D)}^{2}+\Delta y{(D)}^{2}},\\ & = & \frac{\gamma }{D},\,\gamma =\sqrt{{(\mathop{\sum }\limits_{i=1}^{N}{k}_{i}{\alpha }_{i}^{x})}^{2}+{(\mathop{\sum }\limits_{i=1}^{N}{k}_{i}{\alpha }_{i}^{y})}^{2}},\end{array}$$
(15)

where γ is calculated using the \({\alpha }_{i}^{x}\), \({\alpha }_{i}^{y}\), and ki. ki is a signal-dependent noise parameter for each joint. \({\alpha }_{i}^{x}\) and \({\alpha }_{i}^{y}\) are calculated using the following dynamics parameters:

  • Kinematic parameters: the time-normalised joint angular position \({\tilde{\theta }}_{i}(s)\), velocity \({\dot{\tilde{\theta }}}_{i}(s)\), and acceleration \({\ddot{\tilde{\theta }}}_{i}(s)\) for each joint.

  • Physical parameters: the i-th link length Li, distance from the joint to the centre of mass Si, mass mi, moment of inertia Ii, and joint viscosity coefficient Bij.

Physical meanings of Fitts’ law coefficients

As a further consideration, we consider the physical meanings of the Fitts’ law coefficients, a and b. As these coefficients represent the relationship between speed and accuracy, they have been used as an index of human motor performance. Therefore, examining the coefficients of Fitts’ law is an important step to gaining a deeper understanding of human motor performance.

Solving Eq. (15) for D gives

$$D=\frac{\gamma }{{W}^{{\rm{model}}}}.$$
(16)

To bring this equation into the Fitts’ law form, we first consider a first-order Taylor expansion of ln (1 + x) around x = 0; \(\mathrm{ln}\,(1+x)\simeq x\), for −1 < x ≤ 1. By setting \(x=\frac{c}{{W}^{{\rm{model}}}}-1\), we obtain

$$\mathrm{ln}(\frac{c}{{W}^{{\rm{model}}}})\simeq \frac{c}{{W}^{{\rm{model}}}}-1,\,{\rm{for}}\,0 < \frac{c}{{W}^{{\rm{model}}}}\le 2.$$
(17)

where c is a constant. From the change-of-base formula, the following linear approximation is obtained:

$$\begin{array}{ll}\frac{1}{{W}^{{\rm{model}}}}\simeq {p}_{1}+{p}_{2}\,{\log }_{2}(\frac{1}{{W}^{{\rm{model}}}}), & {\rm{for}}\,\frac{c}{2}\le {W}^{{\rm{model}}},\\ {p}_{1}=\frac{1}{c}(1+\,\mathrm{ln}\,c), & \\ {p}_{2}=\frac{1}{c\,{\log }_{2}e}, & {\rm{where}}\,e\,{\rm{is}}\,{\rm{Napier}}\mbox{'}{\rm{s}}\,{\rm{constant}}\,(e\simeq 2.71828\cdots ).\end{array}$$
(18)

If c = 0.0060, the range of Wmodel is 0.003 m ≤ Wmodel and p1 and p2 are determined as −686.0 and 115.5, respectively. Once the value of c is determined, p1 and p2 are irrelevant to the subject or task. Finally, the approximation of the model to Fitts’ law is represented by

$$D={a}^{{\rm{model}}}+{b}^{{\rm{model}}}\,{\log }_{2}(\frac{1}{{W}^{{\rm{model}}}}),$$
(19)
$${a}^{{\rm{model}}}={p}_{1}\gamma ,$$
(20)
$${b}^{{\rm{model}}}={p}_{2}\gamma .$$
(21)

The above equations show that γ is equally involved in both amodel and bmodel, which correspond to the intercept and slope of Fitts’ law, respectively. Therefore, the slope and intercept of Fitts’ law can be expressed by arm dynamics and signal-dependent noise parameters. Previous studies have observed that the slope and intercept of Fitts’ law are related to, for example, arm components (such as fingers, wrists, forearms and full arm) and arm physical parameters (such as length, mass, the moment of inertia)11,12,13,14,15,16,17 and, therefore, such empirical findings might be revealed theoretically by further study.

Because c is always positive, p2 is always positive; thus, increasing γ increases the Fitts’ law slope, and vice versa. We therefore consider the case in which γ increases. From Eq. (15), γ increases when \(|{\sum }_{i=1}^{N}\,{k}_{i}{\alpha }_{i}^{x}|\) or \(|{\sum }_{i=1}^{N}\,{k}_{i}{\alpha }_{i}^{y}|\) increases. Both terms contain the signal-dependent noise parameter ki, which is always positive, so that γ increases as ki increases; that is, the slope of Fitts’ law increases as the noise level increases. Focusing on the \({\alpha }_{i}^{x}\) and \({\alpha }_{i}^{y}\) calculated by the arm dynamics parameters, γ can also be reduced by reducing \({\alpha }_{i}^{x}\) and \({\alpha }_{i}^{y}\), even if ki has a large value. There are two ways to reduce \({\alpha }_{i}^{x}\) and \({\alpha }_{i}^{y}\)—reduce the absolute value of the time-normalised torque or reduce the product of the Jacobian matrix and the inverse viscosity matrix—but it is difficult to intuitively understand how to reduce these owing to the nonlinearity of the dynamics. To understand these factors, it is preferable to rely on numerical calculations such as sensitivity analysis.

Results

The relationship between movement duration and hand endpoint error could be modelled using Wmodel = γ/D. We first investigated how the degree of hand endpoint error calculated by the model could represent the hand endpoint error observed in actual movement. We then examined how close the model function is to Fitts’ law when it was applied to the Fitts’ law axis, log2 (1/W) − D.

The behavioural experiment involved reaching the target at four points on the horizontal plane (front, back, left and right, as viewed from the starting point) within a given movement duration range. Eleven subjects participated, and the distance between the starting point and target point was fixed at 15 cm. The shoulder, elbow and hand positions were measured using a three-dimensional optical position measurement digitizer. The experiment was continued until a total of 320 successful trials (i.e., 80 trials in each direction) had been conducted (see Methods for details of the experiment).

How well the degree of model endpoint error can represent the actual endpoint error

We examined how the degree of endpoint error calculated by the model can explain the endpoint error observed in actual movement. Figure 2 shows a comparison between the observed and model endpoint errors in a representative subject (see Supplementary Figs. S4 to S13 for other subjects). Table 1 lists the root mean square errors (RMSEs) between the model and observed endpoint error for each subject. The model represented the trial-averaged observed endpoint error well, with a small average error of 0.0043 m. Subjects A, E and K had small RMSEs in all directions (average RMSE less than 0.0030 m), while C, F and H had high RMSEs in all directions (average RMSE greater than 0.0060 m). Overall, the model endpoint error closely matched the observed endpoint errors of actual movement. The observed and model endpoint errors for single-trial are not in one-to-one correspondence (the observed endpoint errors were measured for 20 trials per movement duration condition, while the model endpoint errors were generated over 100 trials using different pseudorandom numbers). The distributions of endpoint errors in the observed and model results were roughly consistent.

Figure 2
figure 2

Comparison between observed and model endpoint errors for a representative subject (A). Tiny grey circles show observed endpoint error for each observed movement duration. Open circles show averaged values of observed endpoint errors at the mean observed movement duration. Tiny light blue crosses show the model endpoint errors at the mean observed movement duration using different pseudorandom numbers. When calculating this, the average of 20 measured trajectories in each movement duration was used as the time-normalised trajectory. Blue triangles show the trial-averaged values of the model endpoint error at the mean observed movement duration. The blue line shows D − W function of the model, which is obtained by using the mean value of γ over the entire movement duration (i.e., 0.30–0.70 s). In calculating this, the average of 80 measured trajectories over the entire movement duration was used as the time-normalised trajectory. Histograms hatched in translucent grey and blue are the distributions after fitting the observed and model endpoint error data with a Rayleigh distribution. As the x- and y-axis endpoint errors follow a Gaussian distribution, their Euclidean distance (i.e., W) follows a Rayleigh distribution.

Table 1 RMSE between model endpoint error and observed endpoint error for each subject.

Among the four directions, leftward endpoint variability was large while backward endpoint variability was small; overall, the model captured these direction-dependent features of hand endpoint variability. The direction dependency of the differences could be attributed to variations in the viscosity coefficient Bij and time-normalised trajectories \(\tilde{\theta }(s)\), velocities \(\dot{\tilde{\theta }}(s)\), and accelerations \(\ddot{\tilde{\theta }}(s)\), as all other physical parameters (Li, Si, mi, Ii) and signal-dependent noise parameters (ki) were the same in all directions.

How close the model function is to Fitts’ law

We investigated how close the model, D = γ/Wmodel, is to Fitts’ law, D = a′ + blog2 (1/W) along the log2 (1/W) − D axis. Figure 3 shows a simple conversion of the D − W axes shown in Fig. 2 to log2 (1/W) − D axes. Fitts’ law (black-dashed line) is indicated by a straight line, while the model line (blue solid line) is somewhat concave along the axis of Fitts’ law, log2 (1/W) − D. This concavity has been observed in many previous studies3,4,11,14,25. The functions are quite close to each other. Similar results were obtained for several other subjects (see Supplementary Figs. S4 to S13 for other subjects’ results).

Figure 3
figure 3

Comparison between Fitts’ law and the model in Id − D axis for a representative subject (A). The tiny grey and open circles, light blue crosses and blue triangles represent the same factors shown in Fig. 2. The black dashed and blue solid lines indicate the Fitts’ law results (regression line for the trial-averaged Wobserved) and the model line along the Id − D axis, respectively.

A comparison of the parameters describing Fitts’ law (a′ and b) with the corresponding parameters for the model (amodel and bmodel) revealed average slope values over the subjects and directions of b = 0.340 and bmodel = 0.431, respectively, and average intercept values over the subjects and directions of a′ = −1.752 and amodel = −2.558, respectively. Thus, in terms of subject-averaged values, both amodel and bmodel were close to the Fitts’ law values of a′ and b, respectively. These results indicate that Fitts’ law can be explained by the W − D relation derived based on human arm dynamics.

Discussion

The relationship between movement duration D and hand endpoint error W—the factors used in Fitts’ law—were modelled based on human arm dynamics and found to follow an inversely proportional form: W = γ/D (where γ is a coefficient based on arm dynamics and signal-dependent noise parameters). The model was found to nearly coincided with Fitts’ law when plotted on the axes log2 (1/W) and D. Furthermore, it was theoretically shown that the slope and intercept of Fitts’ law include both arm dynamics and noise parameters based on transformation of the model into the Fitts’ law form. In the field of Human-Computer Interaction and Ergonomics, the slope and intercept of Fitts’ law have been used as an index of human motor performance because they represent the relationship between speed and accuracy. Our research results are significant in providing a first step to determining a specific relationship between the motor performance and arm dynamics.

Harris and Wolpert20 and Tanaka et al.21 showed that the SAT profile drawn by Fitts’ law can be reproduced using their respectively proposed optimisation models. These were important results showing that Fitts’ law for one-dimensional reaching movements can be reproduced based on single-joint arm dynamics. However, the multi-joint arm dynamics used in the more general arm movements performed by humans on a daily basis can cause nonlinear interference torques such as Coriolis and centrifugal forces between the upper arm and the forearm. As such the nonlinear interference torques do not occur under single-joint arm dynamics, it is important to investigate their effect on the SAT profile based on multi-joint arm dynamics. Our model, which is based on multi-joint arm dynamics, considers both Coriolis and centrifugal forces and therefore should be able to reproduce actual human arm movement more accurately than a model that does not consider them. To verify this, we compared applications of the model with and without Coriolis and centrifugal forces. An initial comparison using the data measured in the experiment described in Methods revealed an insignificant amount of difference between the two cases (Supplementary Fig. S14). This occurred because the inertial and viscous forces dominated the Coriolis and centrifugal forces during this task and because the Coriolis and centrifugal forces in the shoulder joint cancelled each other in the Front and Back directions (Supplementary Fig. S15). We then conducted an experiment with different starting and target postures to analyse the effects of moving rapidly over a longer distance. Supplementary Fig. S16 shows the results of the comparison for each subject. For five of the six subjects, the trial-averaged values of the actual endpoint error were better represented using the model with Coriolis and centrifugal forces, which also produced results closer the actual distributions of endpoint errors. These results demonstrate the importance of considering Coriolis and centrifugal forces in modelling very rapid reaching movements over long distances and highlight one of the advantages of our model, namely, that it considers the nonlinearity of arm movement.

Our model is based on the following assumptions and approximations: (1) feedforward control is assumed; (2) it is assumed that movement of the upper arm and forearm in the horizontal plane can be modelled by the dynamics of a two-link planar manipulator; (3) trajectory invariance in which the trajectory does not change within a certain range of movement duration is assumed; (4) the viscous force \(\frac{1}{D}B\dot{\tilde{\theta }}(s)\) is approximated by \(\frac{1}{{D}^{2}}\lambda B\dot{\tilde{\theta }}(s)\) within a certain range of movement duration (Eq. (6); (5) the presence of signal-dependent noise is assumed (Eq. (7)); (6) a first-order Taylor approximation of the forward model around \({\tau }_{i}^{{\rm{n}}oise}\mathrm{=0}\) is applied (Supplementary Equation 2); (7) it is assumed that the effect of the partial derivatives of Coriolis and centrifugal forces are negligible (Supplementary Equation 3); and (8) a linear approximation is applied to formulate Fitts’ law (Eq. (18)). These assumptions and approximations give several limitations to the model. For example, the effects of feedback cannot be considered because of assumption (1). Moreover, (3), (4), (6), (7) and (8) lead to modelling errors. It is also necessary to discuss the validity of the trajectory invariance assumption in (3). Whether or not time-normalisation is a valid must be confirmed in terms of how much the actually measured trajectory changes over the duration of movement. Supplementary Fig. S1 shows the measured hand paths of all subjects when moving a distance of 15 cm from 0.30 to 0.70 s. Although this provides a qualitative description, it can be considered to indicate that there is no problem with the changes even if trajectory invariance is assumed. This assumption is approximately valid to within a certain range of movement speeds that are neither too slow nor too fast. It should be noted that the assumption does not hold for very rapid or slow movement and, therefore, that finding a method that can be formulated without using time-normalisation should be undertaken as a future task. Another issue to be addressed in the future is the lack of a sophisticated understanding of motor impedance sources such as viscosity and stiffness. In this study, we did not consider joint stiffness because doing so would complicate the model. Given the numerous studies showing that stiffness is related to the SAT26,27,28, it will be important to incorporate stiffness into future versions of the model. Despite the limitations noted above, the error between the model and the experimental results was small, and we can conclude that the model can successfully explain the endpoint error occurring in actual movement.

The use of signal-dependent noise made it possible to represent the SATs—a breakthrough in the study of motor control. This breakthrough should be supplemented by further findings obtained, for example, by considering the constant and temporal noises in the model, as van Beers et al.22 did; and considering how to estimate the noise parameter k using data other than hand endpoint error (e.g., a method that can estimate using EMG signals). We are also interested in studying human motor control strategies that achieve high spatial accuracy even with very rapid movement. It is speculated that nonlinear interference forces such as Coriolis and centrifugal forces are effectively used in such strategies, but this needs to be studied in detail.

Finally, in this study we investigated only arm-reaching movement. It will be possible to investigate other types of movement, such as eye and feet movement, have not, but changing the approach, for example, from arm to foot dynamics. The relationships derived in this paper can therefore be generally applied to any type of movement, which will be an important approach in future research.

Methods

We conducted behavioural experiments to examine to what degree the model endpoint error can represent the actual endpoint error and how similar the model is to Fitts’ law when it is applied to the Fitts’ law axis.

Subjects

Eleven healthy young adults (11 males, age range 21–24 y) participated. All subjects were right-handed according to the Edinburgh handedness test score (score range 64.7–100%). Informed consent agreements were obtained from all subjects, and the study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Ethics Committee of Nagaoka University of Technology.

Apparatus

The experimental setup is illustrated in Fig. 4A. The subjects sat on the chair located in front of the table and display. Each subject placed his right arm on the air-sled on the table to reduce the friction between their arm and the table, with the hand position set to match the tip of the handle of the air-sled. The air-sled and forearm were then fixed with stretch tape and the chair height was adjusted so that the table and subject’s arm were oriented in a parallel manner. Each subject’s shoulder was restrained by seat belts, restricting movement of the arm to the horizontal plane. The chair was then moved toward the table until the subject’s chest just touched the table. Infrared markers were affixed to the shoulder and elbow joints and the hand, and the three marker positions were measured at a sampling frequency of 500 Hz using a three-dimensional optical position measurement digitiser (Optotrak Certus, Northern Digital Inc., Waterloo, Canada). The measured current hand positions were projected onto a display (PDP-504P, Pioneer, Tokyo, Japan) installed in front of the subject. A starting point (a circle with a radius of 10 mm) and a target (a circle with a radius of 2 mm) were also shown on the screen, and the subjects performed the experimental task while watching the display.

Figure 4
figure 4

Experimental design. (A) Top view of experimental setup, showing definitions of joint angles θ1 and θ2 and x- and y-coordinates. (B) Display positions of all four targets with respect to central circle. (C) Examples of screen patterns of starting and target positions and experimental protocols. (D) Process for estimating signal-dependent noise parameters k1 and k2. To evaluate the degree of coincidence between pairs of ellipses, we defined the Ellipse Difference Index (EDI), which calculates the distances between four corresponding points (AD) on the respective ellipses. The sum of the EDI over all conditions of movement duration and direction was set as the total EDI, with the values of k1 and k2 at the minimum total EDI selected as the signal-dependent noises.

Task

Experimental tasks used in the study of SAT primarily fall into two categories: spatially constrained movement tasks, and temporally constrained movement tasks (details are given in4).

In this study, we used a temporally constrained movement task based on our assumption of pre-programmed and rapid movements. The task involved setting a target movement duration D and movement distance A and then measuring the hand endpoint error or variability W of the corresponding movement. The subjects were required to reach the target as accurately as possible within the instructed range of movement duration.

Specifically, as shown in Fig. 4A, the task involved using the shoulder and elbow joints to engage in arm reaching movements in the horizontal plane toward the target, a circle with a radius of 2 mm, located 15 cm away from a 10-mm starting circle displayed in the centre of the screen. The initial shoulder position was defined as the origin of a coordinate system in which the positive x- and y-axes were to the right and in front of the subject, respectively. The starting position was defined as a shoulder angle (θ1) of 45° and an elbow angle (θ2) of 100°, and therefore the starting position relative to the right shoulder differed by subject: on average, it was −0.072 m toward the x-axis and 0.390 m toward the y-axis with respect to the initial shoulder position.

The targets were located at four directions—front (90°), back (270°), left (180°), and right (0°)—relative to the starting position (Fig. 4B). Movement was measured over the target movement duration intervals 0.30–0.40, 0.40–0.50, 0.50–0.60 and 0.60–0.70 s, respectively. As the distance from the starting point to the target was fixed, the movement speed was controlled. After the end of each trial, it was determined whether the measured movement duration was within the set duration range and whether the movement was executed ballistically. If both conditions were satisfied, the trial was regarded as a success; if neither was satisfied, it was regarded as a failure. Corresponding feedback was displayed to the subject on the upper right of the screen as follows: failure (too fast), failure (too slow), or success (Fig. 4A).

Figure 4C shows examples of the screen for each target pattern; the four target patterns were presented pseudorandomly. In the example shown, the pattern started from the front and, if the trial was a success, it continued backward, leftward and then rightward. The conditions of target movement duration were also applied pseudorandomly; in the example, the first interval was 0.60–0.70 s and was repeated five times until success was achieved in all four directions. If success was achieved over 4 directions × 5 rounds = 20 trials, the next movement duration was applied. If success was achieved at all movement duration conditions, the session was concluded. In all, four sessions were performed for each subject. As it was necessary for each subject to succeed at 80 trials per session, the experiment was continued until a total of 320 successful trials had been conducted.

Each subject participated in a preparatory experiment involving the same conditions as the main experiment but with half the number of sessions, with the goal of familiarising the subjects with the experimental setup and imparting a sense of target movement duration. The data measured in the preparatory experiment were not used in the subsequent analysis; in any event, no apparent learning effects were observed in the main experiments as a result of the preparatory sessions.

Data analysis

Observed endpoint error and movement duration

Our methods for pre-processing the positional data and determining the observed endpoint error and movement duration were carried out as followed. The acquired positional data were low-pass filtered using a third-order, zero phase-lag Butterworth filter with a cut-off frequency of 10 Hz. The start and end of each movement were determined based on the tangential velocity, which was calculated using numerical differentiation. The start of a movement was defined as the point following the start cue at which the tangential velocity first exceeded the threshold velocity, defined as 5% of the peak value of the tangential velocity. The end of a movement was defined as the point prior to the end cue at which the tangential velocity fell below the threshold. The time from movement start to end was then defined as the observed movement duration, D. The required conditions of a successful trial were therefore (1) that the observed movement duration fell within the target range and (2) that there was only one peak in the tangential velocity between the start and end of the movement. The successful trials were then analysed to determine the corresponding Euclidean distances from the centre of the target to the endpoint, which was defined as the observed endpoint error, Wobserved.

Model endpoint error

To obtain the model endpoint error, Wmodel, the following model parameters were used: the kinematic parameters such as the time-normalised desired angular position, velocity, and acceleration for each joint (\({\tilde{\theta }}_{i}(s)\), \({\dot{\tilde{\theta }}}_{i}(s)\), and \({\ddot{\tilde{\theta }}}_{i}(s)\)), the physical parameters for each link or joint (Li, Si, mi, Ii, and Bij), and the noise parameter for each joint (ki). We describe below how we estimated them.

Desired trajectories. We used the joint angular trajectories averaged over trials as the theoretically desired trajectory. The joint angular trajectory was first calculated using the measured positional data and subject’s arm length by inverse kinematics and resampled temporally at the mean movement duration for each trial. The trial-averaged trajectories were then calculated for each subject and direction (for each target movement duration was also calculated as well). The trial-averaged joint angular velocities and acceleration were also calculated and time-normalised in terms of D to \(\dot{\tilde{\theta }}(s)\) and \(\ddot{\tilde{\theta }}(s)\), respectively.

Physical parameters. The upper arm and forearm lengths, L1 and L2, respectively, were obtained from the distances between the markers on the shoulder, elbow, and hand, which were measured using the position digitiser before starting the experiment. The distance from the joint to the centre of mass, Si, mass, mi, and moment of inertia around the joint, Ii, were estimated based on their respective proportional relationships to arm length19,24,29. An adjusted forearm mass, m2, was obtained by adding the 0.740-kg mass of the air-sled and stretch tapes to the forearm mass estimated using the above method. The viscosity coefficient, Bij, was estimated using the method applied in19 based on the approximate relationship between joint torque and viscosity coefficient during static force control, as measured by30. The resulting physical parameters of the subject’s arm are listed in Table 2. Note that the viscosity coefficients shown in the table were obtained using the trial-averaged joint angular trajectory for each direction. For further details on how to calculate these physical parameters, refer to Supplementary Equation 4.

Table 2 Physical parameters of the arm and signal-dependent noise parameters.

Noise parameters. The parameter determining the magnitude of signal-dependent noise, ki, was estimated using the following simulation procedure:

  1. 1.

    The joint torques were calculated using the mean trajectories and the physical parameters of the arm.

  2. 2.

    Signal-dependent noise was generated artificially using an arbitrary value of ki and added to the joint torque. The calculation applied Eq. (7) with zi(t) set as a random variable with a standard normal distribution.

  3. 3.

    The endpoint errors were obtained by transforming the noise-added torque from the joint to the task space using the forward dynamics and kinematics models.

  4. 4.

    Step 1–3 were repeated 100 times for each pairing of k1 and k2 to obtain 100 simulated endpoints. We then used a grid search algorithm to find the value of ki corresponding to the smallest difference between the confidence ellipses of the simulated and observed endpoints in terms of the Ellipse Difference Index (EDI), the sum of distances between four corresponding points on two ellipses (see Fig. 4D). The total EDI was defined as the summed value of the EDI over all movement duration and directions.

The noise parameters of each subject obtained using this method are listed at the bottom of Table 2.