1 Introduction

Knowledge of patterns of intervertebral movement of the lumbar spine is clinically useful in the assessment and rehabilitation of spinal disorders [4, 10]. Biomechanical modeling also requires accurate kinematic information as model input data. Nevertheless, measurement of intervertebral movements is technically challenging as the spine is relatively inaccessible and the nature of spinal movement is complex.

Radiographic, electro-optical, electromagnetic and miniature inertial sensing systems have been used in the measurement of lumbar spine motion with some success [2, 3, 6, 18, 1921, 28, 29, 35]. Surface measurements using optical markers or sensing devices are subjected to considerable error due to interposition of tissue between the spine and the surface markers. Recently, Morl and Blickhan [24] had examined the correlation between the motions of external skin markers and the underlying vertebrae using open magnetic resonance imaging. Although moderate to strong correlation was observed, the measurement errors were found to be high. Surface measurement techniques are only accurate for determining the movement of spine regions as opposed to discriminating the contribution of individual intervertebral joints. Surface measurements that yield accurate intervertebral movement data would require the insertion of pins into the spinous processes [10, 11], but this method has low acceptability for patients.

Only radiographic techniques are able to determine movement of the vertebrae with acceptable accuracy in vivo [28]. Lee and Evans [20] reported that the mean error in determining intervertebral joint motion in the sagittal plane was about 1.0°. However, radiographic techniques such as the videofluoroscopy [27, 38] and cineradiography [17, 35], are complicated, and do involve ionizing radiation.

Sun et al. [34] developed an inverse kinematic model for determining intervertebral motion of the lumbar spine during flexion. The lumbar spine was modeled as an open-ended, kinematic chain of five links, which represented the five vertebrae (L1–L5). An optimization equation with physiological constraints was employed to determine intervertebral joint configuration. The method was found to be very accurate with a high degree of agreement between the predicted movement and the actual movement as determined by radiographic images (r = 0.83–0.97). The mean error of prediction (mean difference between radiographic measurement and predicted value) was less than 1.6°. Although this was considered to be accurate, the technique of Sun et al. was able to predict intervertebral displacement in a static neutral or flexed position only. Therefore, we still need a noninvasive and reliable method to study time history of intervertebral joint motions.

A Bayesian belief network is a mathematical model that represents a set of variables and their probabilistic dependencies. It is a graphical model where variables are represented by nodes on a graph, and the arcs joining these nodes encode conditional dependencies between the variables [7, 9, 16]. Such models have recently been employed to solve problems related to the tracking of motions of objects [31, 37], where the target variables (motions of objects) were unknown and might be probabilistically associated with observations or variables that could be measured experimentally. Bayesian network is used to update the state of a set of variables when observations are made. This process of computing the posterior distribution of variable-given evidence is called probabilistic inference, which is implemented by belief propagation algorithms using the Bayes’ rule [7, 8, 23, 39]. In this study, we attempted to estimate the intervertebral motions of lumbar spine using observations or cues obtained from motion sensors attached to the skin. Bayesian belief network was employed to describe the probabilistic dependencies between vertebral motions and skin-mounted sensor information and to estimate the intervertebral movements through probabilistic inference.

The purpose of this study was to investigate the feasibility of using a Bayesian belief network model to determine the intervertebral motion of the lumbar spine and to study the motion sequence patterns of the intervertebral joints using the data estimated by the model.

2 Methods

This study was carried out on healthy subjects who performed flexion and extension motions of the spine. Radiographic images were obtained statically in three spine postures—flexion, extension and neutral. The flexion and extension images were used as input data for modeling, together with movement information derived from the skin surface using an electromagnetic motion tracking system. The intervertebral joint motions of the spine were then computed. The validity of the model was tested by comparing the predicted position of the vertebrae in the neutral position with those obtained from the radiographic image in the neutral posture. In addition, the timing and sequence of movements of the various intervertebral joints was assessed based on the data estimated by the proposed model.

2.1 Subject

Seventeen healthy volunteers (4 males, 17 females, age 71.8 ± 3.3 years, body height = 156.1 ± 8.9 cm, body weight = 59.0 ± 11.3 kg, body mass index = 24.1 ± 3.6) were recruited. All subjects had no history of back pain or symptoms attributable to the spine that required medical attention or treatment. Subjects were excluded if there was vertebral body collapse, metastases, spondylolisthesis, spondylolysis, rheumatological disease, or surgery to the lumbar spine. Measurements obtained were part of a health assessment at the Jockey Club Centre for Osteoporosis Care and Control, The Chinese University of Hong Kong [14, 15]. Dual X-ray densiometer (Hologic, Waltham, MA, USA) was used to measure the bone mineral density (BMD) at the lumbar spine. All subjects had normal bone density.

2.2 Instrument

An electromagnetic motion tracking system (Fastrak®, Polhemus, 40 Hercules Dr, Colchester, Vermont, USA) was used to obtain skin displacement information related to movement of the lumbar spine. This consisted of a source of electromagnetic waves and four miniature motion sensors (Polhemus RX1-D, 0.9 mm × 0.9 mm × 0.9 mm; 20 g). The source was placed in fixed position within one metre of the subject. The four sensors were attached to the skin. The tips of the spinous processes of the first lumbar (L1) and first sacral (S1) vertebrae were identified by palpation. Two sensors were placed over these spinous processes. The other two sensors were evenly distributed between L1 and S1. A previous study demonstrated that the electromagnetic sensors had an angular accuracy of ±0.2° [29]. A software program [17], able to perform fast serial communication with 120 Hz data update rate, i.e., 30 Hz per sensor sampling rate, was used to acquire and display the data in real time.

2.3 Radiograph measurements

Lateral radiographs (Philips M50 CP-H X-ray machine) were obtained for each subject with the sensors attached to the back. Each subject was asked to stand upright with the pelvis rigidly fixed to a frame and the left side of the body facing the film. Lateral images of the lumbar spine (L1–S1) were acquired by a conventional radiographic system in three positions: neutral upright, full flexion, and full extension. In order to accurately define the sensors’ location on the radiographic image, two radio-opaque lead markers were fixed on each sensor as shown in Fig. 1. A radio-opaque ruler was also attached to the subject’s back in the sagittal plane so that magnification of the vertebrae could be determined.

Fig. 1
figure 1

Radiographic image showing the reference points for the vertebrae (the white nodes, N1–N4 are illustrated for L4) and the four sensors of the Fastrak system (D1–D4)

The coordinates of the vertebral bodies, the tips of the spinous processes, and the lead markers of the sensors were recorded. The L1–L5 vertebral bodies were fitted with quadrangles on each image, as shown in Fig. 1, the corners of the quadrangle representing landmarks for defining the position and orientation of the vertebra. In the case of the sacrum, the posterosuperior and anterosuperior corners of the sacrum were used as landmarks.

In order to validate the prediction of the model, intervertebral joint angles were determined based on the radiographic images of the vertebral bodies, and compared with those predicted by the model. The sagittal angles of the intervertebral joints were computed from the coordinates of the corners of the two adjoining intervertebral bodies, and the mean error of this measurement was found to be 1.0° [20].

2.4 The Bayesian belief network model

In this study, the purpose of modeling was to estimate the intervertebral motions of the lumbar spine using knowledge of the sensor trajectory and the spatial relationship between sensors and the vertebrae as obtained in the flexion and extension radiographic images. Bayesian network was employed to describe the probabilistic dependencies between element nodes and the observations, which represented the variables of interest and the measurable data, respectively. Specifically, the element nodes were the coordinates of two corners (N2 and N3, in Fig. 1) of the vertebral bodies, and the observations were the positions of the skin-mounted Fastrak sensors. Arcs within the networks represented associations among the variables and observations, as shown in Fig. 2. For the model construction, the initial and final spatial relationships between the vertebrae and sensors were defined by the flexion and extension radiographic images. The positions of the vertebrae or the element nodes during the movement from flexion to extension were estimated by the Bayesian network model based on the observations that is, the information obtained from the skin-mounted sensors. Belief propagation algorithms were employed to implement Bayes’ probabilistic inference for estimation task.

Fig. 2
figure 2

The Bayesian network model illustrating the dynamic movement from the kth time frame to the k + 1th time frame. The open circles and open squares represent the element nodes (vertebral position, the x nodes) and observations (sensor position, the y nodes), respectively. Arcs represent the association among element nodes and observations. For simplicity, not all the associations are shown. Thus, the model represents the dynamic process with a Bayesian network, which contains l time frames (only two time frames are shown here) with the same basic local sub-structure. In total, there are l × 12x nodes and l × 8y nodes in the model

As a graphical model, Bayesian belief network is a static structure carrying no timing information. In order to model the kinematics of the flexion–extension movement, the Fastrak data in the time domain were projected onto the space domain by transforming the time sequence into space sequence. Therefore, the graphical model included all time frames each of which had a basic structure as illustrated in Fig. 1. The network structure for any two successive time frames is shown in Fig. 2, where x and y represent the element node (“circle” in Fig. 2) and observation (“square” in Fig. 2), respectively. The nodes in the network obey the following principles:

  1. 1.

    Among different time frames—each x node of the kth time frame is associated with the k−1th and k + 1th time frame.

  2. 2.

    Within the same time frame—there were probabilistic dependencies between the vertebrae L1–S1 and sensors D1–D4.

  3. 3.

    Associations among x nodes within the same time frame.

    1. (1)

      x i was associated with x i−1 and x i+1. It was considered that the position of a vertebra was directly influenced by its neighbors.

    2. (2)

      x 12 was associated with any other nodes (x 1x 11). The local coordinate system was fixed to the sacrum so that it the lumbar vertebrae move with respect to the sacrum.

In Fig. 2, arcs are used to illustrate the main associations among the various nodes and observations. The sensor information is obtained by actual measurement, and therefore probabilistic inference is made in single direction from observation to element node. However, the probabilistic associations between the element nodes are dual directional.

Belief propagation algorithm was employed to predict the values of x-nodes (coordinates of the vertebral body corners) based on the topology of the network model. In brief, the algorithm was implemented as follows. Bel(n)(x i ) and \( m_{{j \to i}} ^{{{\left( n \right)}}} {\left( {x_{i} } \right)} \) represent the Belief of the ith node and the Message from the jth node to the ith node at the nth state, respectively.

  1. (1)

    Initialization: the initial values of Bel(0)(x i ) and \( m_{{j \to i}} ^{{{\left( 0 \right)}}} {\left( {x_{i} } \right)} \) are set as zero mean Gaussian distribution with large covariance. Previous research showed that the outline or shape of the vertebrae followed a Gaussian distribution [32].

  2. (2)

    Updating for each x node by calculating the Belief

    $$ {\text{Bel}}^{{{\left( n \right)}}} {\left( {x_{i} } \right)}\; = \;\phi _{i} {\left( {x_{i} ,y_{i} } \right)}{\prod\limits_{j \in \Gamma {\left( i \right)}} {m^{{{\left( n \right)}}}_{{j \to i}} {\left( {x_{i} } \right)}} } $$
    (1)

    where ϕ j (x j , y j ) is the likelihood between x j node and its observation represented by y j node; \( {\prod\nolimits_{j \in \Gamma {\left( i \right)}} {m^{{{\left( n \right)}}}_{{j \to i}} {\left( {x_{i} } \right)}} } \) is the prior of x i at the nth state; Γ(i) is the neighborhood of node i; and \( m^{{{\left( n \right)}}}_{{j \to i}} {\left( {x_{i} } \right)} \) is the message from node j to node i at the nth state, where \( j \in \Gamma {\left( i \right)}. \)

The association between the variable (x node) and its observation (y node) is modeled using coordinate transformation

$$ Y\; = R*X + T $$
(2)

where X and Y are vectors representing coordinates of any two associated nodes, R is the rotational matrix, and T is the translation. Accordingly, the compatibility function ϕ j (x j , y j ) in Eq. (1) was defined by a Gaussian distribution of variable y i R i *x i t i with zero mean.

The calculation of \( m^{{{\left( n \right)}}}_{{j \to i}} {\left( {x_{i} } \right)} \) in Eq. (1) is achieved by two steps:

  1. (i)

    Message product at the nth iteration: multiply incoming messages with the local observation to form a statistical distribution over x j .

    $$ M^{{{\left( n \right)}}}_{j} {\left( {x_{j} } \right)}\; = \;\phi _{j} {\left( {x_{j} ,y_{j} } \right)}{\prod\limits_{k \in \Gamma {\left( j \right)}} {m^{{^{{{\left( {n - 1} \right)}}} }}_{{k \to j}} {\left( {x_{j} } \right)}} } $$
    (3)

    where ϕ j (x j , y j ) is the likelihood between x j node and its observation y j ; Γ(j) is the neighborhood of node j; and \( m^{{{\left( n \right)}}}_{{k \to j}} {\left( {x_{j} } \right)} \) is the message from node k to node j at the nth state, where \( k \in \Gamma {\left( j \right)}. \)

  2. (ii)

    Message propagation at the nth state: transform distribution from x j node to x i node using the pairwise interaction compatibility function ψ ji (x j , x i ).

    $$ m^{{{\left( n \right)}}}_{{j \to i}} {\left( {x_{i} } \right)}\; = \;{\int {\psi _{{ji}} {\left( {x_{j} ,x_{i} } \right)}M^{{{\left( n \right)}}}_{j} {\left( {x_{j} } \right)}dx_{j} } } $$
    (4)

At the nth state, by integrating over x j , all the jth node prior knowledge about ith node is summarized and transformed to the ith node. Similar to ϕ j (x j , y j ), the pairwise interaction compatibility function ψ ji (x j , x i ) in Eq. (4) describes the association between x i and x j which is a coordinate transformation as shown in Eq. (2). Therefore, ψ ji (x j , x i ) is defined in the same manner as ϕ j (x j , y j ), i.e., x i R ji *x j t ji , where R ji and t ji are the rotation matrix and displacement from the jth node to the ith node. The initial and final values of rotation matrix (R) and the displacement (T) in ψ ji (x j , x i ) are set, respectively, by the position information of x i and x j extracted from the two boundary frames in the model.

Finally, Eq. (4) is substituted into Eq. (1), updating the distribution of x i at the nth state, i.e., Bel(n)(x i ).

  1. (3)

    Iteratively calculating the Belief for all the variable nodes until the termination condition (Bel(n)(x i )–Bel(n1)(x i )<ε, ε = 10−5) was satisfied, allowing the convergence of the ultimate state value.

  2. (4)

    The mean values of Bel(n)(x i ) in the last state were taken as the ultimate estimation of the ith node, i.e., the coordinates of vertebral landmarks. The sagittal angles of the intervertebral joints were then computed from the ultimate estimates of the coordinates of the corners of the two adjoining intervertebral bodies [20].

2.5 Movement sequence patterns of the lumbar spine

Using the data predicted by the model, the movement sequencing of the intervertebral joints of the lumbar spine was determined for four movements—extension-neutral, neutral-flexion, flexion-neutral, and neutral-extension. In order to provide uniform data length for all movement–time curves, data was normalized with respect to time using the spline function, and expressed as percentage of the total time. An intervertebral joint was considered to start moving when the change in joint angle became greater than 1°. Movement sequence was determined from the order of movements using the proposal of Gatton and Pearcy [5]. Five categories of movement sequence were defined—“all together” (where all joints started movement simultaneously), “bottom up” (when the bottom of the lumbar spine L5/S1 was first to move, and the top T12/L1 last to move), “middle first” (when the middle joints, i.e., either L2/3 or L3/4, moved first), “top down” (when the T12/L1 joint moved first and the L5/S1 joint moved last), and “other” (not fitting anyone of the above categories).

3 Results

3.1 Validation of model

Based on the boundary conditions provided by the radiographic images at the end of flexion and extension, the Bayesian belief network model predicted the coordinates of the vertebral landmark-given observations (coordinates of radio-opaque markers on the sensors). Figure 3 is an example of the intervertebral joint motion of the lumbar spine of one subject that was predicted by the model. For the purpose of validation, the intervertebral angles from the radiographic image in the neutral posture were compared with those calculated from the estimation from the model. The correlation coefficient between the estimated movements and the actual movements recorded on radiographs was found to be 0.99 (p < 0.01), indicating a very high degree of agreement. Error was defined as the difference between measured and predicted values. The magnitude of error of the various intervertebral joints is shown in Fig. 4. The overall mean error was found to be −1.45° ± 3.34°, where the L2/3 and L5/S1 joints had the least and largest mean errors, respectively. The error was considered to be small, and the model accurate enough for predicting intervertebral joint motion.

Fig. 3
figure 3

The dynamic movement of the lumbar vertebrae estimated by the Bayesian network model using data from the Fastrak sensors. The final positions of flexion and extension are shown. x1–x12 and y1–y8, which are the the lumbar vertebra and the skin-mounted sensors, respectively

Fig. 4
figure 4

Prediction error for the intervertebral angles (unit: degree)

3.2 Movement sequence patterns

There were wide variations in the movement sequences of the intervertebral joints among subjects. Table 1 shows the frequency of occurrence of each movement pattern. Most subjects exhibited patterns, which were not classifiable and belonged to the category “others”. The most frequently observed sequence was the “bottom-up” pattern, and no subject exhibited the “top-down” pattern. It was noted that for subjects classified as the same sequence pattern, the rate of change of intervertebral angles (velocities) varied from subject to subject. Interestingly, it was also observed that some individual joints extended during flexion motion and vice versa.

Table 1 Movement sequence patterns of the intervertebral joints of the lumbar spine

4 Discussions

This study was the first attempt to determine the kinematics and movement patterns of the intervertebral joints using the Bayesian belief network model, a model, which is commonly used in artificial intelligence [13]. The degree of correlation between the measured and predicted movements was excellent with a mean error of less than 1.5°. Some previous studies [20, 34] had reported accuracy of similar magnitude, but they analyzed only intervertebral displacement information only in the static posture. The current model has many other potential benefits. For instance, it is easier to operate and computationally less complex when compared to the inverse kinematic model proposed by Sun et al. [34]. Unlike previous models, the Bayesian model makes no assumption about the geometry and structure of the kinematic chain, and is therefore highly robust for predicting spinal motions.

Measurements of spinal motions using skin-mounted sensors are subjected to considerable error due to soft tissue deformation between the spine and the sensors. The sensor displacements may not reliably represent the actual spinal motions. Due to the soft tissue deformation, the spatial association between the vertebrae and the sensors are constantly changing. It would be impossible to establish this association using traditional statistical methods such as correlation or regression because the soft tissue deformation is rather unpredictable. However, the present model solves this problem by estimating the spatial association between the vertebrae and the sensor through probabilistic inference using the belief propagation algorithm. This allows spinal motions to be successfully determined using surface information, as demonstrated by the small error observed in this study.

Table 1 shows that the L5/S1 joint was subjected to a larger error compared to the other intervertebral joints. This is mainly because one landmark on the S1 vertebra is remote to the sensor site (in Fig. 1, 2). In the model construction, two upper corners of the sacrum were chosen as landmarks to define the bone position. One of the element nodes for S1, x 11 (Fig. 2), was further away from the vertebra when compared to the other element nodes. This increased node distance may have accounted for the larger prediction error observed in the L5/S1 joint. Equations (1)–(4) of the model show that node distance is an essential factor in the algorithms for belief propagation. A smaller distance would result in a more reliable relationship between the element node and the observation. This problem can be minimized in future by choosing another bony landmark such as the S2 spinous process.

This study clearly shows that there is wide variation in the way that the lumbar spine moves between subjects. The variability in the motion sequence patterns is illustrated in Table 1. This is largely because the human spine has redundant degree of freedom, giving great flexibility in the performance of a movement. For instance, spinal flexion can be accomplished with infinite configurations of the intervertebral joints. Previous studies on lumbar spine kinematics assumed certain patterns in movement sequence of the lumbar spine. For instance, Kanayama et al. [12] suggested that during forward flexion the spine moves from the top first, though this was not observed in this study. McGill [25] assumed that all the intervertebral joints moved at the same time in his model, but in this study, only about 4% of examinations showed an “all together” pattern. Our study supports the findings of Gatton and Pearcy [5], who reported that different spines exhibit different movement sequences. Both our study and this earlier work showed that the “bottom-up” pattern was the most common pattern. However, our study may provide more reliable data because our work was based on the best estimates of intervertebral motions using probablisitic inference. The observation of Gatton and Pearcy was only based on information obtained from skin-mounted sensors and they did not estimate the underlying intervertebral motions.

The most interesting finding in this study was that some intervertebral joints move in a direction opposite to the direction of bending of the lumbar spine. That is, some joints exhibited extension during flexion of the spine and vice versa. Such aberrant motion has also been reported by previous studies [5, 22, 26, 28, 30, 33, 36]. The movement patterns of the spine are often oversimplified in many textbooks, and this study had provided a detailed description of subject variability in the spinal motion pattern. However, our sample size is small, and a larger study will be required to fully document the various patterns. For this purpose, our study does provide a valid and accurate model, which allows an easier assessment of spinal motion. The clinical implications of the aberrant motions are still unclear, and further studies will be required to address this issue.

Future biomechanical studies will benefit from the dynamic model developed in this study. The actual intervertebral rotations can be predicted by the model, and no assumptions have to be made regarding the movement pattern. The relationship among the various segments did not have to be assumed to be constant throughout the movement. The accuracy of the biomechanical model is thus tremendously improved. This will allow us to study how different kinematic patterns in different individuals produce different loading patterns in the spine.

Knowledge of the kinematic patterns of the spine may also be useful in the clinical assessment of spinal disorders such as back pain, spondylolysis or spondylolisthesis. A major attraction of the model is that it provides kinematic data and allows us to determine the higher derivatives of motions such as velocities and accelerations. Previous research showed a significant relationship between back pain and intervertebral motion [1]. The association between pain and higher movement derivatives is still unclear. This can be explored using the present model.

It should be pointed out that the present study was conducted in a group of elderly subjects. It is likely that the Bayesian belief network model could also employed in younger subjects with similar accuracy. However, the kinematic and movement sequence pattern observed in this study may not be generalized to the younger age group, and this will need to be clarified in future studies. It would also be interesting to study how activity level and quality of life may influence the kinematic patterns among the elderly subjects.

5 Conclusion

This is the first study, which employs a Bayesian belief network model to study the kinematic pattern of the intervertebral movement of the lumbar spine. The developed model is able to estimate the intervertebral angle reliably with a mean error of less than 1.5°. The technique is non-invasive and has the potential to be used clinically to measure kinematics of intervertebral movement. It will also be useful to biomechanical modelling when kinematic data are required as input variables. This study showed a wide variation in the kinematic patterns in the subjects studied. The present model may also be used to study velocity and acceleration of intervertebral joint providing further insight into the kinematics of the lumbar spine.