Introduction

Idiopathic scoliosis (IS) is a three-dimensional lateral curvature of the spine coupled with vertebral rotation for which there is no known cause. About 2–4% of the adolescent population has some degree of scoliosis1,2. Approximately 2.2% of these adolescents will require treatment, consisting of observation, orthotic (brace) treatment, or surgery3.

If scoliosis is left untreated and a large curve develops, it can injure both the lungs and heart causing significant health problems. Treatment decisions for scoliosis are based on consideration of the patient’s physiologic maturity, curve severity, curve location, cosmetic concerns, and the estimated potential for progression. The Cobb angle method4 is the gold standard to measure the curve severity. A Cobb angle less than 10° is not considered to be scoliosis. Spinal deformity with a Cobb angle of 10° to 25° will be monitored regularly until skeletal maturity or significant curve progression. If the Cobb angle is 25° to 45°, brace treatment is suggested. If the Cobb angle is greater than 45°, surgery is usually recommended3. Generally, 5° of change or more between two successive radiographs is considered as the indication of curve progression5.

To measure the Cobb angle, the end-vertebrae that tilt most severely toward the concavity of the spinal curve are identified. The Cobb angle can be manually measured by calculating the angle (θ) between the lines respectively drawn along the upper endplate of the superior end-vertebra and the lower endplate of the inferior end-vertebra, as shown in Figure 1. However, the manual measurement of Cobb angle depends on experience and judgment. Errors are due to selecting different end-vertebrae and estimating different slopes of the vertebrae. Even when the same end-vertebrae are used for measurement, the standard measurement error is 3° to 5° for the same observer and 5° to 7° for different observers57, which are beyond the 5° for progression assessment. In addition, the manual measurement is tedious and time-consuming.

Fig 1
figure 1

Cobb angle measurement.

Allen et al.8 developed an automatic Cobb measurement method based on active shape models, where the training set was required, and the measurement results heavily relied on the training data. Chockalingam et al.9 proposed a method to measure the Cobb angle automatically. Their program automatically generated eight horizontal lines over the region of interest (ROI) dividing the spine image into equally spaced regions. Users set two points on each line where it intersected the vertebrae edge. Connecting these midpoints on each of the line defined the midline of the spine. The Cobb angle was calculated based on the midline of the spine. The accuracy of this method depended on how well the edges of the vertebrae were identified. Xu et al.10 used a similar method, which was based on the spinal midline to measure the Cobb angle. Their method automatically detected the boundary of the spine joining the vertebrae. However, the accuracy of the study has not been reported.

This proposed work is a new approach which incorporates the vertebral shape prior into the fuzzy Hough Transform (FHT)11 to automatically detect the directions of the end-vertebrae on the posteroanterior (PA) radiograph. After the directions of the end-vertebrae were automatically determined, the Cobb angle is calculated. No training set is required, and minimum involvement is needed. The reliability and repeatability of this method was tested by two examiners in a 2-week period. The overall goal of this study was to reduce measurement error of the Cobb angle by reducing the judgment required in measuring the severity of idiopathic scoliosis.

Materials and Methods

Materials

Seventy-six PA radiographs of 71 females and 5 males age 14.7 ± 2.3 who attended one site between April and November 2007 using Fuji CR 5000R computed radiography with 10 pixels/mm resolution were selected. The selection criteria were (1) diagnosis of idiopathic scoliosis, (2) ages between 9 and 18 years, and (3) Cobb angle less than 90°. The exclusion criteria were patients who (1) had other musculoskeletal or neurological disorders, (2) prescribed a brace, or (3) had had surgery. This study received ethics approval from the local ethic board.

The maximum Cobb angle measured by the orthopedic surgeon was 68°. According to the Lenke classification12, there are 29 cases of type 1, 1 case of type 2, 14 cases of type 3, 16 cases of type 5, and 16 cases of type 6.

Methods

Although a fully automatically method is one of the goals of this study, the developed method still requires the user to select the two end-vertebrae of the curve as well as the brightness and contrast adjustment. As illustrated in Figure 1, once the directions of the lines that fit the endplates of the end-vertebrae (θ 1 and θ 2) are known, the Cobb angle θ can be calculated as θ = θ 1 + θ 2. In this approach, after the end-vertebrae were selected, the directions of these lines were automatically estimated based on fuzzy Hough transform. The developed algorithm was implemented in MATLAB 7.0 and Visual C++.NET. The whole processing procedures are illustrated in the flowchart (Fig. 2).

Fig 2
figure 2

Processing procedure.

Preprocessing

Because the quality of the spinal radiographs was inconsistent, and the vertebral sizes were variant in a large range, it was difficult to process the images directly by the computer. Therefore, the original radiograph was cropped from cervical vertebra (C7) to sacrum and was resized to a standard height of 1,000 pixels. Each resized images was manually enhanced by adjusting the brightness and contrast using Image J software [NIH, USA]. The most tilted end-vertebrae were selected from the enhanced images. The regions of interest (ROI of size 100 × 80 pixels, as shown in Figure 3a) containing the selected end-vertebrae were created by the algorithm.

Fig 3
figure 3

Preprocessing. a ROIs selection; b denoised ROIs by anisotropic diffusion; c edge detection by Canny operator; d edge images without anisotropic diffusion.

Because the radiograph contains a complex background due to various types of artifacts, it is necessary to denoise the images to facilitate the following processing. The anisotropic diffusion algorithm13 can remove noises while preserving edge information. It can be described as the partial differential equation:

$$\frac{{\partial I\left( {x,y,t} \right)}}{{\partial t}} = {\text{div}}\left[ {c\left( {x,y,t} \right) \cdot \nabla I\left( {x,y,t} \right)} \right]$$
(1)

where I is the image, x and y represent the spatial coordinates of each image pixel, t is artificial time parameter corresponding to the iteration index in the discrete implementation, div represents divergence operator, ▿ is the gradient operator, and c(x, y, t) is the diffusion conductance function, usually defined as

$$c\left( {x,y,t} \right) = c\left( {\left| {\nabla I\left( {x,y,t} \right)} \right|} \right) = e^{ - \frac{{\left| {\nabla I\left( {x,y,t} \right)} \right|^2 }}{{K^2 }}} $$
(2)

where K is the relaxation parameter which is chosen according to the noise levels. K was set as 6.0 in the experiments. The diffusion conductance function diffuses more in smooth areas and less across edges. Therefore, the anisotropic diffusion method can realize intra-region smoothing in preference to smoothing across edges. Figure 3b is the denoised images of the ROIs in Figure 3a.

Edge information is a prerequisite to implement the Hough transform. The Canny operator14 is the most commonly used method and is regarded as optimal for edge detection. The Canny edge detector is then applied to Figure 3b images, and the results are shown in Figure 3c. To illustrate the effect of the anisotropic diffusion, the edge images of the ROIs (without applying the anisotropic diffusion) obtained by the same Canny operator are shown in Figure 3d, where many noisy edges are found.

Fuzzy Hough Transform

Hough transform (HT)15,16 is originally a technique to detect straight lines. In the image space (xy plane), any line is represented as ρ = x cos θ + y sin θ, where ρ is the distance between the line and the origin, and θ is the angle of the vector from the origin to the closest point of the line, as illustrated in Figure 4.

Fig 4
figure 4

Representation of a straight line.

The (ρ, θ) plane is referred as Hough space. Accordingly, a point (x 0, y 0) in the image space corresponds to a curve ρ = x 0 cos θ + y 0 sin θ in the Hough space, and all the points belonging to a particular line (specified by (ρ 0, θ 0)) correspond to a family of curves passing through a common point (ρ 0, θ 0). Applying the HT to a set of edge points results in a two-dimensional function C(ρ, θ), which represents the number of edge points satisfying the linear equation ρ = x cos θ + y sin θ. In practical applications, the Hough space is quantized and the accumulator array C(ρ k , θ k ) is obtained. By finding the local peaks of C(ρ k , θ k ), the most likely lines can be detected.

However, in the conventional HT, even if an edge point is near the line specified by (ρ k , θ k ), it does not contribute to C for that (ρ k , θ k ). In other words, there is no difference between a point being far from the ideal line or being just a little off it. Because each side of the vertebra is not exactly a straight line, but is close to a straight line (as shown in Fig. 3c or d), it may be difficult to correctly find out the peaks in the Hough space to specify the line segments that fit to the vertebra sides. Han et al.11 developed the FHT, where around each point on the ideal line a small region R was defined, and each edge point in this region contributed more or less to the accumulator C depending on its distance (d) from the ideal line. Therefore, the peaks could be identified correctly even for distorted line structures. In addition, a faster algorithm11 was proposed to realize the FHT by convolving the accumulator C along the ρ axis with the membership function

$$g\left( d \right) = \left\{ {\begin{array}{*{20}c} {ke^{{{ - d^2 } \mathord{\left/ {\vphantom {{ - d^2 } {\sigma ^2 }}} \right. \kern-\nulldelimiterspace} {\sigma ^2 }}} } & {d <R} \\ 0 & {{\text{otherwise}}} \\ \end{array} } \right.$$
(3)

where k, σ, and R were constants that were chosen empirically as k = 2 and R = σ = 4 in the experiments.

The effectiveness of the FHT is illustrated in Figure 5, where a and d are the edge images, and b, c, e, and f show the detected lines by finding the maximum peaks in Hough space. In Figure 5d, e, and f, one dot represents one edge pixel. Figure 5b and c shows the detected lines of a by using the conventional HT and FHT, respectively, where the lines detected by the FHT are visually more convincing to humans. Figure 5e and f shows the detected lines of d by using the HT and FHT, respectively, where the conventional HT finds seven lines and each one passes through six edge points, while the FHT finds the single line that correctly fits to the edge points.

Fig 5
figure 5

Comparison of the conventional HT and FHT: edge images (a) and (d), detected lines of (a) by HT (b) and FHT (c), detected lines of (d) by HT (e) and FHT (f).

If the FHT can detect the line segments that fit the endplates of the end-vertebrae properly, the Cobb angle can be calculated from the directions of these lines as described previously. However, because of the complexity of vertebra images, the FHT sometimes failed to find the proper lines. As an example, Figure 6 shows the edge images of vertebrae (in the upper row) and the incorrect endplates detected by the FHT (in the lower row), where the artifacts from the structure within the vertebra (a), the neighboring vertebra (b), and the background (c) caused the lines (detected by finding the maximum peaks in Hough space) fail to fit to true endplates of the end-vertebrae. Our solution was to incorporate the vertebral shape prior into the FHT.

Fig. 6
figure 6

ac Edge images of vertebrae and incorrect line segments detected by FHT.

Vertebral Shape Prior in Hough Space

Let H 1 = (ρ 1, θ 1), H 2 = (ρ 2, θ 2),…, and H l  = (ρ l , θ l ) denote the l peaks of C(ρ k , θ k ). It can be observed that the vertebral sides satisfy the specific geometric relations in Hough space:

  1. 1.

    Let the peaks (ρ i , θ i ) and (ρ j , θ j ), respectively, correspond to the upper and lower endplates of a vertebra (the horizontal sides of the vertebra). For a minor-deformed vertebra, θ i and θ j are approximately equal, i.e., |θ i  − θ j| < T 1, (T 1 = 10 is the threshold), as two endplates are almost parallel. Similarly, the pair of peaks corresponding to the vertical sides of the vertebra satisfies the same condition.

  2. 2.

    Let \(\bar \theta _{\text{H}} \) and \(\bar \theta _{\text{V}} \) represent the average θ values of the pairs of peaks that respectively correspond to the horizontal sides and the vertical sides. For a minor-deformed vertebra, \(\left\| {\bar \theta _{\text{H}} - \bar \theta _{\text{V}} } \right.\left| { - 90} \right| <T_2 \), (T 2 = 10 is the threshold), as the horizontal sides and the vertical sides are close to perpendicular to each other.

  3. 3.

    The value of \(\bar \theta _{\text{H}} \) (corresponding to the pair of endplates) is less than 45° (half of 90°), as 45° is considered as the maximum angle of a single tilted vertebra, and 90° is considered as the maximum Cobb angle possibly met with the inclusion criteria of this study.

  4. 4.

    The difference between the ρ values of two peaks that correspond to the endplates is at the range from 30 to 60 pixels, i.e., 30 < | ρ i  − ρ j | < 60, as the height of the vertebrae in the resized images is within this range. Similarly, the pair of peaks corresponding to the vertical sides satisfies 40 < |ρ i  − ρ j | < 80.

Automatic Cobb Angle Measurement

Based on the vertebral shape prior, the Cobb angle can be calculated by the following steps.

  1. Step 1

    apply the FHT to the edge image. In Hough space, choose the peaks whose C values are beyond 60% of the maximum C value.

  2. Step 2

    pair the peaks that satisfy |θ i  − θ j| < T 1. Let \(\bar \theta \) denote the average angle of a pair of peaks, i.e., \(\overline \theta = {{\left| {\theta _j + \theta _j } \right|} \mathord{\left/ {\vphantom {{\left| {\theta _j + \theta _j } \right|} 2}} \right. \kern-\nulldelimiterspace} 2}\). Select the pairs, which satisfy \( - 45 <\overline \theta <45\) and 30 < | ρ i  − ρ j | < 60, as the horizontal candidate pairs; select the pairs, which satisfy \(\overline \theta >45\) or \(\overline \theta <- 45\) and 40 < |ρ i  − ρ j | < 80, as the vertical candidate pairs. Let \(\bar \theta _{{\text{H}}n} \) and \(\bar \theta _{{\text{V}}n} \), respectively, denote the average angles of the nth horizontal and vertical candidate pairs.

  3. Step 3

    compare all candidate pairs to find those satisfy that \(\left\| {\bar \theta _{{\text{H}}n} } \right. - \bar \theta _{{\text{V}}m} \left| { - 90} \right| <T_2 \). Identify the couple of pairs with the maximum average C values, of which the horizontal pair (\(\bar \theta _{\text{H}} \)) is considered as the peaks that correspond to the endplates. The corresponding angle \(\bar \theta _{\text{H}} \) is considered as the angle of the tilted vertebra. The detected line segments that fit to the endplates of the vertebrae in Figure 3c and in Figure 6 are respectively shown in Figure 7.

Fig 7
figure 7

ae Detected vertebra direction and line segments that fit to the endplates of the vertebrae.

Finally, the Cobb angle is calculated as the sum of the angles of two end-vertebrae.

Performance Analysis

The proposed method was tested and compared with the manual measurement method to evaluate its performance. The manual method was measured by an orthopedic surgeon specialized in scoliosis with 25 years experience. Two examiners were respectively asked to measure the Cobb angle on each radiograph by using the developed software twice over a period of 2 weeks. Examiner 1 (E1) was involved in scoliosis clinic for 20 years, and examiner 2 (E2) was the software developer with no clinical experience to measure Cobb angle manually. The results obtained from the surgeon were considered as the true values. All measurements on the same radiograph were performed on the same curve, although some spines had multiple curves.

Spearman correlation studies were used to assess the strength of the relation between two measurements. A higher correlation coefficient (r, varying between 0 and 1) indicates a better linear relation between two measurements. Paired t tests were performed to assess the significance of potential variation between two measurements. A p value (significance level) for Spearman correlation and paired t test was considered significant if less than 0.05. The intraclass correlation coefficient (ICC, varying between 0 and 1) with 95% confidence intervals (CI) was used to evaluate the reliability17. The Currier criteria18 for ICC values were adopted: 0.90–0.99 = high reliability, 0.80–0.89 = good reliability, 0.70–0.79 = fair reliability, <0.69 = poor reliability. As an error analysis, mean absolute difference of two measurements was also assessed, with the associated 95% CI. SPSS (SPSS Inc, Chicago, IL, USA) was used to perform these analyses.

Results and Discussion

To evaluate the validity of the proposed method, the automatic measurements from two examiners were respectively compared with the manual measurement from the surgeon. The results were given in Table 1. All Spearman correlation analyses showed r > 0.9 and p < 0.001, which indicated strong correlation between the automatic and manual measurements. The paired t tests results of p > 0.05 suggested that the differences between the manual and automatic measurements were not significant. All ICC values were higher than 0.95, with 95% CI between 0.924 and 0.981. The mean absolute differences between the automatic and manual measurements were all less than 5°, with 95% CI between 2.9° and 5.2°. These results demonstrated high agreement of the automatic method with the manual measurement method. These results also indicated that using the automatic method could obtain the similar results even by the examiner with little experience.

Table 1 Comparison Between Automatic and Manual Measurements

The results of the interobserver reliability analyses were presented in Table 2. It was shown that the measurements from two examiners were highly correlated with each other (r > 0.9 and p < 0.001). The paired t tests results of p > 0.05 indicated that the interobserver differences were not statistically significant. The ICCs > 0.95, with 95% CI between 0.945 and 0.982, showed high interobserver reliability. The mean absolute differences were less than 5° with 95% CI between 2.7° and 4.6°. These results suggested that the automatic method was consistent regardless of the experience of the examiners.

Table 2 Interobserver Analyses

The intraobserver reliability of the proposed method was assessed for each examiner. The results were presented in Table 3. The Spearman correlation results of r > 0.9 and p < 0.001 showed strong correlation between the measurements by the same examiner. The paired t tests results of p > 0.05 indicated that the measurements by the same examiner were not significantly different. Both intraobserver analyses produced ICCs > 0.95, with 95% CI between 0.966 and 0.990. The mean absolute differences were less than 3° with 95% CI between 2.1° and 3.5°. These results demonstrated high intraobserver reliability of the proposed method.

Table 3 Intraobserver Analyses

To investigate whether the severity of the curve would affect the automatic measurement, the radiographs were classified into three categories according to the Cobb angle measured by the surgeon: less than 25° (13 radiographs), between 25° and 45° (49 radiographs), and more than 45° (14 radiographs). The comparison between the manual and automatic measurements was then performed on three categories, respectively. All Spearman correlation analyses resulted in p < 0.001, which indicated that there was a strong correlation between the automatic and manual measurements regardless of the curve severity. The paired t tests results of p > 0.05 indicated that the intermethod differences were not statistically significant in any of the three categories. The ICCs and mean absolute differences were presented in Tables 4 and 5, respectively. It was shown in Table 4 that the lowest ICC value was 0.768 in the category of curve between 25° and 45°, and the highest ICC value was 0.953 in the category of curve more than 45°. Although the ICC values were different, the results still indicated that the severity of the curve did not cause the significant differences to the intermethod measurement. The mean absolute differences in Table 5 showed that the intermethod differences were still within 5° for any of the three categories. In the comparison between the automatic measurement from the E2 (no clinical experience) and the manual measurement from the experienced surgeon, the mean absolute differences was 5° in the category of curve more than 45°. It might be due to the selection of different vertebrae between the E2 and the surgeon.

Table 4 ICCs (95% CI) of Intermethod for Three Categories of Curves
Table 5 Mean Absolute Difference (95% CI) of Intermethod for Three Categories of Curves

Currently, both of the manual and the proposal methods depended on the quality of the radiographs. User’s manual adjustment to get good brightness and contrast images was required. The end-vertebrae selection was the relatively simple step during the user judgment. The manual method then required the user to draw two straight lines to pass through the end-plates of the vertebrae, but the computer algorithm automatically calculated the Cobb angle. The average computing time was less than half a minute after two end-vertebrae were selected. This was similar to the time that an experienced surgeon would need to determine the Cobb angle from a radiograph using the manual method. However, the computer method resulted in a smaller variability.

The proposed method required less user intervention, compared with the computerized method proposed by Chockalingam et al.9 where at least 16 points must be assigned manually to define the vertebral edges. Compared with another computerized method proposed by Allen et al.8, the proposed method did not require the training set. As long as each individual vertebra tilted less than 45°, the accuracy of the proposed method had no relations with the severity of the spinal curve. Although the shape constraints were reasonable for most radiographs, false detection might occur if a vertebra tilted more than 45°, or the vertebra had a severely deformed shape that did not satisfy the shape constraints. Although some of the endplates of vertebrae were more like a plate, as long as the two vertical sides of the vertebra were still close to parallel, the direction of the vertebra might be detected correctly. The algorithm would choose the most fitted couple of pairs of lines, which improved its robustness.

Conclusion

Although the proposed method still required user judgment to measure the Cobb angle, very little user interaction and skills are required. Manually drawing lines across the endplates of the end-vertebrae may be the major source for measurement errors. In the proposed method, the directions of these lines were automatically estimated to reduce the errors. The validity analyses demonstrated that this method highly agreed with the manual measurement method. The interobserver and intraobserver analyses indicated that the measurement errors were comparable to the threshold of changes that could influence treatment decisions. These results suggested that the proposed method could help orthopedic surgeons measure the Cobb angle more reliably and release their time during scoliosis clinic.