Introduction

Ossification of the posterior longitudinal ligament (OPLL) of the spine is characterized by ectopic bone formation in the spinal ligaments. In 1960, Tsukimoto [1] first reported OPLL in Japan and OPLL is a common disorder among Japanese and other Asian populations. The incidence of OPLL in Japan is about 3 % (1.8–4.1 %) [2], and the male/female ratio for patients diagnosed as having OPLL is 1.96 (1.1–3.0) [3]. OPLL causes compression of the spinal cord and leads to various degrees of myelopathy. Typical symptoms of OPLL are sensory and motor disturbance of the upper and lower extremities, abnormal reflexes, hyperresponsive deep reflexes, and bladder-bowel dysfunction. Various degrees of dysfunction, such as precise action and gait disturbance, lead to the restriction of activities involved in daily living and the deterioration of quality of life.

The occurrence and development of OPLL involve many environmental, systemic, and local factors. Examples of factors are diet, metabolic or endocrinological background, and mechanical stress [4]. Genetic susceptibilities to OPLL have been identified by several groups. COL11A2 [5], NPPS [6], and COL6A1 [7] have been reported as candidate genes for OPLL.

According to the report of the Investigation Committee on OPLL of the Japanese Ministry of Public Health and Welfare (now the Japanese Ministry of Health, Labour, and Welfare), OPLL of the cervical spine has been classified into four types by lateral plain radiographs [8, 9]: (1) continuous; a long lesion extending over several vertebral bodies, (2) segmental; one or several separate lesions behind the vertebral bodies, (3) mixed; a combination of the continuous and segmental types, and (4) circumscribed; mainly located posterior to a disc space (Fig. 1).

Fig. 1
figure 1

The classification for cervical OPLL

The guidelines committee of the Japan Orthopaedic Association proposed that the clinical diagnostic criteria for OPLL need the radiograph findings as well as the clinical symptoms (Table 1) [10]. A small ossification area that is not visible on radiographs but can be detected only by computed tomography (CT) images does not fulfill the diagnostic definition for OPLL. Therefore, it is sometimes difficult to differentiate OPLL from cervical spondylotic myelopathy (CSM) only by radiographs. Recently, sagittal CT images are used for diagnosis of OPLL, and we can evaluate the ossification area in detail with CT images. But CT images are not included in the clinical guidelines for OPLL. (As a proposal, the use of CT images is advised.) Accurate classification and diagnosis for cervical OPLL is important for treatment and prognosis, but the reliability of the classification has not been evaluated.

Table 1 Diagnostic criteria for cervical OPLL

The purpose of this study was to investigate the interobserver and intraobserver reliability of the classification and diagnosis for cervical OPLL by radiographs and CT images.

Materials and methods

Observers

Because experience in clinical practice was expected to affect the reliability of the classification and diagnosis, the observers were separated into two groups by their level of experience in orthopaedic surgery. One group included eight spine surgery specialists, and the other included eight residents in the orthopaedic residency programs. A total of 16 observers classified each patient’s image into five groups: continuous, segmental, mixed, circumscribed type OPLL, or CSM.

Patients and images

Fifty-seven patients (41 males and 16 females) with a diagnosis of OPLL or CSM were included in this study. The films of radiographs and CT images were scanned by computer (radiographs; 200 dpi, CT images; 300 dpi), and each image was displayed on a computer screen. The magnification of the images depended on each observer. To differentiate the ossification area clearly, we did not use CT myelography.

Evaluation for reliability

To evaluate interobserver reliability, the observers classified images only by radiographs, and next they classified by both radiographs and CT images. To evaluate intraobserver reliability, the observers followed the same procedure on another day (more than 24 h later). We used not only radiographs but also CT images to evaluate the degree of improvement of reliability, and we also evaluated difference in reliabilities between only radiographs and both radiographs and CT images. The reliability was examined with Cohen’s kappa values. Interpretation of the strength of agreement determined with the kappa values was given by adopting the criteria of Landis and Koch [11]: >0.81; Almost perfect, 0.61–0.80; Substantial, 0.41–0.60; Moderate, 0.21–0.40; Fair, 0–0.20; Slight.

Results

Interobserver and intraobserver reliability of the classification for cervical OPLL (Tables 2, 3)

In the first round, interobserver reliability was 0.528 with radiographs only (specialists 0.565, residents 0.476) and 0.633 using both radiographs and CT images (specialists 0.667, residents 0.623). In the second round, interobserver reliability was 0.574 with radiographs only (specialists 0.630, residents 0.526) and 0.658 using both radiographs and CT images (specialists 0.733, residents 0.622).

Table 2 Interobserver reliability of the classification for cervical OPLL
Table 3 Intraobserver reliability of the classification for cervical OPLL

In the first round, interobserver reliability with radiographs only showed moderate agreement, but interobserver reliability using both radiographs and CT images showed substantial agreement. In the second round, interobserver reliability was improved in comparison with the first round.

Intraobserver reliability was 0.477 with radiographs only (specialists 0.561, residents 0.392) and 0.605 using both radiographs and CT images (specialists 0.665, residents 0.544). Intraobserver reliability of residents with radiographs only was fair. Intraobserver reliability was improved by additional CT images.

Interobserver and intraobserver reliability of the diagnosis of either cervical OPLL or CSM (Tables 4, 5)

We also evaluated interobserver and intraobserver reliability of the diagnosis of either cervical OPLL or CSM. In the first round, interobserver reliability was 0.743 with radiographs only (specialists 0.758, residents 0.812) and 0.833 using both radiographs and CT images (specialists 0.710, residents 0.753). In the second round, interobserver reliability was 0.787 with radiographs only (specialists 0.817, residents 0.878) and 0.853 using both radiographs and CT images (specialists 0.832, residents 0.823). If the observers are separated into specialists and residents, interobserver reliability obtained using both radiographs and CT images was lower than that with radiographs only in each observer group. But interobserver reliability in all observers using both radiographs and CT images improved in comparison with radiographs only. By additional CT images, interobserver reliability of the diagnosis showed almost perfect agreement.

Table 4 Interobserver reliability of the diagnosis of either OPLL or CSM
Table 5 Intraobserver reliability of the diagnosis of either OPLL or CSM

Intraobserver reliability of the diagnosis was 0.613 (specialists 0.690, residents 0.537) with radiographs only and 0.802 (specialists 0.795, residents 0.808) using radiographs and CT images. Intraobserver reliability was improved by additional CT images.

Case presentation (Fig. 2)

Case 1

Using only radiograph, ten observers classified this case as CSM. Although the ossification area was unclear by radiograph, 15 observers classified it as cervical OPLL by additional CT image.

Fig. 2
figure 2

Case presentation

Case 2

All 16 observers classified this case as circumscribed cervical OPLL by additional CT image.

Case 3

By radiograph, eight observers classified this case as mixed cervical OPLL and the other eight observers classified it as continuous cervical OPLL. By additional CT image, 13 observers classified it as mixed cervical OPLL and the other three observers classified it as continuous cervical OPLL.

Discussion

In Clinical Guidelines for OPLL (Table 1) [10], diagnostic criteria for OPLL include requirements of radiographic images and clinical findings. According to the requirements of radiographic images, the ossification area must be visible in lateral plain radiograph, and a small ossification area that is visible in only CT images is not included in OPLL. If the lower cervical spine is not visible adequately by lateral plain radiograph, CT images are advised.

Chang et al. [12] reported inter- and intra-observer variability of cervical OPLL classification using reconstructed CT images. In their study, the inter- and intra-observer kappa values were 0.51 and 0.67 by the lateral radiograph, 0.70 and 0.85 by 2-D CT images. The reliability of their study was higher than that of our study.

In our study, interobserver and intraobserver reliability of classification for the cervical OPLL was improved by additional CT images. In lateral plain radiographs, facet joints of the cervical spine may overlap with the ossification area. Intervertebral bone spurs may also overlap with circumscribed OPLL. In contrast, we could not observe such overlaps in CT images. Interobserver reliability of specialists was higher than that of residents, and the reliability of residents was remarkably improved in comparison with that of spine surgery specialists by additional CT images. It seems that experience in clinical practice affects the interobserver and intraobserver reliability. The influence of experience in clinical practice may be reduced by CT images.

Interobserver reliability of the diagnosis for cervical OPLL was improved by additional CT images, and intraobserver reliability was improved remarkably. The criteria for OPLL seemed to affect the results. According to the current criteria for OPLL, it is difficult to evaluate an ossification area that is visible on CT image clearly but difficult to identify on plain radiograph.

It is well known that the ossification area is often progressive during the natural course of the disease. Shindo et al. [13] reported the long-term natural course of OPLL. Progression of the ossification was detected in 38 % of segmental OPLL, 75 % of continuous OPLL, and 55 % of mixed OPLL, but it was not related to aggravation of the myelopathy. OPLL often progresses after surgery, which may cause late-onset neurological deterioration. Kawaguchi et al. [14] reported the relationship between the progression of ossification of the OPLL and the clinical results following en bloc cervical laminoplasty. Young patients with mixed and continuous types OPLL had the greatest risk for progression. Some patients had neurological deterioration following an increase in the thickness of the ossification. Hori et al. [15] reported on 55 patients after cervical laminoplasty who were available for serial radiographs for more than 5 years. The patients were divided into three groups according to the pattern of OPLL progression. OPLL progression related to patient age or OPLL type. In genetic study, it was reported that mixed or continuous type OPLL had higher osteogenic differentiation potency than segmental or circumscribed type OPLL [16]. As above, patients with mixed or continuous OPLL had the greatest risk for progression of the ossification area. It is important for prognosis and diagnosis that reliability of the classification for cervical OPLL has high agreement. We will work out the new classification for cervical OPLL by advanced CT images.

CT images were very valuable on the classification and diagnosis for cervical OPLL; however, there were some problems. One problem was the cost. CT images of cervical spine cost about three times more than radiographs of cervical spine in Japan. Another problem was the radiation dosage to the patients. The radiation dose of CT images of cervical spine was about 17 times higher than that of radiographs of cervical spine in our institute (The radiation dosage of CT images of cervical spine was 61.8 mGy and that of radiographs of cervical spine was 3.6 mGy). In terms of the cost and the radiation dosage, CT images were not repeatable test.

This study suggested that interoberver and intraobserver reliability of the classification for cervical OPLL was improved by additional CT images, which also improved intraobserver reliability of the diagnosis for cervical OPLL. We propose that diagnostic criteria for OPLL include both radiographs and CT images.