Abstract
To assess whether application of a support vector machine learning algorithm to ancillary data obtained from posterior-anterior dual-energy X-ray absorptiometry (DEXA) studies could identify patients with lumbar spine (L1–L4) vertebral body fractures without additional DEXA imaging or radiation. Three hundred seven patients (199 without any fractures of the spine, and 108 patients with at least one fracture of the L1, L2, L3, or L4 vertebral bodies) who had DEXA studies were evaluated. Ancillary data from DEXA output was analyzed. The dataset was split into training (80%) and test (20%) datasets. Support vector machines (SVMs) with 10-fold cross-validation and different kernels were used to identify the best kernel based on the greatest area under the curve (AUC) and the best training vectors in the training dataset. The SVM with the best kernel was then applied to the test dataset to assess the accuracy of the SVM. Receiver operating characteristic (ROC) curves of the SVMs using different kernels in the test dataset were compared using DeLong’s test. The SVM classifier with the linear kernel had the greatest AUC in the training dataset (AUC = 0.9258). The AUC of the SVM classifier with the linear kernel in the test dataset was 0.8963. The SVM classifier with the linear kernel had an overall average accuracy of 91.8% in the test dataset. The sensitivity, specificity, positive predictive value, and negative predictive of the SVM classifier with the linear kernel to detect lumbar spine fractures were 81.8%, 97.4%, 94.7%, and 90.5%, respectively. The SVM classifier with the linear kernel ROC curve had a significantly better AUC than the SVM classifier with the cubic polynomial kernel (P = 0.034) for discriminating between patients with lumbar spine fractures and control patients, but not significantly different from the SVM classifier with a radial basis function (RBF) kernel (P = 0.317) or the SVM classifier with a sigmoid kernel (P = 0.729). All fractures identified by the SVM classifiers were not prospectively identified by the radiologist. SVM analysis of ancillary data obtained from routine DEXA studies can identify lumbar spine fractures without the use of vertebral fracture assessment (VFA) DEXA imaging or radiation, and identify fractures missed by radiologists.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Low bone mineral density (BMD) may result in osteoporosis or osteopenia and is associated with increased risk of fractures including vertebral body fractures and hip fractures [1,2,3]. Fractures of the hip have been associated with increased morbidity and mortality in the elderly [4,5,6]; therefore, identifying patients at risk for hip fractures is important [4,5,6]. A previous spine fracture increases the risk of a subsequent hip fracture [7]; therefore, identifying patients with spine fractures is clinically important. BMD is most commonly measured in clinical practice using dual-energy X-ray absorptiometry (DEXA) imaging [8]. Screening DEXA studies are recommended by the US Preventive Task Force (USPTF)/World Health Organization (WHO) [9,10,11] and the International Society for Clinical Densitometry (ISCD) for the clinical evaluation of postmenopausal women aged 65 years or greater, and for individuals at high risk for low BMD [9,10,11]. Approximately three million DEXA studies were performed in 2006 in the USA alone [12, 13].
Incidental findings are commonly found on imaging studies including DEXA studies [14]. These incidental findings may be detected by the radiologist and may change clinical management [14,15,16,17]. The most common incidental findings include osteoblastic lesions, Paget’s disease of bone, and lumbar spine fractures [14,15,16,17,18,19]. Fractures of the spine may result in decreased vertebral body height and in some cases focal increased BMD [14, 18]. Lumbar spine fractures are particularly important because they change the patient’s clinical management and result in the need for additional imaging of the spine and possibly surgical consultation. Therefore, there is a clinical need to identify patients with lumbar spine fractures.
A prior report showed that approximately 16% of DEXA studies have incidental findings including fractures, and none of these incidental findings were commented on by the interpreting radiologist [14]. We hypothesized that machine learning algorithms could be used to identify lumbar spine fractures from DEXA studies without analysis of image pixel data using the DEXA data output. The goal of this manuscript was to use a support vector machine (SVM) classifier to automatically detect L1–L4 lumbar spine fractures using ancillary data from routine posterior–anterior (PA) DEXA studies without requiring additional DEXA imaging or radiation.
Subjects and Methods
The study was approved by the local Institutional Review board (IRB) with a waiver of the need for signed informed consent.
We identified 307 patients who underwent a DEXA study. These patients were treated at a tertiary care academic healthcare center between January 1, 2010, and April 1, 2018. Of these 307 patients, 108 (35.2%) had at least one fracture of the upper lumbar spine (L1, L2, L3, or L4) vertebral body. Patients were classified as having fractures if there was prior imaging (either computed tomography (CT), magnetic resonance imaging (MRI), or radiographs of the lumbar spine) showing the lumbar spine fracture prior to the DEXA study (Fig. 1). These lumbar spine fractures were initially diagnosed based on at least 20% vertebral body height loss and were diagnosed by an independent radiologist. Another musculoskeletal radiologist confirmed the findings and reviewed the initial radiology report. The control patients had no lumbar spine fractures based on imaging (CT, MRI, or radiographs) of the lumbar spine obtained after the DEXA study was performed.
DEXA Imaging
All patients were imaged using the same General Electric (GE) DEXA Lunar Prodigy Advance system (General Electric Healthcare, Chicago, IL) in order to minimize intermachine variability [20]. Quality assurance on this machine was performed as recommended by the International Society for Clinical Densitometry (ISCD) [21].
The quantitative measures used in the analysis were the patient’s age, sex, height, weight, and the ancillary DEXA data output: L1 BMD, L1 T-score, L1 Z-score, L1 area, L1 width, L1 height, L2 BMD, L2 T-score, L2 Z-score, L2 area, L2 width, L2 height, L3 BMD, L3 T-score, L3 Z-score, L3 area, L3 width, L3 height, L4 BMD, L4 T-score, L4 Z-score, L4 area, L4 width, L4 height, neck BMD, neck T-score, neck Z-score, neck bone mineral content (BMC), neck area, Wards BMD, Wards T-score, Wards Z-score, Wards BMC, Wards area, trochanter BMD, trochanter T-score, trochanter Z-score, trochanter BMC, trochanter area, shaft BMD, shaft T-score, shaft Z-score, shaft BMC, shaft area, total hip BMD, total hip T-score, total hip Z-score, total hip BMC, and total hip area.
Statistical Methods
Qualitative and quantitative variables were compared between patients in each group.
The proportion of males in the control patients was compared to the proportion of males in the patients with spine fractures using Fisher’s exact tests. Quantitative variables including age, height, and weight were compared between control patients and patients with spine fractures using t tests with unequal variances. Pearson’s correlations were used to evaluate the association between lumbar spine BMD (measured at L1, L2, L3, and L4, respectively) with patient age, height, and weight, respectively. Pearson’s correlations were used to evaluate the associations between lumbar vertebral body height (measured at L1, L2, L3, and L4, respectively) each with patient age, height, and weight, respectively.
This dataset was divided into two smaller datasets—a training dataset (80%) and a test dataset (20%). The training dataset had 86 (35.0%) patients with at least one fracture of the L1–L4 lumbar spine, and 160 (65.0%) control patients. The test dataset had the remaining 22 (36.0%) patients with at least one fracture of the L1–L4 lumbar spine, and 39 (63.9%) control patients. Approximately 65.4% (161/246) of the patients in the training dataset were female, whereas 67.2% (41/61) of the patients in the test dataset were female. Males had significantly higher BMD at L1 (95% CI (0.03, 0.13), P = 0.001), L2 (95% CI (0.04, 0.15), P = 5.4 × 10−4), L3(95% CI (0.03, 0.15), P = 0.003), L4 (95% CI (0.03, 0.16), P = 0.003), and larger vertebral body heights at L1 (95% CI (0.12, 0.27), P = 1.11 × 10−6), L2 (95% CI (0.12, 0.30), P = 2.86 × 10−6), L3 (95% CI (0.13, 0.31), P = 1.82 × 10−6), and L4 (95% CI (0.17, 0.33), P = 2.59 × 10−9) than females. Patients’ clinical and demographic characteristics are shown in Table 1.
A SVM was used to identify the training vectors that best discriminated patients with fractures from control patients in the training dataset. SVM is a supervised learning model that is often used for pattern recognition, classification, and regression analysis [22]. C-classification with four different kernels (linear, cubic polynomial, radial basis function (RBF) and sigmoid) [22,23,24,25] with 10-fold cross-validation was utilized.
These training vectors were then used to classify each patient’s DEXA study into one of two categories: either fracture of the lumbar spine (F), or control DEXA study (N) in the training and the test datasets. DeLong’s test was used to compare receiver-operator characteristic (ROC) curves [26].
Statistics were performed using Rv3.4 (https://www.r-project.org/). The e1071 package was used for the SVMs, and the pROC library [27] was used for DeLong’s comparison of the ROC curves. All statistical tests were two-sided, and P values < 0.05 were considered statistically significant.
Results
Patients with lumbar spine fractures were on average older (P = 0.006) and weighed less (P = 6.42 × 10−9) than the control patients. There was no significant correlation, r, between patient age and the lumbar spine BMD. Vertebral body heights at each level decreased with patient age (P < 0.01 at each level). Vertebral body heights at each level were also positively associated with patient height (P < 0.001 at each level) and weight (P < 0.001). Lumbar spine BMD was positively associated with increased patient weight (P < 0.001 at each level) (Table 2).
The SVM classifier with linear kernel had an accuracy of 93.5% (230/246) and AUC of 0.9258 (Table 3) for discriminating patients with lumbar spine fractures from control patients in the training dataset, and had the highest accuracy of the kernels evaluated in the training dataset. The sensitivity and specificity of the SVM with the linear kernel for discriminating patients with lumbar spine fractures from control patients using DEXA ancillary data in the training dataset were 89.5% (77/86) and 96.6% (153/160). The SVM classifier with linear kernel had an accuracy of 91.8% (56/61) and an AUC of 0.8963 in the test dataset. The SVM classifier with RBF kernel was more accurate than the SVM with the linear kernel in the test data for detecting patients with L1–L4 lumbar spine fractures, but this was not statistically significant (DeLong’s test P = 0.317). The SVM classifier with the RBF kernel detected L1–L4 lumbar spine vertebral body fractures significantly better than expected by chance (P < 0.001, 95% CI (0.84, 0.98)). The sensitivity and specificity of the SVM classifier with the linear kernel for discriminating patients with lumbar spine fractures from control patients in the test dataset were 81.8% (18/22) and 97.4% (38/39), and comparable to that of the SVM classifier with the RBF kernel.
The SVM classifier with RBF kernel had a ROC curve with significantly higher AUC than the SVM classifier with cubic polynomial kernel ROC curve, for discriminating patients with lumbar spine fractures from control patients in the test data (DeLong’s test P = 0.013). There was no significant difference in between the AUCs of the ROC curves from the SVM classifier with the RBF kernel, and the SVM classifier with linear kernel (DeLong’s test P = 0.317) or the SVM classifier with sigmoid kernel (DeLong’s test P = 0.543). The SVM classifier with linear kernel had a significantly better AUC for discriminating patients with fractures from control patients than the cubic polynomial kernel ROC curve (DeLong’s test P = 0.034) (Fig. 2). The SVM with the sigmoid kernel had an AUC that was not significantly different from that of the AUC of the SVM with the cubic polynomial kernel (DeLong’s test P = 0.181) or the AUC of the SVM with the linear kernel (DeLong’s test P = 0.729).
Discussion
The results show that patients with fractures of the L1-L4 lumbar spine can be identified after obtaining a screening posterior-anterior lumbar spine DEXA study by analysis of the DEXA ancillary data. Lumbar spine fractures were detected by the SVM with the linear kernel with sensitivity of 81.8% and specificity of 97.4%. All kernels evaluated performed reasonably well apart from the cubic polynomial kernel.
This study has tremendous clinical implications. Firstly, all 108 fractures identified were not prospectively identified by the diagnostic radiologist interpreting the DEXA study. The analysis shows that the optimized SVM detected over 81% of these fractures in the test dataset. Application of the SVM required no additional imaging or radiation, and utilized routine PA DEXA ancillary data that are produced by DEXA manufacturers. This is distinct from the DEXA vertebral fracture assessment (VFA), which requires a lateral projection of the thoracolumbar spine to assess for spine fractures and results in additional radiation to the patient.
DEXA screening studies are often used to identify patients at risk for hip, spine, and wrist fractures because of low bone mineral density; however, as this analysis shows, the DEXA screening study can also detect patients with upper lumbar spine fractures. This is clinically significant because detection of a spine fracture changes clinical management and often prompts further imaging workup including MRI studies to assess for potential spinal cord compromise. In addition, a lumbar spine fracture in a patient with osteoporosis results in a diagnosis of severe osteoporosis.
To our knowledge, this is the first study utilizing the ancillary data from PA DEXA output to predict lumbar spine fractures. The other results are similar to previously published studies. Males have been shown to have higher BMD than females [28, 29], which is supported by the data. BMD was strongly associated with patient height, and weight [30,31,32]. Prior reports have shown that vertebral body fractures [14]) can be detected by radiologists reviewing images from DEXA studies. DEXAs have been utilized to identify patients at risk for vertebral body fractures related to low BMD [1,2,3]. The data show that patients with fractured L1–L4 vertebral bodies had lower BMD than control patients, which is similar to prior published reports [1,2,3].
The study has a few limitations. Firstly, it is a retrospective study based on clinical data at a single academic institution, and therefore subject to ascertainment bias. The study sample proportion of patients with lumbar spine fractures was enriched for patients with fractures and therefore is not necessarily representative of the true prevalence noted in the general screening population. The patients were restricted to patients with their DEXA studies performed on the same DEXA scanner made by a single manufacturer. This was done to eliminate systematic differences that have been found between DEXA scanners made by different manufacturers due to differences in calibration [33, 34]. However, we believe that these results will be generalizable to other DEXA systems and would probably be most accurate if the training and testing datasets come from the same DEXA scanner manufacturer.
While less than 16% of DEXA studies have been reported to have incidental findings, approximately 35.1% (108/307) of the patients in this study had lumbar spine fractures, which potentially could affect the performance of the SVM in clinical practice. Our sample was somewhat enriched in the prevalence of lumbar spine fractures to allow the SVM algorithm to better predict these rare findings. The small sample size also limits the power of the algorithm to better predict the rare incidental findings that can be detected on DEXA studies. The failures to detect fractures appeared to be primarily because of Schmorl’s nodes/endplate deformities that could not be detected using the frontal projection obtained routinely for DEXA studies, or because the technologist’s tracing of each vertebral body for measurement of vertebral body height and area were inaccurate. However, our study is the first to show that fractures can be identified using machine learning algorithms without analysis of image pixel data and that these algorithms can be used to aid radiologists in identifying these often missed fractures that change clinical management. Further research is required to validate our findings in larger studies.
In summary, SVM classifiers can use quantitative ancillary data routinely obtained from DEXA studies to identify L1–L4 lumbar spine fractures. Machine learning algorithms can be used as an adjunct to identify incidental findings and assist radiologists in the interpretation of DEXA studies.
References
Cefalu CA: Is bone mineral density predictive of fracture risk reduction? Curr Med Res Opin 20(3):341–349, 2004
Unnanuntana A, Gladnick BP, Donnelly E, Lane JM: The assessment of fracture risk. J Bone Joint Surg Am 92(3):743–753, 2010. https://doi.org/10.2106/JBJS.I.00919
McClung MR: The relationship between bone mineral density and fracture risk. Curr Osteoporos Rep 3(2):57–63, 2005
Schnell S, Friedman SM, Mendelson DA, Bingham KW, Kates SL: The 1-year mortality of patients treated in a hip fracture program for elders. Geriatr Orthop Surg Rehabil 1(1):6–14, 2010. https://doi.org/10.1177/2151458510378105
Panula J, Pihlajamäki H, Mattila VM, Jaatinen P, Vahlberg T, Aarnio P, Kivelä SL: Mortality and cause of death in hip fracture patients aged 65 or older: a population-based study. BMC Musculoskelet Disord 12:105, 2011. https://doi.org/10.1186/1471-2474-12-105
Cameron ID, Chen JS, March LM, Simpson JM, Cumming RG, Seibel MJ, Sambrook PN: Hip fracture causes excess mortality owing to cardiovascular and infectious disease in institutionalized older people: a prospective 5-year study. J Bone Miner Res 25(4):866–872, 2010. https://doi.org/10.1359/jbmr.091029.
Haentjens P, Autier P, Collins J, Velkeniers B, Vanderschueren D, Boonen S: Colles fracture, spine fracture, and subsequent risk of hip fracture in men and women. A meta-analysis. J Bone Joint Surg Am 85-A(10):1936–1943, 2003
Garg MK, Kharb S: Dual energy X-ray absorptiometry: Pitfalls in measurement and interpretation of bone mineral density. Indian J Endocrinol Metab 17(2):203–210, 2013. https://doi.org/10.4103/2230-8210.109659
Preventive Services US, Force T: Screening for osteoporosis: recommendation statement. Am Fam Physician 83(10):1197–1200, 2011
World Health Organization: Assessment of fracture risk and its application to screening for postmenopausal osteoporosis: technical report series 843. Geneva: WHO, 1994
Kanis JA, Gluer CC, for the Committee of Scientific Advisors, International Osteoporosis Foundation An update on the diagnosis and assessment of osteoporosis with densitometry. Osteoporos Int 200011192–202.
O'Malley CD, Johnston SS, Lenhart G, Cherkowski G, Palmer L, Morgan SL: Trends in dual-energy X-ray absorptiometry in the United States, 2000-2009. J Clin Densitom 14(2):100–107, 2011. https://doi.org/10.1016/j.jocd.2011.03.003.
Curtis JR, Laster A, Becker DJ, Carbone L, Gary LC, Kilgore ML, Matthews RS, Morrisey MA, Saag KG, Tanner SB, Delzell E: The geographic availability and associated utilization of dual-energy X-ray absorptiometry (DXA) testing among older persons in the United States. Osteoporos Int 20(9):1553–1561, 2009. https://doi.org/10.1007/s00198-008-0821-xEpub 2008 Dec 24
Bazzocchi A, Ferrari F, Diano D, Albisinni U, Battista G, Rossi C, Guglielmi G: Incidental findings with dual-energy X-ray absorptiometry: Spectrum of possible diagnoses. Calcif Tissue Int 91(2):149–156, 2012. https://doi.org/10.1007/s00223-012-9609-2Epub 2012 May 24
Gregson CL, Hardcastle SA, Cooper C, Tobias JH: Friend or foe: High bone mineral density on routine bone density scanning, a review of causes and management. Rheumatology (Oxford) 52(6):968–985, 2013. https://doi.org/10.1093/rheumatology/ket007Epub 2013 Feb 27
Vasireddy S, Halsey JP: Incidental detection of lumbar Paget’s disease by bone densitometry. Rheumatology (Oxford) 40(12):1424–1425, 2001
Cherian RA, Haddaway MJ, Davie MW, McCall IW, Cassar-Pullicino VN: Effect of Paget’s disease of bone on areal lumbar spine bone mineral density measured by DXA, and density of cortical and trabecular bone measured by quantitative CT. Br J Radiol 73(871):720–726, 2000
Martineau P, Bazarjani S, Zuckier LS: Artifacts and incidental findings encountered on dual-energy X-ray absorptiometry: Atlas and analysis. Semin Nucl Med 45(5):458–469, 2015. https://doi.org/10.1053/j.semnuclmed.2015.02.001
Mehta SD, Sebro R: Random forest classifiers aid in the detection of incidental osteoblastic osseous metastases in DEXA studies. Int J Comput Assist Radiol Surg 14:903–909, 2019. https://doi.org/10.1007/s11548-019-01933-1
Genant HK, Grampp S, Glüer CC, Faulkner KG, Jergas M, Engelke K, Hagiwara S, Van Kuijk C: Universal standardization for dual x-ray absorptiometry: Patient and phantom cross-calibration results. J Bone Miner Res 9:1503–1514, 1994
Lewiecki EM, Binkley N, Morgan SL, Shuhart CR, Camargos BM, Carey JJ, Gordon CM, Jankowski LG, Lee JK, Leslie WD: International Society for Clinical Densitometry best practices for dual-energy X-ray absorptiometry measurement and reporting: International Society for Clinical Densitometry Guidance. J Clin Densitom 19(2):127–140, 2016. https://doi.org/10.1016/j.jocd.2016.03.003Epub 2016 Mar 22
Shen T, Jiang J, Li Y, Wu P, Zuo C, Yan Z: Decision supporting model for one-year conversion Probability from MCI to AD using CNN and SVM. Conf Proc IEEE Eng Med Biol Soc 2018:738–741, 2018. https://doi.org/10.1109/EMBC.2018.8512398
Longato E, Acciaroli G, Facchinetti A, Maran A, Sparacino G: Simple linear support vector machine classifier can distinguish impaired glucose tolerance versus type 2 diabetes using a reduced set of CGM-based glycemic variability indices. J Diabetes Sci Technol 31:1932296819838856, 2019. https://doi.org/10.1177/1932296819838856
Cortes C, Vapnik VN: Support-vector networks. Mach Learn 20(3):273–297, 1995
Wajid SK, Elzaart A, Berbar MA. Eighth international conference computer graphics. 2011. 145–150. DOI:https://doi.org/10.1109/CGIV.2011.31
DeLong ER, DeLong DM, Clarke-Pearson DL: Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics. 44(3):837–845, 1988
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J, Müller M: pROC: An open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 12:77, 2011
Nieves JW, Formica C, Ruffing J, Zion M, Garrett P, Lindsay R, Cosman F: Males have larger skeletal size and bone mass than females despite comparable body size. J Bone Miner Res 20(3):529–535, 2005 Epub 2004 Oct 11
Naganathan V, Sambrook P: Gender differences in volumetric bone density: A study of opposite-sex twins. Osteoporos Int 14(7):564–569, 2003 Epub 2003 Jun 26
Iwaniec UT, Turner RT: Influence of body weight on bone mass, architecture and turnover. J Endocrinol 230(3):R115–R130, 2016. https://doi.org/10.1530/JOE-16-0089Epub 2016 Jun 27
Caffarelli C, Alessi C, Nuti R, Gonnelli S: Divergent effects of obesity on fragility fractures. Clin Interv Aging 24(9):1629–1636, 2014. https://doi.org/10.2147/CIA.S64625.eCollection2014
De Schepper J, Derde MP, Van den Broeck M, Piepsz A, Jonckheer MH. Normative data for lumbar spine bone mineral content in children: Influence of age, height, weight and pubertal stageJ Nucl Med 1991;32(2):216–220.
Tothill P, Fenner JA, Reid DM: Comparisons between three dual-energy X-ray absorptiometers used for measuring spine and femur. Br J Radiol 68(810):621–629, 1995
Hui SL, Gao S, Zhou XH, Johnston, Jr CC, Lu Y, Glüer CC, Grampp S, Genant H: Universal standardization of bone density measurements: A method with optimal properties for calibration among several instruments. J Bone Miner Res 12(9):1463–1470, 1997
Funding
RS was supported by a Radiology Society of North America Research Scholarship.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
This study was approved by the senior author’s Institutional Review Board and the need for signed informed consent was waived.
Conflict of Interest
The authors declare that they have no conflict of interest.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Mehta, S.D., Sebro, R. Computer-Aided Detection of Incidental Lumbar Spine Fractures from Routine Dual-Energy X-Ray Absorptiometry (DEXA) Studies Using a Support Vector Machine (SVM) Classifier. J Digit Imaging 33, 204–210 (2020). https://doi.org/10.1007/s10278-019-00224-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10278-019-00224-0