Systematic reviews of diagnostic tests in endocrinology: an audit of methods, reporting, and performance

Spencer-Bonilla, Gabriela; Singh Ospina, Naykky; Rodriguez-Gutierrez, Rene; Brito, Juan P.; Iñiguez-Ariza, Nicole; Tamhane, Shrikant; Erwin, Patricia J.; Murad, M. Hassan; Montori, Victor M.

doi:10.1007/s12020-017-1298-1

Systematic reviews of diagnostic tests in endocrinology: an audit of methods, reporting, and performance

Meta-Analysis
Published: 05 June 2017

Volume 57, pages 18–34, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Endocrine Aims and scope Submit manuscript

Systematic reviews of diagnostic tests in endocrinology: an audit of methods, reporting, and performance

Download PDF

Gabriela Spencer-Bonilla^1,2,
Naykky Singh Ospina^1,3,
Rene Rodriguez-Gutierrez^1,4,
Juan P. Brito^1,5,
Nicole Iñiguez-Ariza⁵,
Shrikant Tamhane^1,5,
Patricia J. Erwin⁶,
M. Hassan Murad^1,7 &
…
Victor M. Montori^1,5

505 Accesses
8 Citations
Explore all metrics

Abstract

Background

Systematic reviews provide clinicians and policymakers estimates of diagnostic test accuracy and their usefulness in clinical practice. We identified all available systematic reviews of diagnosis in endocrinology, summarized the diagnostic accuracy of the tests included, and assessed the credibility and clinical usefulness of the methods and reporting.

Methods

We searched Ovid MEDLINE, EMBASE, and Cochrane CENTRAL from inception to December 2015 for systematic reviews and meta-analyses reporting accuracy measures of diagnostic tests in endocrinology. Experienced reviewers independently screened for eligible studies and collected data. We summarized the results, methods, and reporting of the reviews. We performed subgroup analyses to categorize diagnostic tests as most useful based on their accuracy.

Results

We identified 84 systematic reviews; half of the tests included were classified as helpful when positive, one-fourth as helpful when negative. Most authors adequately reported how studies were identified and selected and how their trustworthiness (risk of bias) was judged. Only one in three reviews, however, reported an overall judgment about trustworthiness and one in five reported using adequate meta-analytic methods. One in four reported contacting authors for further information and about half included only patients with diagnostic uncertainty.

Conclusion

Up to half of the diagnostic endocrine tests in which the likelihood ratio was calculated or provided are likely to be helpful in practice when positive as are one-quarter when negative. Most diagnostic systematic reviews in endocrine lack methodological rigor, protection against bias, and offer limited credibility. Substantial efforts, therefore, seem necessary to improve the quality of diagnostic systematic reviews in endocrinology.

Application of weighting methods for presenting risk-of-bias assessments in systematic reviews of diagnostic test accuracy studies

Article Open access 27 June 2021

Time to publication among completed diagnostic accuracy studies: associated with reported accuracy estimates

Article Open access 06 June 2016

Challenges of rapid reviews for diagnostic test accuracy questions: a protocol for an international survey and expert consultation

Article Open access 04 April 2019

Discover the latest articles, news and stories from top researchers in related subjects.

Medical Imaging

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Accurate diagnosis informs adequate treatment in clinical practice. Understanding the accuracy of diagnostic tests allows clinicians to discern between futile tests and those that can significantly advance the diagnostic process. Yet, clinicians often have inaccurate expectations regarding the benefits and harms of medical tests [1].

Despite the clinical importance of diagnostic tests, research regarding diagnosis usually lags behind its treatment counterpart, both in terms of quantity and quality [2, 3]. An analysis of research in the field of endocrinology suggests knowledge gaps (including those related to diagnosis) are not adequately addressed by ongoing research studies [4]. The Institute of Medicine’s (IOM) report on diagnostic errors, recognizes this knowledge gap, and highlights the need for the research community to improve the understanding of the diagnostic process [2].

Systematic reviews (SRs), summarizing the body of evidence, provide clinicians, patients, guideline panelists, and policymakers with best estimates of the accuracy of diagnostic tests and their usefulness in clinical practice. They allow clinicians to evaluate the body of available evidence instead of using only the latest, largest, or most well-known study to inform their practice. In order for clinicians to apply the results in practice, SRs should: (1) produce results that are clinically useful and (2) have credible and reproducible methods for synthesizing and reporting the evidence. Our logic model (Fig. 1) evaluates the presence of criteria that can help clinicians evaluate SRs in general (e.g., performing an assessment of risk of bias) and diagnostic reviews in particular (e.g., including only patients with diagnostic uncertainty) [5, 6].

In accordance with IOM’s recommendations, an assessment of the literature can help identify useful diagnostic tests in endocrinology and the conditions that warrant investment of scarce research resources to improve the diagnostic process. To this end, we identified all SRs and meta-analyses addressing the diagnostic accuracy of different tests in the field of endocrinology. Here, we summarize their findings along with the quality and reporting of the methods used in order to determine if the diagnostic evidence syntheses in endocrinology are: (a) clinically useful and (b) credible.

Methods

Eligibility criteria

We included SRs and meta-analyses reporting accuracy measures of diagnostic tests used in patients under evaluation for any endocrine condition. Endocrine conditions were defined as those that belonged to one of the following categories: (1) bone, (2) diabetes/glycemia, (3) neuroendocrine tumors, (4) pituitary–gonadal–adrenal, and (5) thyroid. Eligible studies reported clinically applicable diagnostic accuracy measures using: sensitivity, specificity, positive predictive value, negative predictive value, positive likelihood ratio, negative likelihood ratio, diagnostic odds ratio, diagnostic accuracy, or the area under the curve. Studies not reporting any of these diagnostic measures were excluded without author contact.

Search methods for identification of studies

A comprehensive search of MEDLINE, EMBASE, and Cochrane CENTRAL from inception to December 2015 without language restrictions was designed by an experienced medical librarian (P.J.E.) with input from the study’s principal investigators (N.S.O. and R.R.G.). Controlled vocabulary supplemented with keywords was used to search for SRs of diagnostic tests in endocrinology (Online Resource 1). We consulted Mayo Clinic experts in the field to identify references that could have been missed by our search.

Selection of studies and data management

The search results were uploaded into SR software (DistillerSR, Ottawa, Canada, https://distillercer.com/products/distillersr-systematic-review-software/). Six reviewers, working independently and in duplicate, reviewed all abstracts and titles for inclusion (J.P.B., N.I.A., R.R.G., N.S.O., G.S.B., and S.T.) and assessed the eligibility of full-text publications retrieved if at least one reviewer considered the abstract eligible. Disagreements were resolved by arbitration (a third reviewer decided whether the study should be included). Chance-adjusted agreement was quantified using the kappa statistic [7].

Data collection

Reviewers performed data collection independently and in duplicate using a standardized form and data extraction instructions after calibrating with two included full texts. Extracted data included: (a) general information about the review (first author, year of publication, country, condition of interest, diagnostic test, gold standard used, prevalence of the disease in the studied population, number of articles and patients included); (b) clinical performance of the diagnostic test (pooled summary statistics); and (c) methods and data reporting (search strategy, language restrictions, review of references, independent duplicate process, clear clinical question, selection criteria, summary of included studies, predefined subgroup analyses, methods used for data representation, method used for analysis, evaluation of risk of bias and tool used, author contact description, publication bias assessment, heterogeneity assessment, and assessment of the confidence warranted by the evidence).

In order to determine the clinical usefulness of diagnostic tests, subgroup analyses were performed including only studies in which a likelihood ratio (a measure of diagnostic accuracy) was provided or could be calculated from the available data (e.g., when sensitivity and specificity were provided). We chose likelihood ratios for this analysis since this diagnostic variable allows comparison of results across studies, can be applied regardless of the prevalence of a condition, and is easily understood and applied by physicians when expressed in non-technical language [8]. In cases in which a single diagnostic test had more than one review available, we included the study with the best clinical performance: a best-case scenario analysis.

We used descriptive statistics to summarize extracted variables. Authors’ descriptions of “risk of bias” and “quality” were considered to be reflecting the same construct [9]. Author ratings of studies as “high” or “moderate to high” quality were coded as low-risk of bias and “low” or “acceptable” quality ratings were coded as high-risk of bias. We considered assessments of “confidence in the evidence” to be only those that used the results of the review to assess the quality of the body of evidence. Bivariate and hierarchical models were considered to be adequate methods for statistical analyses [10]. When a description of methods for statistical analysis was provided, reviews that did not use these methods were coded as “other” if another method (e.g., Moses–Littenberg random effects) was used or “unclear” when the description was insufficient to ascertain the method used.

Results

The database search generated 2122 eligible reports. After abstract and title screening, 115 reports were identified for full-text review. After reproducible full-text screening (κ statistic = 0.89) [7], 84 studies were found to be eligible [11–95] (Fig. 2). The majority of the included studies evaluated diagnostic tests related to thyroid conditions (n-42, 50%), most commonly thyroid cancer, or diabetes/glycemia (n-19, 23%). Five SRs did not include a meta-analysis. General characteristics of the included studies are reported in Table 1.

Table 1 Summary of included systematic reviews

Full size table

Forty-four of the 84 included reviews, evaluating 65 diagnostic tests, provided a pooled estimate of the likelihood ratio or enough data to calculate it. The positive likelihood ratio (+LR) of these tests is shown in Fig. 3 and the negative likelihood ratio (−LR) in Fig. 4. Forty-seven percent of the 65 diagnostic tests had a positive likelihood ratio of ≥10. Only 35% of these studies (reporting helpful +LR), however, reported an overall statement of the risk of bias of the included studies. Online Resource 2 shows a summary of diagnostic test with +LR ≥ 10, considered to be most useful to “rule-in” a diagnosis in endocrinology.

Twenty-six percent of the tests had a negative likelihood ratio of ≤0.10, only five of which reported an overall statement of risk of bias. Online Resource 3 shows a summary of diagnostic tests with −LR ≤ 0.10, considered to be most useful to “rule-out” a diagnosis in endocrinology. The most useful test, when positive or negative was the composite of free cortisol, salivary midnight cortisol, and dexamethasone suppression test for ruling in or out Cushing’s syndrome.

The most commonly reported summary statistics were sensitivity and specificity (94 and 87%, respectively) (Online Resource 4). The most common way of representing findings was the use of forest plots (68%) followed by receiver operator curves (65%). Only one meta-analysis presented the results using a Fagan nomogram, and three used other graphical representations (Online Resource 5).

About half of the reviews clearly reported including only patients in which there was diagnostic uncertainty and one in three reported the prevalence of the condition in the study population. Only five studies assessed the overall confidence merited by the body of evidence with unclear methods and limited conclusions provided (Fig. 5, Online Resource 6).

Most of the reviews reported a clear gold standard (93%), a comprehensive search strategy, a well-defined clinical question, and clear inclusion criteria; reviewed primary studies for references; and provided a summary table of the included studies. Only about 31% reported that their search was unrestricted by language. Approximately one in five of the reviews reported contacting authors, with a mean author response rate of 57% (range 16–91%) (Fig. 5, Online Resource 6).

The majority of the reviews reported assessing the risk of bias of the included studies (69%), but only one-third provided readers with an overall statement regarding the risk of bias of the included studies. When available, this information was reported in the abstract of the study in 7% of the cases and in the conclusion of the manuscript in 22%. Assessment of publication bias was reported by 40% of authors; the most common method was use of funnel plots. When a conclusion about publication bias was made, most (87%) were found to have insignificant publication bias.

The statistical method used for the diagnostic meta-analysis was not reported or was unclear in 8 (10%) of the reviews. Adequate methods of statistical analysis were reported in 16 (20%) of the meta-analyses. Other methods were reported in 70% of the reviews; the most common description was “random effects model”. Pre-specified subgroup analyses were reported in 34% of the studies. The majority of the reviews assessed for heterogeneity (77%). Most commonly, the Cochrane’s Q or the I ² statistic method was used; and 86% of those with a conclusion reported significant or high heterogeneity. Exploration of heterogeneity was performed in 38% of meta-analyses mostly through subgroup analyses or meta-regression. Results changed in about half of the reviews that reported the results of exploring heterogeneity (14/30).

Discussion

We performed a systematic evaluation of reviews and meta-analyses evaluating diagnostic tests in endocrinology to determine the clinical usefulness of the included tests and the credibility of the syntheses. Most of the included reviews focused on glycemia/diabetes and thyroid conditions. Clinically, ~50% of the diagnostic tests had a positive likelihood ratio equal to or greater than 10, significantly changing the probability of disease and helping clinicians rule in a disease if the test is positive. About 25% of the tests had a negative likelihood ratio ≤0.10, and can significantly decrease the probability of disease and help clinicians rule out a disease process. These diagnostic tests (Online Resources 2 and 3) can therefore be considered among the most useful tests in day-to-day endocrine clinical practice. This conclusion, however, is limited by the quality of information regarding the risk of bias of the included studies.

Likelihood ratios were used as summary statistics in 43% of the reviews and applied in a Fagan Nomogram in only one. As in previous evaluations, the most common measures used to report diagnostic accuracy were sensitivity and specificity [96]. Studies using hypothetical scenarios have shown that physicians integrate diagnostic performance information into their clinical assessment more accurately when it is provided as a likelihood ratio described in non-technical wording [8] and that probability modifying plots and natural frequency trees might help clinicians more accurately interpret diagnostic test results [97].

Empirical evidence suggests that characteristics of the sampled population can bias the results of evaluations of diagnostic test performance [98, 99]. About half of systematic reviewers reported only including primary studies with patients with diagnostic uncertainty and one-third reported the prevalence of the condition in the study population. These findings limit clinician’s abilities to assess the applicability of findings of SRs in their clinical settings.

Although most reviews reported credible methods for study identification and selection, included a clear gold standard, and evaluated the risk of bias of the studies, a limited number used appropriate statistical methods, contacted authors, or provided an overall summary of the risk of bias of the included studies. In 2006, a SR that evaluated 89 reviews of diagnostic tests in oncology found that reviews commonly defined inclusion criteria, provided a summary of the included studies, and reported performing crucial steps in duplicate. Quality assessment of the included studies was performed in 61% of the studies and a formal assessment provided in only 30% [6]. Our results, in a different field of medicine and 10 years later, are quite similar.

The number of reviews of diagnostic tests in endocrinology that reported author contact (~20%) to obtain primary information was lower than in SRs in general (50–85%) [100], though this procedure can lead to changes in the estimates of clinical utility of tests [101]. In a study of 114 SRs, half of the authors conducted an explicit assessment of the quality of the individual studies that were included and in 64 of the cases the results of these evaluation were presented in a table [9]. We found that although most reviews evaluated the quality of the included studies, most failed to provide an overall statement for the reader and a minority (7%) reported this information in the abstract. In addition, the majority of the studies included some description of statistical methods used, but only 20% used bivariate or hierarchical models, recommended by experts for diagnostic meta-analyses. These methods are preferred because they simultaneously summarize sensitivity and specificity [10, 102–104].

SRs and meta-analyses should include assessments of the body of evidence based on the study design and risk of bias of the included studies, indirectness, inconsistency across results, imprecision, and risk of publication bias [105]. We found that most diagnostic SRs in endocrinology fail to provide an overall statement about study quality, do not commonly evaluate the risk of publication bias, explore inconsistencies, or provide an overall assessment of the confidence in the evidence. This limits the results of these reviews to a summary estimate without assessing the confidence merited by the results or providing actionable insight to improve the quality of the body of evidence (identification of knowledge gaps and areas for research).

Diagnostic SRs are critical in clinical practice to: (1) identify the most useful clinical tests and (2) help identify areas where further research is needed. These benefits can be hampered if the methodology used to perform and report these reviews is not adequate. Our review suggests multiple areas where authors of SRs of diagnostic tests could improve such as: author contact, exploration of heterogeneity (e.g., sensitivity analysis), use of advanced statistical methods, reporting of the overall risk of bias of included studies, and assessing the confidence in the body of the evidence.

We performed subgroup analyses to identify those studies that would be more useful in clinical practice; however, this was limited to the included tests (useful tests for which a SR has not been done are missing) and those that provided information that allowed us to calculate a likelihood ratio. Diagnostic accuracy estimates and risk of bias assessment are just a few of the components that can be used to determine if a test would actually benefit a patient. Diagnostic test studies are not usually linked to a patient important outcome; they provide only indirect evidence of potential benefits that depend on other factors such as: availability of treatment for a potential disease, patient context, and so on [105].

This SR audit summarizes the accuracy of diagnostic tests in endocrinology, the reporting of these reviews, and helps to identify knowledge gaps in diagnosis. Several limitations exist to the application of our results in clinical practice. First, our search may have missed SRs not indexed under endocrinology. Second, we did not assess the quality of studies included in SRs and relied (when available) on authors’ assessments of quality for our analysis. Importantly, while best methods to perform and report a SR and meta-analysis of healthcare interventions are delineated by PRISMA [106], best standards for SRs of diagnostic tests may differ [107].

Conclusions

Almost half of the tests in which a LR could be calculated or was provided produced significant variations in the pre-test probability of disease and are very likely to be helpful when positive; ~25% of the tests are very likely to be helpful when negative. In general, most reviews of diagnostic tests in endocrinology followed acceptable general methods for study identification, screening, and extraction. Most of the reviews reported evaluating the risk of bias (rarely providing an overall statement), but only 23% contacted authors and 20% used adequate statistical methods. As a result, the overall confidence in the diagnostic estimates provided by these studies is limited.

Progress in the field of diagnostic tests in endocrinology should be supported by standardized methods and reporting for SRs and meta-analyses. These standards should include use of adequate statistical methods and overall statements about the confidence in the body of evidence evaluating these tests. These evidence summaries should provide evidence to help clinicians and patients discuss the usefulness of the tests and the trustworthiness in the evidence producing these estimates.

References

T.C. Hoffmann, C. Del Mar: Clinicians’ expectations of the benefits and harms of treatments, screening, and tests: a systematic review. JAMA Intern. Med. (2017). doi:10.1001/jamainternmed.2016.8254
Article PubMed Google Scholar
J.R. Ball, E. Balogh, Improving diagnosis in health care: highlights of a report from the national academies of sciences, engineering, and medicine. Ann. Intern. Med. 164(1), 59–61 (2016). doi:10.7326/m15-2256
Article PubMed Google Scholar
P. Tricoci, J.M. Allen, J.M. Kramer, R.M. Califf, S.C. Smith Jr., Scientific evidence underlying the ACC/AHA clinical practice guidelines. J. Am. Med. Assoc. 301(8), 831–841 (2009). doi:10.1001/jama.2009.205
Article CAS Google Scholar
N. Singh Ospina, R. Rodriguez-Gutierrez, J.P. Brito, W.F. Young Jr., V.M. Montori, Is the endocrine research pipeline broken? A systematic evaluation of the endocrine society clinical practice guidelines and trial registration. BMC Med. 13, 187 (2015). doi:10.1186/s12916-015-0435-z
Article PubMed Central PubMed Google Scholar
L. Ge, J.C. Wang, J.L. Li, L. Liang, N. An, X.T. Shi, Y.C. Liu, J.H. Tian, The assessment of the quality of reporting of systematic reviews/meta-analyses in diagnostic tests published by authors in China. PLoS ONE 9(1), e85908 (2014). doi:10.1371/journal.pone.0085908
Article CAS PubMed Central PubMed Google Scholar
S. Mallett, J.J. Deeks, S. Halligan, S. Hopewell, V. Cornelius, D.G. Altman, Systematic reviews of diagnostic tests in cancer: review of methods and reporting. Br. Med. J. 333(7565), 413 (2006). doi:10.1136/bmj.38895.467130.55
Article Google Scholar
T. McGinn, P.C. Wyer, T.B. Newman, S. Keitz, R. Leipzig, G.G. For, Tips for learners of evidence-based medicine: 3. Measures of observer variability (kappa statistic). Can. Med. Assoc. J. 171(11), 1369–1373 (2004). doi:10.1503/cmaj.1031981
Article Google Scholar
L.M. Bachmann, D.B. Bischof, S.A. Bischofberger, M.G. Bonani, F.M. Osann, J. Steurer, Systematic quantitative overviews of the literature to determine the value of diagnostic tests for predicting acute appendicitis: study protocol. BMC Surg. 2, 2 (2002)
Article PubMed PubMed Central Google Scholar
P. Whiting, A.W. Rutjes, J. Dinnes, J.B. Reitsma, P.M. Bossuyt, J. Kleijnen, A systematic review finds that diagnostic reviews fail to incorporate quality despite available tools. J. Clin. Epidemiol. 58(1), 1–12 (2005). doi:10.1016/j.jclinepi.2004.04.008
Article PubMed Google Scholar
R.M. Harbord, P. Whiting, J.A. Sterne, M. Egger, J.J. Deeks, A. Shang, L.M. Bachmann, An empirical comparison of methods for meta-analysis of diagnostic accuracy showed hierarchical models are necessary. J. Clin. Epidemiol. 61(11), 1095–1103 (2008). doi:10.1016/j.jclinepi.2007.09.013
Article PubMed Google Scholar
M. Gotthardt, B. Lohmann, T.M. Behr, A. Bauhofer, C. Franzius, M.L. Schipper, M. Wagner, H. Hoffken, H. Sitter, M. Rothmund, K. Joseph, C. Nies, Clinical value of parathyroid scintigraphy with technetium-99m methoxyisobutylisonitrile: discrepancies in clinical data and a systematic metaanalysis of the literature. World J. Surg. 28(1), 100–107 (2004)
Article PubMed Google Scholar
S. Nayak, I. Olkin, H. Liu, M. Grabe, M.K. Gould, I.E. Allen, D.K. Owens, D.M. Bravata, Meta-analysis: accuracy of quantitative ultrasound for identifying patients with osteoporosis. Ann. Intern. Med. 144(11), 832–841 (2006)
Article PubMed Google Scholar
C. Caldarella, G. Treglia, M.A. Isgro, A. Giordano, Diagnostic performance of positron emission tomography using 11C-methionine in patients with suspected parathyroid adenoma: a meta-analysis. Endocrine 43(1), 78–83 (2013). doi:10.1007/s12020-012-9746-4
Article CAS PubMed Google Scholar
C. Caldarella, G. Treglia, A. Pontecorvi, A. Giordano, Diagnostic performance of planar scintigraphy using 99mTc-MIBI in patients with secondary hyperparathyroidism: a meta-analysis. Ann. Nucl. Med. 26(10), 794–803 (2012). doi:10.1007/s12149-012-0643-y
Article PubMed Google Scholar
K. Cheung, T.S. Wang, F. Farrokhyar, S.A. Roman, J.A. Sosa, A meta-analysis of preoperative localization techniques for patients with primary hyperparathyroidism. Ann. Surg. Oncol. 19(2), 577–583 (2012). doi:10.1245/s10434-011-1870-5
Article PubMed Google Scholar
W.J. Wei, C.T. Shen, H.J. Song, Z.L. Qiu, Q.Y. Luo, Comparison of SPET/CT, SPET and planar imaging using 99mTc-MIBI as independent techniques to support minimally invasive parathyroidectomy in primary hyperparathyroidism: a meta-analysis. Hell. J. Nucl. Med. 18(2), 127–135 (2015). doi:10.1967/s002449910207
Article PubMed Google Scholar
K.K. Wong, L.M. Fig, M.D. Gross, B.A. Dwamena, Parathyroid adenoma localization with 99mTc-sestamibi SPECT/CT: a meta-analysis. Nucl. Med. Commun. 36(4), 363–375 (2015). doi:10.1097/MNM.0000000000000262
Article CAS PubMed Google Scholar
J. Windeler, J. Kobberling, The fructosamine assay in diagnosis and control of diabetes mellitus scientific evidence for its clinical usefulness? J. Clin. Chem. Clin. Biochem. 28(3), 129–138 (1990)
CAS PubMed Google Scholar
J.E. Jensen, S.H. Nielsen, L. Foged, S.N. Holmegaard, E. Magid, The MICRAL test for diabetic microalbuminuria: predictive values as a function of prevalence. Scand. J. Clin. Lab. Invest. 56(2), 117–122 (1996)
Article CAS PubMed Google Scholar
A.L. Peters, M.B. Davidson, D.L. Schriger, V. Hasselblad, A clinical approach for the diagnosis of diabetes mellitus: an analysis using glycosylated hemoglobin levels. Metaanalysis Research Group on the Diagnosis of Diabetes Using Glycated Hemoglobin Levels. [Erratum appears in JAMA 1997 Apr 9;277(14):1125]. J. Am. Med. Assoc. 276(15), 1246–1252 (1996)
Article CAS Google Scholar
E.A. Whitsel, E.J. Boyko, D.S. Siscovick, Reassessing the role of QTc in the diagnosis of autonomic failure among patients with diabetes: a meta-analysis. Diabetes Care 23(2), 241–247 (2000)
Article CAS PubMed Google Scholar
B. Ewald, J. Attia, Which test to detect microalbuminuria in diabetic patients? A systematic review. Aust. Fam. Physician 33(7), 565–567, 571 (2004).
PubMed Google Scholar
G. Virgili, F. Menchini, A.F. Dimastrogiovanni, E. Rapizzi, U. Menchini, F. Bandello, R.G. Chiodini, Optical coherence tomography versus stereoscopic fundus photography or biomicroscopy for diagnosing diabetic macular edema: a systematic review. Invest. Ophthalmol. Vis. Sci. 48(11), 4963–4973 (2007)
Article PubMed Google Scholar
M.T. Dinh, C.L. Abad, N. Safdar, Diagnostic accuracy of the physical examination and imaging tests for osteomyelitis underlying diabetic foot ulcers: meta-analysis. Clin. Infect. Dis. 47(4), 519–527 (2008). doi:10.1086/590011
Article PubMed Google Scholar
Y. Feng, F.J. Schlosser, B.E. Sumpio, The Semmes Weinstein monofilament examination as a screening tool for diabetic peripheral neuropathy. J. Vasc. Surg. 50(3), 675–682, 682.e671 (2009). doi:10.1016/j.jvs.2009.05.017.
Article PubMed Google Scholar
B.A. Blomberg, M.C. Moghbel, B. Saboury, C.A. Stanley, A. Alavi, The value of radiologic interventions and 18F-DOPA PET in diagnosing and localizing focal congenital hyperinsulinism: systematic review and meta-analysis. Mol. Imaging Biol. 15(1), 97–105 (2013). doi:10.1007/s11307-012-0572-0
Article PubMed Google Scholar
Y. Ye, H. Xie, X. Zhao, S. Zhang, The oral glucose tolerance test for the diagnosis of diabetes mellitus in patients during acute coronary syndrome hospitalization: a meta-analysis of diagnostic test accuracy. Cardiovasc. Diabetol. 11, 155 (2012). doi:10.1186/1475-2840-11-155
Article PubMed Central PubMed Google Scholar
S.T. Tang, Q. Zhang, C.J. Wang, H.Q. Tang, T.X. Wu, [Glycosylated hemoglobin A1c for the diagnosis of diabetes mellitus: a meta-analysis]. Chung-Hua Nei Ko Tsa Chih Chinese. J. Intern. Med. 52(1), 21–25 (2013)
Article CAS Google Scholar
Q.W. Tian, C. Xuan, H.W. Wang, J.X. Zhao, W.L. Yu, G. Gao, B.B. Zhang, L.M. Lun, Diagnostic accuracy of glycosylated hemoglobin in Chinese patients with gestational diabetes mellitus: a meta-analysis based on 2,812 patients and 5,918 controls. Genet. Test. Mol. Biomarkers 17(9), 687–695 (2013). doi:10.1089/gtmb.2013.0099
Article CAS PubMed Google Scholar
S. Yan, S. Liu, Y. Zhao, W. Zhang, X. Sun, J. Li, F. Jiang, J. Ju, N. Lang, Y. Zhang, W. Zhou, Q. Li, Diagnostic accuracy of HbA1c in diabetes between Eastern and Western. Eur. J. Clin. Invest. 43(7), 716–726 (2013). doi:10.1111/eci.12098
Article CAS PubMed Google Scholar
J. Yang, R. Hao, X. Zhu, Diagnostic role of 18F-dihydroxyphenylalanine positron emission tomography in patients with congenital hyperinsulinism: a meta-analysis. Nucl. Med. Commun. 34(4), 347–353 (2013). doi:10.1097/MNM.0b013e32835e6ac6
Article CAS PubMed Google Scholar
G. Hirschfeld, Mv Glischinski, M. Blankenburg, B. Zernikow, Screening for peripheral neuropathies in children with diabetes: a systematic review. Pediatrics 133(5), e1324–e1330 (2014). doi:10.1542/peds.2013-3645
Article PubMed Google Scholar
X. Su, Z. Zhang, X. Qu, Y. Tian, G. Zhang, Hemoglobin A1c for diagnosis of postpartum abnormal glucose tolerance among women with gestational diabetes mellitus: diagnostic meta-analysis. PLoS ONE 9(7), e102144 (2014). doi:10.1371/journal.pone.0102144
Article CAS PubMed Central PubMed Google Scholar
A. Tsapas, A. Liakos, P. Paschos, T. Karagiannis, E. Bekiari, N. Tentolouris, P. Boura, A simple plaster for screening for diabetic neuropathy: a diagnostic test accuracy systematic review and meta-analysis. Metabolism 63(4), 584–592 (2014). doi:10.1016/j.metabol.2013.11.019
Article CAS PubMed Google Scholar
H.Y. Wu, Y.S. Peng, C.K. Chiang, J.W. Huang, K.Y. Hung, K.D. Wu, Y.K. Tu, K.L. Chien, Diagnostic performance of random urine samples using albumin concentration vs ratio of albumin to creatinine for microalbuminuria screening in patients with diabetes mellitus: a systematic review and meta-analysis. JAMA Intern. Med. 174(7), 1108–1115 (2014). doi:10.1001/jamainternmed.2014.1363
Article CAS PubMed Google Scholar
N. Xu, H. Wu, D. Li, J. Wang, Diagnostic accuracy of glycated hemoglobin compared with oral glucose tolerance test for diagnosing diabetes mellitus in Chinese adults: a meta-analysis. Diabetes Res. Clin. Pract. 106(1), 11–18 (2014). doi:10.1016/j.diabres.2014.04.010
Article CAS PubMed Google Scholar
L. Shi, H. Wu, J. Dong, K. Jiang, X. Lu, J. Shi, Telemedicine for detecting diabetic retinopathy: a systematic review and meta-analysis. Br. J. Ophthalmol. 99(6), 823–831 (2015). doi:10.1136/bjophthalmol-2014-305631
Article PubMed Google Scholar
G. Treglia, F. Bertagna, R. Sadeghi, F.A. Verburg, L. Ceriani, L. Giovanella, Focal thyroid incidental uptake detected by 18F-fluorodeoxyglucose positron emission tomography. Meta-analysis on prevalence and malignancy risk. Nucl. Med. 52(4), 130–136 (2013). doi:10.3413/Nukmed-0568-13-03
Article CAS Google Scholar
S.R. Puli, N. Kalva, M.L. Bechtold, S.R. Pamulaparthy, M.D. Cashman, N.C. Estes, R.H. Pearl, F.H. Volmar, S. Dillon, M.F. Shekleton, D. Forcione, Diagnostic accuracy of endoscopic ultrasound in pancreatic neuroendocrine tumors: a systematic review and meta analysis. World J. Gastroenterol. 19(23), 3678–3684 (2013). doi:10.3748/wjg.v19.i23.3678
Article PubMed Central PubMed Google Scholar
X. Yang, Y. Yang, Z. Li, C. Cheng, T. Yang, C. Wang, L. Liu, S. Liu, Diagnostic value of circulating chromogranin a for neuroendocrine tumors: a systematic review and meta-analysis. PLoS ONE 10(4), e124884 (2015). doi:10.1371/journal.pone.0124884
Article CAS Google Scholar
G.W. Boland, M.J. Lee, G.S. Gazelle, E.F. Halpern, M.M. McNicholas, P.R. Mueller, Characterization of adrenal masses using unenhanced CT: an analysis of the CT literature. Am. J. Roentgenol. 171(1), 201–204 (1998)
Article CAS Google Scholar
R.I. Dorin, C.R. Qualls, L.M. Crapo, Diagnosis of adrenal insufficiency. Ann. Intern. Med. 139(3), 194–204 (2003)
Article PubMed Google Scholar
M.B. Elamin, M.H. Murad, R. Mullan, D. Erickson, K. Harris, S. Nadeem, R. Ennis, P.J. Erwin, V.M. Montori, Accuracy of diagnostic tests for Cushing’s syndrome: a systematic review and metaanalyses. J. Clin. Endocrinol. Metab. 93(5), 1553–1562 (2008). doi:10.1210/jc.2008-0139
Article CAS PubMed Google Scholar
R. Kazlauskaite, A.T. Evans, C.V. Villabona, T.A. Abdu, B. Ambrosi, A.B. Atkinson, C.H. Choi, R.N. Clayton, C.H. Courtney, E.N. Gonc, M. Maghnie, S.R. Rose, S.G. Soule, K. Tordjman; Adrenal, I.C.f.E.o.C.T.i.H.-P., Corticotropin tests for hypothalamic-pituitary-adrenal insufficiency: a metaanalysis. J. Clin. Endocrinol. Metab. 93(11), 4245–4253 (2008). doi:10.1210/jc.2008-0710
Article CAS PubMed Google Scholar
T. Carroll, H. Raff, J.W. Findling, Late-night salivary cortisol for the diagnosis of Cushing syndrome: a meta-analysis. Endocr. Pract. 15(4), 335–342 (2009). doi:10.4158/EP09023OR
Article PubMed Google Scholar
A.F. Jacobson, H. Deng, J. Lombard, H.J. Lessig, R.R. Black, 123I-meta-iodobenzylguanidine scintigraphy for the detection of neuroblastoma and pheochromocytoma: results of a meta-analysis. J. Clin. Endocrinol. Metab. 95(6), 2596–2606 (2010). doi:10.1210/jc.2009-2604
Article CAS PubMed Google Scholar
G.W. Boland, B.A. Dwamena, M.J. Sangwaiya, A.G. Goehler, M.A. Blake, P.F. Hahn, J.A. Scott, M.K. Kalra, Characterization of adrenal masses by using FDG PET: a systematic review and meta-analysis of diagnostic test performance. Radiology 259(1), 117–126 (2011). doi:10.1148/radiol.11100569
Article PubMed Google Scholar
A. Hazem, M.B. Elamin, G. Malaga, I. Bancos, Y. Prevost, C. Zeballos-Palacios, E.R. Velasquez, P.J. Erwin, N. Natt, V.M. Montori, M.H. Murad, The accuracy of diagnostic tests for GH deficiency in adults: a systematic review and meta-analysis. Eur. J. Endocrinol. 165(6), 841–849 (2011). doi:10.1530/EJE-11-0476
Article CAS PubMed Google Scholar
S. Iliodromiti, T.W. Kelsey, R.A. Anderson, S.M. Nelson, Can anti-Mullerian hormone predict the diagnosis of polycystic ovary syndrome? A systematic review and meta-analysis of extracted data. J. Clin. Endocrinol. Metab. 98(8), 3332–3340 (2013). doi:10.1210/jc.2013-1393
Article CAS PubMed Google Scholar
V. Rufini, G. Treglia, P. Castaldi, G. Perotti, A. Giordano, Comparison of metaiodobenzylguanidine scintigraphy with positron emission tomography in the diagnostic work-up of pheochromocytoma and paraganglioma: a systematic review. Q. J. Nucl. Med. Mol. Imaging 57(2), 122–133 (2013)
CAS PubMed Google Scholar
Y. Shen, J. Zhang, Y. Zhao, Y. Yan, Y. Liu, J. Cai, Diagnostic value of serum IGF-1 and IGFBP-3 in growth hormone deficiency: a systematic review with meta-analysis. Eur. J. Pediatr. 174(4), 419–427 (2015). doi:10.1007/s00431-014-2406-3
Article CAS PubMed Google Scholar
C.F. Eustatia-Rutten, J.W. Smit, J.A. Romijn, E.Pvd Kleij-Corssmit, A.M. Pereira, M.P. Stokkel, J. Kievit, Diagnostic value of serum thyroglobulin measurements in the follow-up of differentiated thyroid carcinoma, a structured meta-analysis. Clin. Endocrinol. 61(1), 61–74 (2004)
Article CAS Google Scholar
V. Rufini, G. Treglia, F. Montravers, A. Giordano, Diagnostic accuracy of [18F]DOPA PET and PET/CT in patients with neuroendocrine tumors: a meta-analysis. Clin. Transl. Imaging 1(2), 111–122 (2013). doi:10.1007/s40336-013-0005-3
Article Google Scholar
L. Peng, M.J. Gu, Diagnostic value of conventional and ultrasound-guided fine-needle aspiration biopsy for thyroid nodules: a meta-analysis. [Chinese]. Acad. J. Second Mil. Med. Univ. 28(9), 968–972 (2007)
Google Scholar
Y. Peng, H.H. Wang, A meta-analysis of comparing fine-needle aspiration and frozen section for evaluating thyroid nodules. Diagn. Cytopathol. 36(12), 916–920 (2008). doi:10.1002/dc.20943
Article PubMed Google Scholar
P.G. Raijmakers, M.A. Paul, P. Lips, Sentinel node detection in patients with thyroid carcinoma: a meta-analysis. World J. Surg. 32(9), 1961–1967 (2008). doi:10.1007/s00268-008-9657-y
Article CAS PubMed Central PubMed Google Scholar
M.J. Dong, Z.F. Liu, K. Zhao, L.X. Ruan, G.L. Wang, S.Y. Yang, F. Sun, X.G. Luo, Value of 18F-FDG-PET/PET-CT in differentiated thyroid carcinoma with radioiodine-negative whole-body scan: a meta-analysis. Nucl. Med. Commun. 30(8), 639–650 (2009). doi:10.1097/MNM.0b013e32832dcfa7
Article PubMed Google Scholar
C. Stevens, J.K. Lee, M. Sadatsafavi, G.K. Blair, Pediatric thyroid fine-needle aspiration cytology: a meta-analysis. J. Pediatr. Surg. 44(11), 2184–2191 (2009). doi:10.1016/j.jpedsurg.2009.07.022
Article PubMed Google Scholar
J. Bojunga, E. Herrmann, G. Meyer, S. Weber, S. Zeuzem, M. Friedrich-Rust, Real-time elastography for the differentiation of benign and malignant thyroid nodules: a meta-analysis. Thyroid 20(10), 1145–1150 (2010). doi:10.1089/thy.2010.0079
Article PubMed Google Scholar
W. Iared, D.C. Shigueoka, J.C. Cristofoli, R. Andriolo, A.N. Atallah, S.A. Ajzen, O. Valente, Use of color Doppler ultrasonography for the prediction of malignancy in follicular thyroid neoplasms: systematic review and meta-analysis. J. Ultrasound Med. 29(3), 419–425 (2010)
Article PubMed Google Scholar
M.E. Miller, Q. Chen, D. Elashoff, E. Abemayor, M.S. John, Positron emission tomography and positron emission tomography-CT evaluation for recurrent papillary thyroid carcinoma: meta-analysis and literature review. Head Neck 33(4), 562–565 (2011). doi:10.1002/hed.21492
Article PubMed Google Scholar
D. Vriens, J.Hd Wilt, G.Jvd Wilt, R.T. Netea-Maier, W.J. Oyen, L.Fd Geus-Oei, The role of [18F]-2-fluoro-2-deoxy-d-glucose-positron emission tomography in thyroid nodules with indeterminate fine-needle aspiration biopsy: systematic review and meta-analysis of the literature. Cancer 117(20), 4582–4594 (2011). doi:10.1002/cncr.26085
Article PubMed Google Scholar
M. Bongiovanni, A. Spitale, W.C. Faquin, L. Mazzucchelli, Z.W. Baloch, The Bethesda system for reporting thyroid cytopathology: a meta-analysis. Acta Cytol. 56(4), 333–339 (2012). doi:10.1159/000339959
Article PubMed Google Scholar
X. Cheng, L. Bao, Z. Xu, D. Li, J. Wang, Y. Li, 18F-FDG-PET and 18F-FDG-PET/CT in the detection of recurrent or metastatic medullary thyroid carcinoma: a systematic review and meta-analysis. J. Med. Imaging Radiat. Oncol. 56(2), 136–142 (2012). doi:10.1111/j.1754-9485.2012.02344.x
Article PubMed Google Scholar
L.Ld Matos, A.B.D. Giglio, C.O. Matsubayashi, Md.L. Farah, A.D. Giglio, M.Ad.S. Pinhal, Expression of CK-19, galectin-3 and HBME-1 in the differentiation of thyroid lesions: systematic review and diagnostic meta-analysis. Diagn. Pathol. 7, 97 (2012). doi:10.1186/1746-1596-7-97
Article CAS PubMed Central PubMed Google Scholar
R. Tozzoli, M. Bagnasco, D. Giavarina, N. Bizzaro, TSH receptor autoantibody immunoassay in patients with Graves’ disease: improvement of diagnostic accuracy over different generations of methods. Systematic review and meta-analysis. Autoimmun. Rev. 12(2), 107–113 (2012). doi:10.1016/j.autrev.2012.07.003
Article CAS PubMed Google Scholar
L.M. Wu, H.Y. Gu, X.H. Qu, J. Zheng, W. Zhang, Y. Yin, J.R. Xu, The accuracy of ultrasonography in the preoperative diagnosis of cervical lymph node metastasis in patients with papillary thyroid carcinoma: a meta-analysis. Eur. J. Radiol. 81(8), 1798–1805 (2012). doi:10.1016/j.ejrad.2011.04.028
Article PubMed Google Scholar
S.A. Razavi, T.A. Hadduck, G. Sadigh, B.A. Dwamena, Comparative effectiveness of elastographic and B-mode ultrasound criteria for diagnostic discrimination of thyroid nodules: a meta-analysis. Am. J. Roentgenol. 200(6), 1317–1326 (2013). doi:10.2214/AJR.12.9215
Article Google Scholar
G. Treglia, C. Caldarella, E. Saggiorato, L. Ceriani, F. Orlandi, M. Salvatori, L. Giovanella, Diagnostic performance of (99m)Tc-MIBI scan in predicting the malignancy of thyroid nodules: a meta-analysis. Endocrine 44(1), 70–78 (2013). doi:10.1007/s12020-013-9932-z
Article CAS PubMed Google Scholar
N. Wang, H. Zhai, Y. Lu, Is fluorine-18 fluorodeoxyglucose positron emission tomography useful for the thyroid nodules with indeterminate fine needle aspiration biopsy? A meta-analysis of the literature. [Erratum appears in J Otolaryngol Head Neck Surg.2014;43():43]. J. Otolaryngol. Head Neck Surg. 42, 38 (2013). doi:10.1186/1916-0216-42-38
Article PubMed Central PubMed Google Scholar
B. Zhang, X. Ma, N. Wu, L. Liu, X. Liu, J. Zhang, J. Yang, T. Niu, Shear wave elastography for differentiation of benign and malignant thyroid nodules: a meta-analysis. J. Ultrasound Med. 32(12), 2163–2169 (2013). doi:10.7863/ultra.32.12.2163
Article PubMed Google Scholar
J.P. Brito, M.R. Gionfriddo, A.A. Nofal, K.R. Boehmer, A.L. Leppin, C. Reading, M. Callstrom, T.A. Elraiyah, L.J. Prokop, M.N. Stan, M.H. Murad, J.C. Morris, V.M. Montori, The accuracy of thyroid nodule ultrasound to predict thyroid cancer: systematic review and meta-analysis. J. Clin. Endocrinol. Metab. 99(4), 1253–1263 (2014). doi:10.1210/jc.2013-2928
Article CAS PubMed Google Scholar
M. Ghajarzadeh, F. Sodagari, M. Shakiba, Diagnostic accuracy of sonoelastography in detecting malignant thyroid nodules: a systematic review and meta-analysis. Am. J. Roentgenol. 202(4), W379–W389 (2014). doi:10.2214/AJR.12.9785
Article Google Scholar
L. Giovanella, G. Treglia, R. Sadeghi, P. Trimboli, L. Ceriani, F.A. Verburg, Unstimulated highly sensitive thyroglobulin in follow-up of differentiated thyroid cancer patients: a meta-analysis. J. Clin. Endocrinol. Metab. 99(2), 440–447 (2014). doi:10.1210/jc.2013-3156
Article CAS PubMed Google Scholar
G. Grani, A. Fumarola, Thyroglobulin in lymph node fine-needle aspiration washout: a systematic review and meta-analysis of diagnostic accuracy. J. Clin. Endocrinol. Metab. 99(6), 1970–1982 (2014). doi:10.1210/jc.2014-1098
Article CAS PubMed Google Scholar
Y. Jia, Y. Yu, X. Li, S. Wei, X. Zheng, X. Yang, J. Zhao, T. Xia, M. Gao, Diagnostic value of B-RAF(V600E) in difficult-to-diagnose thyroid nodules using fine-needle aspiration: systematic review and meta-analysis. Diagn. Cytopathol. 42(1), 94–101 (2014). doi:10.1002/dc.23044
Article PubMed Google Scholar
L. Li, B.D. Chen, H.F. Zhu, S. Wu, D. Wei, J.Q. Zhang, L. Yu, Comparison of pre-operation diagnosis of thyroid cancer with fine needle aspiration and core-needle biopsy: a meta-analysis. Asian Pac. J. Cancer Prev. 15(17), 7187–7193 (2014)
Article PubMed Google Scholar
Y.Y. Ma, X.G. Zhang, P. Paerhati, Y. Mu, N. Tieliewuhan, Value of ultrasonographic elastography in differential diagnosis of benign/malignant thyroid nodules in China: a meta-analysis. [Chinese]. Chin. J. Evid. Based Med. 14(5), 584–591 (2014). doi:10.7507/1672-2531.20140097
Article Google Scholar
N. Qu, L. Zhang, Z.W. Lu, W.J. Wei, Y. Zhang, Q.H. Ji, Risk of malignancy in focal thyroid lesions identified by (18)F-fluorodeoxyglucose positron emission tomography or positron emission tomography/computed tomography: evidence from a large series of studies. Tumour Biol. 35(6), 6139–6147 (2014). doi:10.1007/s13277-014-1813-4
Article CAS PubMed Google Scholar
B.S. Sheffield, H. Masoudi, B. Walker, S.M. Wiseman, Preoperative diagnosis of thyroid nodules using the Bethesda system for reporting thyroid cytopathology: a comprehensive review and meta-analysis. Expert Rev. Endocrinol. Metab. 9(2), 97–110 (2014). doi:10.1586/17446651.2014.887435
Article CAS PubMed Google Scholar
J. Sun, J. Cai, X. Wang, Real-time ultrasound elastography for differentiation of benign and malignant thyroid nodules: a meta-analysis. J. Ultrasound Med. 33(3), 495–502 (2014). doi:10.7863/ultra.33.3.495
Article PubMed Google Scholar
A. Wale, K.A. Miles, B. Young, C. Zammit, A. Williams, J. Quin, S. Dizdarevic, Combined (99m)Tc-methoxyisobutylisonitrile scintigraphy and fine-needle aspiration cytology offers an accurate and potentially cost-effective investigative strategy for the assessment of solitary or dominant thyroid nodules. Eur. J. Nuclear Med. Mol. Imaging 41(1), 105–115 (2014). doi:10.1007/s00259-013-2546-0
Article CAS Google Scholar
X. Wei, Y. Li, S. Zhang, M. Gao, Thyroid imaging reporting and data system (TI-RADS) in the diagnostic value of thyroid nodules: a systematic review. Tumour Biol. 35(7), 6769–6776 (2014). doi:10.1007/s13277-014-1837-9
Article PubMed Google Scholar
K. Wolinski, M. Szkudlarek, E. Szczepanek-Parulska, M. Ruchala, Usefulness of different ultrasound features of malignancy in predicting the type of thyroid lesions: a meta-analysis of prospective studies. Pol. Arch. Med. Wewn. 124(3), 97–104 (2014)
PubMed Google Scholar
D. Yu, Y. Han, T. Chen, Contrast-enhanced ultrasound for differentiation of benign and malignant thyroid lesions: meta-analysis. Otolaryngol. Head Neck Surg. 151(6), 909–915 (2014). doi:10.1177/0194599814555838
Article PubMed Google Scholar
Y. Zhang, Q. Zhong, X. Chen, J. Fang, Z. Huang, Diagnostic value of microRNAs in discriminating malignant thyroid nodules from benign ones on fine-needle aspiration samples. Tumour Biol. 35(9), 9343–9353 (2014). doi:10.1007/s13277-014-2209-1
Article CAS PubMed Google Scholar
F.J. Dong, M. Li, Y. Jiao, J.F. Xu, Y. Xiong, L. Zhang, H. Luo, Z.M. Ding, Acoustic radiation force impulse imaging for detecting thyroid nodules: a systematic review and pooled meta-analysis. Med. Ultrason. 17(2), 192–199 (2015). doi:10.11152/mu.2013.2066.172.hyr
Article CAS PubMed Google Scholar
S. Nell, J.W. Kist, T.P. Debray, Bd Keizer, T.Jv Oostenbrugge, I.H.B. Rinkes, G.D. Valk, M.R. Vriens, Qualitative elastography can replace thyroid nodule fine-needle aspiration in patients with soft thyroid nodules. A systematic review and meta-analysis. Eur. J. Radiol. 84(4), 652–661 (2015). doi:10.1016/j.ejrad.2015.01.003
Article PubMed Google Scholar
J.S. Pyo, J.H. Sohn, G. Kang, BRAF immunohistochemistry using clone VE1 is strongly concordant with BRAF(V600E) mutation test in papillary thyroid carcinoma. Endocr. Pathol. 26(3), 211–217 (2015). doi:10.1007/s12022-015-9374-7
Article CAS PubMed Google Scholar
L.R. Remonti, C.K. Kramer, C.B. Leitao, L.C. Pinto, J.L. Gross, Thyroid ultrasound features and risk of carcinoma: a systematic review and meta-analysis of observational studies. Thyroid 25(5), 538–550 (2015). doi:10.1089/thy.2014.0353
Article PubMed Central PubMed Google Scholar
V. Veer, S. Puttagunta, The role of elastography in evaluating thyroid nodules: a literature review and meta-analysis. Eur. Arch. Otorhinolaryngol. 272(8), 1845–1855 (2015). doi:10.1007/s00405-014-3155-7
Article PubMed Google Scholar
G.J. Zhou, M. Xiao, L.N. Zhao, J.G. Tang, L. Zhang, MicroRNAs as novel biomarkers for the differentiation of malignant versus benign thyroid lesions: a meta-analysis. Genet. Mol. Res. 14(3), 7279–7289 (2015). doi:10.4238/2015.July.3.3
Article CAS PubMed Google Scholar
G. Treglia, P. Castaldi, G. Rindi, A. Giordano, V. Rufini, Diagnostic performance of Gallium-68 somatostatin receptor PET and PET/CT in patients with thoracic and gastroenteropancreatic neuroendocrine tumours: a meta-analysis. Endocrine 42(1), 80–87 (2012). doi:10.1007/s12020-012-9631-1
Article CAS PubMed Google Scholar
P. Lin, M. Chen, B. Liu, S. Wang, X. Li, Diagnostic performance of shear wave elastography in the identification of malignant thyroid nodules: a meta-analysis. Eur. Radiol. 24(11), 2729–2738 (2014). doi:10.1007/s00330-014-3320-9
Article PubMed Google Scholar
L.M. Wu, X.X. Chen, Y.L. Li, J. Hua, J. Chen, J. Hu, J.R. Xu, On the utility of quantitative diffusion-weighted MR imaging as a tool in differentiation between malignant and benign thyroid nodules. Acad. Radiol. 21(3), 355–363 (2014). doi:10.1016/j.acra.2013.10.008
Article PubMed Google Scholar
H. Honest, K.S. Khan, Reporting of measures of accuracy in systematic reviews of diagnostic literature. BMC Health Serv. Res. 2, 4 (2002)
Article PubMed PubMed Central Google Scholar
Y. Ben-Shlomo, S.M. Collin, J. Quekett, J.A. Sterne, P. Whiting, Presentation of diagnostic information to doctors may change their interpretation and clinical management: a web-based randomised controlled trial. PLoS ONE 10(7), e0128637 (2015). doi:10.1371/journal.pone.0128637
Article CAS PubMed Central PubMed Google Scholar
J.G. Lijmer, B.W. Mol, S. Heisterkamp, G.J. Bonsel, M.H. Prins, J.H. van der Meulen, P.M. Bossuyt, Empirical evidence of design-related bias in studies of diagnostic tests. J. Am. Med. Assoc. 282(11), 1061–1066 (1999)
Article CAS Google Scholar
A.W. Rutjes, J.B. Reitsma, M. Di Nisio, N. Smidt, J.C. van Rijn, P.M. Bossuyt, Evidence of bias and variation in diagnostic accuracy studies. Can. Med. Assoc. J. 174(4), 469–476 (2006). doi:10.1503/cmaj.050090
Article Google Scholar
R.J. Mullan, D.N. Flynn, B. Carlberg, I.M. Tleyjeh, C.C. Kamath, M.L. LaBella, P.J. Erwin, G.H. Guyatt, V.M. Montori, Systematic reviewers commonly contact study authors but do so with limited rigor. J. Clin. Epidemiol. 62(2), 138–142 (2009). doi:10.1016/j.jclinepi.2008.08.002
Article PubMed Google Scholar
S.S. Selph, A.D. Ginsburg, R. Chou, Impact of contacting study authors to obtain additional data for systematic reviews: diagnostic accuracy studies for hepatic fibrosis. Syst. Rev. 3, 107 (2014). doi:10.1186/2046-4053-3-107
Article PubMed Central PubMed Google Scholar
M.M.G. Leeflang, A.W.S. Rutjes, J.B. Reitsma, L. Hooft, P.M.M. Bossuyt, Variation of a test’s sensitivity and specificity with disease prevalence. Can. Med. Assoc. J. 185(11), E537–E544 (2013). doi:10.1503/cmaj.121286
Article Google Scholar
Y. Takwoingi, B. Guo, R.D. Riley, J.J. Deeks Performance of methods for meta-analysis of diagnostic test accuracy with few studies or sparse data. Stat. Methods Med. Res. (2015). doi:10.1177/0962280215592269
Article PubMed PubMed Central Google Scholar
E.A. Ochodo, J.B. Reitsma, P.M. Bossuyt, M.M.G. Leeflang, Survey revealed a lack of clarity about recommended methods for meta-analysis of diagnostic accuracy data. J. Clin. Epidemiol. 66(11), 1281–1288 (2013). doi:10.1016/j.jclinepi.2013.05.015
Article PubMed Google Scholar
H.J. Schunemann, A.D. Oxman, J. Brozek, P. Glasziou, R. Jaeschke, G.E. Vist, J.W. Williams Jr., R. Kunz, J. Craig, V.M. Montori, P. Bossuyt, G.H. Guyatt, G.W. Group, Grading quality of evidence and strength of recommendations for diagnostic tests and strategies. Br. Med. J. 336(7653), 1106–1110 (2008). doi:10.1136/bmj.39500.677199.AE
Article Google Scholar
A. Liberati, D.G. Altman, J. Tetzlaff, C. Mulrow, P.C. Gotzsche, J.P. Ioannidis, M. Clarke, P.J. Devereaux, J. Kleijnen, D. Moher, The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. J. Clin. Epidemiol. 62(10), e1–e34 (2009). doi:10.1016/j.jclinepi.2009.06.006
Article PubMed Google Scholar
W.L. Deville, F. Buntinx, L.M. Bouter, V.M. Montori, H.C. de Vet, D.A. van der Windt, P.D. Bezemer, Conducting systematic reviews of diagnostic studies: didactic guidelines. BMC Med. Res. Methodol. 2, 9 (2002)
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

G.S.B. was supported by CTSA Grant Number TL1TR000137 from the National Center for Advancing Translational Science (NCATS) and Grant Number 3R01HL131535-01S1 from the National Heart Lung and Blood Institute (NHLBI). V.M.M. was partially supported by Grant Number UL1TR000135 from the NCATS, a component of the National Institutes of Health (NIH). Its contents are solely the responsibility of the author and do not necessarily represent the official view of the NIH.

Author information

Authors and Affiliations

Knowledge and Evaluation Research Unit, Mayo Clinic, Rochester, MN, USA
Gabriela Spencer-Bonilla, Naykky Singh Ospina, Rene Rodriguez-Gutierrez, Juan P. Brito, Shrikant Tamhane, M. Hassan Murad & Victor M. Montori
School of Medicine, University of Puerto Rico Medical Sciences Campus, San Juan, PR, USA
Gabriela Spencer-Bonilla
Division of Endocrinology, Diabetes and Metabolism, Department of Medicine, University of Florida, Gainesville, FL, USA
Naykky Singh Ospina
Division of Endocrinology, Department of Internal Medicine, University Hospital “Dr. Jose E. Gonzalez”, Autonomous University of Nuevo Leon, Monterrey, MX, USA
Rene Rodriguez-Gutierrez
Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Mayo Clinic, Rochester, MN, USA
Juan P. Brito, Nicole Iñiguez-Ariza, Shrikant Tamhane & Victor M. Montori
Mayo Medical Library, Mayo Clinic, Rochester, MN, 55905, USA
Patricia J. Erwin
Division of Preventive, Occupational, and Aerospace Medicine, Mayo Clinic, Rochester, MN, USA
M. Hassan Murad

Authors

Gabriela Spencer-Bonilla
View author publications
You can also search for this author in PubMed Google Scholar
Naykky Singh Ospina
View author publications
You can also search for this author in PubMed Google Scholar
Rene Rodriguez-Gutierrez
View author publications
You can also search for this author in PubMed Google Scholar
Juan P. Brito
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Iñiguez-Ariza
View author publications
You can also search for this author in PubMed Google Scholar
Shrikant Tamhane
View author publications
You can also search for this author in PubMed Google Scholar
Patricia J. Erwin
View author publications
You can also search for this author in PubMed Google Scholar
M. Hassan Murad
View author publications
You can also search for this author in PubMed Google Scholar
Victor M. Montori
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Victor M. Montori.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Naykky Singh Ospina and Gabriela Spencer-Bonilla contributed equally to this work.

Electronic supplementary material

Supplementary Information

Rights and permissions

Reprints and permissions

About this article

Cite this article

Spencer-Bonilla, G., Singh Ospina, N., Rodriguez-Gutierrez, R. et al. Systematic reviews of diagnostic tests in endocrinology: an audit of methods, reporting, and performance. Endocrine 57, 18–34 (2017). https://doi.org/10.1007/s12020-017-1298-1

Download citation

Received: 24 January 2017
Accepted: 01 April 2017
Published: 05 June 2017
Issue Date: July 2017
DOI: https://doi.org/10.1007/s12020-017-1298-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Systematic reviews of diagnostic tests in endocrinology: an audit of methods, reporting, and performance