Introduction

The optimal test to diagnose secondary adrenal insufficiency (AI) remains a matter of debate [1, 2]. Both, the low dose (1 µg) and the standard dose (250 µg) cosyntropin tests have been used for evaluation of adrenal function, and have their own proponents and critics, although several studies show similar diagnostic performance [1, 3, 4]. Oelkers et al reported a subcutaneous cosyntropin dose of 20–30 µg producing near maximum concentrations of cortisol over 1 h and Adrenocorticotropic hormone (ACTH) levels in the 200–350 pg/ml range [5]. This range of ACTH closely resembles levels achieved during the Insulin Tolerance Test (ITT) and Metyrapone tests [6, 7]. Accordingly, we had been using 25 µg cosyntropin dose IM for Cosyntropin Stimulation Tests (CST) in most patients in our Neuroendocrine clinic since 2004.

There have been many studies exploring the value of serum total cortisol during CSTs, however there are only a handful studies looking at the value of serum free cortisol measurement in an outpatient setting to assess secondary AI [8, 9].

We conducted a randomized, prospective pilot trial comparing the serum total and free cortisol responses of three different cosyntropin doses; 1 μg IV low dose cosyntropin test (LDCT), 25 μg IM medium dose cosyntropin test (MDCT), and 250 μg IM standard dose cosyntropin test (SDCT) to evaluate secondary AI. We used the results of ITT as our laboratory gold standard and assessed the total and free cortisol cut-off values during the CSTs and ITT. Since some studies suggest that the same cortisol cut-off cannot be used at 30 min and 60 min time points during the SDCT [10,11,12], we investigated the cut-offs at both time points separately for the SDCT.

Materials and methods

Patients and controls

The study had two groups of patients 18 to 65 years of age. The first group (n  = 10) included patients with hypothalamic/pituitary disease with at least one pituitary axis deficiency other than ACTH deficiency (see electronic supplementary material for diagnoses and exclusion criteria). The second group (n  = 12) comprised healthy volunteers. The three CSTs and ITT were done in random order. A minimum and maximum time interval of 48 h and 3 months were allowed in the study protocol to accommodate participants’ schedule.

Cortisol levels were measured at 30 and 60 min during CSTs. A peak cortisol cut-off of 18 μg/dL was used as the pass criterion during ITT. The result of total cortisol levels during ITT were used to determine free cortisol cut-offs during CSTs and ITT. None of the patients had pituitary surgery within 6 weeks prior to enrollment and none of the women were on estrogen. Patients had to be on at least 3 months of stable hormone replacement for hormone deficiencies. All controls had normal TSH, free T4 and prolactin levels. All premenopausal women in the control group had a history of regular, age appropriate menses and none took birth control pills within 3 months of study entry. Postmenopausal subjects had appropriately elevated FSH concentration. All men in the control group had normal FSH and total testosterone levels.

Study procedures

All participants were instructed to fast for 12 h prior to testing. Six patients were on glucocorticoid replacement and were instructed to hold their medication for 24 h prior to their testing. All the tests were conducted at the Clinical Research Unit on site at the Cleveland Clinic. The study was approved by the Cleveland Clinic Institutional Review Board. All subjects signed an informed consent.

Cosyntropin Stimulation Tests and ITT : see electronic supplementary material for procedure details.

Assays

The total and free cortisol were analyzed at Quest Diagnostics Nichols Institute, San Juan Capistrano, CA 92675, USA. Both serum free and total cortisol levels were measured using an isotope dilution mass spectrophotometric method. Briefly, total cortisol and serum free cortisol (from dialysate after separation by equilibrium dialysis) were directly measured using a two-dimensional separation system, including on-line extraction by high turbulence liquid chromatography followed by analytical liquid chromatography separation and detection by a tandem mass spectrometer. The limit of quantitation (LOQ) for the serum free cortisol was 0.03 µg/dL; intra-assay variation (CV) was 7.4 and 9.3 % at serum free cortisol of 0.36 and 2.20 µg/dL, respectively; inter-assay CV was 9.4 and 9.8% at serum free cortisol of 0.36 and 2.17 µg/dL, respectively. For total cortisol the LOQ was 0.1 µg/dL; intra-assay CV was 3.0 and 4.6% at total cortisol of 15.9 and 202.7 µg/dL, respectively; inter-assay CV was 7.9 and 6.6% at total cortisol of 15.2 and 189.9 µg/dL, respectively. Of 21 common steroid compounds tested for interference, at 200 µg/dL, only 20 alpha-dihydroxycortisol and 20 beta-hydroxycortisone were detected at 1.4 and 1.4%, respectively.

Statistical analysis

Patient groups were compared on categorical factors using Pearson χ 2 tests and Fisher exact tests. Continuous measures were evaluated using Wilcoxon rank sum tests. Sensitivity (SE) and specificity (SP) for each of the CSTs were estimated using optimal cut-offs from ROC curves, with respect to ITT as the gold standard. Youden’s Index, which identifies the largest combination of SE and SP for each test was used to identify the optimal cut-off for each test. Tests were compared on SE and SP using McNemar’s test. Pearson’s correlation coefficient was used to measure the strength of association of peak cortisol levels among pairs of testing methods. Ninety-five percent confidence intervals were produced for all the correlation, SE, and SP estimates. Comparisons of the correlations were performed using the methods by Meng, Rosenthal, and Rubin (1992) to compare the correlations and calculate 95% confidence intervals for the difference among the tests [13]. Lower bounds of these intervals above a zero value was considered statistically significant evidence that the 25-µg test is a better test, while lower bounds at least as high as −0.15 was used to determine non-inferiority at those respective margins. Analyses were performed using R software (version 3.1; Vienna, Austria) and SAS software (version .3; Cary, NC).

Results

The median (range) age for patients and volunteers was 54 (23–62) and 33 (21–51), respectively (p = 0.021). The F/M sex ratio for patients and volunteers were 2/8 and 6/6, respectively (p  = 0.20). The time interval between two consecutive tests was 10 days (2–48 days). While, one participant had 64 day interval between the first and last test, 80% of patient group and 84% of control group had all four tests completed within 30 days.

The peak total cortisol levels during ITT in patients and controls were 16.6 μg/dL (1.1–26.1) and 23.2 μg/dL (20.2–25.9), respectively. The peak total cortisol levels for patients and controls during the CSTs were as following: LDCT 14.5 μg/dL (2.2–28.4) and 23.4 μg/dL (16.7–31.9); MDCT 17.4 μg/dL (2.0–24.9) and 22.0 μg/dL (16.4–28.6); 30 min SDCT 16.1 μg/dL(1.8–25.8) and 22.3 μg/dL (18.8–27.8); 60 min SDCT 19.1 μg/dL (2.4–29.4) and 25.0 μg/dL (21.9–31.5), respectively (Fig. 1).

Fig. 1
figure 1

Boxplots show the relationships, by test, between patient groups on total cortisol. The thick line in the middle of the boxes reflects the median, while the box edges are the quartiles (interquartile range). Whiskers extend to the smallest and largest values within 1.5 times the interquartile range above the third quartile and below the first quartile

The best total cortisol cut-offs for diagnosis of secondary AI were 14.6 μg/dL (100% SE & SP) for LDCT, 18.7 μg/dL (100% SE, 88% SP) for MDCT, 16.1 μg/dL (100% SE & SP) for 30 min SDCT and 19.5 μg/dL (100% SE & SP) for 60 min SDCT. There was no difference in the ROC curve for cortisol values between the CSTs (p  > 0.41). Using a cortisol cut-off of 18 μg/dL as the pass criteria during CSTs, only the cortisol level at 30 min during SDCT provided discrimination similar to ITT.

All subjects had peak total cortisol at 60 min during the SDCT. During MDCT and LDCT, 81% and 50% of subjects had peak cortisol at 30 min, respectively. Pearson correlations for peak cortisol levels between the ITT and the CSTs is shown in Fig 1 in electronic supplementary material. Peak cortisol level during SDCT correlated better with ITT compared to MDCT (p  = 0.044). Among CSTs, LDCT had the lowest correlation with ITT, and was significantly lower than the SDCT (p  = 0.005), but not the MDCT (p = 0.32).

The peak free cortisol levels during ITT in patients and controls were 0.95  μg/dL (0.06–2.2) and 1.9 μg/dL (1.2–2.7), respectively. The peak free cortisol levels for patients and controls during CSTs were as following: LDCT 0.77 μg/dL (0.07–2.2) and 1.9 μg/dL (1.0–3.7); MDCT 0.91 μg/dL (0.05–1.7) and 1.8 μg/dL (0.74–3.0); 30 min SDCT 0.89 μg/dL (0.06–2.2) and 1.8 μg/dL (1.1–3.1); 60 min SDCT 1.3 μg/dL (0.06–2.6) and 2.3 μg/dL (1.7–3.8), respectively (Fig. 2).

Fig. 2
figure 2

Boxplots show the relationships, by test, between patient groups on free cortisol. The thick line in the middle of the boxes shows the median, while the box edges are the quartiles (interquartile range). Whiskers extend to the smallest and largest values within 1.5 times the interquartile range above the third quartile and below the first quartile. Responses outside these ranges are plotted as individual circles and represent potential outliers

The best peak free cortisol cut-offs were 1 μg/dL for ITT, 0.9 μg/dL for LDCT (100%SE and 100%SP), 0.9 μg/dL for MDCT (100%SE and 94%SP), and 0.9 μg/dL (100%SE and 100%SP) and 1.3 μg/dL (100%SE and 100%SP) for 30 min and 60 min SDCT, respectively. Pearson correlations for peak free cortisol levels between the ITT and the CSTs is shown in Fig 2 in electronic supplementary material. There was no difference in the area under the ROC curve for free cortisol values between the CSTs (p  = 0.36).

Discussion

We compared total and free cortisol levels during CSTs using three different doses of cosyntropin with ITT as the gold standard test in a randomized, prospective pilot trial. All CSTs performed well compared to ITT as long as appropriate cut-offs were used. Specifically, we did not find any advantage for using 25 μg cosyntropin dose compared to 1 and 250 μg doses.

Over the years, several different tests have been proposed to evaluate patients for secondary AI. The dilemma for an ideal test has been discussed in detail by Grossman in a review article [14]. The CST is used routinely to assess secondary AI, compared to ITT, since it is safe and relatively convenient. We found here, in patients with established hypothalamic pituitary pathology, CSTs correlated well with the ITT.

250 μg CST (SDCT)

One critique of the SDCT is that 250 µg of cosyntropin produces supraphysiologic ACTH levels in the 1000–60,000 pg/ml range [6, 15]. The concern is that adrenal cortex is maximally stimulated leading to false negative results in partial AI [16,17,18,19,20,21]. Case reports have been published with patients having AI based on clinical symptoms and abnormal ITT, who responded normally to the 250 µg dose [22, 23].

There is significant variability in the reported cut-off levels of serum cortisol during SDCT ranging from 18.1 µg /dL to 22.4 µg /dL [2, 3, 11, 24]. The limitations of the traditional cut-offs of SDCT in pregnant women have been discussed by Yuen et al. and higher cut-offs are recommended for that population [25]. Almost always, patients achieve higher cortisol levels at 60 min and therefore there is little evidence for the use of the same cortisol cut- off for both 30 and 60 min time points. In our study, both time points provided excellent diagnostic accuracy as long as appropriate cortisol cut-offs were used. In addition, a cortisol level of 18 μg /dL at 30 min during SDCT provided discrimination similar to the ITT. Accordingly, one may argue that 30 min cortisol value may be adequate during the SDCT. This is similar to the findings in the series of 200 patients which suggested to use the 30 min time point during the SDCT [10].

1 μg CST (LDCT)

The proponents of the 1 µg cosyntropin dose reported higher sensitivity in patients on chronic glucocorticoid treatment [16]. This has been supported by other investigators [1, 24, 26,27,28,29]. However, some studies have reported lower specificity with the 1 µg cosyntropin dose [30,31,32]. Fleseriu et al found abnormal cortisol levels in 23 out of 26 patients during the LDCT who had passed the SDCT. These patients were asymptomatic, without any glucocorticoid coverage, during a follow-up period that lasted as long as 2 years [33]. In addition, there is a concern for the reproducibility of 1 µg dose at different times of the day. Nieman et al reported 20 and 63% false positive results with LDCT in healthy volunteers who underwent the test at 0800 h or 1600 h, respectively [34].

Furthermore, the 1 µg cosyntropin is not available as a commercial preparation. It is prepared by diluting 250 µg vials, with the potential for dosing errors. Lower doses of cosyntropin stored in glass tubes have been shown to be unstable; and therefore, plastic tubes are recommended [16]. However, longer plastic tubing has also been reported as a factor for the lower specificity of LDCT due to loss of ACTH in the walls of the tubing [34].

Similar to SDCT, LDCT has been reported with wide variations in cut-off, ranging from 17 to 22  µg/dL (470–600 nmol/l) [2, 24, 32]. In the current study, LDCT provided 100% sensitivity and specificity as long as a lower cortisol cut-off of 14.6 µg /dL was used as the pass criterion, which suggests that the higher false positive results in some studies may be related to the use of a cortisol cut- off of 18 µg /dL, which is generally used as the pass criteria for the LDCT [32].

Other doses of cosyntropin used for CST

More than 50 years ago, Landon et al. reported that a much lower dose of cosyntropin can produce maximal adrenal stimulation [35]. There have been dose-response studies highlighting that lower doses of synthetic ACTH produce peak cortisol levels similar to the 250 µg dose [5, 16, 36, 37]. Contreras et al. reported that mean serum cortisol concentrations 30 min after 25 µg and 250 µg ACTH IM were not different (p  = 0.158) [38]. Similarly, Oelkers et al. showed that near maximum cortisol concentration is achieved following 20 to 30 µg (subcutaneous) cosyntropin administration [5]. Accordingly, we designed this pilot study to investigate whether the use of 25 µg cosyntropin dose would provide any advantage over 1 and 250 µg doses. We found that the peak cortisol level during SDCT correlated better with ITT compared to MDCT (p  = 0.044). In summary, the MDCT did not provide an advantage over LDCT and SDCT.

Free cortisol during CST and ITT

There is a sound pathophysiological basis for measurement of serum free cortisol rather than total cortisol level in patients with low binding proteins such as advanced liver disease, nephrotic syndrome or critically ill patients, or in states associated with elevated binding proteins such as pregnancy and in women taking estrogen [9, 39]. However, due to limited number of studies a widely accepted cut-off for free cortisol during CSTs or ITT, in an outpatient setting has not been established.

In a cross-sectional study of 295 outpatients, a free cortisol of 0.9 and 1.2 µg/dL at 30 and 60 min during SDCT was suggested as the pass criteria for the diagnosis of AI [8]. In another study of 60 healthy volunteers and 21 patients with pituitary disease, free cortisol levels of 1.1 µg/dL, 1.2 and 1.5 µg/dL were suggested as the pass criteria during LDCT, and at 30 and 60 min time points during SDCT, respectively [9]. Our study found similar free cortisol cut-off values of 0.9 µg/dL for LDCT and 30 min time point during SDCT, and 1.3 µg/dL for 60 min time point during SDCT. The lack of wide availability and long turnaround time are limitations for free cortisol in clinical practice.

There may be clinically significant differences in cortisol cut-offs when different assays are used. Serum cortisol values in this study were measured using a LCMS assay. In one study the total cortisol levels were 12% lower when measured by immunoassay compared to LCMS [9]. One may expect similar differences in free cortisol assays measured by LCMS.

This is the first, prospective, randomized study to compare MDCT to SDCT, LDCT, and ITT. However, our study has limitations. The number of participants is small, which limits understanding of potential relationships between total and free cortisol levels and other factors, such as age or sex. We excluded patients who may benefit the most from measurement of free cortisol levels such as those with liver disease, nephrotic syndrome, or women on estrogen or those who were pregnant, so as to establish baseline values in patients with pituitary disease and in normal controls. Larger studies are needed to establish the suggested cut-off levels for serum free cortisol.

Conclusion

The current pilot study does not support an advantage of the use of 25 mcg cosyntropin during CST. All three dose-schedules of the CSTs performed well and had excellent correlation with ITT as long as appropriate cut-offs were used. A free cortisol cut-off of 0.9 μg/dL may be used as the pass criteria during the LDCT, MDCT, and at 30 min time point during SDCT. A higher free cortisol cut-off of 1.3 μg/dL is required for 60 min time point during SDCT.