Introduction

Autism is a neurodevelopmental disorder with the hallmark symptoms of social interaction and communication difficulty and restricted and repetitive behaviors [1]. Autism can be further classified into classic autism, Asperger’s syndrome, and pervasive developmental disorder-not otherwise specified (PDD-NOS). A recent study by the Centers for Disease Control and Prevention (CDC) reported the prevalence of autism among children in the USA to be ~1.13 % [2]. In their review of epidemiological surveys for autism across the world, Elsabbagh et al. [3] reported the global mean prevalence for autism to be 62 per 10,000. Though considerable variability in autism prevalence was found across different regions of the world, the authors noted that autism prevalence did not differ significantly between continental America and Europe. Autism prevalence has been reported to be ~1 % in the UK [4]. Some parts of the world have been found to have a higher prevalence of autism than the USA. Using a population-based sample in Korea, Kim et al. [5] found the prevalence of autism among children aged 7–12 years to be 2.64 %. Notwithstanding the differences in case definition and ascertainment, these results generally indicate that autism may be more prevalent than commonly thought. In the 6-years period between 2002 and 2008, autism prevalence increased by 78 % in the USA [2]. Currently, autism is more prevalent than pediatric diabetes, cancer, and AIDS combined [6], thereby making it a significant public health concern. Autism is five times more common among males as compared to females [2]. The true cause of autism remains unknown; however, recent literature suggests the occurrence of the disorder to be a result of genetic predisposition coupled with an environmental trigger [7].

Since autism is a chronic disorder, the responsibility of providing constant care to children with the disorder falls on the caregivers (generally their parents). This takes a tremendous toll on the physical and emotional health of the caregivers. An abundance of literature has highlighted the negative impact of caring for a child with autism on caregivers’ health and well-being [812]. One autism caregiving outcome that has received increasing attention over the past few years is health-related quality of life (HRQOL) [1316]. An individual’s HRQOL encompasses his/her physical, psychological, and social health, and these domains may be influenced by the experiences and perceptions of the individual [17]. The HRQOL of caregivers not only affects their own personal health, but may also impact the quality of care that they provide to the child with autism. In their study of HRQOL among autism caregivers, Khanna et al. [15] found lower physical and mental health scores among caregivers of children with autism as compared to the general US adult population. Other studies have also reported lower HRQOL scores among caregivers of children with autism in comparison with the general adult population or parents of children with typical development [13, 14, 16]. These studies have been consistent in terms of their use of the Medical Outcomes Study (MOS) Short-Form Health Survey (SF-36 or SF-12) instrument for HRQOL measurement. The SF-36 and SF-12 are commonly used generic health profile instruments.

Although these studies provide useful information regarding the general health status among autism caregivers, limited literature currently exists regarding the preference-based scores for health outcomes among caregivers of these patients. Assessing health state preferences offers certain advantages as compared to the information gleaned from generic health profile measures [18]. Instead of assessing disease presence, absence, or symptom severity, preference-based measures determine an individual’s preference for a being in a particular health condition. Preference-based measures combine both positive and negative aspects of health state and present them as a single number that typically ranges from 0 (death) to 1 (perfect health). By providing a common unit, preference-based measures provide researchers the flexibility to make comparisons across different health populations. Preferences may be determined under conditions of certainty (without any underlying risk) or uncertainty (in the presence of risk). When measured using the former approach, the resulting preferences are termed ‘values,’ while those measured through the later are termed ‘utility.’ Preference values can be ascertained through approaches such as rating scales and time trade-off (TTO), whereas utilities are measured through the standard gamble approach. Preference values (or utilities) are used in calculating quality-adjusted life years (QALYs), which are central to economic evaluations such as cost-utility analysis [19]. Given the scarcity of healthcare resources, data on cost per QALY gained enable adequate resource allocation to treatments or interventions that provide the biggest gain. So unlike health status (as measured using SF-36 or SF-12), preference values and utilities can be used by policy makers or researchers in making decisions in the presence of resource limitations or under uncertain conditions [20].

Preference values and utilities have been traditionally used in patient outcomes assessment; however, they are considered to be equally important parameter in caregiving population. In an insightful piece of work examining the estimation of QALYs for caregivers, Al-Janabi et al. [21] make a strong argument as to why it is essential to ascertain QALYs among caregivers, even when the healthcare interventions are targeted toward the patients. The authors contend that economic evaluations that do not consider all QALY (including health) gains, including among those related to patients, may result in inadequate distribution of healthcare resources. In the context of caregiving population, preference data could be applied toward determining the usefulness of supporting interventions or programs. For example, when examining the cost-effectiveness of a Systemic Care Program for Dementia (SCPD) that focuses on dementia caregivers’ sense of competence against usual care, Spijker et al. [22] calculated the quality of caregivers life (QALY) using preference values elicited through EuroQol Five Dimensional Questionnaire (EQ-5D-3L) [23, 24]. Preference scores for caregivers are also needed in valuations of costs associated with informal care [25]. Changes in preference scores for caregivers may be useful in determining the true cost-effectiveness of a drug aimed at alleviating disease severity of the care recipient [26]. Estimation of health state preferences among caregivers serves a useful purpose and must be incorporated as a critical outcome measure in addition to patient healthcare assessment.

An individual’s preference value or utility can be assessed using generic preference-based measures such as the EuroQol Five Dimensional Questionnaire (EQ-5D-3L) [23, 24], Health Utilities Index (HUI-I, II, III) [27], Quality of Well-Being Scale (QWB) [28, 29], and Short Form-Six Dimensions (SF-6D) [30]. These measures have underlying differences with respect to the health dimensions captured, number of questions, and health state valuations [31]. One of the key differences among these instruments is in the valuation technique used to derive preferences for the health states that these instruments capture. The EQ-5D-3L health state valuations is based on TTO, while valuation for HUI and SF-6D is based on standard gamble. As a result, the preference values (or utilities) generated from these measures typically vary. Despite these differences, these measures have a distinct advantage over generic health profile measures in that they not only capture HRQOL information, but also summarize the HRQOL as a single number that typically ranges from a state of perfect health to death.

Given its simplicity and ease of use, the EQ-5D-3L is one of the most commonly used preference-based measures [32]. The EQ-5D-3L includes two components: descriptive system (EQ-5D-3L index) and visual analog scale (EQ-5D-3L VAS) [23, 24]. The EQ-5D-3L index provides information of an individual’s health profile in five domains, with each domain having three response options ranging from ‘no problem’ (1) to ‘extreme problem’ (3). A single summary index score can be derived from the EQ-5D-3L health states. The summary index score for EQ-5D-3L may range from −0.59 (worse than death) to 1.00 (perfect health) [33]. The EQ-5D-3L VAS assesses self-rating of health on a scale of 0–100.

Although the EQ-5D-3L has never been previously used in caregivers of autistic patients, it has been used to study caregiver outcomes in other neurological disorders across different regions of the world [3439]. Researchers typically use EQ-5D-3L to assess the health impact (HRQOL) of caregiving on caregivers and to study the relationship between HRQOL and other caregiving outcomes (burden, depression, etc.). For example, when studying the HRQOL of caregivers of individuals with Alzheimer’s disease, Serrano-Aguilar et al. [34] found higher frequency of problems in each of the five domains of EQ-5D-3L among caregivers as compared to the general population in Spain. The authors also found an inverse association between caregiver burden and caregivers’ HRQOL. In another similar study, Martinez-Martin et al. [35] found burden and depression scores to be inversely related to EQ-5D-3L index and VAS scores among caregivers of individuals with Parkinson’s disease. Several researchers have also tested the psychometric profile of EQ-5D-3L among patients with neurological and psychiatric disorders such as anxiety disorders, schizophrenia, and Parkinson’s disease [4047]. König et al. [43] found the EQ-5D-3L to have adequate construct validity and moderate responsiveness among patients with anxiety disorders. Given its low respondent burden and ease of administration, the authors suggested the inclusion of EQ-5D-3L as an assessment measure for overall HRQOL and health state valuations in patients with anxiety disorders. In a sample of patients with schizophrenia, Prieto et al. [40] found the EQ-5D-3L scores to be correlated with measures of patients’ clinical status. The EQ-5D-3L was found to be able to discern HRQOL differences among schizophrenia patients by their degree of severity of illness. These results, along with those of other such studies, reflect that the EQ-5D-3L instrument is a viable option to measure health outcomes across different patient/caregiver populations.

For the EQ-5D-3L to be used in the assessment of preference scores among autism caregivers, it is imperative that the psychometric profile of the instrument be tested in this population. The purpose of this study was to determine the psychometric properties of the EQ-5D-3L instrument among caregivers of children with autism. Specifically, we tested the convergent validity, discriminant validity, known-groups validity, and internal consistency reliability, and assessed floor and ceiling effects of the EQ-5D-3L instrument. Since preference scores in this caregiving population have not been previously determined, studying the psychometric properties of a commonly used preference instrument, that is, EQ-5D-3L, could provide confidence in its use for future research.

Methods

Study sample and procedure

This study was part of a larger project aimed at studying health utilities among caregivers of children with autism. Only the information relevant to study purpose is included in the current study. A cross-sectional online survey of primary caregivers of children with autism was conducted for data collection. Study participants were recruited from families registered with the Interactive Autism Network (IAN) in the USA. The IAN is an online autism community that enables collaboration between researchers and individuals and families affected by autism throughout the USA [48]. It is run by the Kennedy Krieger Institute and funded by Autism Speaks and the Simons Foundation. More than 38,000 participants are registered with the IAN, including over 14,000 individuals with autism. Through IAN portal, autism researchers can gain access to families of individuals with autism who meet study criteria and are willing to participate in research. Among families registered with the IAN, those having (1) a child with autism less than or equal to 18 years of age and (2) no more than one child diagnosed with autism were identified and emailed a cover letter. To protect member identity, the identification of families based on study criteria and cover letter mailing was conducted by the IAN. The cover letter explained the study purpose, emphasized voluntary participation, and included the survey link. Within each family, participation in the study was restricted to the primary caregiver, which was classified as the individual in the family who had the main responsibility for providing daily care to the child with autism. The online survey was administered via Qualtrics online survey software (Qualtrics Inc., Provo, UT). As a token of appreciation, a $15 Amazon gift card was offered to participants who completed the survey. The online survey was closed once the desired number of responses (set at 300) for the larger project was achieved. A total of 326 responses were received during the period May 7–11, 2012. The study received approval from the University of Mississippi (UM) Institutional Review Board (IRB) under exempt status.

The survey included four sections. There were other instruments included in the actual survey besides those described in the study. Since the other measures are not related to present study purpose, they have not been reported here. Section I consisted of instruments used in the assessment of caregiver health status (SF-12v2) [49] and caregiver burden [Caregiver Strain Questionnaire (CGSQ)] [50]. Section II consisted of an assessment of severity of autism in the child [Social Communication Questionnaire (SCQ)] [51]. Section III consisted of items aimed at assessing the relevant sociodemographic information of the caregiver and child with autism. Section IV consisted of the EQ-5D-3L instrument [23, 24]. An online (Web-based) version of the EQ-5D-3L instrument was used in the current study. Participants were re-directed after section III from the Qualtrics online system to the EuroQol Group server system to complete section IV (EQ-5D-3L) of the survey.

Study measures

EuroQol Questionnaire (EQ-5D-3L)

The EQ-5D-3L is a generic HRQOL instrument that includes the EQ-5D-3L index and EQ-5D-3L VAS [23, 24]. The EQ-5D-3L index has five dimensions assessing the domains of mobility, self-care, usual activities, pain/discomfort, and anxiety/depression. Each item has three possible responses: ‘1, no problems’; ‘2, moderate problems’; and ‘3, extreme problems.’ The response to the five items provides an indication of the degree to which an individual has problems in each domain. An individual’s health status can be described as a five-digit numeral, created by combining the response to the five items. A total of 35 (243) possible health states are captured by the EQ-5D-3L index. Using a set of population-based preference weights, an index score can be generated from the self-reported health states. For the current study, we used the scoring algorithm developed by Shaw et al. [52] to calculate the EQ-5D-3L index score. The preference-weighting system for EQ-5D-3L was generated using a representative sample of 4,048 adults 18 years of age or older in the USA. Through TTO exercises, Shaw et al. [52] established US population-based preference weights for the 243 unique health states of EQ-5D-3L. Index scores could range from −0.11 for health state ‘33333’ (worst health state) to 1.00 for health state ‘11111’ (perfect health state). Luo et al. [53] provided the EQ-5D-3L norms for self-reported health status of the general adult population in the USA.

The EQ-5D-3L VAS includes a vertical graph scale (thermometer shaped) with end points 0 (worst health status) and 100 (best health status). Participants were asked to choose a number from (inclusive) 0 to 100 that best describes their current health state.

Medical Outcomes Study Short-Form Health Survey version 2 (SF-12v2)

The SF-12v2 is a generic health profile instrument that includes 12-items [49]. Similar to its parent survey the SF-36, the SF-12v2 provides summary scores for physical HRQOL (physical component summary [PCS] score) and mental HRQOL (mental component summary [MCS] score). In addition, scores for eight sub-domains—physical functioning, role physical, bodily pain, general health, vitality, social functioning, role emotional, and mental health—are also provided. Standard US norm-based scores for the two summary scales and eight subscales were calculated using the QualityMetric SF Health Outcomes™ Scoring Software. Norm-based scores for the SF-12v2 have a mean of 50 and standard deviation of 10 in the general US population. Higher scores indicate better health status [49].

Caregiver Strain Questionnaire (CGSQ)

The 21-item Caregiver Strain Questionnaire (CGSQ) was used to assess the burden experienced by caregivers of children with autism [50]. A 5-point Likert scale is used to measure each item and includes endpoints ‘not at all a problem’ (1) and ‘very much a problem’ (5). Overall burden score can range from 3 to 15, with higher score indicating greater caregiver burden [50]. The CGSQ has been found to have good reliability and validity for the assessment of burden among caregivers of children with autism [54].

Social Communication Questionnaire (SCQ)

The 40-item SCQ (Lifetime version) was used to determine the severity of autism in the child [51]. The parent report SCQ assesses severity in the domains of reciprocal social interaction, communication, and restricted, repetitive, and stereotyped patterns of behavior. Items are measured on a ‘yes/no’ format. A score of 1 is given in the presence of abnormal behavior, while 0 is given in the absence of abnormal behavior. A total score was calculated by summing the scores on individual items [51], with higher score indicating greater severity.

Statistical analysis

Survey data were checked for missing values. Surveys with more than 15 % missing data were removed from analysis. Of the 326 responses received, 10 were removed based on missing data criterion. Thus, a total of 316 responses were analyzed for study purposes. There were no missing values for EQ-5D-3L instrument. The Web-based EQ-5D-3L survey requires users to choose a response option for each element (EQ-5D-3L index and EQ-5D-3L VAS) before proceeding, thereby resulting in complete data. Missing data for all other study measures except SF-12v2 were mean substituted. Although mean substitution can affect estimates for variance and correlations [55], it is considered to be an appropriate approach when the degree of missing data is small [55, 56]. Since the percentage of missing data among the final set of 316 responses was low (0.28 %), mean substitution was considered to an appropriate technique for missing data replacement in the context of this study. The proportion of missing data for the SF-12v2 was 0.32 %. Missing values for SF-12v2 were handled by the missing score estimation technique developed by QualityMetric and undertaken using the QualityMetric SF Health Outcomes™ Scoring Software.

Convergent validity refers to the extent to which different scales measuring the same underlying construct correlate. In order to ascertain the convergent validity of the EQ-5D-3L instrument, the correlation of EQ-5D-3L five index domains with SF-12v2 domains and CGSQ was examined. Spearman rank correlation was used to examine the relationship between the hypothesized constructs. Correlations between 0.1 and 0.29 were considered small, between 0.3 and 0.49 as moderate, and above 0.5 as strong [57]. Based on the general EQ-5D-3L literature [46, 5862], moderate to strong correlations (direction listed in parenthesis) were hypothesized between the following sets of constructs: EQ-5D-3L ‘mobility’ domain and SF-12v2 ‘physical functioning’ domain (−), EQ-5D-3L ‘anxiety/depression’ domain and SF-12v2 ‘mental health’ domain (−), EQ-5D-3L ‘anxiety/depression’ domain and CGSQ ‘caregiver burden’ (+), EQ-5D-3L ‘pain/discomfort’ domain and SF-12v2 ‘bodily pain’ domain (−), EQ-5D-3L ‘usual activities’ domain and SF-12v2 ‘bodily pain’ (−) and ‘role physical’ domain (−), EQ-5D-3L ‘self-care’ domain and SF-12v2 ‘bodily pain’ (−) and ‘role physical’ domain (−), EQ-5D-3L index and SF-12v2 PCS (+) and MCS (+), EQ-5D-3L index and CGSQ ‘caregiver burden’ (−), EQ-5D-3L VAS and SF-12v2 ‘general health’ domain (+), PCS (+), and MCS (+), and EQ-5D-3L VAS and CGSQ ‘caregiver burden’ (−). A low and insignificant correlation among measures of theoretically unrelated constructs evinces discriminant validity. Discriminant validity was determined by observing the correlation of EQ-5D-3L ‘mobility’ and ‘self-care’ scores with SF-12v2 ‘role emotional,’ ‘mental health,’ and MCS scores. A low and statistically insignificant correlation was expected among these dissimilar constructs.

In the context of this study, the known-groups validity refers to the ability of an instrument to differentiate among participants by severity level. To determine the known-groups validity of the EQ-5D-3L instrument, variation in EQ-5D-3L index and EQ-5D-3L VAS scores by level of autism severity in the child was examined. Prior studies of EQ-5D-3L psychometrics in different patient populations have examined known-groups validity through an examination of variation in EQ-5D-3L index and VAS scores by level of disease severity [46, 63, 64]. Also, studies in autism have shown care recipient disease severity to be a predictor of caregiver HRQOL [15]. Therefore, in this study, we hypothesized that caregiver EQ-5D-3L index and VAS scores would decrease with an increase in autism severity among the care recipient. Caregivers were categorized into tertiles based on the care recipients’ autism severity scores that were measured using the SCQ instrument. Using the Kruskal–Wallis test, the EQ-5D-3L index and EQ-5D-3L VAS scores of caregivers were compared across the tertile categories. A nonparametric test was used because of the non-normal distribution of EQ-5D-3L scores. Post-hoc analyses were conducted using the Mann–Whitney U test. A gradient relationship between caregiver utility scores and level of autism severity in the child was expected, such that utility scores decrease with an increase in autism severity.

The internal consistency reliability of EQ-5D-3L instrument was determined using Cronbach’s alpha. A Cronbach’s alpha value of greater than or equal to 0.70 is considered adequate [65]. Corrected item-total correlations were also determined and used to observe the reliability of individual items. A correlation value greater than 0.20 of an item with the total score sans item is considered acceptable [66]. Floor and ceiling effects for the EQ-5D-3L instrument were assessed by determining the percentage of participants who received the lowest (floor effect) and highest (ceiling effect) possible scores. These effects are considered to be absent when the percentage of participants with lowest (floor) or highest (ceiling) possible score is less than or equal to 20 % [67].

For tests of statistical significance, P values less than 0.05 were used. Data analyses were performed using Statistical Package for Social Sciences (SPSS) version 20.0 (SPSS Inc., Chicago, IL).

Results

Sample characteristics

Table 1 lists the sociodemographic characteristics of caregivers and care recipients. The final sample included 316 primary caregivers of children with autism. The mean age of participants was 41.3 years (±6.9). In terms of gender, race, and marital status, a majority of participants were female (93.7 %), white (87.3 %), and married (80.4 %), respectively. In order to put into perspective the characteristics of caregivers in our study, we compared our sample characteristics to those of caregivers of children with special health care needs in the USA [68]. The average of our sample was similar to national caregiver population (41.3 vs. 40.6 years). When compared to the national caregiver population for children with special health care needs, a higher proportion of our sample included females (93.7 vs. 72 %) and white (87.3 vs. 60 %). A higher proportion of our sample were married (80.4 vs. 53 %), college graduate (36.4 vs. 17 %), working full-time (38.3 vs. 35 %), and had income $50,000 or more (72.7 vs. 37 %) as compared to national caregiver population for children with special health care needs. Most (91 %) of the caregivers in our sample were mothers of children with autism, which is consistent with the general autism literature [69, 70]. As per the child with autism, 60.8 % had classic autism, 18.4 % had Asperger’s syndrome, and 20.3 % had PDD-NOS. Though the exact distribution of different forms of autism is not available in the literature, a higher prevalence of classic autism has been indicated [71]. Almost 80 % of children with autism were male, which is consistent with the general prevalence estimates for this disorder [2]. The average age of the care recipients was 9.5 years (±3.9).

Table 1 Study characteristics of participants and the care recipients (N = 316)

EQ-5D-3L descriptives

Figure 1 depicts the response received for the EQ-5D-3L instrument. Almost three-fifths of participants reported having problems in the domain of ‘anxiety/depression.’ Half of the participants reported having problems with ‘pain/discomfort,’ and one-fourth indicated problems with ‘usual activities’. For each of the five domains, very few participants indicated having ‘extreme problems.’ Least problems were indicated in the ‘self-care’ domain. The mean EQ-5D-3L index score of caregivers was 0.82 (±0.16), and the mean EQ-5D-3L VAS score was 76.00 (±16.11).

Fig. 1
figure 1

Distribution of the EQ-5D-3L health states among study participants (N = 316)

Table 2 lists the five most common EQ-5D-3L health states among caregivers of children with autism. Almost 24 % of participants reported being in perfect health state (‘11111’) and had an index score of ‘1.00’. These participants had an average EQ-5D-3L VAS score of 87.83 (±7.5). Cumulatively, the top five EQ-5D-3L health states accounted for more than three-fourths (78.2 %) of participants. The average EQ-5D-3L index score for participants in the top five health states was 0.8723 (±0.0873), while the average VAS score for this group was 79.78 (±12.94).

Table 2 Top five self-reported health states (EQ-5D-3L)

Convergent and discriminant validity

Table 3 describes the correlation between EQ-5D-3L domains and relevant study constructs (SF-12v2 and CGSQ). All hypothesized relationships for convergent validity except for those between EQ-5D-3L ‘self-care’ domain and SF-12v2 ‘bodily pain’ and ‘role physical’ domain were supported. Low, but statistically significant, correlation was observed between EQ-5D-3L ‘self-care’ and SF-12v2 ‘bodily pain’ (−0.225, P < 0.001) and ‘role physical’ (−0.166, P = 0.003) domain. Among the hypothesized relationships, the strongest correlation was observed between the EQ-5D-3L VAS and SF-12v2 ‘general health’ domain (0.663, P < 0.001). The EQ-5D-3L index and VAS were significantly correlated with SF-12v2 PCS and MCS domains. Higher caregiver burden scores were associated with lower EQ-5D-3L index (−0.358, P < 0.001) and VAS (−0.322, P < 0.001) scores. As hypothesized for indices of discriminant validity, a low and statistically insignificant correlation was observed between scores for EQ-5D-3L ‘mobility’ and ‘self-care’ domains and SF-12v2 ‘role emotional,’ ‘mental health,’ and MCS domains.

Table 3 Correlation of EQ-5D-3L domains, EQ-5D-3L index, and EQ-5D-3L VAS with SF-12v2 and CGSQ scores

Known-groups validity

Results from Kruskal–Wallis test revealed significant differences in caregivers EQ-5D-3L index and EQ-5D-3L VAS scores by care recipients’ levels of autism severity (Table 4). Post-hoc analysis revealed that the median (range) index scores were lower among the third tertile (i.e., most severe) autism severity group as compared to the first (i.e., least severe) [0.81 (0.20–1.00) vs. 0.83 (0.26–1.00), P = 0.005] and second [0.81 (0.20–1.00) vs. 0.84 (0.29–1.00), P = 0.012] tertile groups. Similarly, EQ-5D-3L VAS scores were lower among the third tertile group as compared to the first and second tertile groups. No significant differences were noted between the first and second tertiles on either the index or the VAS scores.

Table 4 Variation in caregivers’ EQ-5D-3L health scores by care recipients level of autism severity

Reliability

The five-item EQ-5D-3L instrument had internal consistency reliability (Cronbach’s alpha) of 0.63, which was less than the acceptable criteria of 0.70. However, all corrected item-total correlations were higher than the minimal acceptable value of 0.20 (Table 5).

Table 5 Reliability analysis of EQ-5D-3L

Floor and ceiling effects

Roughly 24 % of participants had the highest possible index score of 1.00, indicating the presence of ceiling effect for EQ-5D-3L index. These participants reported ‘no problems’ in the five health domains (‘11111’) and had scores indicative of a perfect health state. Floor effect was absent, since none of the participants reported the lowest possible health state (‘33333’). When observing the distribution of EQ-5D-3L VAS, no floor or ceiling effects were seen. Roughly 3 % of participants had the highest possible score (100) and none had the lowest score (0).

Discussion

This study assessed the psychometric properties of the EQ-5D-3L for use among primary caregivers of children with autism. The EQ-5D-3L is a commonly used generic utility instrument, which has been previously validated in several different patient populations [42, 7275]. A thorough review of the literature did not reveal any previous study that has determined the psychometric profile of the EQ-5D-3L among autism caregivers. The conciseness of the EQ-5D-3L offers low respondent burden. Further, the availability of both online and paper versions provides researchers the flexibility of using this instrument in different settings. These factors, together with the EQ-5D-3L instrument’s ability to provide information concerning patient preferences, makes it an appealing choice for studies, including the current one, aimed at HRQOL measurement. The convergent validity, discriminant validity, known-groups validity, internal consistency reliability, and floor and ceiling effects of the EQ-5D-3L instrument were evaluated in this study. Considering that there is no autism-specific utility instrument currently available, the EQ-5D-3L presents a psychometrically sound option for researchers who aim to assess HRQOL among this growing population of caregivers.

Study results revealed EQ-5D-3L to have good construct (convergent, discriminant, and known-groups) validity. All except two of the hypothesized relationships of EQ-5D-3L with relevant constructs in the study were supported, thereby providing good evidence of its convergent validity. The EQ-5D-3L domains correlated well with similar domains in the SF-12v2 instrument. Both instruments are among the most commonly used in the patient-reported outcomes literature. The SF-12v2 has been previously used in studies of autism caregiving [13, 15, 16] and indicated to perform well. The EQ-5D-3L provides good alternative to measure caregiver HRQOL, with the added advantage of ascertaining caregiver preferences for health states. As expected, the EQ-5D-3L index and VAS scores were significantly related to caregiver burden. This is the first study to highlight the relationship between caregiver burden and general utility instrument. Future studies may use the EQ-5D-3L to further investigate the role played by burden and other caregiving situational factors in influencing caregivers’ health state preferences. In the present study, an insignificantly low correlation was found among EQ-5D-3L ‘mobility’ and ‘self-care’ scores and SF-12v2 ‘role emotional,’ ‘mental health,’ and MCS scores, thereby demonstrating support for EQ-5D-3L discriminant validity.

As hypothesized, the EQ-5D-3L index and VAS scores were lower for caregivers who had a child with autism with extreme severity, that is, had SCQ score in third tertile, as compared to those with a child with low (first tertile) or moderate (second tertile) severity. This provides some evidence of the EQ-5D-3L instrument ability to distinguish between known groups of autism caregivers. However, this result should be interpreted in the presence of the fact that the scores for the first and second tertile were statistically insignificant. A plausible explanation for this finding could be that the classification of autism severity among the care recipient into tertiles was arbitrary and not based on scoring structure of the autism severity instrument (SCQ) used in the study. Alternatively, it is possible that the EQ-5D-3L was not sensitive enough to discern differences in preference values among caregivers having child with ‘low’ and ‘moderate’ autism severity. When studying HRQOL among caregivers of elderly patients with Alzheimer’s, Bell et al. [76] found a lack of variation in preference scores (captured using HUI-II) among caregivers by care recipient disease severity. The authors indicated that generic preference-weighted instruments may not be sensitive enough to capture variation in health state preferences by disease severity. In contrast, Tilford et al. [77] were able to detect differences in preference scores (captured using QWB) among caregivers of children with spina bifida by level of disease severity. Our results suggest that the EQ-5D-3L is sensitive in terms of distinguishing preference scores among caregivers at the higher end of disease severity with those at lower and medium end, but is unlikely to perform well in comparing preferences among the latter two groups.

The internal consistency reliability of the EQ-5D-3L instrument was lower (0.63 Cronbach’s alpha) than the acceptable limit of 0.70 [65]. However, reliability assessed through corrected item-total correlations was satisfactory. A true interpretation of internal consistency of the EQ-5D-3L is unclear, considering that the instrument includes one question for each health domain [78]. It may also be reflective of some heterogeneity among these domains. Ceiling effects were present in index scores, with almost one in five participants reporting a perfect health state. Prior studies of EQ-5D-3L psychometric profile in different patient populations have also reported the presence of ceiling effects [45, 79]. The presence of ceiling effects may present some challenges in EQ-5D-3L responsiveness, but is unlikely to limit its ability to serve as a useful generic utility instrument. Floor effects were absent for index scores. Floor and ceiling effects were absent for the EQ-5D-3L VAS.

There are few limitations in the study. The study was conducted among caregivers of children with autism who were registered with the IAN. It is likely that families of children with autism registered with the IAN have greater access to resources and increased awareness of the disorder. With more than 38,000 members including individuals with autism, family members, and caregivers, the number of individuals registered with the IAN still constitutes a small proportion of the ~2 million individuals that are affected with autism in the USA. Therefore, the results of the proposed study may not be generalizable to all families of children with autism. When comparing our sample characteristics to national profile of caregivers of children with special healthcare needs [68], differences were observed. Our sample had more females and whites, and generally ranked higher on indicators of socioeconomic status. It should be noted that our sample included only primary caregivers of children with autism, while the comparator group included any caregiver that provided unpaid care to a child with special healthcare needs [68]. Further, autism has been found to be more prevalent among whites [2] and in families of higher socioeconomic status [80]. When examining the relationship between autism and socioeconomic status among a population-based sample of children in the USA, Durkin et al. [81] found an increasing prevalence of autism with socioeconomic status, reflecting a dose–response curve. The authors suggested that there may be a causal link between socioeconomic advantage and the risk of developing autism. Another large epidemiological investigation involving more than 4,000 children with autism in the state of California found the risk of autism to increase with maternal education [82]. These factors may explain some of the differences, particularly the higher socioeconomic status, observed among our sample as compared to the national profile of caregivers of children with special healthcare needs. Since a cross-sectional design was used for study purposes, we could not test the criterion validity (predictive validity) and test–retest reliability of the EQ-5D-3L instrument. Future studies could test the usefulness of EQ-5D-3L among autism caregivers using a longitudinal research design.

Conclusion

To the best of our knowledge, this is the first study to test the psychometric properties of the EQ-5D-3L among caregivers of children with autism. The EQ-5D-3L was found to have good convergent and discriminant validity and met study hypothesis. The EQ-5D-3L domains correlated well with theoretically similar domains from other instruments (convergent validity) and had low correlation with unrelated constructs (discriminant validity). Known-groups validity was good, and the EQ-5D-3L scores varied as expected across known-groups. Mixed results were seen as per the homogeneity of EQ-5D-3L items. Though the Cronbach’s alpha was slightly lower than the generally acceptable 0.70 value, the corrected item-total correlations were adequate. Floor effects were absent, but ceiling effects were seen for EQ-5D-3L index scores. The EQ-5D-3L VAS did not have floor and ceiling effects. As the research on studying HRQOL outcomes among caregivers of children with autism gains increasing prominence, it may be of value to also document their preferences for health states. The EQ-5D-3L is a viable preference-based generic instrument that has been widely used in the health outcomes literature. As found in the current study, the instrument also works well in autism caregiving population.