Introduction

Obesity is a serious public health issue which causes important health problems such as cardiovascular disease and diabetes, and affects quality of life negatively [1, 2]. It has become a global epidemic. In Europe, its prevalence is between 32–79% among adult men, and 28–78% among adult women [3]. Albania, Bosnia-Herzegovina, and England are European countries where the prevalence of obesity is especially high [2].

According to a pre-study report of ‘National Nutrition and Health Examination 2010’, the prevalence of being overweight is 31.35%, while obesity prevalence is 12.02%. 28.93% of the women are overweight, but 14.49% of them are obese and in men 33.64% are overweight, 9.70% are obese in Turkey [4].

The main risk factors for obesity are low physical activity, eating habits, age, being a woman, number of births, marriage, quitting smoking, and drinking alcohol. The quality of life in obese women is lower than in obese men. For example, in obese women, stigmatization, anxiety, and depression are more prevalent [5,6,7]. Choo et al. found that women with increased body mass index had lower self-efficacy, more severe depressive symptoms and greater deterioration in quality of life [7].

Quality of life is the satisfaction of an individual with his/her own life [8,9,10]. One of the most important factors which influence satisfaction of an individual with life is being healthy. Protection and improvement of health would enable increased quality of life [9]. Obesity in adults is associated with impaired Health-Related Quality of Life (HRQoL). One of the most important factors which influence quality of life among women is being at a healthy weight [11]. Various studies support the relation between body mass index and quality of life. For women, being overweight influences physical wellbeing, self-esteem, social interactions, and family relations, sex life, and work life [10,11,12].

The World Health Organization (WHO) defines quality of life as “individuals’ perception of their position in life in the context of the culture and value systems in which they live and in relation to their goals, expectations, standards, and concerns.” It is a comprehensive concept affected in a complex way by individuals’ physical health, psychological state, and their relationship to salient features of their environment [13]. Culture can be defined as the behaviors, beliefs, and characteristics of a particular social or age group. For social and ethnic groups, these behaviors, often learned during childhood, are values that represent unconscious factors in the motivation of individual behaviors [14, 15]. In prevalence studies conducted in Turkey, it was determined that the prevalence of obesity is higher in females compared to males [16,17,18]. A study reported that obesity among women is almost two times higher than men (41.0%, 20.5%). Studies from Europe reported obesity prevalence among men between 32 and 79% while among women between 28 and 78% [3].

There are some scales that are/can be used to measure quality of life [8, 13, 19], but there are few scales that can be used to measure quality of life in obese women [20]. The aim of the present study was to develop a scale to assess the quality of life of overweight women. It was thought that this scale would be useful to measure health status and evaluate the outcomes of interventions to improve health.

Methods

Study design

This study presents the results of methodological research that derives evaluation items and tests the validity and reliability of the scale to measure quality-of-life of overweight women.

Procedure

The present researcher conducted focus group interviews before this study, and developed an item pool which analyzes the quality of life among women using focus group data, their experiences and the contents of interview covering all aspects of QOL based on detailed literature review [1, 10, 12, 18, 19]. The focus group study aimed to capture the experiences of women currently receiving primary health care. Three group interviews were conducted with 18 women. Scale items were developed to evaluate the relationship between obesity and physical health, psychological dimensions (self-perception), difficulties in performing and maintaining a job, social relations (spouse and important others) and relations with their surroundings. Thus, it should be ensured that the grading scale is a complete structure that includes the physical, spiritual, and social dimensions of health and communication with the environment. The semi-structured questions included: (1) Does obesity affect your physical activity and ability? If yes, How? (2) Does obesity affect your social relationships (with spouse, family and others)? If yes, How? (3) Does obesity affect your sex life? If yes, How? (4) What is your opinion about obesity? Does your weight affect your work life? When compared to others, how do you feel about yourself? Then women’s statements were converted to sentences. The following methods were used when creating items:

  1. 1.

    The words the women spoke were turned into smooth sentences,

  2. 2.

    All items were created so as not to include factual statements,

  3. 3.

    Expressions were written in simple present time and in plain language style,

  4. 4.

    Each item includes only one idea/opinion.

The statements about the quality of life of overweight women, comprising 56 items, were categorized into six sub-groups: (1) physical activity (12 items), (2) public distress (14 items), (3) sex life (9 items), (4) self-perception (9 items), (5) work life (9 items), and (6) self-esteem (5 items). Six experts determined the content validity of the scale. Of those five experts were professional academics at university, studying in this field. The other expert was an academic at the biostatistics department who took part in the scale development studies. The item pool was sent to the experts via e-mail by the authors. They were asked to answer the following two questions: (1) Do the items have the quality of an expression of question? (2) Do the items assess the criteria of quality of life? Content Validity Ratios (CVR) were calculated using Lawshe technique according to the views of the experts for each item as “appropriate or highly appropriate” using the following formula [20, 21].

$${\text{CVR}}=\frac{{{\text{number}}\;{\text{of}}\;{\text{experts}}\;{\text{stating}}\;{\text{`suitable'}}\;{\text{and}}\;{\text{`quite}}\;{\text{suitable'}}/{\text{number}}\;{\text{of}}\;{\text{experts}}}}{{{\text{number}}\;{\text{of}}\;{\text{experts}}}}{\text{.}}$$

It was stated that the scale had sufficient content validity since 80% of experts evaluated items with 3–4 points and the scale had 0.80 and above CVI. It is suggested that CVI values of items should not be below 0.80 [20, 21]. The CVI of 15 items was 0.80 and others were 1.00. The total CVI rate of the scale was 0.925 and since this CVI is higher than 0.80, it was accepted as suitable.

Sampling

Participants in the study were from public education centers in a city. The sample was composed of women whose body mass indices were 30 and above, and who had no disease associated with obesity.

Research eligibility criteria

The main eligibility criteria were being 18 years and older, overweight (BMI 30 and over), and willing to comply with the study. In addition, diseases related to obesity and non-pregnancy complaints were included.

Data collection

The data were collected between March and April 2014 through face-to-face interviews on weekdays at each center. The participants completed the data collection tools individually, and each participant spent about 15 min filling out the forms. There are 56 items in the item pool prepared for the Scale of Quality of Life of Overweight Women (SMQLOW). It is recommended that the sample size should be 5–10 times more than the number of items [22, 23]. We collected the data from 506 women in the research period, this number complies with the recommended sample size.

The measurements of height and weight were done in a room prepared by the public education centers with the same measurement instruments whose calibration was done before application. The measurement of height was done without shoes; heels leaned behind, and back stiff and head in a normal anatomic position. The measurement of weight was done by taking off extra clothes and shoes. The body mass index was calculated by dividing the body weight in kilograms per square of height in meters. All the measurements were done in the morning before breakfast.

Data analysis

The data were evaluated using SPSS 20 Package Program (SPSS, Statistical Package for the Social Science). The content validity index was calculated after expert views for content validity. Factor analysis was applied to reveal the factor structure to test the structure validity of the scale. Parallel analysis was used for determining the number of components. Among many techniques proposed to determine the number of factors to retain, Horn’s Parallel Analysis (PA) is the most strongly recommended one. The PA method is implemented by generating a large number of data matrices from random data. Each matrix is generated in parallel with the real data which means that matrices with the same number of cases and variables are created. The rationale underlying PA is that components from real data with a valid underlying factor structure should have larger Eigenvalues than those derived from random data having the same sample size and number of variables. Thus, researchers would not be interested in a factor that does not account for more variance than the parallel factor obtained from random numbers because meaningful components extracted from actual data should have larger eigenvalues than parallel eigenvalues obtained from random data factors are retained in the real data as long as they are greater than the mean eigenvalue generated from the random data matrices [24, 25].

The construct validity of the scale was examined by exploratory factor analysis using principal component with varimax rotation method and Kaiser Normalization. Factor analysis was performed in four stages: investigating whether the data were suitable for factor analysis (using Kaiser–Meyer–Olkin), designating factors, rotating factors, and labelling factors.

Cronbach alpha coefficient was used to determine internal consistency of items in the scale. Item analysis was applied to determine contribution of the items to the scale. To this aim all item-total correlation coefficients in the scale were calculated. Moreover, reliability coefficients were calculated again for the rest of the items when an item is deleted from the scale.

Results

The mean age of the participants was 44.9 ± 11. The scale, made up of 56 items at first, was reduced into 40 in parallel with item analysis. Of the respondents, 0.8% was at normal weight, 22.3% of them were overweight, and 44.5% of them were obese at first grade. 30.2% of them were obese at second grade. The mean scores of the respondents according to BMI were 163 ± 8.87; 145 ± 3.05; 138 ± 2.32; 128 ± 2.72, respectively. The difference between the groups was significant. F = 6.101; p < 0.001 and the differences between all groups were significant according to post hoc test.

There were 16 items with coefficients 0.30 which were deleted because of low correlation. Therefore, the final scale consisted of 40 items. The scale was structured as four factors based on the results of the parallel analysis (Table 1). Rotation sums of squared loading of factors and the percentage of the total variance explained by factors, respectively, were 9706 (24.264%); 6751 (16.87%); 6463 (16.157%); 3359 (8.397%). Thus, total variance explained by four factors was determined as 65.694%. Factor loads of the items forming the scale varied between 0.546 and 0.840 (Table 2). KMO value was 0.959 and the result of Bartlett’s Test of Sphericity analysis was found to be χ2 = 14164.9; p < 0.001.

Table 1 Eigen values of actual and simulative data
Table 2 Provision of the item names

Corrected item-total correlations ranged between 0.564 and .839. Cronbach’s alpha values were determined as 0.967 for the whole scale, 772–961 for the subscales. While Cronbach’s alpha value was 0.939 in the first half, Spearman Brown split-half test correlation of the SMQLOW was 0.951 in the second half and the Spearman Brown value was determined as 0.884.

Labelling and content of factors

The factors in the scale were labelled according to the items included in their sub-dimensions. Hence the first factor, composed of 16 items, was labelled as ‘self-care and physical activity’ since it included items relating to the status of respondents’ self-care and physical activity and problems related to them. The second factor contained nine items which was labelled as ‘sex life’ since the items here reflected problems experienced by overweight women in relation to their sex life. The third factor contained 11 items because these items are related to the distress caused by the environment and this factor was labelled as ‘public distresses’. The fourth factor, comprised four items, was labelled as ‘self-perception’ because the items of this factor are related with the individual’s own perception.

Discussion

This study was carried out to develop a quality-of-life scale for overweight women, and to test validity and reliability of the scale. Three factors were defined in validity and reliability of the quality-of-life scale for overweight women. These factors are composed of 40 items and explain 65.694% of total variance.

Construct validity

Content and construct validity methods were used for the validity of SMQLOW.

Content validity was explained in the “Methods” section. The items whose CVI scores were three and four were included in the scale [20, 26]. Exploratory factor analysis (EFA) was used in construct validity. Exploratory factor analysis (EFA) is a multivariate statistical technique which aims to bring many variables related to each other and find new variables (factors/dimensions) which are few in number and conceptually meaningful [21, 27, 28]. Before was EFA done to determine construct validity, the Kaiser–Meyer–Olkin (KMO) coefficient and Bartlett test were done to evaluate the compatibility of the data. It is expected that the Bartlett test result should be meaningful (p < 0.05) and the KMO value should be more than 0.50 [8, 29]. If the KMO value is between 0.50 and 0.70 the sampling number is at medium level, if t is between 0.80 and 0.90, it is quite good and if it is 0.90 and above, it is accepted to be perfect. If the KMO value is below 0.50, it is stated that number of sampling is insufficient [23, 29]. KMO values in this interval show that the sample was large enough. The Bartlett test result is also meaningful which indicates the existence of a correlation between scale items and shows that the data set is suitable for EFA [30]. The analysis result of Bartlett’s Test of Sphericity was found sufficient. This value which was found to be statistically significant (χ2 = 141164.9; p < 0.001) indicates that the data were normally distributed.

According to the EFA results in this study, the scale is composed of three factors. In EFA, correlation of an item with a factor and correlation of an item with the whole scale is observed; this process analyzes how much of the total variance can be explained by the factors. It is accepted as an important variable included in the analysis that should explain two-thirds of the total variance. However, it is difficult to reach this rate in social and behavioral sciences. The variance rates between 40 and 60% are acceptable for social sciences [28]. When the explained variance is high, it is regarded as an indicator that related terms or constructs measure what is to be measured well. In this study, the factors explain 65.69% of total variance and this rate is acceptable. Four factors obtained as a result of EFA, carried out with varimax rotation and factor loads of the items were collected. The factor loads point out the correlation between items and factors. If the factor load of an item is low, then it means that items do not have a powerful correlation. There are studies which adopt the idea that the factor of items should not be below 0.40 [30, 31] and those stating that load values should not be below 0.30 [20, 21, 32, 33]. Therefore, SMQLOW has construct validity (Table 2).

Reliability of scale

Reliability is the power of a measurement scale in giving compatible and coherent results [30]. It shows how well an instrument gives decisive and renewable results [32]. In testing the reliability of SMQLOW, item-total correlation coefficients, Cronbach’s alpha internal consistency analysis, and split-half reliability methods were used. In a Likert-type scale, it is expected that the reliability coefficient should be as close as possible to one [27]. The reliability coefficient being around 0.90 is accepted as ‘perfect’, being 0.80 means ‘very good’ and being around 0.70 means ‘sufficient’ [22]. Consequently, the Cronbach’s alpha coefficient of the scale obtained in this study is above 0.70 [20].

The reliability coefficient obtained by dividing a scale into two is referred as equivalent split-half reliability. If the scale has sub-dimensions, each sub-dimension is accepted as a whole in itself and the method can be applied for sub-dimensions as well, which is the most common method among scale reliability determination methods [22]. Internal consistency refers to the general agreement between multiple and scaled items. According to these results, the internal consistency of the SMQLOW was considered highly reliable.

The item-total score correlation is based on the calculation of the correlation between each item and the total score. If the total scale of an item and value of correlation coefficient is minus, zero or close to zero; then the item should be excluded. If the value is low, it means that the item measures a different quality from others [21, 27]. In this study, the item-total correlation coefficient of items in the final scale was between 0.564 and .839 (Table 3) If the items’ total correlation is 0.30 and above, it means they can identify individuals well in the sense of measured characteristics [32]. Hence, each item in this scale generally serves to measure the expected quality and each item is distinctive at acceptable level. Also, the significant negative correlation between the BMI and SMQLOW scores of the participants (− 0.114; p < 0.05) and significant difference in the mean SMQLOW scores according to BMI (F = 6.101; p < 0.001) show that the scale is distinctive.

Table 3 Descriptive statistics and reliability analysis of the scale

Conclusion

Findings obtained in the study indicate that SMQLOW is valid and reliable. The scale can be used to identify the quality of life among overweight people and the evaluation of the effectiveness of activities for improving health. Developed in this way, SMQLOW could indirectly contribute to protecting and improving women’s health.