QueryDevelopment and understanding of the concept of mindfulness in psychology was initiated by the work of Jon Kabat-Zinn since the late 1970s. Since then, various operational definitions have been devised for the purpose of research (Krägeloh et al., 2019). Many definitions have revolved around the importance of attention in characterizing the mindfulness construct (Brown & Ryan, 2003). Mindfulness is considered to be an active state of consciousness wherein an individual pays attention to each moment’s experience, which is continually refined by a persistent practice of meditation and its application in everyday life (Kabat-Zinn, 2005). Other important elements considered to be characteristic of mindfulness include non-judgmental observation (Baer, 2003) and present-moment awareness (Shapiro et al., 2008). Thus, mindfulness has been defined as “the awareness that emerges through paying attention on purpose, in the present moment, and non-judgmentally to the unfolding of experience moment by moment” (Kabat-Zinn, 2003, p. 145).

In the understanding of mindfulness and measuring/assessing its components, various self-report measures have been developed in various contexts based on the state and trait mindfulness concept. State mindfulness in general refers to the present-moment condition, whether an individual is able to cultivate mindfulness in the present (Lau et al., 2006), whereas trait mindfulness refers to a stable characteristic of a person to be able to be mindful daily or enter those mindful states frequently (Baer et al., 2006). Recently, advancements in research suggest that mindfulness brings about both state (temporary) and trait (personality, long-term practice) changes (Medvedev et al., 2017a, b).

Presently, there are eight trait-mindfulness scales, namely the Freiburg Mindfulness Inventory (FMI; Buchheld et al., 2001), the Mindful Attention Awareness Scale (MAAS; Brown & Ryan, 2003), the Cognitive and Affective Mindfulness Scale and then a revised version (CAMS-R; Feldman et al., 2004), the Southampton Mindfulness Questionnaire (SMQ; Chadwick et al., 2008), the Kentucky Inventory of Mindfulness Scale (KIMS; Baer et al., 2004), the Five Facet Mindfulness Questionnaire (FFMQ; Baer et al., 2006), and the Philadelphia Mindfulness Scale (PHLMS; Cardaciotto et al., 2008). Two state mindfulness scales in widespread use are Toronto Mindfulness Scale (Lau et al., 2006) and State Mindfulness Scale (Tanay & Bernstein, 2013). Even though the scales are widely available and utilized, they posit several issues as discussed in a review paper by Bergomi et al. (2013) suggesting that no single questionnaire can be accountable as a true measure of mindfulness and applicable to all populations across the world.

Currently, the most widely used assessment tool is the Five-Facet Mindfulness Questionnaire (FFMQ) which has been developed through factorial analysis after pooling of items from five different scales: the Freiburg Mindfulness Inventory (Buchheld et al., 2001), the Mindful Attention Awareness Scale (Brown & Ryan, 2003), the Mindfulness Questionnaire (Chadwick et al., 2008), the Kentucky Inventory of Mindfulness Skills (Baer et al., 2004), and the Cognitive and Affective Mindfulness Scale (Feldman et al., 2004). The FFMQ has five factors named Observing, Describing, Acting with Awareness, Nonjudging, and Nonreactivity. Each factor has been assigned with eight items each except for Nonreactivity, which consists of seven items, thus altogether combining to a 39-item questionnaire.

In the original development and validation of the FFMQ in English with US samples using exploratory factor analysis (EFA), confirmatory factor analysis (CFA), and a higher-order CFA, Baer et al. (2006) confirmed that the four of the facets (Acting with Awareness, Describing, Nonjudging, and Nonreactivity) can be described within the multifaceted mindfulness construct but results for the fifth facet (Observing) showed inconclusive results. When analyzed separately for a sample of meditators, the five-factor model showed acceptable fit, but misfit was reported for a dataset from respondents who were not regularly meditating. Later studies, however, reported that a five-factor hierarchical model worked well for both meditators and non-meditators, both when using CFA with item parcels (Baer et al., 2008) and without parcelling (Christopher et al., 2012).

Further research continued to show inconsistent factor solutions when the English-language version of the FFMQ was administered to different samples from various cultural backgrounds and geographical locations. Williams et al. (2014) engaged three different samples from the United Kingdom: adults who were recruited via the online settings from the community using a convenience sampling technique, including (group 1) adults practicing meditation recruited via online meditation sites, local centers, (group 2) the Exeter Mindfulness Network, and (group 3) adults diagnosed with recurrent depression based on the DSM-IV criteria recruited from primary care services. They found that CFA analysis with item parcelling showed a four-factor hierarchical model without the Observing factor to be tenable for the community adults and the clinical sample, whereas a five-factor model fitted to data from the meditators group well, consistent with the initial validation study by Baer et al. (2006). Siegling and Petrides (2016) reported in their study with participants from the United Kingdom that the Observing facet does not fit the factor solution using CFA (with item parcelling) and argued that the removal of this facet from the instrument could be justified. Similarly, Gu et al. (2016) recruited samples with a history of recurrent depression from the United Kingdom. They collected and analyzed data using CFA (with item parcelling) before and after a mindfulness-based cognitive therapy (MBCT; Segal et al., 2002, 2013) intervention and found that a four-factor hierarchical model (excluding the Observing facet) fits well with the sample pre-MBCT whereas a five-factor model showed acceptable fit for post-MBCT data. In other cases, in contrast, a five-factor model appeared suitable even for participants without formal mindfulness experience. In a study with a New Zealand sample, Medvedev et al., (2017a, b) used a subtest approach in their Rasch analysis. The five-factor model was found to provide adequate fit, although two items were required to be deleted.

Despite the historical link to mindfulness within Indian cultures, surprisingly little research has explored the concept of mindfulness in these populations. One exception is a study by Mandal et al. (2016) who developed and validated a Hindi version of the FFMQ using a sample of 300 non-meditators recruited from Varanasi, Uttar Pradesh, India. They had established an adequate fit of a four-factor model without the Observing facet and finalized a 28 item questionnaire to utilize with the Hindi-speaking population of India. The Hindi-language version of the FFMQ showed good internal consistency for the scale and the facets, ranging from 0.61 to 0.82. While the Hindi version of the FFMQ certainly has utility in India for the assessment of mindfulness, evaluation of versions in some of the other languages spoken in the country is also necessary. This includes English, which is widely utilized in India in educational, organizational, government, and administration settings (Montaut, 2010). India is a very large country with approximately 460 different languages and English is considered to be the “subsidiary official language nationwide” (Joshi, 2020, pg.2). It is either the secondary or third language for many in the Southern and North-East regions of India. In some regions, it is not uncommon to encounter some resistance toward the use of Hindi, and in such cases, the preferred language (apart from the local language) is English. In that sense, English may be regarded to function as a lingua franca in India (Kachru, 1986; Mehrotra, 2003). In our psychometric evaluation of the English-language FFMQ in India, our work is also able to provide a psychometric baseline from which to commence investigations into the extent to which the construct of mindfulness as measured by the FFMQ can be generalized to Indian populations.

In the past literature, classical test theory approaches including EFA and CFA have been predominantly applied to ascertain factor structures, with comparatively fewer studies utilizing an item-response theory (IRT) approach (Medvedev et al., 2017a, b). The present study utilized Rasch analysis due to its ability to provide detailed information about item performance. Other advantages include investigation of ordering of response thresholds, differential item functioning by demographic factors, and local dependency of items. Using a so-called subtest approach, Rasch analysis is able to differentiate between response dependency due to method effects and due to dimensionality, in a similar way to bi-factor modelling in CTT (Lundgren-Nilsson et al., 2013). Rasch analysis has been documented to be an advantageous strategy in assessing, understanding, and improving the precision of psychometrics of mindfulness instruments and other health-related outcome measures (Hobart & Cano, 2009; Medvedev et al., 2017a, b; Siegert et al., 2010). Lastly, the focus of our analyses was not necessarily on identifying the best functioning model as this may result in idiosyncratic factor solutions and thus an inflation of different scoring systems available around the world. A solution may thus be preferable if it has slightly inferior psychometric properties but nevertheless meets criteria for an adequate solution. The present exploration of the English-language FFMQ in India thus tested the conventional five-factor solution. This study thus explored the suitability of the FFMQ to assess mindfulness in India.

Method

Participants

The total sample for the present study comprised 300 participants recruited from the general population in New Delhi, India. The inclusion criteria for the sample recruitment required participants to be in the age range of 20 to 35 years. This was to ensure a homogeneous sample could be obtained in terms of sufficient proficiency in English. Exclusion criteria included presence of any significant psychiatric illness, neurological trauma or brain disease, intellectual disability, and current or past history of substance abuse. Participants were also required to be unfamiliar with the practice of meditation. This was to ensure that the sample is again homogeneous and that the scale could be evaluated for its ability to assess trait mindfulness as one would typically do in cross-sectional studies or as part of assessment prior to a mindfulness-based intervention.

Based on these criteria, the total sample constituted 163 females (54.3%) and 137 males (45.7%) with mean age of 27.14 years (SD = 4.04). The majority of the participants were salaried employees (62.3%) with an annual average income of at least 300,000 Rupees (41.0%).

Procedure

The participants for this cross-sectional study were initially recruited via the researchers’ networks. They were briefed about the purpose of the study as well as the inclusion and exclusion criteria to assist in recruiting further participants utilizing a purposive snowball sampling technique, keeping in mind the inclusion and exclusion criteria for the purpose of the study. Each participant received a phone call from the researcher to confirm their participation in the present study, and, along with rapport building, the relevance and information pertaining to the study was explained to them. This was also outlined in an information sheet with accompanying consent form addressing India’s data protection act, confidentiality, and right to discontinue participation at any given time. Finally, a suitable time was decided to meet participants individually or in a group for the purpose of completion of the questionnaire to ensure that the questionnaire was completed in full. The participants were met either at their work place or home/hostel settings. They were briefed again about the procedure outlined above, and then they were asked to complete the demographic details and were also given a general instruction sheet to begin marking the responses on the FFMQ. The total time taken to administer the test was approximately 20 minutes. No incentives were provided to the participants for their participation in this study. Upon completion, all the participants were thanked for their participations and de-briefed.

Measures

FFMQ (Baer et al., 2006) presents 39 self-report items that are rated on a five-point Likert-type scale ranging from “1 = never or very rarely true” to “5 = very often or always true.” The questionnaire assesses the five facets of mindfulness. The Observing facet captures the tendency to notice or attend to internal and external experiences, such as sensations, emotions, and cognitions. Examples of items in this facet are “When I take a shower or bath, I stay alert to the sensations of water on my body” and “I notice how foods and drinks affect my thoughts, bodily sensations, and emotions.” The Describing facet reflects the tendency to explain or label one’s experience in words, such as “I am good at finding words to describe my feelings” and “It’s hard for me to find the words to describe what I’m thinking.” Acting with Awareness facet brings full awareness and undivided attention to current activity or experiences. Example statements are “When I do things, my mind wanders off and I’m easily distracted” and “I don’t pay attention to what I’m doing because I’m daydreaming, worrying, or otherwise distracted.” The Nonjudging facet assesses the extent to which one maintains a non-evaluative stance toward inner experiences. Examples of statements for this facet are “I criticize myself for having irrational or inappropriate emotions” and “I make judgments about whether my thoughts are good or bad.” The fifth facet of this questionnaire, Nonreactivity, reflects the tendency to allow thoughts and feelings to come and go, without getting caught up in them, such as “In difficult situations, I can pause without immediately reacting” and “I watch my feelings without getting lost in them.” All items of Acting with Awareness, Nonjudging, and Nonreacting and three items of Describing (total 19 items) were negatively phrased and needed to be reverse coded prior to analysis so that higher scores reflect a higher level of mindfulness. The FFMQ has been reported to have adequate reliability, convergent and discriminant validity, and incremental validity in the prediction of psychological symptoms (Baer et al., 2006). The subscales have strong internal consistency ranging from Cronbach’s alpha = 0.75 to 0.92 with samples of meditators and non-meditators (Baer et al., 2008).

Data Analysis

All descriptive statistical analyses were performed using SPSS Version 25.0., and RUMM2030 (Andrich et al., 2009) was used for Rasch analysis (Fig. 1, overview of the analysis strategy). A likelihood-ratio test ascertained the suitability of the unrestricted Partial Credit model for our dataset with all 39 FFMQ items. Individual item analysis was conducted initially as a baseline model, followed by investigation of the higher-order structure of the FFMQ using a subtest analysis (Lundgren-Nilsson et al., 2013). This approach has been found to be an effective strategy in a previous evaluation of the FFMQ using Rasch analysis (Medvedev et al., 2017a, b). Combining facet items into subtests can address local dependency that arises between items due to their shared item content. This approach involves creating summary scores of locally dependent items and testing a further iteration of the model (Lundgren-Nilsson et al., 2013). If this subtest solution provides a satisfactory unidimensional fit to the Rasch model, it can be concluded that the subtests constitute elements within a higher-order structure of mindfulness. Careful consideration at each step of the iterative analysis followed the guidelines comprehensively outlined by Siegert et al. (2010) and Balalla et al. (2019). The present study examined goodness of fit for each model on the basis of the following criteria: item-trait interaction examined by chi square value was not significant, item thresholds were not disordered, individual item fit residuals were between the acceptable range of − 2.50 and + 2.50, item location mean approximated 0.00, and person and item fit residuals approached 0.00 with SD = 1.00. Differential item functioning (DIF) was also examined for the four socio-demographic categories available for the dataset (age, gender, occupation, and annual income). This investigated to what extent items contribute to the latent trait (in this case mindfulness) in the same way for different demographic groups. Unidimensionality was investigated using Smith’s (2000) test. When applied to individual subscales, we thus tested to what extent subscales present one single construct. When applied to subtests of item clusters by subscale, a unidimensional solution would indicate that a single higher-order factor of mindfulness can be postulated. Lastly, within Rasch analysis, reliability is expressed as person separation index (PSI), which is interpreted in the same way as Cronbach’s alpha (Tennant & Conaghan, 2007).

Fig. 1
figure 1

Overview of the analysis strategy employed in the present study

To further corroborate our findings, we investigated the factor structure yielded from the Rasch analysis using CFA with the software package LISREL Version 8.80 (Jöreskog & Sörbom, 1993). Even though items exhibited acceptable skewness and kurtosis, CFA was performed using the asymptotically distribution free method of diagonally weighted least squares, which is the most appropriate for ordinal-level data (Flora & Curran, 2004). We did not use item parcelling and employed a conservative approach of not correlating the error co-variances of items. The following fit-index cut-off values were considered to evaluate model fit: root mean square error of approximation (RMSEA) < 0.06, comparative fit index (CFI) > 0.95, and standardized root mean square residual (SRMR) < 0.08 (Marsh et al., 1988; McDonald & Ho, 2002). CFA was performed to provide continuity with other past research such as the Hindi version (Mandal et al., 2016), Dutch version (Bruin et al., 2012), Italian version (Giovannini et al., 2014), and the English-language version (Aguado et al. 2015; Gu et al., 2016; Gordon, 2018). Another reason for performing CFA was to lend support to the findings yielded from Rasch analysis to investigate the higher-order structure using another analytic approach.

Results

Baseline Model Fit

The baseline analysis included all 39 items without any higher-order factor structure suggested by any subtests (Table 1, 5F), which did not show an acceptable fit (χ2 (195) = 479.50, p < 0.01). Three items of the Describing facet showed elevated significant (Bonferroni adjusted) fit residuals (Table 2): item 12 (7.40), item 16 (6.14), and item 22 (7.14). Smith’s (2002) test indicated evidence for multidimensionality. No DIF was found for any socio-demographic (person) factors. Due to their high degree of misfit, these three items of the Describing facet required deletion prior to continuing further investigation of the higher-order factor structure. Thus, in the subsequent analysis of the five-factor model, items 12, 16, and 22 were not included (Table 1, 5F1). This model showed an acceptable fit (χ2 (180) = 184.45, p > 0.05) but failed the test of unidimensionality (Table 1).

Table 1 Summary of fit statistics for the baseline and the final Rasch analyses of the FFMQ-item fit residual (value and standard deviation), person fit residual (value and standard deviation), item-trait interaction, goodness of fit (χ2 and p value), person separation index (PSI), and results from Smith’s (2000) test of unidimensionality. The three models were baseline model; baseline (5F1) after deleting three misfitting items from the Describing facet (items 12, 16, and 22), and the final five-factor final model using subtest analysis approach (5F2)
Table 2 Item number, wording of the items, item location, standard error (SE), item fit residual, and chi square for the FFMQ 39-item baseline model (5F)

Five-Factor Model Fit

Using the subtest analysis approach as outlined by Medvedev et al., (2017a, b), five subtests were created named Observing, Acting with Awareness, Nonjudging, Describing, and Nonreactivity. Each subtest had 8 items each as categorized in the original instrument but guided by our first analysis, the three misfit items were not included in the Describing subtest. No significant DIF was found for any person factors (Fig. 2, person-item distribution plot). The analysis showed an acceptable fit (χ2 (25) = 28.49, p = 0.28). For this five-factor higher-order structure, unidimensionality was confirmed (Table 1, 5F2). Although, compared to models 5F and 5F1, PSI dropped when using subtests in model 5F2, the value still indicated good reliability. However, with PSI = 0.81, the value was slightly below the 0.85 criterion used to indicate the scale is suitable for assessment of within-participant changes, and instead may preferably be used for between-group comparisons (Tennant & Conaghan, 2007).

Fig. 2
figure 2

Person-item thresholds distributions for the FFMQ with three items discarded and facet subtests

Confirmatory Factor Analysis: Five-Factor Model Fit

Keeping in view the aim of the present study to investigate the higher-order model of mindfulness for its suitability in India, to further support our findings derived from the Rasch analysis and to provide continuity with past research utilizing classical test theory approaches, CFA was performed. The CFA model corroborated our findings for the higher-order five-factor model without the three misfitting items of the Describing facet. RMSEA (0.05), CFI (0.96), and SRMR (0.09) values confirmed the acceptable five-factor model fit as evidenced through Rasch analysis. Considering that we did not correlate any item error variances, the fact that SRMR was found to be marginally above the cut-off levels can be deemed as acceptable (Hermida, 2015). Some of the item factor loadings of the items were found to be at the lower end of the ideal range (between 0.40 and 0.50): items 1 and 11 of the Observing facet, item 3 of the Nonjudging facet, and items 4 and 33 of the Nonreactivity facet (Fig. 3, factor loadings).

Fig. 3
figure 3

Factor loadings

Discussion

The aim of the present study was to investigate the higher-order structure of mindfulness using the FFMQ-English-language version in a sample recruited from India. It also aimed to provide an acceptable model fit for the recruited sample which would open the pathway for future research to ascertain the cross-cultural generalizability and validity of the mindfulness instrument. Overall, a five-factor solution without three misfitting items of the Describing facet totalling to 36 items was found to provide an acceptable fit for use with the present sample from India. This finding from Rasch analysis was subsequently confirmed with CFA.

When testing a baseline model, three items from the Describing facet were found to be misfitting, which was followed up by a subtest analysis (Medvedev et al., 2017a, b) to investigate a higher-order factor structure. Medvedev et al., (2017a, b) found acceptable fit of five factors but two misfitting items (items 24 and 32) were removed from the original questionnaire, whereas in the present analysis, three items (items 12, 16, and 22) from the Describing facet were removed due to elevated fit residuals. The final five-factor model using the subtest analysis approach met the criteria of unidimensionality based on Smith’s (2002) test confirming that these five factors without the three misfitting items adequately assess the higher-order construct of mindfulness in a sample of non-meditators from India.

In the previous literature, issues related to the FFMQ factor structure have often been reported about the Observing facet for the English-language version (Aguado et al. 2015; Gu et al., 2016; Gordon, 2018). Inconsistent factor structures such as reported for the English-language version of the FFMQ were also noted for translations into various different languages. While CFAs of the Dutch (Bruin et al., 2012), Chinese (Deng et al., 2011), Japanese (Sugiura et al., 2012), and Italian versions (Giovannini et al., 2014) pointed toward a five-factor solution, a four-factor model appeared more suitable in other cases. For Portuguese (Ramos et al., 2018) and German versions (Tran et al., 2013), the Nonreactivity facet was discarded resulting in a 26-item Portuguese version of the FFMQ and a 20-item German version of the FFMQ questionnaire. For the Polish (Radon 2014), Hindi (Mandal et al., 2016), and French versions (Heeren et al., 2011), the Observing facet was discarded yielding a four-factor solution. When utilizing the Dutch version of the FFMQ in a sample of pregnant women, Truijens et al. (2016) concluded that a three-factor solution without Observing and Describing and reducing the number of items to 12 was required.

The present study using the English-language version of the FFMQ in Delhi did not confirm the issues reported when the Hindi version was validated in another part of India (Varanasi). Mandal et al. (2016) reported that the Observing facet did not fit well within the overall factor structure of the FFMQ-Hindi, but in the present study, the five-factor model was adequate as long as three misfitting items of the Describing facet had been discarded. These three items may reflect specific features of mindfulness as expressed by the KIMS (Baer et al., 2004) items that were included in the FFMQ during its development. The KIMS was specifically designed to assess certain skills which are cultivated with the practice of mindfulness-based interventions such as dialectical-behavioral therapy (DBT; Hayes et al., 2004) involving Observing, Describing, and Nonjudging aspects of the approach. As the participants in the present study were not directly exposed to mindfulness-based interventions and practice, there could be possible variability in the comprehension and understanding of the item statements. Additionally, as argued by Rudkin et al. (2018), FFMQ Observing facet does not sufficiently address aspects of emotional awareness. The misfit for the three Describing items could also potentially be accounted for by differences in the style of communication adopted by the Indian communities and personality traits that are considered to be socially desirable. The Indian community has been categorized as generally adopting a high-context communication style (non-verbal, introvert, silent, and not believing in long talks) as described by Hall (1976) in his initial understanding of the communication style which was further reiterated by Lewis (2005). This reflects their response toward describing their feelings and emotions to others as assessed by the describe facet of FFMQ by the statements such as “I can easily put my beliefs, opinions and expectations into words,” “Even when I’m feeling terribly upset, I can find a way to put it into words,” and “My natural tendency is to put my experiences into words.” Nishimura et al. (2008) explained that the major drive to describe and communicate for the Indian communities is to preserve harmony and unity in relationships rather than prioritizing conveyance of accurate information. Medvedev et al. (2016) also argued that the statements of the Describing facet are ambiguous in the sense that they insufficiently differentiate between tendency and ability.

Haas and Akamatsu (2019) concluded that cultural differences are relevant in explaining the difference in the conceptual understanding of mindfulness in the East and the West. They also suggested that further studies should attempt to utilize and investigate the efficacy of Rasch analysis in the investigation of cross-cultural differences and generalizability. The present study provided some further steps toward this goal by utilizing Rasch analysis to demonstrate the suitability of the higher-order factor of mindfulness in India. The steady increase and popularity in utilizing the Rasch model has been illustrated in a systematic review (Leung et al., 2014) but only recently, the thoroughness in reporting the outcomes of Rasch studies has improved. Leung et al. (2014) suggested that the recommended way of reporting the Rasch results is to be able to present tables that permit conversion of ordinal-level data to interval-level scores, which will thus assist in increasing the precision of measurement scales. This recommendation has now recently been implemented in mindfulness research, such as for the KIMS (Medvedev et al., 2016), FFMQ (Medvedev et al., 2017a, 2017b), the Comprehensive Inventory of Mindfulness Experience (CHIME; Medvedev et al., 2019), and the Interpersonal Mindfulness in Parenting-Korean version (IM-P-K; Kim et al., 2019). In the present study, our primary rationale was to start an investigation of the generalizability of the higher-order construct of mindfulness in India. Given the diversity and size of the country, a more diverse sample from various parts of the country will need to be obtained before suitable conversion tables can be developed. The present study has demonstrated that the five-factor structure of the FFMQ appears tenable in an Indian sample, giving confidence that further studies in India will be able to confirm that. The advantage of the English-language version is that it may be applicable in a wide range of regions where, apart from local languages, English may occasionally be preferred over Hindi when completing questionnaires. However, more work will also need to be conducted to translate the FFMQ into other languages of India and to explore to what extent the five-factor structure can be replicated there.

Limitations

The following limitations need to be acknowledged: The study recruited a homogeneous sample within a limited age range, with no prior exposure and experience with meditation, and from a localized region of North India. Past literature showed the variability of model fit to be related to prior experience in meditation. Thus, further research should aim to recruit a more heterogeneous sample with a wider age range as India is a diverse country with varied cultural and geographical locations. India provides an interesting context for mindfulness, not only due to its historical links to mindfulness but also due to its cultural diversity. Using the English-language version of the FFMQ permits investigation of cross-cultural generalizability, but additional translations are necessary to explore the effect of linguistic variation.