Introduction

Rumination is a complex, multi-faceted construct, which has been studied from different theoretical perspectives (Smith & Alloy, 2009). Perhaps the most prolific of these is the response styles theory (RST), which defines rumination as “repetitively focusing on the fact that one is depressed; on one’s symptoms of depression; and on the causes, meanings, and consequences of depressive symptoms” (Nolen-Hoeksema, 1991, p. 569). As becomes evident from this definition, the RST conceptualizes rumination as essentially maladaptive and focused on depression (Lyubomirsky & Tkach, 2004). Certainly, even though rumination is present in a myriad of mental health problems, it is more strongly related to depression (Cernvall et al., 2016; Iqbal & Dar, 2015).

Rumination was originally proposed as a mediator of the association between gender and depression (Nolen-Hoeksema, 1987). Indeed, small but consistent differences have been found between men and women’s levels of rumination (Johnson & Whisman, 2013), and the mediation hypothesis has been successfully tested in some studies (Grant et al., 2004; Ricarte Trives et al., 2016). Consequently, rumination is part of state-of-the-art models of gender differences in depression (Hyde & Mezulis, 2020). Moreover, it seems to mediate the relationship between gender and other disorders such as eating pathology (Opwis et al., 2017).

The Ruminative Response Scale

Many measures of rumination have been proposed; however, the Ruminative Response Scale (RRS) is the one that has received the greatest attention (Luminet, 2004). It was originally part of the Response Styles Questionnaire (RSQ), a wider instrument that also assessed other three styles of reacting to negative mood (Nolen-Hoeksema & Morrow, 1991). As time went by, however, researchers started to focus on the 22 items of the RRS, which were eventually studied as an independent measure.

The 22-item RRS was used extensively to study rumination in relation to depression levels (Nolen-Hoeksema et al., 1994), the onset of depression (Just & Alloy, 1997), and the gender differences observed in this disorder (Nolen-Hoeksema et al., 1999). These results notwithstanding, some authors also raised concerns regarding the dimensionality of the scale. Specifically, they pointed out to the fact that many items of the 22-item RRS seemed to be confounded with depression itself (Cox et al., 2001; Roberts et al., 1998; Segerstrom et al., 2000). In response to these criticisms, Treynor et al., (2003) proposed a new version of the scale, which was composed of only 10 items and had two underlying dimensions: brooding and reflection. According to these authors, the latter—characterized by self-focus—was an adaptive aspect of rumination in the long term, since it was longitudinally associated with lower levels of depression. This short form of the RRS (hereafter simply referred to as “the RRS”) will be the focus of the present report.

Factor Structure of the RRS

Several studies of the RRS reported the use of principal component analysis (PCA) as a technique to perform exploratory factor analysis (Cowdrey et al., 2011; Dinis et al., 2011; Hernández-Martínez et al., 2016; Lee & Kim, 2014; Roelofs et al., 2006; Shin et al., 2015; Thanoi & Klainin-Yobas, 2015). Although this is a variable reduction technique, it considers both the common variance and the error variance in the analysis, while EFA only includes the common variance between the item and the rest of the items (Lloret-Segura et al., 2014). Thus, the use of PCA could also affect the results of the factorial structure.

Exploratory research, either with PCA or EFA, has repeatedly identified a two-factor structure of the RRS (Armey et al., 2009; Dinis et al., 2011; Hernández-Martínez et al., 2016; Lee & Kim, 2014; Shin et al., 2015; Thanoi & Klainin-Yobas, 2015; Zanon et al., 2018), even though most of these studies had to remove indicators in order to achieve a clear factorial solution. This suggests that the RRS is still subject to improvement by adding some new items and deleting others. For instance, it has been observed, both in exploratory and confirmatory analyses, that item 5 (Write down what you are thinking and analyze it) performs poorly, thus its elimination seems justified (Dinis et al., 2011; Parola et al., 2017; Xavier et al., 2016).

Many confirmatory studies found mediocre fit (both CFI < 0.90 and RMSEA > 0.08) of the proposed two-factor structure (Arana & Rice, 2017-Argentinian sample; Schoofs et al., 2010; Whisman et al., 2018; Xavier et al., 2016). Others had better fit but still far from the desired CFI > 0.95 and RMSEA < 0.06 (Arana & Rice, 2017-American sample; Hasegawa, 2013; Lucena-Santos et al., 2018; Ruiz et al., 2017; Thanoi & Klainin-Yobas, 2015). Finally, only a small set of results were close to the desired criteria of good fit (Lei et al., 2017; Parola et al., 2017). Due to the lack of clarity regarding validity evidence based on the factorial structure, the original model proposed by Treynor et al., (2003) still needs further assessment.

Measurement Invariance of the RRS

Rumination was proposed to help explain the gender differences in depression (Nolen-Hoeksema, 1987). It is, therefore, necessary that rumination measures function equivalently between genders, so that meaningful comparisons can be made (e.g., Johnson & Whisman, 2013). In factor analytical terms, such equivalence is named measurement invariance and implies three aspects: metric (equal loadings), scalar (equal intercepts or thresholds), and strict (equal residual variances) invariance (Bowen & Masa, 2015; Dimitrov, 2010), even though some authors suggest that metric and scalar invariance be tested together (Sass, 2011). Some authors found evidence of metric invariance of the RRS but did not test any further (Ruiz et al., 2017; Xavier et al., 2016). Others, on the other hand, found scalar (Whisman et al., 2018) and even strict invariance (Lei et al., 2017; Liang & Lee, 2019) of the RRS between genders.

Latent Correlation Between Brooding and Reflection

Not all studies reported the latent correlation between the two factors (i.e., brooding and reflection). However, those who did found large or very large correlations, ranging from 0.63 to 0.95 (Lucena-Santos et al., 2018; Ruiz et al., 2017; Thanoi & Klainin-Yobas, 2015; Xavier et al., 2016). This raises concerns as to whether the significant correlations of the RRS’s factors and other variables are due to their specific or shared variance. For instance, it has been observed that brooding has larger correlations with depression, even though the correlation of reflection and depression is significant too (Armey et al., 2009; Dinis et al., 2011; Hernández-Martínez et al., 2016; Lee & Kim, 2014; Schoofs et al., 2010; Shin et al., 2015). However, it remains unclear whether these correlations are due—at least partly—to an underlying general dimension. This issue can be addressed by modelling both specific (i.e., brooding and reflection) and general (i.e., global rumination) factors.

The RRS and Depression

Rumination as a construct was originally developed in depression research (Nolen-Hoeksema, 1987, 1991). Therefore, it is a valid question to ask whether brooding and reflection are equally related to depression. A first answer can be found in Treynor et al., (2003) work itself. These authors found that brooding was more associated with depression when measured cross-sectionally; however, when depression was measured 1 year later, and after adjusting for various confounders, reflection was the only significant predictor. The former result has been repeatedly replicated: In all but one of the studies reviewed by the authors, brooding had a stronger correlation to depression than reflection (Armey et al., 2009; Dinis et al., 2011; Lee & Kim, 2014; Schoofs et al., 2010; Shin et al., 2015; but Lei et al., 2017).

The Bifactor Approach

As reported above, large correlations are routinely observed between brooding and reflection. This suggests that the variance shared by the factors may be larger than each one’s separate variance (Hair et al., 2019). Furthermore, it is possible that an unmodelled general factor explains this shared variance; in other words, a global rumination factor seems plausible (Furr & Bacharach, 2014). On the other hand, as already mentioned, brooding and reflection are not equally related to relevant outcomes. Therefore, a case can be made that they should not be completely subsumed under a single global dimension.

The situation just described can be addressed taking a bifactor approach. It involves creating a model with both a general factor (i.e., rumination) and specific factors (i.e., brooding and reflection), all of which are orthogonal to each other (Markon, 2019; Reise, 2012). One advantage of modelling the RRS as bifactor is that each factor’s unique contribution can be evaluated while simultaneously controlling for the others (Chen et al., 2006). Thus, it is possible to examine whether the specific factors are still relevant when the general factor is included in the model (Reise et al., 2010; Rodriguez et al., 2016). Moreover, if a relevant outcome (e.g., depression) is added to the model, its associations with both the general and specific factors can be estimated (Chen et al., 2006).

In spite of the popularity of the RRS, few studies have analyzed its items with a bifactor methodology. For example, Topper et al., (2014) used only the brooding items of the RRS as part of a bifactor model that included worry as a second specific factor. These authors found that brooding, as a specific factor, did not predict depression after controlling for the global factor of repetitive negative thinking. On the other hand, McEvoy and Brans, (2013) did find that both brooding and reflection predict depression even after controlling for a general factor. Given the limited evidence, further research is needed to examine the RRS from a bifactor perspective.

Purpose of the Present Study

This study sought to evaluate the dimensionality of a modified version of the RRS by means of examination of its factor structure, measurement invariance, and association with depression. Thus, the following objectives were developed: (a) to examine the factor structure of the RRS with exploratory factor analysis (EFA); (b) to compare possible factor structures with confirmatory factor analysis (CFA); (c) to appraise the relevance of each factor of the bifactor model; (d) to assess evidence of measurement invariance of the RRS with regard to sex (as a proxy for gender); and (e) to estimate how well each dimension of rumination predicts depression.

Method

Sample 1

A total of 1096 Peruvian undergraduate university students from a public university located in Lima (Peru) completed the questionnaire. Prior to data collection, we decided that the sample size should be at least 1000, so that each institutional academic field be represented by approximately 200 students. No formal power analysis was conducted. Fifty-two participants were excluded due to missing data, leaving a study sample of n = 1044. This was 50.1% male and 49.3% female; six participants did not report their sex. The mean age of the participants was 21.02 (SD = 3.19; range: 18–57). Some of them (18.6%) had a paid job. Also, most were freshmen (29.5%) or sophomores (31.4%). The majority belonged to programs in the engineering (25.6%), economic-business (20.8%), and healthcare areas (19.5%). Sample 1 was randomly split into two equally sized subsamples (named 1a and 1b) for exploratory and confirmatory analyses.

Sample 2

The sample consisted of 441 Peruvian undergraduate university students (46.6% female) from the same university as sample 1. This was a convenience sample with no a priori sample size determination. Ten individuals were excluded, leaving 431 participants as the final sample. Their mean age was 21.40 (SD = 2.52; range: 18–43). Most of them were second- or third-year students (33.2% and 33.6%, respectively) and enrolled in programs in the economic-business area (43.40%). Participants were selected through convenience non-probability sampling.

Measures

RRS (Treynor et al., 2003)

The RRS is a self-administered rumination questionnaire. For this study, the Spanish version of the RRS proposed by Cova et al., (2007) was used. Given the reported problems with the psychometric properties of the RRS, these authors added four new items to the original 10-item RRS (Cova et al., 2009). One of these items, however, performed poorly in their study (F. Cova, personal communication, August 23, 2016). Therefore, we decided to replace this item with a different one, which was developed based on rational criteria by the first author. The final 4 additional items were as follows: I try to understand why I feel this way (Reflection, created by the first author), I think over and over again about the bad things that have happened to me (Brooding), I try to find out why I feel like this (Reflection), and I remember all the times when bad things happened to me (Brooding). Also, minor adjustments were made to some items after unstructured preliminary testing. The resulting 14 items are responded on a 4-point Likert-type scale (1 = almost never to 4 = almost always). As will be described in the “Results” section, the scale was later shortened to 12 items with good psychometric properties, the details of which will be presented throughout this article.

The Beck Depression Inventory–II (BDI–II; Beck et al., 2006)

The BDI–II is a self-report questionnaire measuring depressive symptoms in the past 2 weeks. In this study, the Argentinian version, validated by Brenlla and Rodríguez (2006), was employed. It consists of 21 items that are scored from 0 to 3, and its psychometric properties have been examined in Peruvian undergraduate samples before (e.g., Sánchez-Villena & Cedrón, 2019). Even though the dimensionality of the BDI-II has been under debate, the evidence suggests that it is an essentially unidimensional measure (Brouwer et al., 2013). For this study, we conducted a series of exploratory and confirmatory analyses to select a set of items that best represented depression as a global latent variable. These items (1, 3, 7, 10, 12, 13, 14, and 21) reflected a single dimension with acceptable reliability (ωNL = 0.76).

Procedure

Data from both samples were collected in paper-and-pencil format. Potential participants were approached in their classrooms after asking the instructor in charge for permission. A booklet that contained the RRS was handed. It also included other measures that were not part of the present study: the Acceptance and Action Questionnaire II, the Positive and Negative Affect Schedule, the Emotion Regulation Questionnaire, and the Escala de Bienestar Psicológico para Adultos. Different versions of the booklet were randomly handed, so that the instruments were presented in different order. Overall, participants took approximately 15 min to complete all the scales. The first page of the booklet described the aims of the study, and explicitly stated that participation was voluntary and anonymous. Only participants who consented to participate completed the questionnaires. They received no financial or academic reward for their participation. This study was part of a project approved by the Vice-Rectorate for Research of Universidad Nacional Mayor de San Marcos.

Data Analysis

Two EFAs were conducted on subsample 1a. First, two factors were extracted using the MINRES method with Oblimin rotation. Second, a bifactor EFA was conducted with the Direct Schmid-Leiman approach proposed by Waller, (2018), which, according to a recent simulation study, is a recommended method (Giordano & Waller, 2020). In both cases, polychoric correlations were used.

After two possible solutions (oblique and bifactor) were derived from the EFA, they were tested on subsample 1b with CFAs using a robust weighted least squares estimator (WLSMV). Model fit was considered acceptable based on the following criteria: CFI > 0.95, TLI > 0.95, RMSEA < 0.06 (Hu & Bentler, 1999), and WRMR < 1 (DiStefano et al., 2018). Additionally, the bifactor model was also examined with the additional indices suggested by Rodriguez et al., (2016). Specifically, (1) if omega hierarchical (ωH) of the general Rumination factor was large compared to omega hierarchical subscale (ωHS) of the specific factors (Brooding and Reflection), (2) construct reliability (H) was larger than 0.70 for the general but not for the specific factors, and (3) explained common variance (ECV) by the general factor was 0.60 or larger (Reise et al., 2013), then the scale could be judged to be essentially unidimensional.

Next, measurement invariance was tested via multigroup CFA using sample 1 as a whole. The configural (baseline) model was compared to the more restricted scalar model, in which both factor loadings and item thresholds were constrained to be equal. Following best practices for ordinal indicators, a separate metric model (with only factor loadings set to be equal across groups) was not tested (Chen et al., 2020; Sass, 2011). A significant reduction in the Δχ2 test would be considered evidence of noninvariance. Approximate fit indices (e.g., ΔCFI) were not used for evaluating invariance because they have shown to be inadequate when categorical indicators are used (Sass et al., 2014). However, they are reported for the interested reader.

Finally, a full structural equation model was tested in sample 2. A new latent variable (Depression) was added to the bifactor model, and regression paths from both Rumination (general factor) and Brooding (specific factor) were estimated. Finally, a regression path from Reflection to Depression was also tested in an exploratory way. Since measurement invariance between sexes could not be established in sample 1, these analyses were conducted on males and females separately.

All the statistical analyses were computed in R 4.0.3. The following packages were used: psych 2.0.8, fungible 1.96.3, and lavaan 0.6–7.

Results

Exploratory Factor Analysis

The oblique EFA showed a pattern of factor loadings very similar to the original structure of the RRS. The only exceptions were items 4 (a Reflection item which had a stronger loading on Brooding) and 5 (a Reflection item which had negligible loadings on both factors). As shown in Table 1, similar results were found for the bifactor EFA, where each item (except for the two aforementioned problematic items) had non-negligible factor loadings on both the general Rumination factor and one specific factor.

Table 1 Factor loadings of the exploratory and confirmatory factor analyses of the Ruminative Response Scale

Confirmatory Factor Analysis

The two competing models obtained from the previous analyses were tested in subsample 1b with CFA. The oblique two-factor model had acceptable fit according to the CFI and the TLI, but not according to the RMSEA and the WRMR, χ2(53) = 267.46, p < 0.001, CFI = 0.96, TLI = 0.95, RMSEA = 0.09, WRMR = 1.33. On the other hand, the bifactor model had better fit on most indices (except for the RMSEA), χ2(42) = 198.88, p < 0.001, CFI = 0.97, TLI = 0.95, RMSEA = 0.09, WRMR = 0.99. The factor loadings of both models are presented in the last columns of Table 1.

The bifactor model was further examined with additional indices. First, the omega hierarchical of the general Rumination factor (ωH = 0.75) was notably larger than the omega hierarchical subscale of both Brooding (ωHS = 0.17) and Reflection (ωHS = 0.41). On the other hand, construct reliability was strong for Rumination (0.88) and Reflection (0.70), but not for Brooding (0.55). Finally, explained common variance of the general factor was also large (ECV = 0.65). All in all, these results give partial support to the essential unidimensionality of the RRS. However, given that the specific factors do not seem to be purely residual, we decided to keep the bifactor model for the subsequent analyses.

Measurement Invariance

When tested separately on females (n = 515), the bifactor model had acceptable or marginally acceptable fit on most indices, χ2(42) = 191.32, p < 0.001, CFI = 0.97, TLI = 0.95, RMSEA = 0.08, WRMR = 1.02. Males (n = 523), on the other hand, showed results that were slightly better, χ2(42) = 135.94, p < 0.001, CFI = 0.98, TLI = 0.97, RMSEA = 0.07, WRMR = 0.80. A configural model in which both groups were included in a single CFA had similar acceptable but suboptimal fit, χ2(84) = 330.60, p < 0.001, CFI = 0.98, TLI = 0.96, RMSEA = 0.08, WRMR = 1.30. However, when scalar invariance was tested by constraining loadings and thresholds to be the same in both groups, model fit decreased significantly, Δχ2(45) = 100.64, p < 0.001, ΔCFI ≈ 0, ΔRMSEA ≈ 0.01. Even though we tried to achieve partial invariance by freeing several parameters, we were not able to do so. Consequently, measurement invariance between sexes was not supported.

Full Structural Equation Models

A new latent variable (Depression) was added to construct a full SEM. Two models were evaluated separately in men and women with sample 2 data. First, Depression was regressed on both Rumination and Brooding, but not on Reflection. Second, a model that also included a regression path from Reflection to Depression was also tested in an exploratory way. This second model did not significantly improve fit in the male sample (n = 228), Δχ2(1) = 0.05, p = 0.816. Thus, the final model for men was the one presented in Fig. 1, χ2(156) = 234.78, p < 0.001, CFI = 0.97, TLI = 0.96, RMSEA = 0.05, WRMR = 0.88.

Fig. 1
figure 1

Structural equation model of the associations between rumination and depression in men (n = 228). Note. The model was estimated with the WLSMV method. Except for the loadings of rrs1, rrs3, and rrs6 on Brooding, all the coefficients presented were statistically significant (p < .05)

When the second SEM was compared to the first one in the female sample (n = 198), model fit did show a significant improvement, Δχ2(1) = 5.15, p = 0.023. As shown in Fig. 2, the additional coefficient of this model indicates an inverse association between Reflection and Depression (β =  − 0.25). Model fit was acceptable for this SEM, χ2(155) = 211.02, p = 0.002, CFI = 0.98, TLI = 0.98, RMSEA = 0.04, WRMR = 0.78.

Fig. 2
figure 2

Structural equation model of the associations between rumination and depression in women (n = 198). Note. The model was estimated with the WLSMV method. Except for the loadings of rrs3 and rrs6 on Brooding, as well as the loadings of rrs2 and rrs10 on Reflection, all the coefficients presented were statistically significant (p < .05)

Discussion

This study aimed to examine the factor structure and measurement invariance of the RRS, as well as its relationship with depression. The bifactor approach allowed us to examine the role of brooding and reflection while controlling for the effect of global rumination. Our results suggest that this bifactor structure is adequate and has better fit than the original two-factor oblique solution. However, measurement invariance of the bifactor model could not be established with regard to sex, so the RRS may not be measuring the same constructs in male and female participants. Two structural equation models were evaluated in men and women separately. In both cases, brooding had the strongest association with depression, followed by general rumination. With regard to reflection, it had a negative and significant association on the women sample, while it was nonsignificant for men.

The fact that the bifactor model was the best-fitting one is in line with previous research that found large correlations between brooding and reflection, thus suggesting the existence of a strong general factor (i.e., rumination; Lucena-Santos et al., 2018; Ruiz et al., 2017; Thanoi & Klainin-Yobas, 2015; Xavier et al., 2016). This fact notwithstanding, it should be also acknowledged that bifactor models tend to have better fit per se, even in situations in which the data are not bifactor (Markon, 2019). Consequently, it is important to examine additional indices that evaluate the relevance of all the factors of such a model (Rodriguez et al., 2016). When this was done, we found partial evidence that the RRS may be a unidimensional measure. However, it was also observed that the specific factors (especially Reflection) did explain an important amount of variance even after controlling for global rumination. Moreover, since we intended to test the claim that brooding and reflection show different associations with mental health criteria (Treynor et al., 2003), we decided to keep the bifactor model for further analyses.

This bifactor model was not invariant in men and women. This finding is in contrast to previous studies that found measurement invariance of the RRS across genders (Lei et al., 2017; Liang & Lee, 2019; Ruiz et al., 2017; Whisman et al., 2018; Xavier et al., 2016). Two possible explanations arise. First, it should be acknowledged that our model (i.e., bifactor) is different from the model tested in previous studies (i.e., oblique). Our model has additional parameters, is more complex and, therefore, it is less likely that all of them should be invariant across groups. Moreover, measurement invariance of such complex bifactors models has still been scarcely studied (Khojasteh & Lo, 2015). Second, it should also be noted that most existing studies on the invariance of the RRS were conducted using the maximum likelihood estimator (Lei et al., 2017; Liang & Lee, 2019; Whisman et al., 2018; Xavier et al., 2016). However, it is debatable whether such an approach is appropriate for Likert indicators that only have four response options (Brown, 2015; Rhemtulla et al., 2012).

It is clear from our data that the lack of invariance between men and women is statistically significant. There are no clear guidelines, however, to judge the practical significance of this difference. Even though the change in approximate fit indices (e.g., ΔCFI) is routinely used for this purpose (Cheung & Rensvold, 2002), these guidelines may be inappropriate for nonlinear models with categorical indicators (Sass et al., 2014). Thus, the lack of empirical criteria to evaluate the practical importance of the observed invariance prevents us from making claims in this regard. In consequence, we decided to proceed in a conservative way and our full SEM was analyzed in men and women separately.

The first notable finding of the SEM was that brooding was a stronger predictor of depression than global rumination. Thus, contrary to our hypothesis, it seems that the distinction between brooding and reflection is not a methodological artifact. Instead, as Treynor et al., (2003) initially proposed, they are distinct constructs that remain important even after their shared variance is taken into account.

The second surprising finding of the SEM was that reflection had a significant negative association with depression in the female sample. This result was unexpected and should not be taken for granted before it is replicated with larger samples. To the best of our knowledge, it is the first time that such a result is reported from cross-sectional data. Even though Treynor et al., (2003) did find an inverse association between reflection and depression, this was only observed in longitudinal data. Indeed, when these variables were examined cross-sectionally, the vast majority of studies found a positive association between them (Armey et al., 2009; Dinis et al., 2011; Lee & Kim, 2014; Schoofs et al., 2010; Shin et al., 2015).

It is possible that the negative association between reflection and depression was not evident in previous studies due the influence of global rumination on simple composite scores. Indeed, one advantage of the bifactor approach is that each factor can be examined separately while controlling for the others (Chen et al., 2006). As an approximate test of this possibility, we also examined the correlation between the composite scores of reflection and depression on the same female sample. Similar to previous studies, this correlation was positive and small-to-moderate, r(199) = 0.20, p = 0.005, thus giving some support to our tentative hypothesis.

The fact that the negative association between reflection and depression was only significant for women eludes a simple interpretation. First, it should be borne in mind that the bifactor model of the RRS was not invariant between sexes. Therefore, it is possible that the latent variable named “reflection” is not the same in both groups (i.e., we are measuring a different construct in men and women) (Dimitrov, 2010). Second, if we assume that the lack of invariance in our data is of no practical relevance, more substantial explanations can be forwarded. The following commentaries, however, shall remain speculative until further research addresses this point in more detail.

Past research found that the association between positive reframing and depression is significantly stronger in women, but sex was not a significant moderator of the relation between self-blame and depression (Kelly et al., 2008). Indeed, it has been suggested that engaging in self-reflection does not influence self-clarity in men, contrary to what is observed in women (Csank & Conway, 2004). Also, people who score high on masculinity tend to have less complex representations of their emotional life (Conway, 2000). These findings seem to imply that culturally based gender differences exist regarding the way in which people experience and regulate their emotions (Nolen-Hoeksema, 2012). Moreover, these differences seem to develop throughout the lifespan beginning with early interactions with parents (Root & Denham, 2010). For instance, it has been observed that parents discuss more emotional content with their daughters than with their sons (Fivush et al., 2000). Therefore, it could be that women are better trained to reflect on their emotional experiences and thus regulate their distress levels.

Limitations

Some limitations of the present study need to be considered. First, it should be noted that the participants from both samples were undergraduate students from one single university. This limits the generalization of our findings and requires them to be replicated in different populations. Second, all the variables were measured with self-reports. Even though this is a common practice in the field, future studies would benefit from including other data collection methods. Third, we did not collect data on gender, but relied on self-reported biological sex instead. Fourth, the sample size of sample 2 did not allow us to conduct a new test of invariance for the full structural equation models. However, since invariance was not found for the measurement model in sample 1, it can be assumed that the structural model was not invariant either. Finally, it should be noted that the two SEMs conducted on males and females separately may be under-powered; thus, caution should be taken when interpreting nonsignificant results. These limitations notwithstanding, the present study provides a new approach and new exploratory findings that should encourage further research.

Conclusion

The present study achieved a clear factorial structure of the RRS after some slight modifications. However, the fact that the bifactor model was not invariant between sexes should be examined in more detail in future studies. Users of the RRS should be aware that it may not be measuring the same constructs in men and women. Finally, the unexpected finding that reflection predicts depression after controlling for brooding and global rumination must be replicated in other samples before substantial conclusions can be made.