Introduction

Obsessive-compulsive disorder (OCD) is characterized by a preoccupation with repeated obsessions that are unrelated to intention, as well as compulsions that are performed to relieve the discomfort and anxiety caused by the obsessive ideas. According to a survey from the World Health Organization, OCD is ranked as one of the 10 most disabling disorder (Michaud et al. 2006). The treatment methods for OCD, which have been verified to be effective, include pharmaceutical therapy with antidepressants, such as Selective Serotonin Reuptake Inhibitors, and Cognitive Behavioral Therapy, mainly exposure and response prevention that has been recognized with strong evidence on its effectiveness.

Typical Obsessive-compulsive (OC) symptoms include washing and checking; however, the severity of each of these dimensions differs among individuals. Compared to other psychiatric disorders, there have been greater advancements in the understanding of OCD, particularly on a physiological basis. For example, extensive neuroimaging studies have been actively conducted (Gilbert et al. 2008; Hirose et al. 2016; Koch et al. 2012; Yagi et al. 2016). Previous studies suggested that many areas in the brain were related to OCD; however, because symptoms vary extensively across patients, the possibility that different dimensions are related to different neural pathways has been proposed (Mataix-Cols et al. 2004). Evaluating and investigating OC symptoms based on different dimensions can help bring a greater understanding of the actual pathological condition of OCD.

The symptom checklist of the Yale-Brown Obsessive Compulsive Scale (Y-BOCS) (Goodman et al. 1989) and the Dimensional Yale-Brown Obsessive-Compulsive Scale (DY-BOCS) (Rosario-Campos et al. 2006) are available for the evaluation of OC symptoms. Also, the Obsessive- Compulsive Inventory (OCI) (Foa et al. 1998) is available for assessing the severity of the OC symptoms; however, it is different from the other scales. The OCI is a self-administered scale, and it can be used for both patients and healthy individuals, while the Y-BOCS and the DY-BOCS are interview-rating scales, developed for assessing OC symptoms among patients. In addition, compared to the Y-BOCS and the DY-BOCS, the OCI is expected to be used as a more comprehensive evaluation tool with the following advantages: it does not require interviewers to undergo special training, it can be used to assess obsessive thoughts and behaviors in the general population, and it allows for the measurement of the severity of each of the subscales of symptomatic dimensions that include Washing, Checking, Doubting, Ordering, Obsessing, Hoarding, and Neutralizing (Ishikawa et al. 2014). However, because the number of items included in each of the OCI subscales varies, it is difficult to identify which OC symptoms are the main symptoms. Therefore, Foa et al. (2002) developed the Obsessive-Compulsive Inventory-Revised (OCI-R) by performing revisions, such as simplifying the subscale scoring and reducing duplication across the subscales. Currently, the OCI-R is the most widely used self-administered scale for OCD studies worldwide, including neuroimaging studies (Gilbert et al. 2008; Hirose et al. 2016; Koch et al. 2012; Yagi et al. 2016) Although the reliability and validity of the Japanese version of the OCI have been confirmed (Ishikawa et al. 2014), there have been no studies regarding the reliability and validity of the Japanese version of the OCI-R. The OCI-R has 18 questions, is easy for therapists to use, and places a little burden on patients compared to the OCI. Therefore, an investigation of the reliability and validity of the Japanese version of the OCI-R is needed for promoting advanced biological and clinical research of OCD in Japan. Therefore, we investigated the factor structure, and the reliability and validity of the Japanese version of the OCI-R.

Methods

Participants

A questionnaire survey was administered to 254 university students enrolled at two universities in a metropolitan area. The survey was conducted from mid-October through late October, 2016. Excluding incomplete responses, a total of 214 responses from the participants (106 men and 108 women, average age = 19.95 ± 1.21 years) were used for analysis. Of these, 38 participants (18 men and 20 women, average age = 20.18 ± .98 years) were available for follow-up and provided responses to the Japanese version of the OCI-R two weeks after the initial survey. Participation was voluntary, and no payment or course credits were offered.

Materials

Demographic Data

Data on age, sex, and faculty were obtained. We asked for the last five digits of each participant’s mobile phone number to confirm the identification of participation to ensure the reliability of re-assessment.

Japanese Version of the OCI-R

The OCI-R, used for the assessment of the severity of OC symptoms, was developed by Foa et al. (2002) as a revised version of the OCI (Foa et al. 1998). There are 18 items, including “I check things more often than necessary” and “I wash my hands more often and longer than necessary.” It is made up of six subscales, including Washing, Checking, Ordering, Obsessing, Hoarding, and Neutralizing. We extracted 18 items from the Japanese version of the OCI (Ishikawa et al. 2014), which were same as the original OCI-R, and we used these 18 items as the Japanese version of the OCI-R. Translation procedure would be found in Ishikawa et al. (Ishikawa et al. 2014). Scores are calculated using a five-point scale [Not at all (0 point) to Extremely (4 points)].

Maudsley Obsessional-Compulsive Inventory (MOCI)

The MOCI (Hodgson and Rachman 1977) is a self-administered scale that measures OC symptoms. It consists of 30 yes/no questions. A total score and one for each of the four subscales (Checking, Cleaning, Slowness, and Doubting) are calculated. Yoshida et al. 1995verified the reliability and validity of the Japanese version. We used the MOCI to examine the convergent validity. On the basis of the previous study (Foa et al. 2002; Hajcak et al. 2004), we expected to find the strong positive correlation between the OCI-R and the MOCI.

Patient Health Questionnaire-9 (PHQ-9)

The Japanese version of the PHQ-9 is a simplified assessment tool related to major depressive disorder (Muramatsu et al. 2007). Responses were scored using a four-point scale [Not at all (0 point) to Nearly every day (3 points)], and a total score was calculated (0–27 points). We used the PHQ-9 to check concurrent validity. On the basis of the previous study (Foa et al. 2002; Ghassemzadeh et al. 2011; Hajcak et al. 2004), we expected to find moderate to strong positive correlation between the OCI-R and the PHQ-9.

Generalized Anxiety Disorder-7 (GAD-7)

The Japanese version of the GAD-7 is a simplified assessment tool for generalized anxiety disorder (GAD) (Muramatsu et al. 2010). Responses were scored as scored using a four-point scale [Not at all (0 point) to Nearly every day (3 points)], and a total score was calculated (0–21 points). We used the GAD-7 to check concurrent validity. On the basis of the previous study (Sica et al. 2009; Woo et al. 2010), we expected to find the moderate positive correlation between the OCI-R and the GAD-7.

Procedure

The questionnaires for the survey were filled out anonymously. After obtaining approval from the lecturers, the questionnaires were distributed during or after the lectures at universities and collected on the same day. Of these, 35 participants, who were confirmed to be the same individuals based on the last five digits of their mobile phone number, were followed up two weeks later for a second survey.

Statistical Analysis

SPSS (Ver. 22.0) and AMOS (Ver. 22.0) were used to perform statistical analyses. We performed confirmatory factor analysis to confirm the factor structure. We used the criteria recommended by Vandenberg and Lance (2000) (Root Mean Squared Error of Approximation [RMSEA] ≤ .08, Tucker-Levis index [TLI] ≥ .90, Standardized Root Mean Square Residual [SRMR] ≤ .15, and Comparative Fit Index [CFI] ≥ .90). For the reliability, internal consistency and test-retest reliability were investigated. For the validity, convergent validity and concurrent validity were investigated by calculating Pearson product-moment correlation coefficient. We used the Cohen’s criteria criteria to determine the degree of correlation, considering |r| ≥ .50 as strong correlation, |r| ≥ .30 as moderate correlation, and |r| ≥ .10 as weak correlation (Cohen 1988).

Results

Confirmatory Factor Analysis

A confirmatory factor analysis was performed using a hypothetical model with the same six factors as the original version and items corresponded to each factor. The results showed that fit index were χ2 (120) = 248.29 (p < .01), RMSEA = .07, TLI = .87, SRMR = .06, and CFI = .90. Based on the relevance criteria adopted by Vandenberg & Lance (Vandenberg and Lance 2000), the values we obtained were generally within a permissible range (Fig. 1). It can, therefore, be concluded that the same six factors are valid as in the original version.

Fig. 1
figure 1

Result of confirmatory factor analysis of the Japanese version of the OCI-R

Internal Consistency

Cronbach’s α coefficient was calculated to determine the internal consistency, and the overall internal consistency was high at α = .87. The results of each of the subscales were as follows: Washing, α = .77; Checking, α = .74; Ordering, α = .63; Obsessing, α = .83; Hoarding, α = .60; and Neutralizing, α = .49. Although some subscales such as Ordering, Obsessing, and Hoarding showed lower values than the values reported by Foa et al. (2002), the overall values for internal consistency were favorable.

Test-Retest Reliability

The interclass correlation coefficient (ICC [2, 1]) was calculated to exmine the test-retest reliablity. The results showed that the total ICC score was high at .86 (confidence interval [CI]: .75–.93). In the subscales, the ICC values for Checking, Obsessing, Washing, Ordering, Neutralizing, and Hoarding were .74 (CI: .55–.86), .86 (CI: .75–.93), .81 (CI: .66–.89), .89 (CI: .79–.94), .41 (CI: .11–.64), and .86 (CI: .74–.92), respectively. These values were generally considered favorable.

Convergent and Concurrent Validity

Regarding the convergent validity, the total score of the OCI-R showed a strong positive correlation with the total score of the MOCI (r = .67). In addition, as for the concurrent validity, the total score of the OCI-R was strongly positively correlated with the total score of the PHQ-9 and the total score of the GAD-7 (r = .53; r = .61). These results are shown in Table 1.

Table 1 Correlations between the OCI-R and other measures

Discussion

The objective of the present study was to investigate the factor structure, and the reliability and validity of the Japanese version of the OCI-R. The Japanese version of the OCI-R was determined to have the same six-factor structure as the original version, and adequate reliability was demonstrated with favorable values for internal consistency and test-retest reliability. In agreement with the previous studies (Foa et al. 2002; Fullana et al. 2005; Hajcak et al. 2004; Woo et al. 2010), the Cronbach’s α of the Neutralizing subscale was lower than that of the other subscales. Previous studies suggested that the modification of the Neutralizing subscale might be needed, because these three items in this subscale seem to be heterogeneous (Abramowitz and Deacon 2006; Foa et al. 2002; Fullana et al. 2005; Gönner et al. 2008; Hajcak et al. 2004; Woo et al. 2010). For example, item 4 (“I feel compelled to count while I am doing things”) and item 10 (“I feel I have to repeat certain numbers”) are asking about the counting coumpulsions or behaviors, while item 16 (“I feel that there are good and bad numbers”) is asking about superstitious thinking, which may be more common in non-clinical samples rather than counting compulsions or behaviors. In our study, average score of item 4 and item 10 were .47 (SD = .89) and .29 (SD = 72), while average score of item 16 was .79 (SD = 1.13). In fact, item 16 showed the lowest multiple correlation coefficients (R2 = .08), and Cronbach’s α of the Neutralizing subscale was improved after removing item 16 (α = .51). Therefore, replacing item 16 to more overt counting compulsions or behaviors might resolve the heterogeneity in this subscale, although it is important to note that low internal consistency of the Neutralizing subscale could be specific for non-clinical samples (Foa et al. 2002; Fullana et al. 2005).

In terms of convergent validity, correlation with the MOCI was examined. The result showed a strong positive correlation between the total scores of the Japanese version of the OCI-R and the MOCI. As the strong positive correlation was also observed between the total score of the Japanese version of the OCI and the MOCI (Ishikawa et al. 2014), it is believed that the revised version also have equivalent convergent validity. In terms of concurrent validity, the correlations between the total scores of the Japanese version of the OCI-R and the PHQ-9, and the Japanese version of the OCI-R and the GAD-7, were calculated. As a result, strong positive correlations were shown. Previous studies have also reported strong correlations with depressive symptoms (Foa et al. 2002; Hajcak et al. 2004), suggesting a strong association with depressive episodes, which are often observed in patients with OCD. The correlation between the Japanese version of the OCI-R and the GAD-7 was relatively high. As Gönner et al. (2008) mentioned, the high correlations between negative feeling and OC symptoms would be found in the sample of patients diagnosed with comorbid OCD and depression or anxiety disorders than in the samples of patients with OCD alone. In our study, the participants were non-clinical samples, and their average score of the Japanese version of the OCI-R was not high relative to OCD patients. Therefore, we could not find distinguishing characteristics regarding OC symptom, and the correlation between OC symptoms and anxiety might be relatively high. Despite the unexpected strong correlation between OC symptoms and anxiety, we confirmed that the Japanese version of the OCI-R has adequate validity.

One of the limitations of the present study was that the analyses were performed in healthy university students instead of patients with OCD. The participants’ age were between 18 and 25 years, and most of them were between 19 and 21 years. The psychometric properties of the Japanese version of the OCI-R need to be examined in various age samples in the future. In addition, further studies are needed to investigate differences in the profiles and continuity of obsessive symptoms between clinical and non-clinical groups using the Japanese version of the OCI-R. Establishing a cut-off value is expected to increase the utility of the Japanese version of the OCI-R in research and clinical settings in Japan. Finally, although this study showed important information about the cross-cultural validation of the OCI-R, our aim was not to compare the differences of OC symptoms among different countries. Further research comparing multiple cultural group could help to identify the cultural differences in OC symptoms.

Conclusions

The Japanese version of the OCI-R has the same factor structure as the original one, and it has a good reliability and an adequate validity. Our results showed that it is a reliable and valid instrument to measure OC symptoms among Japanese population.