Significance

The Perceived Maternal Parenting Self-efficacy (PMP S-E) questionnaire is based on the theory of Bandura and is identified as the scale with the most robust psychometric properties for use with parents of hospitalized populations. This study is the first to translate this instrument into Spanish with primiparous women of term babies that can be used for practice and research purposes in the area of maternal and child care; to readily measure maternal self-efficacy during hospital stay, at discharge and during postpartum follow-up to assess the effectiveness of healthcare interventions.

Introduction

Becoming a mother is a process during which development is determined by both environmental factors and the characteristics of parent and child (Mercer 2004). When women become mothers they frequently need to develop an independence and self-reliance (Leahy-Warren 2005) that are integral components of motherhood and are greatly relevant for primiparous women who may face many challenges. In order to enhance self-reliance it is important to understand the beliefs women have in their ability to successfully perform the tasks and activities related to their parenting self-efficacy (Bandura 1997).

Considerable evidence exists that links maternal self-efficacy to women’s psychological and child health outcomes. For example, research has illustrated that low maternal self-efficacy is linked to an increased risk of postpartum depression and maternal stress (Leahy-Warren and McCarthy 2011; Law et al. 2019), poorer maternal bonding (Gharaibeh and Hamlan 2012) and in some cases child neglect (Garay-Gordovil 2013). A recent systematic review by Albanese et al. (2019) suggests that parental self-efficacy may impact parent–child attachment too as a result of the level of support they feel they have from their partner (Casse et al. 2016), that it may mediate between internalized stigma and postpartum depression (Mickelson et al. 2017), and to child development such that low parental self-efficacy has been associated with increased child hospitalizations (Holland et al. 2011).

Furthermore, Wittkowski et al. (2017) have identified the most psychometrically robust scales measuring parental self-efficacy including those used with hospitalized/non-hospitalized populations. Whilst some of these measures are strong psychometrically, few have been translated and culturally adapted into Spanish. According to Bandura (2006), scales used to measure self-efficacy must be adapted to the specific domains of interest. A person can consider him/herself very effective in certain situations and very ineffective in others, depending on situational requirements. The Perceived Maternal Parenting Self-Efficacy (PMP S-E) questionnaire (Barnes and Adamson-Macedo 2007) is based on the theory of Bandura and has the most robust psychometric properties. The PMP S-E questionnaire was originally validated with mothers of hospitalized preterms during the neonatal period, but others have gone on to use it in the postnatal period and with primiparous women (Leahy-Warren et al. 2012; Gharaibeh and Hamlan 2012; Aliabadi et al. 2013). However, it has not been translated adapted for and validated with Spanish speaking. The objective of this study was to determine the validity and the reliability of the Perceived Maternal Parenting Self-efficacy tool translated into Spanish and adapted to be used among primiparous women of term babies.

Methods

Overview of Study Design

This was a scale development and re-validation study conducted in Colombia in 2018. The research was approved by the Bioethics Committee of the Maternal and Child Clinic San Luis of Colombia and has been carried out in accordance with the ethical standards established in the Helsinki Declaration of 1964 and its subsequent amendments. All participants were required to give their informed consent before participating.

Participants

The participants were recruited from outpatient clinics of a maternal and child care institution in Colombia. Women were included if they were primiparous, with a full term healthy baby aged ≤ 6 months postpartum. Ten women were included in the pilot study to test the linguistic and cultural adaptation. Ten women more were included in the face validity. The mean age of the twenty women was 24.85 years (SD ± 5.71), 6 (30%) reported low socioeconomic status, 8 (40%) medium and 6 (30%) high; 6 (30%) of women attended to elementary school and 14 (70%) to technical or university education. The mean age of the babies was 3.35 months (SD ± 1.60). In addition, twenty-seven experts participated in the first round and four experts participated in the second round, i.e., face and content validity, whom met adjusted Fehring criteria (Fehring 1994), and were selected through a nationwide call for participants. Sample size was based upon a criterion of ten participants per number of items in the questionnaire (i.e., 10 × 20 = 200) (Morales 2012), plus an additional 10 in case of study attrition. A final sample of 210 women took part in the construct validity and reliability testing. The mean age of 210 participating women was 24.39 (SD ± 5.66). Other sociodemographic characteristics are presented in Table 1. The mean maternal self-efficacy perceived among the participants was 69.43 (SD ± 9.24).

Table 1 Sociodemographic characteristics of the participants in construct validity and reliability (N = 210)

Instrument

The PMP S-E questionnaire (Barnes and Adamson-Macedo 2007) is a 20 item measure of parental self-efficacy and was originally adapted from two existing measures (Teti and Gelfand 1991; Zahr and Cole 1991) with similar objectives, based upon Bandura’s self-efficacy theory, a review of the literature, and the authors’ experience and expertise. The PMP S-E tool is domain-specific, self-report and requires approximately 10 min to complete. Items are distributed across 4 subscales: care taking procedures (e.g., changing a nappy, evoking behaviour(s) (e.g., making the baby calm), reading behaviour(s) or signaling (e.g., knowing baby is tired, and situational beliefs (e.g., that their baby responds well to them. The responses to each item are recorded on a Likert scale ranging from strongly disagree (score 1) to strongly agree (score 4). Scoring ranges from 20 to 80 where a higher score indicates a higher self-efficacy. The PMP S-E in its original version (Barnes and Adamson-Macedo 2007) demonstrated strong psychometric properties including Cronbach alpha of 0.91 for the full questionnaire. Other UK based studies report Cronbach alphas of 0.89 (Leahy-Warren et al. 2012; Shorey et al. 2015); in its Arabic version 0.94 (Gharaibeh and Hamlan 2012), 0.86 in Brazilian samples (Tristão et al. 2015), and 0.93 in Italy (Pedrini et al. 2019) for the full questionnaire.

Procedures and Analysis

Translation and Backtranslation of the PMP S-E Tool with Cultural Adaptation

The guidelines for translating or adapting tests by Muñiz et al. (2013) and (2015) was followed. The translation, back translation and cultural adaptation of the questionnaire is summarized in Fig. 1. Accordingly, two bilingual nurses and one certified translator, all worked separately, and translated the original version of the PMP S-E from English into Spanish. Once the three translations were completed, the review committee (composed of one certified translator, one philologist and three nursing experts in maternal and child care) compared the translations and reached a consensus. Agreement was reached based upon the translation by checking that meaning had not been lost and that comprehension was maintained for each item.

Fig. 1
figure 1

Translation, backtranslation and cultural adaptation of the PMP S-E

The Spanish version was then back translated by another certified translator and two further bilingual nurses, who were not acquainted with the English version. Once the three back translations were completed, the Review Committee reached a consensus on the initial translation. The back translated version was then sent to the original authors of the PMP S-E questionnaire to ensure parity across versions. Adjustments were made as requested and the review committee came to a final consensus.

Selection of Experts

The adjusted criteria of Fehring’s classification system were used (Fehring 1994), which consist in assigning weights in terms of points as follows: PhD (4 points), Master’s Degree (3 points), Specialization (2 points), paper published (1 point), teaching experience (1 point), professional experience (2 points), research (1 point). It was necessary a minimum of five points out of the total 14 points to be considered as an expert. Twenty-seven experts from 17 universities corresponding to 11 capital cities of Colombia participated in the first round. The expert group was made up of 7 PhD, 16 Master and 4 specialization degrees. Average years of teaching experience in maternal and child care was 12.00 (SD ± 7.89). Average years of professional experience in maternal and child care were 16.78 (SD ± 11.16). Average number of articles published on maternal and child care were 4.59 (SD ± 4.68). Average number of investigations conducted on maternal and child care were 5.04 (SD ± 5.15). And four experts participated in the second round. The highest education level was PhD: (1), Master’s Degree: (2), Specialization: (1). Average years of teaching experience in maternal and child care were 13 (SD ± 1.63). Average years of professional experience in maternal and child care were 15 (SD ± 0.82). Average number of articles published on maternal and child care were 4 (SD ± 0.82). Average number of investigations conducted on maternal and child care were 6 (SD ± 0.82).

Face Validity

The comprehension criterion was assessed in a group of primiparous women with the following rating: (1) I don't understand it, (2) I understand it a little, (3) I understand it. In addition to this criterion, the experts also assessed the clarity criterion with the following rating: (1) It is not clear, (2) It is not very clear, (3) It is clear. Finally, the experts also assessed the precision criterion with the following rating: (1) It is not precise, (2) It is not very precise and (3) It is precise. (Hernández Sampieri et al. 2014). The degree of comprehension, clarity and precision of the items was determined using percentages as follows: high: equal to or greater than 85%, medium: 80–84.9%, low: equal to or less than 79%.

Content Validity

The experts assessed the pertinence criterion with the following rating: (1) Not pertinent at all, (2) Not very pertinent, (3) Pertinent, (4) Highly pertinent, and the relevance criterion with the following rating: (1) Not relevant, (2) Not very relevant, (3) Relevant, (4) Highly relevant, with a ratio of values given by: 1 = 0.25, 2 = 0.50, 3 = 0.75, 4 = 1 (Polit et al. 2007).

The content validity index (CVI) (Polit and Beck 2004) was calculated for each expert with the following formula: number of items with a score of between 3–4/total number of items. The CVI of each item was determined for pertinence and relevance. The calculation was made using the formula: number of experts agreeing upon the relevance or pertinence value of each item/total number of experts. Upon completion, the above estimations were exported to an Excel Worksheet for average and percentage estimations. Finally, the results were imported into Stata v12 for analysis. The quantitative analysis of content validity was carried out taking into consideration that scores equal to or greater than 0.80 have high content validity (Polit and Beck 2004). Subsequently, the observations made during the first round of experts to face validity and content items were qualitatively analyzed as follows: in order to control information selection bias (i.e., the inclination to give priority to their observations), in the second round a second group of experts in maternal and child care distinct from the previous one reviewed the observations and consensually made the adjustments to the different items as required.

Construct Validity

This research used factor analysis, a multivariate analysis that attempts to explain the tendency of a set of variables called factors to group. It allows to establish which items or variables are grouped forming domains or factors within the measurement. Based on this information, the items can be removed or regrouped (Raykov and Marcoulides 2008). Factorial principal component analysis was initiated exploring the total correlations of the items using Pearson correlation coefficient, followed by Bartlett sphericity test, and calculating sample adequacy with Kayser–Meyer–Olkin (KMO) statistics, considering acceptable a coefficient greater than 0.65. Varimax orthogonal rotation was implemented. Factor extraction was performed taking into account the percentage of extracted variance, expecting a minimum of 90% cumulative percentage of variance explained. Stata v12.0 was used for these analyses. Continuous variables were presented with central tendency measures and categorical variables were reported with absolute and relative frequencies.

Reliability

Cronbach's alpha coefficient was used to calculate internal consistency estimates in the total sample and subscales. A coefficient of 1.00 indicates perfect reliability (Khoja et al. 2007).

Results

Cultural Adaptation of the Questionnaire of the Spanish Version of the PMP S-E

The following cultural adaptations were made: I believe that I have control over my baby by “I believe that I have control over my baby’s care”. I believe that my baby and I have a good interaction with each other by “I believe that my baby and I have a good interaction”. I am good at getting my babies attention by “I am good at getting my baby’s attention”.

Face Validity and Content Validity of the Spanish Version of the PMP S-E

Ten primiparous women participated in the face validity part of the process, who scored the questionnaire as follows: high comprehension (99%). The score of face validity given by experts was: medium comprehension (84.1%), medium clarity (83.9%) and medium precision (80%). Concerning relevance, the CVI was 0.96 (“highly relevant”) and concerning pertinence the CVI was 0.93 (“highly pertinent”). In the second round, other group of experts made the adjustments of the items by consensual agreement.

Construct Validity of the Spanish Version of the PMP S-E

Factor 1 accounted for 33% of the variance, factor 2 accounted for 22% of the variance, factor 3 accounted for 19% of the variance and factor 4 accounted for 17% of the variance, for a total of 91% of the variance accounted for by the four factors were identified. Sampling adequacy was estimated with Kaiser–Meyer–Olkin statistics (KMO = 0.97), and data adequacy for reduction was estimated with Bartlett sphericity test (X2 = 4972.01, p < 0.01) (Table 2).

Table 2 Varimax rotation method with Kaiser normalization of the Spanish version of the PMP S-E

Reliability

Internal Consistency Reliability of the Spanish Version of the PMP S-E

Cronbach’s alpha for the entire instrument was 0.98 (95% CI, 0.97–0.98). The values for each item and the values by subscales are shown in Table 3. The Spanish version of the PMP S-E is available in Online Appendix A.

Table 3 Varimax rotated factors, factor loading, item-test correlation and Cronbach’s alpha of subscales and if item is removed of the Spanish version of the PMP S-E

Discussion

The PMP S-E instrument has been translated from its original English version into Arabic, Persian, Portuguese, Kannada and Italian (Aliabadi et al. 2013; Gharaibeh and Hamlan 2012; Paul et al. 2018; Pedrini et al. 2019; Tristão et al. 2015). This study involved the translation and cultural adaptation of the instrument from English into Spanish, in adherence to the highest quality standards to ensure its adequacy to Spanish speakers; the second most widely spoken language from across the world. These linguistic and cultural modifications in some items have also occurred according to the specific needs of each country in the studies mentioned above.

Several studies (Aliabadi et al. 2013; Barnes and Adamson-Macedo 2007; Gharaibeh and Hamlan 2012; Leahy-Warren et al. 2012; Pedrini et al. 2019; Shorey et al. 2015), including this one, clearly evidence that the PMP S-E questionnaire is both clear and easily understandable, and has strong face validity. The Spanish version of the PMP S-E obtained high scores of the CVI with respect to pertinence and relevance when evaluated by leading experts from Colombia, consistent with the high content validity observed in the studies conducted in the UK, Jordan and Singapore (Barnes and Adamson-Macedo 2007; Gharaibeh and Hamlan 2012; Shorey et al. 2015).

Construct validity in this investigation also identified four factors, coinciding with all validity studies conducted in different countries (Barnes and Adamson-Macedo 2007; Pedrini et al. 2019). This supports the fact that the Spanish version of the PMP S-E provided by this work, in addition to measuring very well the maternal self-efficacy construct, also measures well differentiated aspects of self-efficacy. The study conducted in Italy did not single out one factor of situational beliefs but divided the reading of behaviors or signals into physical and emotional needs. This also occurred with the Spanish version of the PMP S-E; the items in the factor ‘situational beliefs’ were distributed among other factors and items grouped formed two factors then designated following the denomination used in Italy (Pedrini et al. 2019), as follows; ‘reading and managing emotional cues’ and ‘reading and managing bodily cues.’

In comparison to the study conducted by the authors (Barnes and Adamson-Macedo 2007), the first two factors in the present work (Evoking behaviour(s) and Care taking procedures) also exhibited the highest percentage of variance accounted for. In the study by Tristão et al. (2015), factor one obtained the highest percentage of variance accounted for. Therefore, these first two factors are more significant at accounting for the variability in perceived maternal self-efficacy. These studies involved both women of hospitalized preterm babies and women of term babies, and the time of application of the instrument ranged from the first days of postpartum to 6 months after childbirth. Despite these differences, the measurement of the perceived self-efficacy construct has been similar.

The analysis of the rotated components of the Spanish version of the PMP S-E in comparison to the original English version of the UK (Barnes and Adamson-Macedo 2007) exhibited different possibilities of distribution of the following items: 5. I can make my baby happy, 6. I believe that my baby responds well to me, 7. I believe that my baby and I have a good interaction, 8. I can make my baby calm when he/she has been crying, 13. I am good at understanding what my baby wants, 15. I am good at knowing what activities my baby does not enjoy, 16. I am good at keeping my baby occupied, 20. I can show affection to my baby. Although they also found a different distribution of some items, the investigators of the study conducted in Brazil (Tristão et al. 2015) decided to maintain the original structure, unlike Pedrini et al. (2019), who also found different distribution of some items on the factors and who did not use the factor of situational beliefs. The present study used the same designation used in Italy for these factors: the factor care-taking procedures refers to the mother’s perception of her ability to perform activities and tasks related to basic needs. The factor evoking behaviours refers to the mother’s perception of her ability to elicit certain changes in her baby’s behaviour. The factor reading and managing bodily cues refers to the mother’s perception of her ability to identify and understand the body cues of the baby.

The factor reading and managing emotional cues refers to the mother’s perception of her ability to identify and manage changes in her baby’s behaviour based on emotional cues (Pedrini et al. 2019). Nevertheless, the authors of the PMP S-E have recommended to take into account the total score as an overall indicator of the level of maternal self-efficacy and use the subscales to understand individual perceptions; such valuable information could provide support to further develop good quality intervention programmes.

As to reliability, the high value of the Cronbach’s alpha exhibited by the Spanish version of the PMP S-E both overall and in the subscales and the correlations of each item in the four factors where they were grouped, the alpha coefficients revealed high internal consistency. The present study confirmed the adequate fit of a model with related factors to our data. Both reasons, consistency with the literature and statistical support our decision to recommend this factorial structure, the modified model, in the Spanish version of the questionnaire. Also, possible explanations for the differences found in the factor structure, variance explained, and eigenvalues with other authors could be sample size, culture, demographics and clinical characteristics, among others.

Furthermore, as with several previous studies regarding reliability, we used the same number of items of the original version, namely, 20, unlike the study conducted in Italy (Pedrini et al. 2019) where a panel of experts decided to remove three items. In sum, the psychometric properties of the PMP S-E exhibit high validity and reliability in different languages, including the Spanish version used in this study.

Strengths and Limitations

The processes of linguistic and cultural adaptation and validity and reliability measurements followed rigorous methodology involving the participation of leading experts. This study had an adequate sample size for participating women, who self-completed the questionnaire in order to prevent the bias of giving socially desirable answers. A limitation of this study was the absence of test–retest reliability and that only the factor analysis was used for the construct validity, because there are also other methods for its calculation that could be explored in future studies. Other limitation of this study was that the children of most of the available participants were under one month of age avoiding a stratified sampling by age. However, in different studies the psychometric properties of the PMP S-E have been consistent throughout the first six months of the postpartum period. Since the questionnaire was translated into Spanish, the Spanish version of the PMP S-E is a valuable tool that can be used by Spanish speakers. In turn, it is recommended that each Spanish-speaking country performs its corresponding cultural adaptation.

Conclusion

The present study provides a Spanish translation, with cultural adaptation, and psychometric testing measurements of the original English version of the PMP S-E. This is the first instrument in Spanish for primiparous mothers of term babies; it can be used for practice and research purposes in the area of maternal and child care, given the robust capacity of the Spanish version of the PMP S-E to readily measure maternal self-efficacy.