Introduction

For the last two decades, engagement has been conceived as an important factor affecting students’ learning and academic success (Fredricks et al. 2004; Kuh 2003; Newmann et al. 1992) and refers to the observable and unobservable qualities of students’ involvement and participation in learning activities (Ryan and Patrick 2001). Many researchers have suggested that the qualities of students’ participation in learning activities could involve observable behaviors (Zumbrunn et al. 2014), internal cognitions (Greene et al. 2004), as well as personal emotions (Fredricks et al. 2004) in school contexts. These studies have supported a multifaceted conceptualization of engagement, and four dimensions of engagement have been identified, namely behavioral engagement, emotional engagement, cognitive engagement, and social engagement (Fredricks et al. 2016a, b). Although previous research has provided evidence for the effects of every single dimension or a set of dimensions on students’ learning, the understanding of the relationships among these dimensions of engagement with respect to learning outcomes is very limited. Additionally, engagement in subject areas has recently been receiving increasing attention (e.g., Wang et al. 2016). In science education, scientific inquiry has been considered as a valuable and inseparable part of science teaching and learning (National Research Council [NRC] 1996, 2000) but relatively little is known about the role of students’ engagement in their inquiry performances. Therefore, to construct a model of engagement in scientific inquiry, this study examined the interplay among the four dimensions of students’ engagement, inquiry-related curiosity, and inquiry abilities of students.

Why should inquiry-related curiosity be considered as a potential predictor of engagement? Engagement could be affected by motivation-based factors, such as belonging, self-efficacy (Zumbrunn et al. 2014), and intrinsic motivation (Froiland and Worrell 2016), and curiosity (Wu et al. 2018). Among these factors, curiosity has been valued and encouraged in science education because students’ scientific curiosity about how the world works or why a phenomenon happens may motivate them to conduct investigations and to find these answers through scientific inquiry (NRC 2000, 2012). Curiosity can be defined as a psychological disposition for the unknown, novelty, ambiguity, and uncertainty (Litman 2005; Mussel 2010) and motivate students to pursue novel information and challenging experiences (Sinha et al. 2017). Additionally, Luce and Hsi (2015) argued for “science-relevant curiosity” such as wonderment about casual mechanisms and inconsistent observations in science. Students’ curiosity about science phenomena and inquiry activities could “develop into a deeper, sustained, persistent pursuit of scientific reasoning and science learning” (p. 74) and then lead to higher engagement. For example, when students have wonderment about a phenomenon, a knowledge gap between what they already know and what they want to know may occur (Chin and Osborne 2008). Such a knowledge gap could motivate students to explore how and why the phenomenon happens and to demonstrate inquiry behaviors. Thus, inquiry-related curiosity was examined in this study because such a psychological state and disposition could facilitate students’ engagement in authentic and meaningful science learning.

On the other hand, developing students’ inquiry abilities has been emphasized as a key objective of science learning in many policy documents (NRC 2000) and research studies (Krajcik et al. 1998). These fundamental abilities enable students to coordinate science process skills and knowledge, to conduct meaningful scientific investigations, and to successfully participate in scientific inquiry. This study thus focuses on students’ abilities in four inquiry activities including asking scientific questions, planning experiments, analyzing data, and formulating scientific explanations. However, numerous empirical studies have indicated that students usually lack these fundamental abilities and encounter great difficulties in the process of inquiry (e.g., Bell 2002; Jeong et al. 2007). To understand whether and how qualities of students’ participation in inquiry activities could enhance their abilities, this study aims at exploring how the four dimensions of engagement (i.e., cognitive, behavioral, emotional, and social) are driven by inquiry-related curiosity and how they affect inquiry abilities. To accomplish this purpose, we employed structural equation modeling to analyze data collected from 605 11th graders, including their responses to items in an online questionnaire and their performances on a computer-based assessment of scientific inquiry abilities.

Theoretical and empirical underpinnings

Definition of engagement

Engagement has been viewed as an important construct in educational research because it could influence students’ learning processes and outcomes (Astin 1984; Engle and Conant 2002; Finn 1989). Although it is commonly accepted that engagement can be regarded as the qualities of students’ involvement and participation in learning activities (Ryan and Patrick 2001), different aspects of engagement have been introduced for various tasks, purposes, and contexts. For example, Astin (1984) advocated the notion of student involvement to investigate students’ energy and time investment in both physiological and psychological aspects. On the other hand, Engle and colleagues adopted the notion of disciplinary engagement to illustrate students’ involvement in the particular subject matter in classroom settings (Engle and Conant 2002; Forman et al. 2014). Furthermore, engagement can be used to describe and explain students’ behaviors and participation in schools. With a higher degree of school engagement, students would demonstrate more willingness to search for the strategies to perform better (Finn 1989) and show less anxiety and resistance to school activities (Archambault et al. 2009).

To highlight and address the different aspects of engagement, researchers have argued for a multi-dimensional definition of engagement (Archambault et al. 2009; Newmann et al. 1992). Four dimensions identified by previous research include cognitive engagement, behavioral engagement, emotional engagement, and social engagement. Cognitive engagement refers to students’ proficiency of tasks, use of strategies, or pursuit of achievement to satisfy the requirements of learning tasks (Greene et al. 2004). Behavioral engagement involves students’ attentiveness, persistence, and investment of time (Kuh 2009; Newmann et al. 1992). Emotional engagement refers to students’ perceptions, values or feelings about learning activities and environments (Fredricks et al. 2004). Social engagement refers to students’ interactions with their peers, teachers or staff members in schools (Järvelä et al. 2016). These four dimensions allow a comprehensive and analytical look at students’ participation in learning.

Relationships among the dimensions of engagement

As different dimensions of the same construct, behavioral, cognitive, emotional, and social engagement could interact with each other, investigations of the interplay between the dimensions could assist researchers and educators in understanding how to facilitate a certain dimension of engagement. Archambault et al. (2009) developed a questionnaire to assess the behavioral, cognitive, and emotional engagement of students, and conducted model comparisons to investigate the relationships among them. Although they found that “the three dimensions of engagement were highly covariant” (p. 665), especially the emotional and cognitive dimensions, the study did not illustrate how the dimensions of engagement affect each other. On the other hand, a longitudinal study done by Li and Lerner (2013) indicated that “behavioral and emotional engagement were related bidirectionally” (p. 20) and that cognitive engagement could be predicted by behavioral engagement. In addition, by comparing results from Grades 9, 10, and 11, Lin and Lerner found a significant pathway from the emotional engagement at Grade 10 to cognitive engagement at Grade 11. Gunuc and Kuzu (2015) reconfirmed the predictive link of behavioral engagement to cognitive engagement, and further revealed that emotional engagement could affect behavioral engagement in a technology-based learning environment. Moreover, in a collaborative learning environment, students’ emotions could influence their social and behavioral engagement in small groups (Linnenbrink-Garcia et al. 2011); for example, positive emotions were positively associated with positive group interactions (e.g., actively working to support group members’ engagement).

Taken together, the aforementioned studies suggest a hypothesis whereby emotional engagement could predict behavioral and social engagement, and may directly or indirectly influence cognitive engagement. These relationships constitute a central part of our hypothesized model (Fig. 1) and so far relatively little research has directed its attention to these relationships in disciplinary engagement. Furthermore, this study addresses the issue of how the dimensions of engagement affect students’ performances in scientific inquiry.

Fig. 1
figure 1

A hypothesized model of relationships among students’ inquiry-related curiosity, dimensions of engagement, and inquiry abilities

Engagement and student performances

Previous research has investigated how engagement could affect students’ learning, cognitive, and school performances. Among the four dimensions, cognitive engagement has been the most investigated and its positive association with learning achievement has been well documented (e.g., Greene et al. 2004). The explanation of the association is that cognitive engagement involves the use of learning strategies and cognitive processing, which could result in different levels of achievement, such as test scores (Greene et al. 2004) and literature achievement (Nystrand and Gamoran 1991). Other dimensions of engagement could also predict students’ school performances. Archambault et al. (2009) found that the overall engagement had a significant but negative predictive link to school dropout and that their further examination showed that only students’ behavioral engagement was significantly associated with school dropout. Additionally, in their mixed-methods research, Wu and Huang (2007) revealed that students’ emotional engagement in computer-based science learning had no impact on their achievement. Yet, the findings from Authors’ qualitative data analyses suggested interactions among students’ achievement levels, cognitive engagement, and behavioral engagement. For instance, the high-achieving group was more engaged in the cognitive and behavioral dimensions.

Although the association between engagement and learning performances has been confirmed by previous research, it is still unclear whether social engagement affects students’ learning performances and how the different dimensions of engagement mediate or influence the relationships between engagement and learning performances. Therefore, situated in the context of disciplinary engagement, this study takes the four dimensions of engagement into consideration and hypothesizes that they could positively affect students’ inquiry abilities directly and indirectly (Fig. 1).

Definition and framework of inquiry abilities

Inquiry has been viewed as an essential component of science learning. Scientific inquiry refers to multifaceted activities similar to what scientists do to construct understandings about the world, including questioning, experimenting, designing, planning, analyzing, interpreting, explaining, arguing, and communicating (Barrow 2006; Duschl et al. 2007; Krajcik et al. 1998; NRC 1996, 2000; Pedaste et al. 2015). Fundamental abilities that combine knowledge with skills are required to undertake these inquiry activities (NRC 2000). These abilities are defined as inquiry abilities in this study, and have been emphasized as key learning objectives of science education in policy documents internationally (e.g., NRC 2000; Ministry of Education [MOE] 2018).

To develop an assessment framework that involves the nature of inquiry and reflects the process of inquiry, we reviewed science education standards (NRC 1996, 2000, 2012), science curriculum guidelines (MOE 2018), and research studies (e.g., Wu and Huang 2007; Wu et al. 2015; Krajcik et al. 1998; Pedaste et al. 2015). These documents presented a number of frameworks for science inquiry. One of the most recent and influential frameworks was proposed by the National Generation Science Standards (NRC 2012), in which three spheres of activity for scientists were identified: Investigating (asking questions, making observations and doing experiments), Evaluating (arguing, critiquing, and analyzing), and Developing Explanations (developing theories and models and formulating hypotheses). Additionally, by reviewing 32 articles on inquiry phases and cycles, Pedaste et al. (2015) provided a synthesized framework that is constituted of five distinct general inquiry phases: Orientation, Conceptualization (questioning and hypothesis generation), Investigation (exploration, experimentation, and data interpretation and analysis), Conclusion, and Discussion (communication and reflection). The two frameworks reflect the multifaceted nature of inquiry and cover a variety of inquiry activities. Although they categorized some inquiry activities into different phases or spheres, four inquiry activities that were identified in both frameworks are asking questions, conducting investigations, analyzing data, and developing explanations and conclusions. These four major inquiry activities thus were selected as the backbone of our framework.

Our assessment framework of inquiry abilities focuses on four main abilities and related sub-abilities: (1) Questioning (sub-abilities: formulating questions, identifying questions, and making predictions); (2) Experimenting (sub-abilities: identifying controlled and manipulated variables, planning experimental procedures, and selecting appropriate measurements); (3) Analyzing (sub-abilities: identifying relevant data, and transforming data); and (4) Explaining (sub-abilities: making a claim, using evidence, reasoning from evidence to the claim, and offering and evaluating alternative explanations). We also developed and validated a computer-based assessment that integrated animations, videos, pictures, and simulations into the test and task design to measure students’ inquiry abilities (Wu et al. 2015; Kuo et al. 2015).

Inquiry abilities, engagement, and inquiry-related curiosity

Although inquiry abilities are foundational to science learning, previous research has indicated that students encounter difficulties developing the abilities. Regarding the questioning ability, for example, students may not be able to express ideas and questions clearly (Krajcik et al. 1998), and the questions raised by students could be too divergent to predict, and cannot be answered by experimental results (Apedoe 2008). When doing experiments, students may collect ineffective experimental data because they lack the ability to control variables (Krajcik et al. 1998). Additionally, some middle school students do not have the ability to appropriately organize and interpret data (Jeong et al. 2007). They may not understand the components of scientific explanations (e.g., claim, reasoning, and evidence) and may not be able to use data as evidence to support their claims (McNeill and Krajcik 2008).

To explore possible ways to foster students’ inquiry abilities, researchers have identified factors that may associate with the development of students’ inquiry abilities (Wu et al. 2018; Luce and Hsi 2015). By examining data collected from more than 2000 eighth and 11th graders, Wu et al. (2018) revealed a positive relationship between students’ engagement in school laboratory activities and their inquiry abilities. However, Wu et al. (2018) focused only on students’ behavioral engagement in laboratory activities and did not provide information about how other dimensions of engagement could influence inquiry abilities.

Furthermore, Wu et al. (2018) indicated that students’ scientific curiosity could predict their inquiry abilities. Curiosity can be defined as a psychological disposition for the unknown, novelty, ambiguity, and uncertainty (Litman 2005; Mussel 2010) and measured as “the threshold of desired uncertainty in an environment which leads to exploratory behavior” (Jirout and Klahr 2012, p. 150). Luce and Hsi (2015) argued for the importance of characterizing students’ curiosity expressions with respect to scientific inquiry, and the relationship between curiosity and engagement in inquiry activities has been suggested in previous studies (e.g., Callanan and Jipson 2001; Callanan and Oakes 1992; Engel 2011). One possible explanation is that students’ science curiosity could initiate them into questioning and exploration (Chin and Osborne 2008) and drive their engagement in inquiry activities such as explaining and experimenting; this, in turn, may facilitate the development of students’ inquiry abilities. Thus, our hypothesized model in Fig. 1 also includes the potential impact of inquiry-related curiosity on the dimensions of engagement and inquiry abilities.

Research questions and proposed model

The purpose of this study is to examine how the four dimensions of engagement (i.e., cognitive, behavioral, emotion, and asocial engagement) are driven by students’ inquiry-related curiosity, and how they affect their inquiry abilities. Four research questions guided this study.

  1. (1)

    What are the relationships among the four dimensions of engagement?

  2. (2)

    What are the effects of the dimensions of engagement on inquiry abilities?

  3. (3)

    Is students’ inquiry-related curiosity associated with the dimensions of engagement?

  4. (4)

    Do the dimensions of engagement mediate the association between students’ inquiry-related curiosity and their inquiry abilities?

On the basis of the theoretical and empirical underpinnings, we proposed the model in Fig. 1 and formulated hypotheses for the research questions as follows. (1) Emotional engagement predicts behavioral and social engagement, and directly or indirectly affects cognitive engagement (Archambault et al. 2009; Li and Lerner 2013; Linnenbrink-Garcia et al. 2011). (2) The four dimensions of engagement positively influence inquiry abilities in a direct or indirect manner (Wu and Hang 2007; Greene et al. 2004). (3) Inquiry-related curiosity has a positive impact on the four dimensions of engagement (Wu et al. 2018; Engel 2011). (4) The dimensions of engagement play a mediating role in the association between students’ inquiry-related curiosity and their inquiry abilities (Chin and Osborne 2008). By examining the relationships among high school students’ curiosity, engagement and inquiry abilities, this study could provide insight into how to foster students’ inquiry abilities by enhancing different dimensions of engagement and promoting students’ curiosity.

Methods

Sampling

To achieve the purpose of this study, we recruited 11th grade students from senior high schools in three northern cities of Taiwan to participate in this study. The sample size was determined based on the suggestion of Westland (2010). The appropriate sample size in structural equation modeling should consider both the ratio of indicators to latent variables and the minimum effect in SEM at a given significance and power. For the structural model with six latent variables and 40 indicators in this study (1 ability, 32 engagement, 5 curiosity, and 2 computer experience indicators), according to Westland (2010), the suggested sample size was 675 for power = 0.80, α = 0.05, and the medium effect size (R2) = 0.15.

Using a two-stage stratified cluster design, we first divided the 78 senior high schools of the three cities into eight strata based on their students’ percentile ranks in the Basic Competence Test for Junior High School Students. In the second stage, we sampled one school from each stratum. The class size at high schools in Northern Taiwan was usually between 40 and 45 so we invited two classes in each sampled school (a total of 16 classes) to achieve the suggested sample size of 675. Yet, some classes were smaller than we expected, and the 16 classes with the total of 647 students were recruited to participate in this study. Among the students, 42 offered no responses to the administered questionnaires; their data were therefore excluded, resulting in a sample of 605 11th graders.

Measures

Three instruments were used to collect data in this study: a computer-based assessment of scientific inquiry abilities (39 items), a 32-item questionnaire of engagement, and a questionnaire for collecting information of students’ inquiry-related curiosity, computer experience, and socioeconomic status. Table 1 shows the descriptions of indicators for the latent variables in this study.

Table 1 Descriptions of indicators for the latent variables

Inquiry abilities

A computer-based assessment was developed to evaluate high school students’ questioning, experimenting, analyzing, and explaining abilities (Kuo et al. 2015). The assessment used scenario-based tasks to address the four inquiry abilities and incorporated simulations, animations, and videos to engage students in meaningful inquiry situations. For example, in the Camera Task (see Fig. 2), a simulated camera was provided and allowed students to change the aperture range and shutter speed, to observe the brightness of the photo, and to investigate the relationships between variables. The item pool of the assessment contained 101 items in 26 tasks which were developed to examine students’ inquiry abilities across four science content areas (i.e., chemistry, physics, biology, and earth science).

Fig. 2
figure 2

The camera task in the computer-based assessment of scientific inquiry abilities

The validity and reliability of the assessment items were established by our previous studies (Wu et al. 2014; Kuo et al. 2015). In this study, we selected 39 items from the pool to assess students’ four inquiry abilities: 11 items for questioning, 10 for experimenting, 10 for analyzing, and 8 for explaining. Three professors of science education were invited to evaluate the items for content validity. The content validity among the three experts was excellent (kappa = 0.88–0.96) and indicated their agreement that these tasks were sufficiently relevant to inquiry abilities.

Moreover, scoring rubrics were developed to evaluate the open-ended items. These rubrics included the level codes to indicate the performance levels in our framework, the score codes for actual points, and the examples of anticipated responses (Table 2). Three experienced researchers who had joined our previous studies were invited to score the students’ responses. The scoring researchers were trained to establish interrater reliability. The process involved two runs of grading 60 students’ responses to all 39 items, and the interrater agreements in scoring students’ open-ended answers ranged from 81.2 to 95.0%. After the agreements were achieved, each rater rated the students’ responses by using the scoring rubrics. Student responses from the 39 items were scored and gathered. Unidimensional Rasch models were used for analysis to derive five sets of plausible values (PVs) as the indicator of students’ inquiry abilities (Wu and Adams 2007). By doing so, the students’ inquiry abilities were accurately estimated (Wu et al. 2016). The reliability of expected a posteriori estimation based on the plausible values (EAP/PV) was 0.88, which showed that the test could explain a high percentage of the variation in the students’ inquiry abilities.

Table 2 The scoring rubrics of making a claim for the camera task

Engagement in scientific inquiry

This study adapted items designed by Fredricks and her colleagues (Fredricks et al. 2016a, b; Wang et al. 2016) to measure the four dimensions of students’ engagement in scientific inquiry activities. In the questionnaire, the learning context was described as “science laboratory classes” because the students may not be familiar with the term “scientific inquiry” and the learning opportunities of scientific inquiry have presumably been offered in school laboratory classes (Hofstein and Lunetta 2004).

The participating students were asked to reflect on the degree of their behavioral, cognitive, emotional and social interactive states in science laboratory classes (see the questions in Table 1). Nine indicators were used to evaluate their cognitive engagement (CE) such as thinking about different ways to solve a problem. Eight indicators were to gather information about behavioral engagement (BE), including items about attention, on-task behaviors, and efforts. In the part of emotional engagement (EE), nine indicators included positive and negative emotions, interest, and perceptions of the value of the science laboratory. Six indicators evaluated their social engagement (SE) with peers such as understanding different perspectives, building on others’ ideas, and helping peers. Students rated each item from never (1) to always (4).

A pilot study was conducted to validate the questionnaire and a high school located in northern Taiwan (but outside of the three northern cities where the formal study was conducted) participated in the study. All of the 11th graders in the school were invited to answer the questionnaire, and data from 114 students were collected. The Cronbach’s alpha for the four dimensions of engagement ranged from 0.80 to 0.85, and the Cronbach’s alpha of all 32 items was 0.94. The reliability results suggested that these items provided a high quality of internal consistency to evaluate students’ engagement in laboratory classes. Moreover, the confirmatory factor analysis (CFA) showed the composite reliabilities (CR) of all items ranging between 0.83 to 0.85 (higher than 0.60), which demonstrated a sufficient internal consistency of the latent variables (Bagozzi and Yi 1988). The final version contained 32 items and had suitable reliability and validity to measure students’ inquiry-related engagement.

Inquiry-related curiosity

We developed five items to measure students’ inquiry-related curiosity (Table 1). Students were asked to rate the five statements on a 4-point scale, from never (1) to always (4). These items probed the degree to which the students were curious about the feasibility of their research questions, the rationale of the experiment design, and the possible interpretations and explanations of the experiment data. Our previous study validated these five items (Wu et al. 2018) and suitable reliability was obtained (α = 0.923). In this study, reliability was reexamined and the value of composite reliability was 0.93 (Table 3), indicating a high internal consistency for the measure.

Table 3 Parameters of the initial and re-specified measurement models

Computer experience

Because the questionnaires and assessment were administered on computers, students’ computer experience and familiarity with the computer interface could influence their performances (Scheuermann and Björnsson 2009). To focus on the relationships in our research questions, in this study, students’ computer experience was measured and later controlled in the data analysis. Two items were developed to assess the students’ computer experience. In the first item, students reported the frequency with which they used computer devices outside of school from never (1) to everyday (5). In the second item, students indicated whether web browsing, among three other computer activities (i.e., social networking, online gaming, and word processing), was the most frequent computer activity that they engaged in. Dummy coding was used for this item (engaged = 1; not engaged = 0).

Socioeconomic status

To determine the relationships between curiosity, engagement, and inquiry abilities, this study also measured and controlled students’ socioeconomic status because it has been identified as an influential factor affecting students’ academic performances (May 2006). Four items were used to assess socioeconomic status. Two items asked students to indicate their parents’ education levels from no education (0) to having graduate degrees (6). One item was to estimate their family income according to seven income ranges (coded as 1–7). The last item reported the availability of home possessions and family conditions including rooms owned by students, cars, video recorders, two or more televisions, two or more digital computing devices (i.e., desktop, laptop and tablet computers), and mobile phones owned by their families. The z-scores of the four items were calculated and summed up as a SES variable.

Data analysis

Structural equation modeling

We took a two-step modeling approach (Anderson and Gerbing 1988) to test our hypothesized model. In the first step, CFA was employed to examine construct validity and to investigate whether the hypothesized relationships between indicators and the associated latent variables in our measurement model were valid. The relationships among items of the same latent variable were examined by convergent validity, and the factor loading must be larger than 0.55 (Comrey and Lee 1992; Harrington 2008). Furthermore, the reliability was demonstrated by composite reliability coefficients with a satisfactory value greater than 0.60 (Fornell 1982). The minimum cut-off of average variance extracted (AVE) for each latent variable was 0.50 to indicate that a greater common variance is captured by a construct than the amount of variance due to measurement error (Bagozzi and Yi 1988; Fornell 1982).

In the second step, the latent relationships in our structural model were tested using structural equation modeling with the validated latent variables. To avoid the confounding effect of students’ computer experiences and socioeconomic status, the two variables were used as controlling variables in the models to control all of the examined effects above and beyond the effects of the two variables.

This study also used measures suggested by previous research to evaluate the goodness of fit of our structural model. For the \(\frac{{\chi}^{2}}{df}\) ratio, as indicated by Bollen (1989), “there is no consensus on what represents a ‘good’ fit, with recommendations ranging from ratios of 3, 2 or less […] to as high as 5” (p. 278). We took the recommendation from Kelloway (1998) and West et al. (2012) set 5 as the cut-off value. Other indications of good fit used in this study included a root mean square error of approximation (RMSEA) below 0.08, a standardized root mean square residual (SRMR) less than 0.08, and a comparative fit index (CFI) greater than 0.95 (Bentler 1990; Bowen and Guo 2011; Schreiber et al. 2006). These indices and suggested criteria were employed to test the fitness between the collected data and our structural model.

Weighting

In this study, when the two-stage stratified cluster sampling design was applied, the probabilities of individual students being sampled were unlikely to be equal because of the varied sizes of schools and classes. Such unequal probabilities may lead to under- or over-representation of the target population and to biased estimates from such a sample. Therefore, to better reflect the representativeness, sampling weights were calculated by using the inverse of individual students’ probability of participation within the population stratum where the students belonged. The house weights were then estimated by linearly transforming the sampling weights so that the sum of the house weights was equal to the sample size. Finally, the house weights were applied and appropriated for the analyses that were sensitive to sample size (Rutkowski et al. 2010).

Controlling variables

As mentioned previously, the variables of computer experience and socioeconomic status were measured and controlled in this study in order to accurately determine the relationships between curiosity, engagement, and inquiry abilities. We used the strategy suggested by Spector and Brannick (2011) to eliminate the effects caused by computer experience and socioeconomic status through controlling the covariance of these two variables in the structural model. That is, we connected the two control variables to all the latent variables in the model. We then explained and reported the relationships among curiosity, dimensions of engagement, and inquiry abilities after the impact of the two control variables was considered and extracted.

Results

Descriptive and Correlational Results

The descriptive statistics of the data showed that the values of Kolmogorov–Smirnov of all indicators were significant ("Appendix A"). This result showed that the indicators in this study were normally distributed (Justel et al. 1997). Thus, the maximum likelihood (ML) was used to estimate the parameters of the structural equations (Bollen 1989).

Additionally, correlations between the indicators were examined. In general, most of these indicators were significantly related to inquiry abilities and related to each other (see "Appendix B"). However, some correlations of the indicators measuring the same latent variables were lower than their correlations with indicators of other latent variables. This implied that some indicators of the same variable (e.g., CE_5, CE_7) might not converge. In addition, we examined the reliability of the 37 indicators of curiosity and engagement ("Appendix C"). The initial Cronbach’s α values ranged from 0.72 to 0.89. Although the values revealed that the reliability of our instrument was acceptable (Cronbach 1951), combining the reliability data with the correlational results, we removed nine items of engagement that indicated low convergence and low internal consistency (i.e., CE_5, CE_6, CE_7, BE_5, BE_6, EE_8, EE_9, SE_5, and SE_6). The Cronbach’s α values were recalculated and ranged between 0.80 and 0.89 after nine indicators were deleted, which suggested a high internal consistency of the variables ("Appendix C"). Five indicators for inquiry-related curiosity and 23 for engagement (a total of 28 indicators) were used for the subsequent modeling analysis.

Measurement modeling

CFA was employed to confirm the validity of the five latent variables (i.e., inquiry-related curiosity, cognitive engagement, behavioral engagement, emotional engagement, and social engagement) based on the reliability and coherence of indicators within each latent variable. The CFA results presented the information about the convergent and discriminant patterns of the indicators and their corresponding latent variables in our measurement model. Table 3 shows the standardized factor loadings and residual variances of the indicators for each latent variable in both the initial and re-specified measurement models.

In our initial model, the factor loadings of the 28 indicators were significant. However, according to the suggested criteria, only 22 indicators were considered as reliable enough because their loadings were greater than 0.55. Six indicators of engagement (BE_4, BE_7, BE_8, EE_5, EE_6, and EE_7) with lower loadings were excluded from further analysis. Thus, the re-specified loadings of indicators revealed a satisfactory value to their corresponding variables.

Additionally, the fit indices derived from the CFA confirmed that the re-specified measurement model had an overall adequate model-data fit. Although the chi-square test of the re-specified measurement model showed a significant discrepancy between the re-specified model and the data (χ2 = 820.38, df = 199, p < 0.001), the ratio of chi-square per degree of freedom (\(\frac{{\chi}^{2}}{df}\) = 4.12) below 5 suggested that the re-specified measurement model fitted well to the data (Kelloway 1998; Lin and Tsai 2008). Moreover, the other indices indicated that our measurement model had an overall good fit with the data (RMSEA = 0.072 < 0.080, SRMR = 0.049 < 0.08, and CFI = 0.98 > 0.95).

After the fit indices were considered, the composite reliability (CR) and average variance extracted (AVE) indexes were estimated to examine the convergence of the five multi-indicator latent variables (i.e., four dimensions of engagement and inquiry-related curiosity). All of the CRs ranged between 0.76 and 0.93 and exceeded 0.60 (Table 3) so the five variables demonstrated sufficient internal consistency (Bagozzi and Yi 1988). In the final specified model, while the AVEs of the other four latent variables ranged from 0.51 to 0.72, the AVE of cognitive engagement was lower than 0.50. Yet, Fornell and Larcker (1981) indicated that a slightly lower AVE could be acceptable if the composite reliability of the latent variable was larger than 0.6 because, according to their simulations, the convergent validity of the variable was still adequate. As can be seen in Table 3, although the AVE of cognitive engagement was 0.46, the composite reliability was 0.83, much higher than the cut-off value of 0.6. Thus, together the AVEs and composite reliabilities revealed that the convergent validity of the latent variables in this study was acceptable.

Furthermore, the square roots of all five AVEs were calculated to estimate the discriminant validity as presented in Table 4. All but one of the square roots were greater than the correlations between each of the two latent variables. The correlation between cognitive engagement and social engagement reached 0.80, which was higher than 0.68 (\(\sqrt{AVE}\) of CE) and 0.71 (\(\sqrt{AVE}\) of SE). This implies a low discriminant validity and suggests that cognitive and social engagement had a strong inter-correlation with each other. We further examined the Heterotrait-Monotrait (HTMT) ratio to ensure the discriminant validity of the two constructs (Henseler et al. 2015). The HTMT of cognitive and social engagement was 0.807 in this study, lower than the suggested threshold of 0.85. These results confirmed the discriminant validity of all constructs.

Table 4 Discriminant validity for the measurement model

Structural modeling

In the second phase of our analysis, the structural relationships proposed in Fig. 1 were examined and the fit indices were derived to support the fits between our structural model and the data. The ratio of chi-square per degree of freedom was less than 5.0 (χ2 = 1035.82, df = 304, p < 0.001; = \(\frac{{\chi}^{2}}{df}\)3.41) and other fit indices indicated that our structural model had an overall good fit with the data (RMSEA = 0.064 < 0.080, SRMR = 0.057 < 0.08, and CFI = 0.97 > 0.95). The estimated model in Fig. 3 was thus adequate to describe the structural relationships between inquiry-related curiosity, the four dimensions of engagement, and inquiry abilities.

Fig. 3
figure 3

The structural equation model of engagement in scientific inquiry. A bold line presents a significant association, a dotted arrow line presents an insignificant association, and the arrow presents the direction. The value presents the factor loading between the two variables. *p < .05. **p < .01. ***p < .001

Interplay among the dimensions of engagement

The first research question was to examine the relationships among the four dimensions of engagement based on our hypothesized model. While Table 5 shows that the total effects of the behavioral, social, and emotional engagement on cognitive engagement were all significant, Fig. 3 reveals that the three dimensions affected cognitive engagement through different paths. The results showed that behavioral and social engagement were directly associated with cognitive engagement (effect = 0.20, SE = 0.05, p < 0.001 for behavioral engagement, and effect = 0.57, SE = 0.08, p < 0.001 for social engagement). On the other hand, emotional engagement did not directly link to cognitive engagement (direct effect = 0.04, SE = 0.03, ns) but affected it through the other two dimensions of engagement (indirect effect(EE→BE→CE) = 0.12, SE = 0.03, p < 0.001 and indirect effect(EE→SE→CE) = 0.24, SE = 0.05, p < 0.001). The total indirect effects of emotional engagement on cognitive engagement was significant (total indirect effect = 0.36, SE = 0.05, p < 0.001). Furthermore, a comparison between the two indirect effects of emotional engagement revealed no significant difference between the two pathway effects (difference = 0.12, SE = 0.07, ns). This result indicated that behavioral and social engagement had similar power to mediate the effect between emotional and cognitive engagement. Overall, the results partially supported our hypothesis by showing that emotional engagement could predict behavioral and social engagement, but emotional engagement could only indirectly affect cognitive engagement.

Table 5 The standardized effects of behavioral, social, and emotional engagement on cognitive engagement

Effects of the four dimensions of engagement on inquiry abilities

To answer the second research question, the effects of the four dimensions of engagement on inquiry abilities were investigated. As shown in Table 6, our hypothesis about the positive influence of the four dimensions of engagement on inquiry abilities was not fully confirmed. Only the total effects of cognitive and emotional engagement were significant (total effect = 0.17, SE = 0.08, p < 0.05 for cognitive engagement and total effect = 0.14, SE = 0.04, p < 0.001 for emotional engagement). Among the four dimensions, cognitive and emotional engagement played a relatively important role in enhancing students’ inquiry abilities, and directly affected their inquiry abilities (effect(CE→ability) = 0.17, SE = 0.08, p < 0.05 and effect(EE→ability) = 0.17, SE = 0.08, p < 0.05). Both of the two variables had the same explanation of variance (R2 = 2.89%) to predict students’ inquiry abilities, separately. Yet, a close examination of the effects revealed that although the total effects of behavioral and social engagement on inquiry abilities were not significant, they indirectly affected students’ inquiry abilities, and their significant indirect influence on inquiry abilities was completely mediated by cognitive engagement (effect(BE→CE→ability) = 0.03, SE = 0.02, p < 0.05 and effect(SE→CE→ability) = 0.09, SE = 0.05, p < 0.05).

Table 6 The standardized effects on inquiry abilities

Inquiry-related curiosity affecting the dimensions of engagement

The third research question focused on the associations between students’ inquiry-related curiosity and the dimensions of engagement. Figure 3 and Table 7 illustrate that the total effects of curiosity on the four dimensions of engagement were all significant (total effect (Curiosity → BE) = 0.27, SE = 0.05, p < 0.001; total effect (Curiosity → SE) = 0.58, SE = 0.04, p < 0.001; total effect (Curiosity → EE) = 0.19, SE = 0.06, p < 0.001; total effect (Curiosity → CE) = 0.75, SE = 0.04, p < 0.001). The results supported our hypothesis that inquiry-related curiosity had a positive impact on the four dimensions of engagement. Especially, the total effect of curiosity on cognitive engagement was the strongest. The coefficient indicated that inquiry-related curiosity could explain the 56.25% covariance of cognitive engagement with the mediation of emotional, behavioral, and social engagement.

Table 7 The standardized effects of inquiry-related curiosity on the dimensions of engagement

Additionally, Table 7 shows that inquiry-related curiosity significantly affected all four dimensions of engagement directly and that its direct effect on social engagement was particularly strong. Moreover, curiosity was indirectly, significantly associated with cognitive, behavioral, and social engagement. As shown in Fig. 3, its indirect effects on behavioral and social engagement were through emotional engagement (effect(Curiosity→EE→BE) = 0.11, SE = 0.03, p < 0.01 and effect(Curiosity→EE→SE) = 0.08, SE = 0.02, p < 0.01). On the other hand, the indirect influence on cognitive engagement was through multiple pathways (Table 7). Among them, the strongest indirect effect was through social engagement (effect(Curiosity→SE→CE) = 0.29, SE = 0.04, p < 0.01). Together these results suggested that curiosity was an important factor in promoting engagement in science activities.

Direct and indirect effects of curiosity on students’ inquiry abilities

The fourth research question addressed the impact of curiosity on students’ inquiry abilities. The results showed the significant direct, indirect, and total effects of curiosity on inquiry abilities (direct effect = 0.16. SE = 0.05, p < 0.001; indirect effect = 0.06. SE = 0.02, p < 0.01; total effect = 0.22. SE = 0.02, p < 0.01, for details see in Table 6). Our hypothesis on the mediating role of engagement was not fully confirmed; as can be seen in Table 2, the results suggested that while some pathways mediated by engagement were significant, others were not. Among them, the pathways involving cognitive engagement were all significant except for one (Curi → EE → CE → Ability). Overall, Fig. 3 illustrates that the associations between curiosity and the four dimensions of engagement played a partially mediated role in the relationship between students’ curiosity and inquiry abilities.

Discussion and implications

Interplay among the dimensions of engagement

Previous studies have suggested inter-relationships among some dimensions of engagement (e.g., Gunuc and Kuzu 2015; Li and Lerner 2013) and this study extends current knowledge by demonstrating the internal dynamics of engagement in a disciplinary context. While this study echoes the findings from the previous study by confirming the predictive links from emotional engagement to behavioral and social engagement, our results do not support a direct effect of emotional engagement on cognitive engagement, but suggest an indirect influence through behavioral and social engagement. That is, during scientific inquiry, students’ positive perceptions and feelings about inquiry activities may not determine their use of cognitive strategies and pursuit of achievement; rather, students’ emotional engagement could trigger their attentiveness and on-task behaviors and enhance their collaboration with others, which in turn would increase their cognitive efforts invested in scientific inquiry.

These results related to the internal dynamics of engagement can provide useful information for teachers and designers of science learning environments to facilitate a certain dimension of engagement. Teachers could design interesting and meaningful science activities to enhance students’ perceptions and values of learning environments. Such high emotional engagement could lead to students’ attentiveness and investment of time, and may further support their cognitive engagement. Additionally, encouraging students to work in groups and increasing their interactions with peers could promote social engagement that may energize them to engage cognitively and to accomplish learning tasks.

Enhancing inquiry abilities through engagement

Consistent with the findings from previous research, this study supports the positive association between cognitive engagement and inquiry abilities. The significant direct effect of emotional engagement on inquiry abilities suggests that students’ positive feelings and perceptions of the inquiry activities in science laboratory classes may lead to better performances in asking questions, analyzing data, doing experiments, and making explanations during the scientific inquiry.

However, this result is not consistent with the finding of Wu and Hang (2007), which indicated that the level of emotional engagement did not affect students’ achievement. There may be several reasons for the different results of the two studies. First, to examine whether emotional engagement affected students’ learning achievement, Wu and Hang (2007) divided students into three groups based on their scores on the questionnaire of emotional engagement, and conducted a 2 (instructional approach) × 3 (emotional engagement level) two-way ANOVA test on the achievement post-test. That is, Wu and Hang (2007) categorized the continuous variable of emotional engagement to compare group differences; however, according to Maxwell and Delaney (1993) and Altman and Royston (2006), such a grouping method could cause a reduction in statistical power and lead to false statistical significance. This statistical weakness caused by grouping was resolved in this study, which may have allowed a more accurate estimation of the effect from emotional engagement. Additionally, the statistical methods (2-way ANOVA) used in Wu and Hang (2007) did not consider effects from other dimensions of engagement and curiosity. On the other hand, this study controlled possible confounding variables and used SEM to identify direct and indirect effects of latent variables. The statistical methods used in this study may be more appropriate to estimate the effects of engagement on learning. Thirdly, the learning contexts were different in the two studies. While the Wu and Hang (2007) study was set in technology-enhanced science learning environments, this study focused on science laboratory classes. The nature of engagement may be different in these two types of learning contexts, possibly leading to diverse effects. Finally, the two studies measured different learning outcomes. The learning outcome measured in this study was inquiry abilities, whereas the learning achievement in Wu and Hang (2007) was students’ conceptual knowledge. Different learning outcomes may have different associations to engagement. By employing rigorous statistical methods, this study shows an evident association between emotional engagement and inquiry abilities, and emotional engagement is significant in promoting science learning when students participate in inquiry-based, hands-on activities to develop skills or abilities. An important practical implication may be that an encouraging learning environment where students’ positive perceptions about science inquiry are facilitated may support students’ development of inquiry abilities.

Additionally, the question of whether social engagement affects students’ learning achievement has been relatively under-investigated in early research. This study indicates that although social engagement did not directly affect students’ inquiry abilities, its indirect effect was significant and completely mediated by cognitive engagement. These results support the benefits of collaborative learning. The results also imply that students’ interactions with peers could enhance their use of cognitive strategies and increase intellectual investment in inquiry activities, which may then promote better performances in inquiry.

Inquiry-related curiosity affecting engagement and inquiry abilities

Previous studies have indicated that curiosity can stimulate students to explore why and how a phenomenon occurs, and suggested that such a mechanism-searching process could facilitate their learning in science (Chin and Osborne 2008; Luce and Hsi 2015). This study advances current knowledge of how curiosity affects engagement and inquiry abilities by differentiating the pathways from students’ curiosity to the four dimensions of engagement and inquiry abilities. First, the results of this study underscore the critical role of inquiry-related curiosity in promoting students’ engagement in all four dimensions. The significant direct and indirect relationships imply that students’ curiosity about scientific inquiry could evoke students’ positive emotions, sustain their on-task behaviors, encourage interactions with peers, and initiate the use of cognitive strategies. Through different dimensions of engagement, students’ inquiry abilities could be facilitated. Additionally, the direct effect on inquiry abilities indicates that inquiry-related curiosity may activate the development of inquiry abilities because students with a higher level of curiosity could be more willing to formulate scientific questions, conduct experiments and generate scientific explanations.

If curiosity is such an important indicator in scientific inquiry, how can teachers and educators promote students’ curiosity in scientific inquiry? Increasing the frequency of students’ science laboratory classes and their learning opportunities in informal science-designed settings (e.g., science camps, museums, and environmental centers) may be beneficial (Wu et al. 2018). Furthermore, teachers could identify the topics or questions that intrigue students, and design inquiry tasks around those topics (Krajcik and Czerniak 2007). Allowing students to explore questions they are curious about could sustain their engagement in scientific inquiry and could subsequently promote the development of their inquiry abilities.

Limitations of the study

Although this study advances understanding of the relationships between curiosity, engagement, and inquiry abilities, the results of this study were subject to the following limitations. The first limitation is regarding our assessment framework. The inquiry abilities covered in this study were limited to four inquiry phases: asking scientific questions, planning experiments, analyzing data, and formulating scientific explanations. Other inquiry abilities, such as modeling, argumentation, and communication, were not examined in this study. In addition, this study viewed the inquiry abilities as a unidimensional construct, and unidimensional Rasch models were used to estimate students’ inquiry abilities. This study did not estimate plausible values for each of the four abilities because our previous study (Kuo et al. 2015) supported a unidimensional structure and showed that “the correlations among the four inquiry components were higher than the person separation reliabilities of the four components.” Future studies may thus consider including more inquiry abilities in their assessment frameworks and when more abilities are evaluated, the structure of the constructs may change and different inquiry abilities may be viewed as having multiple dimensions.

A second limitation is from our choice of factors that affect students’ inquiry abilities. Although the four dimensions of engagement significantly affected inquiry abilities directly or indirectly (Table 6), the effect sizes were small and the covariance of inquiry abilities that could be explained by engagement was low. The results suggest that other motivational and cognitive variables may be associated with inquiry abilities. For example, Nehring et al. (2015) used the hierarchical regression analysis to investigate how multiple student charateritics contribute to inquiry skills in chemistry. They found that the cognitive variables (e.g., conceptual knowledge in chemistry, intelligence, and perceived cognitive load) predicted 47% of the inquiry skills while the motivational variables (e.g., interest in scientific investigation and self-concept in chemistry) explained these skills up to 25%. Future research may draw upon different theoretical perspectives, take more variables into consideration, and establish a more comprehensive model to explain the development of inquiry abilities.

Thirdly, in the engagement questionnaire, instead of using the term “inquiry,” we described the learning context as a “science laboratory” (Table 1). The research literature in science education has made a clear distinction between science laboratory and inquiry (e.g., Hofstein and Lunetta 2004); however, previous research also revealed the variety of meanings associated with the term inquiry (e.g., Abd-El-Khalick et al. 2004). In Taiwan, the term inquiry was not formally introduced to teachers and students until the new science curriculum guidelines were launched in 2019. When this study was conducted, considering students’ unfamiliarity with “inquiry,” we decided to carefully describe the activities in the items and use common terms in science classes in Taiwan, such as experiment and laboratory, to refer to inquiry. This was a difficult research decision because while the choice of inappropriate terms may undermine the validity of the questionnaire, participants’ misinterpretations and lack of knowledge of the terms may reduce the quality of the data (Krosnick and Presser 2010). This limitation could be resolved in our future study as new inquiry-based science curricula are implemented and Taiwanese teachers and students become more familiar with science inquiry.

Conclusion

Researchers and educators in science education advocate a focus on scientific inquiry, and promoting students’ inquiry abilities has been identified as a key objective of science education. This study sheds light on how to foster students’ inquiry abilities by enhancing different dimensions of engagement and promoting students’ curiosity. This study constructed a model of engagement in scientific inquiry and explored how four dimensions of engagement (i.e., cognitive, behavioral, emotional, and social) were driven by inquiry-related curiosity and affected students’ inquiry abilities. The results showed that students’ curiosity was associated with their inquiry abilities, and such an association was partially mediated by the four dimensions of engagement in science laboratory classes. Moreover, among the four dimensions of engagement, only cognitive and emotional engagement had significant total effects on students’ inquiry abilities, and the influence of behavioral and social engagement on inquiry abilities was completely mediated by cognitive engagement. These results suggest a critical role played by emotional engagement, cognitive engagement, and curiosity in science inquiry. Therefore, to develop students’ inquiry abilities, teachers and educators should create an encouraging and collaborative learning environment where students could follow their curiosity, have positive perceptions of science inquiry, and be willing to make cognitive efforts in inquiry activities.