Introduction

Self-regulation in learning (SRL) is considered critical for 21st century success, both academically and later in life (Pintrich 2000; Zimmerman 2008). Overall, self-regulation processes are built on both metacognitive and motivational strategies in a learning context that helps students to consider not only what and how they learn, but also whether their gains attain their goals (Moos and Ringdal 2012; Zimmerman 2008).

Fortunately, research has indicated that despite either complexity, SRL skills can be taught while pinpointing the vital role of scaffolding programs in promoting self-regulatory processes among students (Mevarech and Kramarski 2014; Tzohar-Rozen and Kramarski 2014). However, it is important to pay attention to how and when scaffolding should be faded or removed, when learners have acquired the desired skills or concepts after a fixed period of scaffolding and are expected to become autonomous learners who can implement these skills without scaffolds (Pea 2004; Puntambekar and Hubscher 2005). However, the fading conditions under which these skills are effective after removing the scaffolds have not been sufficiently investigated. Particularly lacking is a comparison between immediate effect at the end of the program (short term) and delayed effect (long-term) as a lasting effect (Ge et al. 2012).

This study addressed these issues by examining whether fixed (i.e., continuous) metacognitive scaffolds, compared to unique fading scaffolds related to graduated reduction, facilitate SRL and sense making of mathematical problem solving among young students (age 9–10). We now elaborate on the theoretical background underpinning the research variables assessed at the end of the program and three-months later.

SRL: self-regulated learning

SRL involves proactive, constructive processes that identify the three main components of SRL, cognition, metacognition and motivation (Pintrich 2000; Zimmerman 2000, 2008). Researchers indicate the efficiency of SRL in the context in which the learning takes place (e.g., Pintrich 2000; Zimmerman 2000). For the purpose of the current study, the content of sense making of mathematical problem solving was selected.

Cognition and metacognition components

Cognition is the process involved in knowing, that refers to the use of simple strategies like memorization, information processing, and higher-level strategies such as problem-solving and critical thinking. Metacognition is a form of cognition, a second or higher-order thinking process for understanding the task and the solution strategy (Flavell 1979; Pintrich 2000; Schraw 1998; Zimmerman 2008). Metacognition contains three aspects: knowledge of cognition, judgments and regulation of cognition (Pintrich et al. 2000). Knowledge of cognition is comprised of declarative knowledge about strategy/task (“what”?), procedural knowledge used on various cognitive strategies (“how”?), and conditional knowledge (“when”? and “why”?) that is important for the flexible and adaptive use of various cognitive strategies (Pintrich 2002; Schraw 1998).

Compared to the metacognitive knowledge aspect that has a static nature, metacognitive judgments are more process-related and reflect metacognitive awareness/experiences in ongoing metacognitive activities that individuals may engage in as they monitor and perform a task (Pintrich et al. 2000). Retrospective confidence judgements (CJ), refer to learners’ confidence in their responses, which are mostly realized at the end of the learning process (“I’m sure I was right”; Mihalca et al. 2017; Roebers et al. 2014). Studies indicated a high tendency of overestimation in CJ among students (Labuhn et al. 2010). The current study focuses on bias of confidence judgments, that is, the difference between actual performance and the judged performance (henceforth calibration of CJ, Schraw 2009, see elaboration in the Method section). Previous research indicated that realistic CJs can be developed among younger children, aged between 9 and 10 years (e.g., Roebers et al. 2014).

Similar to metacognitive judgments, the regulation of cognition refers to a dynamic strategic processing activity (Pintrich et al. 2000) that helps control one’s thinking or learning (Nelson and Narens 1994) through three phases. In the planning (forethought) phase, learners set goals for their own planning of specific activities, to complete a specific task. Next, in the monitoring (performance) phase, learners use their goals to monitor the process and move it along to use goals as checkpoints for progress along tasks. Finally, in the reflection (evaluation) phase, learners use information gained from the completed task to improve performance on the next task, what worked, what did not work and why (Pintrich 2000; Zimmerman 2008). The knowledge of cognition belongs predominantly to domain-specific strategies, and regulation of cognition is domain-general for processing strategic activation and application via the three phases (Pintrich 2000; Schraw 1998; Veenman et al. 2006). Research literature indicates that SRL is not achieved naturally, so students should be motivated to engage in a particular task and to be strategic in that process that can influence different processes of self-regulation (Dignath et al. 2008; Meece 1994).

Motivation component

A general assumption shared by most models of self-regulated learning is that there is some type of goal, criterion, or standard against which to assess the operation of the learning process and which guides regulatory processes (Boekaerts and Corno 2005; Winne 1996; Zimmerman 2000). It is assumed that learners set standards or goals to strive for in their learning, monitor their progress toward these goals, and then adapt and regulate their cognition, motivation and behavior in order to reach these goals (Pintrich 2000). The Achievement Goals Orientation Theory which has emerged in recent years as one of the dominant theories of academic motivation linked directly to the SRL theory (Efklides 2011; Pintrich 2000; Wolters 2004; Zusho and Edwards 2011). The theory focuses on the reasons/goals students have for engaging or achieving in a learning situation (Ames 1992; Dweck 1986; Elliot 1997; Kaplan and Maehr 2002). This theory originally stressed two general orientations to achievement: mastery and performance goals (Ames, 1992; Dweck and Leggett 1988).

The present study adopted the basic distinction as suggested by Ames (1992) and Dweck (1986), namely, mastery and performance goal-orientations that function in an achievement situation. The primary difference between these two types of goal-orientations is whether learning is valued as an end in itself, or as a means to reach some external goals. In particular, students with mastery goals value learning for its own sake and prefer situations where they can expand new skills and gain new knowledge (Ames 1992). Mastery goals assist learners to improve their performance, advance their learning, and achieve deep level strategy use (Nolen 1988). On the other hand, students with performance-approach goals value ability and prefer situations where they can demonstrate their ability and compare it with other students (Ames 1992). Students with performance-avoidance goals aim to avoid exhibiting lack of ability, invest minimal effort and use surface strategies.

Many studies found that mastery-oriented students report higher levels of effective cognitive and metacognitive strategies (Pintrich 2000). Planning, monitoring and regulatory strategies, were associated with mastery goal-orientation (Al-Harthy and Was 2010; Chatzistamatiou et al. 2015; Vrugt and Oort 2008; Wolters 2004). However, the link between performance goals and SRL is less straightforward. Most previous studies examining the relation between goal-orientation and learning strategies focused on adolescent and college students (e.g. Weinstein et al. 2011; Wolters 2004). There is some evidence regarding the relations between goal-orientations and self-regulation strategies in elementary schools (e.g., Patrick et al. 2007), indicating that mastery goals were strongly related to self-regulation strategies used by fifth-grade students. Additionally, Seo and Kim (2001) found that in elementary school students, both mastery and performance-approach goals related positively to metacognitive strategies as well as to mathematics achievement. However, further research with a focus on younger students with SRL scaffolds (fixed vs. faded) is needed in order to understand better the above relations at earlier stages of SRL development in the context of mathematical problem solving.

Sense making of mathematical problem solving: short and long-term effects

Mathematical problem solving is known to be one of the most difficult topics for students to grasp (e.g., Mevarech and Kramarski 2014; Schoenfeld 1992; TIMSS 2011). This ability is defined as a sense making process that demands “understanding of a situation, context, or concept by connecting it with existing knowledge or previous experience” (NCTM 2009, p. 4). Studies have indicated that despite numerous efforts over the years to train students in sense making of mathematical problem solving (henceforth “sense making”), many learners of different ages still have difficulty in internalizing these processes for immediate use (short term effect), and in transferring them over time after long-term practice, thus indicating a lasting effect (Schoenfeld 1992; TIMSS 2011; Watts et al. 2016). These difficulties are explained by numerous researchers as resulting from students’ limited cognitive information-processing, due to a lack of brain working memory (WM) space for new knowledge acquisition, that increases students’ cognitive load and affects learners’ mental effort (Ariës et al. 2014; Baddeley 1992; Goldberg 2010; Sweller 1988).

Research findings show that decreasing cognitive load can be attained by increasing learners’ ability to self-regulate their knowledge and effort during matching between SRL and solution phases (Ariës et al. 2015; Goldberg 2010; Kramarski and Fridman 2014). This raises the need to investigate the desired SRL scaffolds effects and its fading conditions, in the short (immediate) and long terms (i.e., lasting effect).

SRL fixed versus faded scaffolds

The research literature contains diversified SRL scaffolding programs recommending explicit scaffolds and self-regulation strategy modeling in the mathematical context, which encourage autonomous learning, and then lead to improved SRL metacognition and goal orientation motivation (e.g., Adler et al. 2016) and mathematical achievements (e.g., Mevarech and Kramarski 2014).

Fixed scaffolds are a continuous “forced” method that allows practicing metacognitive scaffolds directed at the proper moment to each SRL phase, while solving various problems. The research generally supports effects of continuous scaffolds “as a catalyst” for eliciting self-regulation strategies and enhancing transfer performance in the context of mathematical problem solving (e.g., Kramarski et al. 2013; Hoffman and Spatariu 2008). Furthermore, whereas some studies found that metacognitive scaffolds reduce students WM overload in problem solving (e.g., Ariës et al. 2015; Kramarski and Fridman 2014), other researchers argue that a continuous repetition option in metacognitive scaffolds slows their autonomy in solving the problems alone without help, which may add to students’ cognitive WM overload (Kramarski and Fridman 2014; Kramarski et al. 2013; Bannert and Mengelkamp 2013; Mayer 2009). Therefore, studies suggest that an effective metacognitive scaffold program should offer students some fading (reduction) of the scaffolds (Pea 2004; Puntambekar and Hubscher 2005).

Fading scaffolds is an extremely important method aspect of scaffolding, relating to how the supports are faded over time (Pea 2004; Puntambekar and Hubscher 2005), but is often a component missing in intervention programs. Most fading approaches relate to fading support all at once at the end of the program, while an advanced model for fading offers graduated reduction of skills to enhance students’ autonomous activity in attaining the desired skills/aspects (e.g., Puntambekar and Hubscher 2005). The initial practice of the fading graduated reduction model contains detailed support (full aspects/skills) and then involves lessened support over time (Bulu and Pedersen 2010; McNeill et al. 2006). However, there are still open questions about what, when and how-to fade out (e.g., Jaakkola and Veermans 2018; Tawfik et al. 2018).

Studies on fixed versus faded scaffolds effects

Effects of the fixed and faded scaffolds have led to contradictory findings. For instance, Lee and Songer (2004) by intervention support for domain-specific scaffolding in technology enhanced science, compared fixed and graduated faded scaffolds (i.e., withdrawn along the curriculum) provided by three kinds of explanation (exemplars, questions, and sentence), among primary school students (fifth and sixth grade). They found that while learners in both groups exhibited a gain in content knowledge, the fixed group outperformed the faded group in writing scientific explanations from authentic data, which is a higher order thinking skill.

Bulu and Pedersen (2010) compared fixed and graduated faded scaffolds in the context of hypermedia science learning (sixth grade). Three types of scaffolds, examples, question prompts and sentence starters, were presented to the students in both groups (fixed vs. faded). Over the course of the program, scaffolds were withdrawn gradually from the faded group, one at a time, starting from the withdrawal of examples, then question prompts, and finally sentence starters. The study found that students with fixed scaffolds performed better than those with faded scaffolds, in terms of developing solutions and making justifications.

Jaakkola and Veermans (2018) investigated the effects of concreteness fading on learning and transfer across three grade levels (4–6) in elementary school science education, in comparison to learning with fixed (constant) concrete representations with a computer-based simulation environment. The study found a significant interaction between condition and grade level in relation to learning outcomes, suggesting that the outcomes generally improved as a function of grade level. It was found that learning with fixed concrete representations either took less time or resulted in better learning compared to concreteness fading. A surprising finding was that the concrete condition succeeded at least as well as the fading condition on transfer tasks.

Tawfik et al. (2018) compared the effect of fixed (sustained) versus graduated faded scaffolds with four kinds of prompts: Problem-representation, developing solutions, making justifications, and monitoring evaluation at the end of a computer environment program on undergraduate students’ argumentation in ill-structured problem-solving. They found that students in the fixed (sustained) scaffolding condition performed significantly better than the students in the faded condition in scientific argumentation.

In contrast, McNeill et al. (2006) when comparing fixed versus graduated faded scaffolds in supporting students’ construction of scientific explanations, found that students (seventh grade) who underwent graduated faded scaffolding gained better comprehension in scientific explanation (i.e., claim, evidence, and reasoning) in the post-test than students who were exposed to the fixed scaffolds. They concluded that fading written scaffolds better equipped students to write explanations when they were not provided with support.

Similarly, Kester and Kirschner (2009) investigated fixed versus graduated faded scaffolds in two types of support, conceptual (concept map) and strategic (flow chart) in a student-centered e-learning environment to help adult learners to reach hypertext navigation accuracy skills. It was hypothesized that fading support during practice helps learners navigate more accurately during practice and achieve a higher practice and test performance as compared to learners receiving full support or no support during practice. The study confirmed the beneficial effects of fading support on navigation accuracy, but not on effects of fading in similar practice tasks during the program and transfer items (unfamiliar test items) in a regulate written format (no navigation).

To summarize, findings regarding effects of fixed versus faded scaffolds are inconclusive. Studies among primary school students (fourth-seventh grade) have been conducted mainly in the science domain, focusing on the cognitive level of the strategy scaffold but not on the mathematics domain. These studies tested metacognitive scaffolds (fixed and faded) and their effect on both SRL components: metacognition and motivation (Pintrich 2000; Zimmerman 2000). To gain better insight into the effects of these two scaffold types s, this study investigates the development of SRL and problem solving as a consequence of a metacognitive prompt set serving as an instructional scaffold to foster SRL.

Prompts as an instructional scaffold for fostering SRL

Prompts are external stimuli questions or statements that indicate to learners when and how to engage in a productive processing engagement, with the objective of activating metacognition and motivational regulation processes (Bannert and Reimann 2012; Davis 2003; Panadero et al. 2012; Müller and Seufert 2018). Prompts are powerful instructional means to activate learners’ existing self-regulation skills, by providing the balance between necessary external support and desired internal regulation (Mevarech and Kramarski 1997, 2014; Koedinger and Aleven 2007). Based on SRL theoretical relations and the effects found, prompts should not only increase the specific self-regulatory processes that are prompted as metacognition, but should also affect learners’ perceptions of complementary components as motivation (Panadero et al. 2012; Pintrich 2000). Metacognitive prompts may lead learners to reflect on their skills and resources. As a consequence, learners might feel equipped and motivated for learning in an environment that requires learners’ self-regulation when problem solving. Additionally, prompts often lead to learning success, which reflects mastery experiences (Devoldr et al. 2012; Zheng 2016), providing a major source for perceptions of motivational achievement goals (Ames 1992).

Scaffolding prompts can differ in their delivery mode and may be embedded or non-embedded, fixed or adaptive, direct or indirect (Devoldr et al. 2012; Zheng 2016). Embedded scaffolds are integrated in the learning environment so that students are obliged to pay attention to them, while non-embedded scaffolds are initiated by students themselves (Narciss et al. 2007).

The current study addresses the issue of fostering SRL (metacognition and motivation) with IMPROVE metacognitive self-question prompts oriented to what, when, how and why to use or withdraw scaffolds, and the effect of fading versus fixed scaffolds (Pea, 2004; Puntambekar and Hubscher 2005) in a unique faded reduction model, by focusing on younger students (grade four).

IMPROVE metacognitive self-question prompts in fixed versus faded scaffolds

The IMPROVE self-questioning method for mathematical teaching is one of the known methods for cultivating the learner’s SRL phases to enhance mathematical problem solving. This model (Kramarski and Mevarech 2003; Mevarech and Kramarski 1997, 2014) aims at scaffolding key aspects of sense making in problem solving by using generic, self-directed question prompts: Comprehension, Strategy and Reflection oriented to the three SRL phases of planning, monitoring, and reflection (see Fig. 1).

Fig. 1
figure 1

The IMPROVE metacognitive prompts oriented to the SRL phases: planning, monitoring and reflection

IMPROVE metacognitive prompts explicitly directed to the SRL phases have the potential to promote any of the metacognitive processes (planning, monitoring, reflection), to affect goal-orientation motivation that reflects “… ways of approaching, engaging and representing to achievement situations” (Ames 1992, p. 261). Previous studies conducted mainly in secondary schools with IMPROVE prompts on a fixed continuous scaffolding period in mathematics, were successful in metacognition and in sense making of problem solving (Kramarski and Mevarech 2003; Kramarski et al. 2013; Mevarech and Kramarski 1997, 2014), and showed a decrease in students’ cognitive problem solving load (Kramarski and Fridman 2014).

However, these powerful effects were compared to control groups with no scaffolds (Mevarech and Kramarski 2014). Nevertheless, little research has been done to understand the mechanism relating to when and how to withdraw metacognitive scaffolds to encourage student’s autonomy to set motivational individual mastery goals, and to enhance sense making of mathematical problem solutions after fading the prompting support, particularly for long-term effects.

Study design and the goals

This study examines the relative effectiveness of IMPROVE metacognitive self-questioning prompts in facilitating young students’ SRL (metacognition, motivation), under the fixed versus faded (graduated reduction) scaffolding conditions and relating to sense making in a problem-solving context. The Fixed Group was exposed explicitly to the IMPROVE metacognitive prompts (comprehension, strategy, reflection) oriented to SRL phases (planning, monitoring, reflection) in a continuous scaffold (Fig. 1). The Faded Group was also exposed to the same prompts, phases and problems, but used them in accordance with the unique faded graduated reduction model (see elaboration in the Method section and Fig. 2).

Fig. 2
figure 2

Fixed versus Faded question prompts oriented to SRL phases embedded in a set of four sense making tasks

The study examines the following questions:

Does the use of IMPROVE metacognitive self-question prompts with different levels of scaffolds: Fixed versus Faded, with the graduated reduction model, have a short-term effect on the students at the end of the program, and after a fading period of the metacognitive scaffolds (three-months, long-term lasting effect) with regard to-

  1. a.

    SRL: metacognition, CJ and motivation?

  2. b.

    Sense making of problem solving?

Several theoreticians note that the graduated fading model provides optimal autonomous support to the learner in various domain content aspects and skills (Ge et al. 2012; Kester and Kirschner 2009; McNeill et al. 2006; Pea 2004; Puntambekar and Hubscher 2005). In this manner, it is expected that the Faded Group will assimilate the SRL phases more than the Fixed Group, thereby contributing to internalization of the metacognitive process, and sense making of mathematics, particularly in the long-term effect (Mevarech and Kramarski 2014; Kester and Kirschner 2009; Lee and Songer 2004; Schonfeld 1992). This is in line with research showing a greater impact on metacognitive effects in follow-up learning sessions than measured earlier in short term effects (Bannert et al. 2015).

Taking into account the theoretical relatedness between the SRL components (metacognition and motivation) we expect to find effects of the IMPROVE prompts also on students motivation (achievement goal orientation) in the Faded group compared to the Fixed group. Following the mixed effects research findings about the two scaffolding programs (fixed and faded), direct scaffold prompts on metacognition and not on motivation, and the young age of the participants (grade four; 9–10), we didn’t formulate an explicit hypothesis regarding the effects of the two scaffolding types on SRL self-reported measures across the three intervals (T1, T2, T3).

Method

Participants

The current quasi-experimental study included 134 students from 4 classes (grade 4, ages 9–10) who attended two schools selected randomly in the same city in northern Israel with mid-level socioeconomic status, that were similar in their achievements on mathematics standardized measures tested by the Ministry of Education. Two classes, both fixed or faded with all their students (~ 33 students per class) in each school were randomly assigned into one of the two study groups: The Fixed Group (68 students; 50% boys and 50% girls), and the Faded Group (66 students; 51,5% boys and 48.5% girls). Assigning the two classes to one study group was done in order to minimize the confounding variable of the study groups. The two study groups did not differ by gender, χ2 (1) = .03, p = .86; pretests as presented in the results section and teachers’ professional background are presented in the training section. In particular, the two groups are similar in their class heterogeneity (SD; Cohen’s d) on each of the measured variables. The study was approved by the Chief Scientist for Operation at the school.

Measures

Measures on SRL aspects (metacognition, calibration of CJ, motivation) and sense making of mathematics were administrated three times: Pre-intervention (T1), post-intervention (T2) for short-term (immediate) effect and after 3 months (T3) to assess long-term lasting effect.

Metacognition

The self-report Junior Metacognitive Awareness Inventory developed by Sperling et al. (2002) was administrated. The questionnaire contained 24 items, which produced two principal factors (Brown 1987): Knowledge of cognition (10 items, e.g., “I know when I understand something”) and regulation of cognition (14 items, e.g., “I find myself pausing regularly to check my understanding”) scored on a five-point Likert scale ranging from 1, “Never” to 5, “Always”. Cronbach’s Alpha reliability of the 24 items was α = .89, α = .76 for the knowledge of cognition and α = .81 for the regulation of cognition.

Calibration of CJ

A “self-estimation ruler” of calibration of metacognitive CJ was presented at the end of the students’ mathematical problem solution (see Appendix B). The ruler was integrated into the test at three-time points (T1, T2 and T3), with two problem-solving tasks in each test to examine the effects of the two interventions on participants’ CJ. Participants were asked to estimate their success in reaching the solution with the following instruction: “Indicate on the ruler the degree of your success in solving the problem”. The ruler was developed by Huff and Nietfeld (2009) and used a scale ranging from 0 (low estimation) to 100 (high estimation). The participants were given an explanation of the self-estimation role as part of metacognition self-awareness (what and why), and a demonstration of how to mark the judgments on the ruler. They practiced their CJ on a lesson before the beginning of the intervention, after solving a task and then compared their estimation with the correct solution. The analysis was conducted using a calculation calibration bias according to the third formula of Schraw (2009).

$$ {\text{Bias Index}} = \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} (c_{i} - p_{i} ) $$

Calibration bias is the difference between the learner’s self-estimation (i.e.,\( c_{i} \) scores) in executing the task and the actual performance (i.e., \( p_{i} scores) \). Accordingly, a difference aspiring to zero indicates a high level of accuracy, while a difference distant from zero towards the positive or negative direction, indicates inaccuracy of metacognitive judgment ability. A positive difference indicates a higher CJ than the actual score (given by the teacher), whereas a negative difference indicates a low CJ, meaning the learner’s self-estimation was lower than the real score (Schraw 2009). Cronbach’s Alpha reliability relating to calculation of the calibration bias was α = .76.

Motivation

A self-report questionnaire based on achievement goals theory was administered at three time points (T1, T2 and T3). The questionnaire was developed by Midgley and colleagues (Midgley et al. 2000), including 19 statements relating to achievement goals, and used a five-point Likert scale ranging from 1, “Not true at all” to 5, “Very true.” The respondent ranked each statement according to how true it was for him or her when solving sense making problems. Statements were divided into three groups: six statements about mastery goals (for example, “It is important for me to understand clearly what I learn in class”), where an answer of 1 showed a low mastery-goals orientation and an answer of 5 indicated a high mastery-goals approach; six statements relating to performance-approach goals (for example, “It is important for me that students in my class think that I am good at tasks”), where 1 showed low performance-approach goals and 5 indicated high performance-approach goals; and seven statements relating to performance-avoidance goals (for example: “It is important for me not to look stupid at school”), where 1 indicated low performance-avoidance goals and 5 indicated high performance-avoidance goals, which is less effective for self-regulation processes. The measure of performance-avoidance goal is inverted; thus, higher scores indicate less avoidance. Cronbach’s Alpha reliability of the 19 items was α = .88, α = .80 for the mastery goals, α = .89 for the performance-approach goal, and α = .77 for the performance-avoidance goals.

Sense making of mathematics

Tasks were taken from the standardized Meitzav exam for the fourth grade, developed by the Israeli Ministry of Education (2005, Version A). At each time point (T1, T2 and T3) the test consisted of 7 tasks on the same level of complexity (routine and novel), adapted by changing the numbers in each time version (see Appendix B). The scores ranged from 0 to 100 where “100” was awarded for a correct solution with a correct solution path that indicates their thinking strategy; “50” for a partial solution without the solution path or a wrong solution path, and “0” for a wrong answer without a solution path or a wrong solution path. The tests and indicators for the task were approved by the supervisor of mathematics teaching at the Ministry of Education and were recommended for use with fourth graders. Cronbach’s Alpha reliability for the sense making test was α = .75.

Intervention program structure

Teachers’ background and instruction

The four female teachers who taught the participants each held a bachelor’s degree in mathematics education. They have also similar experience in professional development programs for mathematics teaching methods in primary schools to enhance students’ meaningful learning. Each teacher had more than 10 years of math teaching experience and was considered by the principal to be an expert teacher in the school according to official tests and surveys of student satisfaction.

Teachers were instructed in each school by the first author on their intervention program structure (6 h’; Kramarski 2017; Kramarski and Revach 2009). Instructions was supported with written protocols, scripts for presenting the theoretical SRL framework, rationale and techniques for using and discussing it in class (see Fidelity section below for details). Teachers were informed that they were participating in an experiment (12 lessons in 7 weekly sessions) in which a new program and materials were being designed for teaching mathematics based on theories about SRL, autonomous learning in the context of sense making in mathematics.

For both groups, the first part of the instruction focused explicitly on the rationale for SRL (metacognition, CJ, motivation) and its phases (Pintrich 2000; Zimmerman 2008), using the IMPROVE metacognitive self-question prompts (comprehension, strategy and reflection; Mevarech and Kramarski 1997, 2014). In the second part, teachers were exposed explicitly to the rationale, techniques and modeling of their program (fixed or faded). Teachers practiced the problem-solving process with a set of the 4 sense making tasks with the IMPROVE prompts along the SRL phases (fixed or faded) as presented below in Fig. 2 and discussed possible difficulties that students could face in implementing such a model related to their answers on the prompts, and on the sense making of the solution.

Finally, a short explanation was given regarding the testing measures (SRL, CJ, sense making), students’ folders, students’ navigation cards and how to summarize the program in the last meeting as presented on the time line of the program (Fig. 3).

Fig. 3
figure 3

Overview of the Fixed versus Faded groups’ study program and timeline

Fixed versus faded groups structure

Each intervention lasted for 7 weekly sessions, including 12 lessons (the first-introduction and last-summary session included one lesson and 5 sessions included double lessons). The duration of the program’s problem-solving structure is set in accordance with the scope of hours allocated to learning sense making of problem solving in the national curriculum for grade 4. The detailed structure of each session is presented on Fig. 2. Students in both groups in each session solved 4 mathematical sense making problems at the same level of complexity related to the official curriculum scaffold by the IMPROVE self-question prompts oriented to the SRL phases (planning, monitoring, reflection). The last session practice was based on higher complex sense making problems (see example type on Appendix B).

The Fixed Group was exposed in each problem to the full IMPROVE self-question prompts (comprehension, strategy, reflection; Fig. 1) oriented respectively to each SRL phase (planning, monitoring, reflection). Students were asked to answer in the prompts by writing and solving the problem (see Fig. 2; problem 1 for a full prompts solution process). The Faded Group was prompted according to the graduated fading model as follows (see elaboration in Fig. 2):

  • The first problem was presented similarly to the Fixed Group, with the three metacognitive prompts fully oriented respectively to each of the three phases. Students were asked first to reply to the planning and monitoring question prompts by writing, to solve the problem and finally, to reply to the reflection prompt.

  • The second problem was prompted with two questions oriented respectively to each of the two phases: planning and monitoring; students were asked to reply to the two question prompts by writing and by solving the problem. Finally, they named and wrote out the withdrawn question prompt for reflection and replied to it.

  • The third problem was prompted with one question oriented to the planning phase, and students were asked to name and write out the two withdrawn question prompts for monitoring and for reflection. Then, to reply to the question prompts by writing and to solve the problem in the same order as explained in the previous problems.

  • The fourth problem was not prompted, but was solved autonomously by students, while naming and writing out the withdrawn question prompts oriented to the three phases, replying to the question prompts and solving the problem in the same order as they had done previously.

Students then took turns to read their answers aloud to the self-questions prompts and problem solution after solving it. The teacher led a class discussion about these answers and wrote examples on the board. Each student collected his work pages in a personal folder. The teacher kept the folders in the classroom during the intervention and presented them to the researcher in his program fidelity visit. In order to help the students in both groups to assimilate the metacognitive questions and phases, they were given a navigation card as presented in Fig. 1, with all of the self-questions to use during the program. The same duration time for solving the problem was given in both groups.

Fidelity of the program

During the course of the intervention, the first author observed each second lesson in both groups. In each observation, the researcher rated the extent (1-low to 4-high) to which the instructor implemented the intervention precisely according to the program’s scripts, tasks, personal folders; procedures, instruction, metacognitive prompts, and discussion time allocation, according to the goals of the study group. Overall, implementation fidelity was high (for the total observations: M = 3.93, SD = 0.29, range: 3–4). Deviations were minor, infrequent, and easily corrected (e.g., only on two occasions was time allocation rated less than 4 for discussion time allocation—rated 3 once for one Fixed Group teacher and once for one Faded Group teacher). At the end of each lesson, the researcher gave the teacher feedback regarding the observed criteria.

Results

A preliminary MANOVA conducted on the pretest measures indicated no differences between the groups on SRL aspects (metacognition, CJ, motivation) and sense making of problem solving. Thus, mixed ANOVA with repeated measures analyses on time and on group were conducted on the post test. In each analysis, the group (Faded and Fixed) was treated as the between-subjects variable and the time (T1, T2, T3) as the within-subjects variable. Mauchly’s sphericity test was conducted to test the assumption regarding the condition where the variances of the differences between all possible pairs of within-subject conditions (i.e., levels of the independent variable) are equal. The test result was significant, and we reported the adjustment results from the Greenhouse–Geisser test. The design was balanced so that the number of observations in each cell was equal, thereby avoiding any violation of the homogeneity assumption.

Metacognition

To examine scores in the metacognition scales (Schraw and Dennison 1994): knowledge of cognition and regulation of cognition, two 2 × 3 repeated measures analyses of variance (mixed ANOVA) were conducted.

Knowledge of cognition

The two way interaction of group x time was significant, F(2, 131) = 4.26, p < .05, η 2p  = .06 (Table 1). Simple effect analyses comparing the three time points for each study group separately indicated significant differences between the three time points in both groups, [F(2, 64) = 4.23, p < .05, η 2p  = .12 for the Faded Group and F(2, 66) = 5.12, p < .01, η 2p  = .13 for the Fixed Group]. Bonferroni analyses indicated that while higher scores were found in T2 and T3 compared to T1 (d = 0.30; d = 0.34 respectively) in the Faded Group (p < .01), with no significant differences in scores between T2 and T3 (p = .86), a significant decrease in scores was found in T3 compared to T2 (d = − 0.37) in the Fixed Group (p > .05), but no significant differences in scores were found between T1 and T2 (p = .13) or between T1 and T3 (p = .99). Comparing the two study groups at each time point reveals that while no significant differences were found between the two groups in T1 (p = .74) and T2 (p = .98), higher scores in the Faded Group, compared to the Fixed Group, were found in T3 (p < .05, d = 0.43) (Fig. 4).

Table 1 Means (and SD) and F-score of the metacognition and sense making variables by groups: fixed (n = 68) and faded (n = 66) and three-time points
Fig. 4
figure 4

Metacognition scores of two study groups for three-time points

Regulation of cognition

Data in Table 1 show slight differences between the three-time measures in each group. Both groups increased their self-report regulation of cognition ability at the end of the program (T2; d = 0.18). Whereas the Fixed Group decreased that measure on lasting effect (T3; d = − 0.18), the Faded Group maintained the same achievement (d = 0.06). However, repeated measures analyses of variance (mixed ANOVA) indicated that the two way interaction of group x time was also not significant, F(2, 131) = 1.29, p = .28, η 2p  = .02 (Table 1).

Confidence judgment

To examine scores in the calibration of confidence judgment, a 2x3 repeated measures analysis of variance (mixed ANOVA) was conducted. A main effect of time was found, F(2, 131) = 50.03, p < .001, η 2p  = .43, indicating differences in T3 compared to T1 (p < .01) and T2 (p < .001) (d = 0.30 and d = 0.87, respectively), and in T2 compared to T1 and T3 (ps < .001) (d = − 0.47, d = − 0.87, respectively). In continuous to over confidence judgments’ bias on T1, both groups (Fixed & Faded) further increased on T2 their over confidence judgments’ bias of their solution success on the short—term effect (end of the program), whereas on the long-term effect in T3 (3-months later) their confidence judgment was more realistic. In T3, the Fixed Group had a higher index bias score (M = − 4.49) compared to the Faded Group (M = 2.99) indicated greater accuracy in self-estimation. Finally, the main effect of group, F(1, 132) = 1.60, p = .21, η 2p  = .01, and the two way interaction of group × time, F(2, 131) = .43, p = .65, η 2p  = .01 were not significant (Table 1).

Motivation

To examine scores on the motivation scales, three 2 × 3 repeated measures analyses of variance (mixed ANOVA) were conducted for the three motivational tested sub-goals: mastery goal, performance-approach goal and performance-avoidance goal. In each analysis, group (Faded and Fixed) was treated as the between-subjects variable, and time (T1, T2, T3) as the within-subjects variable.

Mastery goal

The two way interaction of group x time was significant, F(2, 131) = 4.95, p < .01, η 2p  = .07 (Table 2). Simple effect analyses comparing the three-time points for each study group separately indicated that while significant differences were found in the Fixed Group between the three time points, F(2, 66) = 5.83, p < .01, η 2p  = .15, no significant differences were found between the three time points in the Faded Group, F(2, 64) = .99, p = .38, η 2p  = .03. Bonferroni analyses indicated that a decrease in scores between T1 and T3 (d = − 0.42) was found in the Fixed Group (p < .05), with no significant differences in scores between T1 and T2 (p = .26) and between T2 and T3 (p = .16). Comparing the two study groups at each time point reveals that while no significant differences were found between the two groups in T1 (p = .08) and T2 (p = .85), higher scores in the Faded Group, compared to the Fixed Group, were found in T3 (p < .05, d = 0.33) (Fig. 5).

Table 2 Means (and SD) and F-score of the motivation variables by the study groups fixed (n = 68) and faded (n = 66) and three-time points
Fig. 5
figure 5

Motivation scores of two study groups for three-time points

Performance-approach goal

The two way interaction of group × time was significant, F(2, 131) = 5.17, p < .01, η 2p  = .07 (Table 2). Simple effect analyses comparing the three-time points for each study group separately indicated no significant differences between the three time points in both groups, F(2, 64) = 2.34, p = .11, η 2p  = .07 for the Faded Group and F(2, 66) = 2.86, p = .07, η 2p  = .08 for the Fixed Group. Comparing the two study groups at each time point reveals that while no significant differences were found between the two groups in T1 (p = .36) and T2 (p = .33), higher scores in the Faded Group compared to the Fixed Group were found in T3 (p < .01, d = 0.56) (Fig. 5).

Performance-avoidance goal (scale inverted)

The two way interaction of group x time was significant, F(2, 131) = 7.59, p < .001, η 2 p  = .10 (Table 2). Simple effect analyses comparing the three time points for each study group separately indicated significant differences between the three time points in both groups, [F(2, 64) = 5.72, p < .01, η 2p  = .15 for the Faded Group and F(2, 66) = 3.20, p < .05, η 2p  = .10 for the Fixed Group]. Bonferroni analyses indicated that the Faded Group manifested less avoidance in T3 compared to T1(d = 0.34) and T2 (ps < .05; d = 0.33), with no significant differences in scores between T1 and T2 (p = .99). However, the Fixed Group manifested significant high-avoidance in T3 compared to T2 (d = − .34), with no significant differences between T1 and T2 (p = .95) and between T1 and T3 (p = .66). Comparing the two study groups at each time point reveals that while no significant differences were found between the two groups in T1 (p = .74) and T2 (p = .95), higher scores in the Faded Group, compared to the Fixed Group, were found in T3 (p < .001, d = 0.57) (Fig. 5).

Sense making of problem solving

To examine the scores in the sense making of problem solving, a 2x3 repeated measures analysis of variance (mixed ANOVA) was conducted, with group (Faded and Fixed) as the between-subjects variable, and time (T1, T2, T3) as the within-subjects variable. The two way interaction of group x time was significant, F(2, 131) = 3.04, p < .05, η 2p  = .05 (Table 1). Simple effect analyses comparing the three time points for each study group separately indicated significant differences between the three time points in both groups, [F(2, 64) = 16.54, p < .001, η 2p  = .34 for the Faded Group and F(2, 66) = 6.76, p < .01, η 2p  = .17 for the Fixed Group]. Bonferroni analyses indicated that while higher scores were found in T3 compared to T1 d = 0.60) and T2 d = 0.68) in the Faded Group (ps < .001), in the Fixed Group, higher scores were found in T3 compared to T2 (p < .001 d = 0.45), but not to T1 (p = .36). No significant differences were found in either group between T1 and T2 (p = .99) for the Faded Group and p = .21 for the Fixed Group. Comparing the two study groups at each time point reveals that while no significant differences were found between the two groups in T1 (p = .55) and T2 (p = .48), higher scores in the Faded Group compared to the Fixed Group were found in T3 (p < .05, d = 0.40) (Fig. 6).

Fig. 6
figure 6

Sense making of two study groups for three-time points

Discussion

The major finding by this study on Fixed versus Faded scaffolds on SRL and sense making benefit for the two metacognitive IMPROVE training groups, was that the faded scaffolds were more effective compared to fixed scaffolds on the tested variables, particularly on lasting effect (T3). These findings indicate that the opportunity to match the scaffolds autonomously to each SRL phase in the faded group was more effective than the static use of those scaffolds by the fixed group (Narciss et al. 2007). We discuss further the main findings for each variable.

Metacognition: knowledge and regulation of cognition

Differences between the groups were found particularly in the knowledge of cognition long-term awareness measure (T3), While the Faded Group exhibited a higher significant increase, the Fixed Group exhibited a significant decrease compared to the short-term effect (T2). But not for the regulation of cognition measure, whereas neither group manifested a statistically significant increase across the three-time intervals (T1, T2, T3). These findings raise two questions -

Q1

Why didn’t the Fixed Group benefit from the short/lasting effect relating to the metacognition measures –knowledge and regulation aspects?

This result is surprising in light of previous studies in mathematics and in other knowledge fields which indicated that fixed metacognitive self-questions scaffolds of the What, When, How, and Why improve metacognition by reducing cognitive load, thus helping to increase WM capacity (Ariës et al. 2015; Kramarski and Fridman 2014). These contradictory findings can be explained in light of the different ages of participants in those studies. Whereas previous studies were mainly implemented with secondary/high school students and adults, the current study focuses on younger ages (9–10). Indeed, at that stage students begin to be mature enough to achieve initial awareness of metacognition, but the brief intervention (7 sessions) and the intensive continuous use of metacognitive self-questions in the Fixed Group might be too loading for the young students’ knowledge attainment. As an initial stage of metacognition, lack of the knowledge aspect may further prevent the regulation of cognition aspect (Schraw 1998; Veenman et al. 2004).

Q2

Why did the Faded Group benefit significantly only on the knowledge of cognition (short/long-term) level, but not on the regulation of cognition? It seems that the autonomous learning that the Faded scaffold provided for these young students was beneficial for the first level of metacognition development (i.e., domain-specific knowledge focus), that is characterized as a lower path of thinking (Schraw 1998), but was not effective enough in short intervention to cope with the higher order level of metacognition (i.e., domain-general regulation focus).

Some research findings in mathematics emphasize the effect of visualized thinking strategies for assimilation of metacognitive problem-solving phases (Abdullah et al. 2014; Cleary et al. 2017; Rittle-Johnson and Star 2007). The lack of regulation aspect attainment could be explained by the linear visual strategy thinking that the prompting model raised (Fig. 1), that succeeded in building a linear mental model of knowledge awareness in comprehension (what?), strategy selection (how?) and initial reflection ability (why?) (Schraw 1998), but missed understanding of the regulation loop. Perhaps a systematic visual, cyclical practice of the SRL phases could help in internalizing that loop (Cleary et al. 2017; Pintrich et al. 2000; Zimmerman 2008). Future studies should further investigate the Fixed versus Faded scaffold effects with the two visualized scaffolds to prompt SRL, by comparing the groups once with the cyclical emphasis and then with the linear emphasis.

Confidence judgment

Q3

Why did both scaffold groups over-estimate their performance on T2 and then decrease on T3? In light of the importance of process measures to assess SRL in real time events (Azevedo 2014; Greene and Azevedo 2010), the calibration of confidence judgments’ measure adds authentic data to students’ metacognitive awareness of their performance on sense making, that adds understanding of the self-report metacognitive measures (knowledge and regulation). Interestingly, while students from both scaffolds groups manifested over-estimation of self-accuracy at the beginning of the study (T1) that further increased significantly at the end of the study (T2; short—term effect), they later became more realistic on the lasting effect (T3).

The dramatic confidence judgments’ bias increase at the end of both scaffolds periods (T2) is explained in the literature as overwhelming from a new scaffold experience that raised young student’s self-efficacy feelings of success in solving the task (e.g., Pintrich 2000). However, the realistic self-estimation on T3 can be explained as a result of internalizing the monitoring and reflection mechanism they were trained to realize accurately their own success. This conclusion is supported by findings of previous studies, according to which practicing confidence judgments’ accuracy while teaching a content field improves the ability in this type of judgement among nine-year-old students (Gidalevich and Kramarski 2017; Roderer and Roebers 2010). They also extend the findings by Zimmerman and colleagues (2011), who found that confidence judgments’ ability among college students was improved during a longer practice (15 weeks) compared to our fourth graders (7 weeks). This finding has practical implications for young students’ achievement improvement. Because a learner with high confidence judgment is unlikely to go back to recheck and correct his mistakes, as such he may mistakenly fall into an at-risk category as a low achiever (Labuhn et al. 2010; Shin et al. 2007).

Motivation: mastery goal, performance-approach goal, performance- avoidance goal

Q4

Why did the Fixed Group decrease on mastery goal measure compared to the Faded Group on the lasting effect (T3)?, while the Faded Group manifested higher mastery goal, performance-approach goals and less performance-avoidance goals in learning mathematics compared to the Fixed Group. This finding supports the underpinning rationale of this study, that autonomous engagement in learning with the faded scaffolds raises students’ motivational goal-orientation (Adler et al. 2016).

Furthermore, the motivational achievements can be explained regarding the groups’ metacognition gains. It seems that in the Faded Group the positive trend on the three kinds of goals, mastery, performance-approach and avoidance is a result of the metacognitive knowledge and initial self-regulation improvement (Tables 1, 2).

This is in line with theoretical assumptions about relations between metacognition and motivation (Pintrich 2000). Metacognition is regarded as a superordinate ability to direct and regulate cognitive, motivational and behavioral processes in order to achieve specific goals (Ifenthaler 2012; Pintrich 2000).

The similar findings trend in metacognition (knowledge aspect) and motivation support the assertion that SRL dimensions and processes are linked (Kim and Pekrun 2014; Pintrich et al. 2000; Schunk 1991) and may be influenced by the same specific instructional intervention’s prompts (Ifenthaler 2012). This conclusion is supported also by the correlations between metacognition and motivation in Tables 6 and 7 in Appendix C.

Q5

Why didn’t the Faded Group increase significantly their mastery goal along the time intervals? The lack of mastery goal achievements increase can be explained by the nature of the indirect scaffold on motivation that hindered the effect on the mastery goal achievement that as such is conceived of as a transfer skill (Kistner et al. 2010). Future studies should further compare the effects of two faded scaffold groups: one directed explicitly to motivation and the other directed explicitly to metacognition as implemented in the study, to find which of the indirect SRL aspects, metacognition or motivation, is more affected by the faded scaffolds.

The SRL findings (metacognition, CJ, motivation) add understanding with regard to the role of the faded scaffolds in young students in fostering SRL in mathematics as a proactive process that does not merely happen to students, but rather occurs autonomously within themselves (Zimmerman 2008). It expands previous findings on IMPROVE self-questioning prompts as a fixed scaffold with young students (e.g., fifth grade; Tzohar-Rozen and Kramarski 2014), that focused only on SRL effect at the end of the program, without allowing time for assimilation and preservation of the program and testing its long-term effect, as in the current study.

Sense making of problem solving: short term and long-term effect

Q6

What makes the difference between the two scaffolds types: Fixed versus Faded on the tested variables? Findings indicated that both groups increased their sense making only in the long-term effect (T3) compared to the immediate effect (T2), whereas a greater increase was found in the Faded Group (d = 0.68) compared to the Fixed Group (d = 0.45). This is in line with research showing a larger impact of metacognitive effects in follow-up learning sessions than earlier measures of short-term effects (Bannert et al. 2015).

These findings support previous conclusions, that metacognitive scaffolds can serve as a learning strategy that enhances long-term memorization and re-use of these learning activities to develop awareness about the relevance of learning activities (e.g., Ariës et al. 2015; Goldberg 2010; Kuhn and Dean 2004;). The differences between the two scaffold types (Fixed vs. Faded) on the sense making of problem solving can be explained by relating to the way that the information processing in each group is stimulated. The fixed scaffolds enabled learners to walk through SRL problem solving phases with explicit SRL question prompts step by step, thereby emphasizing local (concrete) processes guided by less autonomy in self-selected prompts that should be adapted to the SRL phases. This naturally leads to low-road transfer (Salomon and Perkins 1989) to the sense making of problem-solving long-term retention (T3). In contrast, the Faded graduated model offered more scaffolding to abstract information processing by students’ exposure to only one example of full problem-solving prompting phases. This was followed by isomorphic tasks to be solved with self-complementing the withdrawn prompts matched to the relevant phases in the problem-solving process. Such autonomous mindful abstraction establishes the high road (generalization) to far transfer (T3; Salomon and Perkins 1989).

Our findings extend earlier findings on faded scaffolds that were achieved mainly in the science domain but did not lead to conclusive results (e.g., Bulu and Pedersen 2010). The Faded Group’s sense making gains seems to be a result of the positive effects that emerged in the SRL (metacognition and motivation) measures, showing the synergic effect of scaffolding explicitly and directly on one aspect of the SRL model as metacognition awareness and its indirect effect on related aspects of motivation. This finding is most important, because if students are not motivated to engage in a particular task, they are unlikely to be strategic in that process which assists students to achieve a deeper understanding (Efklides 2011).

Conclusions, implications, limitations and future research

This study makes an important contribution to the literature on young students’ (9–10), indicating that the fading effect with the graduated model is more significant compared to the fixed effect, particularly in the lasting effect. Theoretically, the current findings add evidence to the two scaffold type models (Fixed vs. Faded), and transfer issues (long-term effect) that offer merits in advancing our understanding about scaffolding theory. Current scaffolding literature does not clearly indicate what kinds o scaffolds to maintain and what kinds of scaffolds to fade at a certain point in time (Belland et al. 2017; Bulu and Pedersen 2010).

Methodologically, the study contributes findings on a set of SRL self-report measurements (metacognition, motivation), authentic self-assessment (calibration of CJ), and performance in a non-prompted sense-making task at the end of the program (short-term effect), and three-months later to assess lasting effect. Practically, the study offered two programs based on self-question models (Fixed, Faded), to guide acquisition, activation, and application to proactively promote metacognition of young students, and are adaptable to different ages, various mathematical topics and to other learning domains. This is in line with recent studies asserting that metacognition should begin from the primary school level (Kistner et al. 2010).

Despite this study’s potential contributions, several of its limitations deserve consideration. In this study SRL was measured via self-reports in two classes of each group that indicated relatively small/moderate significant effect sizes, that might express a first type error increase. Further investigation is suggested with a larger sample in order to determine the fading contribution to scaffolding programs.

Indeed, findings indicated that the two groups were homogeneous in relating to the measures implemented in the study (see Appendix A). However, there might be additional measures that were not tested and could affect the homogeneity of the groups, such as norms of class talk and verbal abilities that can affect problem solving performance. Future studies should also incorporate class background measures in a follow-up study.

In light of the importance of involving trace methodology in SRL studies (e.g., Zimmerman 2008) an additional measure of metacognition improvement is suggested, as students’ thinking aloud while solving a task or videotaped discourse in class to shed light on sense making. This may help triangulate the testing variables and uncover students’ sequential and temporal metacognitive and motivational aspects exhibited during various solving activities, by focusing on when each aspect appeared and by whom over the timeline of the learning activity, its duration and how students act (Azevedo 2014; Winne and Perry 2000). Finally, an additional measure to shed light on brain working processes such as cognitive load and WM while occurring in the short term versus long-term effect is suggested to assess the two study groups (Ariës et al. 2015; Goldberg 2010).

In line with research showing a larger impact of metacognitive effects in follow-up learning sessions than measured earlier in short-term effects (Bannert et al. 2015), we recommend that this study be replicated by focusing on a longer duration of process assessment with a larger sample, while testing immediately after the training as well as following up on two time lines of lasting effects, to exhibit how students internalize and practice metacognitive habits without training (Kramarski et al. 2013).

Future studies should examine additional kinds of fading, conditions, domains, SRL components (i.e., motivation) and ages. In the current study the faded progression was based on the SRL metacognition prompts oriented to the three phases practiced by a set of 4 tasks in each session, starting with a fixed prompting example and continuing with the graduated faded phases. It might be that this approach is confounded by student’s content experience. To face this limitation, we suggest that fading scaffolds could also be tested if the intensity of removal is not fixed, but adaptable to the individual needs of the learner (Cabello and Sommer Lohrmann 2018).

Furthermore, in the current study the faded sequence of SRL phases oriented to the task solution started with complementing the reflection prompt that is more general and abstract than the planning prompt, that is specific and more concrete. In future studies it is suggested to test an inverse fading sequence by starting with the given abstract prompting phase (reflection) to complementing the missing concrete prompting phase. Finally, studies should also focus on the implications of the metacognitive scaffolds (fixed/faded) for different students’ ages and profiles (e.g., SRL, achievements), to learn how to adapt the models to learners with diverse learning needs, because novices might need a fuller set of scaffolds for a longer period to support their problem-solving performance (Tomlinson 2005).

To conclude , the study is unique in comparing the value of the fixed versus graduated fading scaffold directed to SRL and not to domain knowledge. It supports theoretical claims that fading scaffolds have a greater effect on students’ autonomous SRL (metacognition, motivation) and improvements in sense making, particularly on the long-term retention effect (Ge et al. 2012; Puntambekar and Hubscher 2005).