Introduction

Since the 2004 special issue in Educational Psychology Review on “Measuring Studying and Learning in Higher Education—Conceptual and Methodological Issues” (Lonka et al. 2004), a lot of research effort, from a variety of research traditions, has been invested in exploring the ways in which students learn in higher education. The focus of the present paper is on the contribution of the research in the student approaches to learning (SAL) tradition. Research in this tradition generally focuses on the different ways students engage in learning or handle learning tasks as reported by the students themselves (Biggs 2003; Entwistle 2009; Lonka et al. 2004; Schmeck 1988). For at least 40 years (Marton and Säljö 1976a, b), SAL has been interested in the different qualitative ways students go about learning and studying.

Approaches to Learning in Higher Education

Today’s higher education faces the challenge of not only having to teach students a bulk of domain-specific frameworks and disciplinary insights but also having to foster skills that will enable them to become versatile experts in their own fields and lifelong learners. In order to reach these goals, successful learning and studying in higher education is expected to involve students in deep approaches to learning and towards more meaningful and critical learning instead of just repeating knowledge (Asikainen 2014; Biggs 2003). Although several studies have the (at least implicit) assumption that students’ approaches to learning develop towards more deep approaches in higher education (Baeten et al. 2010; Entwistle and Ramsden 1983; Prosser and Trigwell 1999), there seems to be no clear theoretical foundation for this assumption and no clear empirical evidence. The rationale behind this assumption seems to be largely based on the idea that higher education is indeed increasingly required (and working towards) producing work-ready graduates who are prepared for a life as lifelong learners and that, given the scholarly nature of higher education, part of that should be the development of deep approaches to learning (Lake and Boyd 2015). Despite a general agreement on the basic assumptions, a multitude of conceptual models can be discerned within the SAL tradition, placing emphasis on different aspects of learning, using a variety of different but related concepts, and using a multitude of methods and measures (largely self-report questionnaires) to measure empirically their concepts (Coffield 2004; Richardson 2000; Entwistle and McCune 2004; Vanthournout et al. 2014). The present paper aims to put some “piles” in what is called by some “the swamp of student learning in higher education.”. Therefore, we will systematically investigate the research on how students’ approaches to learning develop during higher education and what conceptual and methodological aspects are related to this development.

In the next paragraphs, we will first situate the SAL tradition within the wider literature on student learning in higher education by giving a historical overview of the research from which the concept of approaches to learning originated and how it has been put into operation and measured longitudinally over the years.

Historical Situation

In the 1970s, Marton and Säljö (1976a) explored 30 educational psychology students’ ways of processing a scientific article and the relationship with their learning outcomes. They found two different ways of processing: surface processing meant that students concentrated on memorizing the text as it was presented, whereas deep processing indicated that students concentrated on the meaning of the text or the main message of it (Marton and Säljö 1976a). In a related publication, Marton and Säljö (1976b) found similar differences in study processes but also found that students changed their study processes according to their perception of the requirements. They found that deep processing could be changed to surface processing by asking students the kind of questions that provoke it. In a further elaboration of this study, Svensson (1977) found similar qualitative differences in students’ study processes: students applying a holistic approach concentrated on the author’s intention and tried to understand the meaning of the text and students applying an atomistic approach focused on separate details in the text and tried to memorize them. In addition, Pask (1976) also found similar categories to Marton and Säljö (1976a, b): students applying a serialistic strategy in studying focus on details and concentrate on learning linearly one thing at a time: A student using a holistic strategy tries to understand the connections between different matters and to form an understanding of matters as a whole.

Deep and Surface Approaches to Learning

These early studies concentrated on the processing, but moving forward from the early studies of Marton and Säljö, research now generally differentiates between two qualitatively different approaches to learning: the surface and the deep approach (Biggs 2003; Marton 1976; Marton and Säljö 1984; Prosser and Trigwell 1999). The deep approach to learning is associated with students’ intentions to understand and to engage appropriately in meaningful learning, focusing on the main themes and principles and using strategies that are appropriate for creating such meaning. The surface approach to learning, on the other hand, refers to students selectively memorizing, based on motives or intentions that are extrinsic to the real purpose of the task, such as a fear of failure or keeping out of trouble (Vanthournout et al. 2014). A strategic (Entwistle and Ramsden 1983) or achieving (Biggs 1987) approach to learning has also been identified which referred to how students organized their studying or study according to the assessment of the course. Conceptually, the achieving/strategic approach differs from the deep and surface approach in that the deep and surface approach to learning describes ways in which students engage in learning, whereas the achieving/strategic approach to learning deals with how students organize their learning (Vanthournout et al. 2014). Some studies on approaches to learning refer to organized studying (Entwistle and Peterson 2004; Entwistle 2009) by emphasizing the organized effort in studying more and not emphasizing the effect of the assessment (Entwistle and Peterson 2004; Entwistle 2009).

Marton and Säljö (1976a, b) initially talked about ways of processing in reading an academic text, which were based on experimental data. These studies did not consider the way students generally go about studying, as it is assumed sometimes (see Richardson 2015) but the processing strategies of students and how these were affected by their perceptions of the teaching-learning environment. Marton (1976) first introduced the concepts of deep and surface approach to describe students’ approaches to learning in everyday studies. These were built on studies based on both experimental data and student interview data about how students go about learning and studying. When talking about processing, Marton and Säljö (1976a, b) did not talk about everyday studies, but the concept approach referred to students’ everyday studying. Svensson (1977) also used the concept of approach in his studies by referring to students’ approaches in general studies. This is how the concept has been used in the large body of quantitative research.

Measuring Approaches to Learning

The studies from Marton, Säljö, and Svensson in Göteborg, discussed above, influenced John Biggs in Australia and Noel Entwistle and Paul Ramsden in the UK to explore the way students usually go about their studying on a more general level. Biggs (1978) developed a Study Process questionnaire (SPQ) which differentiated between three different dimensions of study processes which include students’ motivation and their study strategies for a specific course: (1) utilizing, (2) internalizing, and (3) achieving. Utilizing describes meeting minimum requirements in studying and using reproduction though rote learning, internalizing refers to students’ interest in the academic subject and relating ideas with previous knowledge, and the achieving approach relates to competitive motives in learning and organized studying. Biggs later renamed these approaches to learning as surface, deep, and achieving approaches (Biggs 1987). Furthermore, in Lancaster, Entwistle and Ramsden (1983) also explored students’ study processes on a more general level. The purpose of their study was to explore approaches to learning in larger contexts by developing an instrument, Approaches to Studying Inventory (ASI). The work of Biggs (1978) as well as of Marton and Säljö (1976a, b) and Pask (1976) influenced the development of ASI, and they found similar dimensions of approaches to learning. The deep approach to learning was defined as intention to understand and a critical process in learning which were measured with two components: relating ideas and use of evidence (Entwistle and Ramsden 1983).

In the early 1990s, Jan Vermunt in the Netherlands developed the learning patterns model in an attempt to provide a more comprehensive and integrated account of learning by bringing together four different learning components: cognitive processing strategies, regulation strategies, conceptions of learning, and orientations to learning (Vermunt 1996; Vermunt and Vermetten 2004). We do not elaborate on this model in this contribution, but refer to the paper by Vermunt and Donche in this special issue and the chapter by Vanthournout et al. (2011) for a review on the longitudinal research using this model.

The Teaching-Learning Environment

From the early studies of Marton and Säljö, it has been evident that the teaching-learning environment has an effect on students’ approaches to learning. Or, to be more precise, students’ experiences of the teaching-learning environment are related to students’ approaches to learning (Ramsden 1997). Approaches to learning are thus seen as context specific (Entwistle and Ramsden 1983). In general, positive perceptions of the teaching-learning environment have been found to be related to a deep approach to learning and, respectively, negative perceptions, to the surface approach to learning (Entwistle and Ramsden 1983; Entwistle et al. 2003; Parpala et al. 2010; Richardson 2005, 2006).

Longitudinal Perspective

Learning Outcomes

The first longitudinal studies concerning students’ learning outcomes were done in the 1970s, for example, Marton (1975) and Dahlgren (1975) had already explored students learning outcomes in a pre-post test design. Also, the seminal studies by Marton and Säljö (1976a, b) were longitudinal in nature. They found that students who used deep processing also remembered the main point of the text better after a while (Marton and Säljö 1976a). On the other hand, the students who used surface processing recalled just some details about the text in both cases even though they were provided with the information or correct answers in the first measurement. In the second study (Marton and Säljö 1976b), students’ learning outcomes and ways of processing were measured and manipulated by giving them assignments about the text which required surface- or deep-level processing (Marton and Säljö 1976b). It was shown that the way of processing and the learning outcomes could be influenced by the assessment demands. The group who received factual questions started to concentrate on details and tables more. In addition, some students in the group, who received questions requiring deep-level processing, started to concentrate on the main ideas of the text and deep-level processing. However, not all students did so. In conclusion, it was shown that the way of processing can be impacted by changing the demands of the task.

Learning Processes

Although the nature of the studies mentioned above were longitudinal, none of these studies measured students’ way of processing longitudinally, but measured students’ level of learning outcomes. However, Svensson (1977) used a longitudinal design in his study and also reported longitudinal data on students’ normal ways of studying. His procedure built on the study by Marton and Säljö (1976b) and included three measurement points over a period of 5 weeks. He found that students’ ways of processing the learning material were stable across the occasions. However, there were fewer students applying a holistic approach and some applying more of an atomistic approach after the first 5 weeks. This drop was not so evident in the next procedure concerning the second text.

It seems that interest in the development of student learning took a pause after these studies and concentrated more on developing the questionnaires and large-scale cross-sectional studies. The interest in studying the development of general-level approaches to learning rose again in the mid-1980s. Despite the emphasis on the contextual nature of the approaches to learning or processing in the early studies, a lot of longitudinal studies have been set up from a more general perspective. The first longitudinal quantitative study on the development of approaches to learning was conducted by Watkins and Hattie (1985). They explored 540 Australian tertiary students’ approaches to learning in their first year and in their third year using the ASI, and found a decline in the deep approach during their studies. Since then, longitudinal studies have been conducted, but they seem to have had contradictory results. Some longitudinal studies suggest that the deep approach does not develop during studies in higher education (Ballantine et al. 2008; Lietz and Matthews 2010; Rodriguez and Cano 2007; Watkins and Hattie 1985; Watkins and Ismail 1994; Zeegers 2004). Other longitudinal design studies have also found a decline in the deep approach (Lietz and Matthews 2010; Wilding and Andrews 2006). In addition, some studies have found a decrease in the surface approach to learning during studies in higher education (Hall et al. 2004; Gordon and Debus 2002; Rodriguez and Cano 2007) but also an increase of the surface approach has been reported (Geitz et al. 2016; Zeegers 2001). Several longitudinal studies have also suggested that students’ approaches to learning are relatively stable during studies (e.g., Edmunds and Richardson 2009; Zeegers 2001). However, a recent review of the literature in the field of health sciences indicated that higher education institutes that adopted a curriculum-wide problem-based approach to learning would foster their students’ development of more deep approaches to learning (Dolmans et al. 2016).

The Present Study

In the present study, we will systematically review the longitudinal research on how students’ approaches to learning develop during higher education and will take into account conceptual and methodological differences between the studies.

Method

Criteria for Inclusion

Before searching the literature, we determined the criteria for inclusion in our analysis. First, each study needed to be empirical, meaning that each paper needs to report on the data collection and analyses of qualitative or quantitative data. Second, the studies needed to include an explicit reference to student approaches to learning (SAL) as a research model; this could include a reference to either approaches to learning, processing strategies, or study strategies. As noted earlier, research using the learning pattern model (Vermunt and Vermetten 2004) was excluded. Third, the focus is on students in higher education. Fourth, longitudinal data about students’ approaches to learning needed to be collected and reported upon, meaning that the study needed to report data on student (approaches to) learning with the same students for at least two measurement moments. Finally, the paper needed to include at least one research question or explicit results about the development of student (approaches to) learning.

For our literature search, the following databases were included: ERIC (Ebsco), PsycINFO, and the Web of Science (WoS). The search terms were “Students” and “learning approach,” “studying approach,” “learning strategies,” “learning orientations,” “learning patterns,” and “processing strategies.” Given the focus of this review study, these search terms were combined with the search terms “longitudinal” or “changes over time.” The search was further delimited to students participating in higher education or post-secondary education. Only peer-reviewed publications published in the English language were selected. This initial search retrieved 543 articles from the ERIC database, 44 articles from PsycINFO, and 653 from the Web of Science. After screening the abstracts of these papers against the criteria for inclusion, 62 studies were taken to the next step in the selection procedure. In this next step, the authors read the full papers and evaluated these papers again in the light of the five criteria for inclusion described above. One study was excluded at this stage because the full paper could not be retrieved through the subscriptions held by the authors’ institutions or through requests from the authors by email and Research Gate. The systematic search provided us with 26 papers that met the criteria for inclusion. In addition to this systematic search, potential studies were also sought by going through the references of the selected papers (snowballing). We also contacted several researchers active in the field of approaches to learning and asked them to provide relevant studies or to identify additional sources of studies. This additional search yielded 17 extra studies. Altogether, 43 studies were included for a systematic analysis.

In the final step, following the guidelines of Aveyard (2014), the papers were re-read and the relevant information from the papers was summarized in a review table (see Table 1). The following methodological and conceptual aspects of each paper were included in the table and will be discussed in the results section of this paper where relevant: number of measurements, level of education, time of measurement, field of study/subject, number of participants, number of courses, duration of the study, measurement instrument, theoretical framework, analysis, what was measured, and significant results/changes.

Table 1 Descriptive study characteristics

Results

Before we report how students’ deep and surface approaches to learning develop during higher education, based on the results of our analyses of the review table in Table 1, we give an overview of the general characteristics of the studies included in our study.

A total of 43 studies met our criteria for inclusion. These studies took very different research perspectives. From those studies, 3 were qualitative, 3 were mixed-method studies, and the remaining 37 were quantitative studies. The duration also varied very much: 24 of the studies explored the development of the approaches to learning over a period of 1 year or more and 19 of the studies were for less than 1 year. A total of six studies explored the changes in approaches to learning at course level. In 25 of the studies, there were only two measurement points, 12 of the studies included three measurement points, three of the studies had four and three had five measurement points. Also, the analysis methods varied across the studies. Richardson (2013) talks about traditional or old ways of analyzing longitudinal data on student learning as methods with just a single dependent variable and encourages more advanced ways of exploring student learning longitudinally by including several dependent variables. By using this division, 27 of the quantitative studies used fairly simple methods with one dependent variable to measure the change (RM ANOVA) and 10 of the studies used more advanced methods (MANOVA, path analysis, latent profile transition analysis, multivariate growth model, structural equation modeling, hierarchical linear modeling).

Development of the Deep Approach

In 18 of the studies, a positive development of the deep approach was reported (Asikainen et al. 2014; Ballantine et al. 2008; Biggs and Rihn 1984; Fryer 2017; Geitz et al. 2016; Gordon and Debus 2002; Hall et al. 2004; Jackling 2005; López et al. 2015; Matthews 2004; Muis and Duffy 2013; Phan 2011a, 2013; Svensson 1977; Vu et al. 1998; Walker et al. 2010; Wilson and Fowler 2005; Zhu et al. 2009). In 18 of the studies, no statistically significant changes in the deep approach were found (Chan and Tang 2006; Chen et al. 2015; De Clercq et al. 2013; Edmunds and Richardson 2009; Geitz et al. 2016; Gijbels et al. 2009; Iputo 1999; Jackling 2005; Muis and Duffy 2013; Nieminen et al. 2004; Ova et al. 2012; Prat-Sala and Redford 2010; Rodriguez and Cano 2007; Svensson 1977; Vu et al. 1998; Wilson and Fowler 2005; Zeegers 2001; Zhu et al. 2009) and eight studies found a decrease in the deep approach (Cleveland-Innes and Emes 2005; English et al. 2004; Lietz and Matthews 2010; Prat-Sala and Redford 2010; Quinnell et al. 2012; Volet et al. 1994; Watkins and Hattie 1985; Wilding and Andrews 2006). Six studies found individual differences between different students (Boulton-Lewis et al. 2004; Fryer 2016; Lindblom-Ylanne et al. 2013; Saravanamuthu and Yap 2014; Ward 2011).

In one study, both a decline and stability of the deep approach was reported depending on the level of self-regulation (Prat-Sala and Redford 2010). Vu et al. (1998) and Muis and Duffy (2013) also found an increase of deep processing or critical processing in students belonging to an intervention group but found no increase in a traditional group. Geitz et al. (2016) found an increase of deep learning with Dutch students but no change with German students. Zhu et al. (2009) found no changes in deep-level learning strategies with Flemish students but an increase with Chinese students. Jackling (2005) found an increase in deep strategy but not in deep motive. Furthermore, six other studies found individual differences between different students (Boulton-Lewis et al. 2004; Fryer 2016; Lindblom-Ylanne et al. 2013; Saravanamuthu and Yap 2014; Ward 2011). For example, Fryer (2016) found different developments depending on students’ initial approaches. Phan (2011b) reported that the initial level of deep processing was positively related to the change in surface processing and the initial level of surface processing positively influenced the change in deep processing. A significant negative correlation was also noted between the initial levels of deep processing and the change in deep processing and between the initial levels of surface processing and the change in surface processing (Phan 2011b), thus indicating that change was more likely with students who initially had lower scores on deep or surface approach than students with high scores. In the studies by Postareff et al. (2014, 2015), students were interviewed based on their changes in means before and after a course. In the study by Lindblom-Ylänne et al., different students had different changes.

The 18 studies, which reported a positive development in the deep approach, varied a lot according to their methodological choices. These studies included one qualitative study (Svensson 1977) who also found individual differences and the remaining 17 were quantitative studies. The quantitative studies, which found a development of the deep approach to learning, varied a lot from one to another. Nine of the studies used Biggs framework and his study process questionnaire or the revised version of it (SPQ or R-SPQ-2F) (Geitz et al. 2016; Gordon and Debus 2002, Hall et al. 2004, Jackling 2005; López et al. 2015; Matthews 2004; Phan 2011b, 2013; Wilson and Fowler 2005). Biggs and Rihn (1984) used an early version of the SPQ. In three of the studies, Entwistle’s framework was used. These studies used his ASSIST, the approaches and study skills inventory for students (Ballantine et al. 2008; Walker et al. 2010), or the Finnish version of his ETLQ, the Experiences of Teaching and Learning Questionnaire (Asikainen et al. 2014). Two studies used Pintrichs MSLQ, the Motivated Strategies for Learning Questionnaire (Muis and Duffy 2013; Zhu et al. 2009), one study used the Inventory of Learning Processes (ILP) (Vu et al. 1998), and one used the Approaches to Study Questionnaire by Trigwell and Aswin (Fryer 2016).

The duration between the measurements also varied a lot in the studies that reported a development in the deep approach. The measurements varied from one course to three academic years. In six other studies, the duration was less than a year: 8 weeks (Geitz et al. 2016), 9 weeks (Biggs and Rihn 1984), 10 weeks (Svensson 1977), 12 weeks (Hall et al. 2004), 13 weeks (Wilson and Fowler 2005), 15 weeks (Muis and Duffy 2013), and one semester which was not further elaborated (Zhu et al. 2009). In some studies, the data collection took a year or longer: 3 years (Gordon and Debus 2002), 2 years (Asikainen et al. 2014; Jackling 2005; Matthews 2004; Vu et al. 1998), 1.5 years (Phan 2011b), 16 months (Phan 2013), and 1 year (Ballantine et al. 2008; Fryer 2016; Walker et al. 2010). In the study by López et al. (2015), the duration was not clearly stated as students answered the questionnaire “when teaching began and when it ended.” In addition, the number of measurements also varied from two to five measurements. Eight of the studies comprised two measurements (Asikainen et al. 2014; Ballantine et al. 2008; Biggs and Rihn 1984; Fryer, accepted; Hall et al. 2004; López et al. 2015; Walker et al. 2010), seven of the studies had three measurements (Geitz et al. 2016; Gordon and Debus 2002; Jackling 2005; Phan 2013; Vu et al. 1998; Wilson and Fowler 2005; Svensson 1977), one had four (Phan 2011b) and two had five measurement points (Matthews 2004; Muis and Duffy 2013). Thus, few similarities could be found in the studies that found the increase in the deep approach.

Development of the Surface Approach

When considering the development of the surface approach to learning, 11 studies reported a decrease (Boulton-Lewis et al. 2004; Biggs and Rihn 1984; Chan and Tang 2006; Gordon and Debus 2002; Hall et al. 2004; Matthews 2004; Rodriguez and Cano 2007; Svensson 1977; Volet et al. 1994; Walker et al. 2010; Zeegers 2001), 9 of the studies found an increase of the surface approach (Ballantine et al. 2008; Fryer 2016; Geitz et al. 2016; Gijbels et al. 2009; Quinnell et al. 2012; Prat-Sala and Redford 2010; Wilding and Andrews 2006; Zeegers 2001; Zhu et al. 2009), and 20 of the studies did not find changes in surface-level learning (Chen et al. 2015; Cleveland-Innes and Emes 2005; De Clercq et al. 2013; Edmunds and Richardson 2009; Geitz et al. 2016; Iputo 1999; Jackling 2005; Lietz and Matthews 2010; López et al. 2015; Matthews 2004; Muis and Duffy 2013; Nieminen et al. 2004; Ova et al. 2012; Prat-Sala and Redford 2010; Watkins and Hattie 1985; Wilson and Fowler 2005; Volet et al. 1994; Vu et al. 1998; Zeegers 2001; Zhu et al. 2009). Zeegers (2001) found both an increase (from measurement one to three) and a decrease (from measurement three to five) of the surface approach. Prat-Sala and Redford (2010) found that students with high levels of self-efficacy demonstrated no change in surface approach but students with low self-efficacy increased in their surface approach. Zhu et al. (2009) found an increase on rehearsal strategies with Chinese but not with Flemish students. Matthews (2004) and Volet et al. (1994) found a decrease on surface motive but no changes on surface strategy. Six of the studies found individual differences in the development of the surface approach (Boulton-Lewis et al. 2004; Fryer 2016; Lindblom-Ylanne et al. 2013; Saravanamuthu and Yap 2014; Svensson 1977; Ward 2011). Five of the studies did not measure the change in surface approach (Asikainen et al. 2014; Phan 2011b, 2013; Vu et al. 1998; Postareff et al. 2014, 2015) and one study did not report the change (English et al. 2004). From the 11 studies that found a decrease in the surface approach, two were qualitative (Boulton-Lewis et al. 2004; Svensson 1977) and used three measurement points. The remaining studies were quantitative of which eight studies used the framework of Biggs and one study, the framework of Entwistle. Six studies had two measurement occasions, one had three and two had five measurement points.

Discussion

The main aim of this study was to review systematically the longitudinal research on how students’ approaches to learning develop during higher education A total of 43 studies were included in our review of which 18 reported an increase in students’ deep approach to learning and eight reported a decrease. On the other hand, 11 studies reported a decrease in students’ surface approaches to learning and eight reported an increase. The conclusion, based on our review, could be that the results remain inconclusive as to the question whether students’ develop deeper approaches during higher education. Probably, it would be more appropriate to conclude that these results give an indication that there is no empirical evidence for the assumption that deep learning should evolve during higher education (Asikainen 2014). However, the present paper also took conceptual and methodological aspects of the studies, included in the review, into account in order to explain differences in the results between the studies. These aspects are discussed below.

Conceptual Considerations

When considering the conceptual factors, it seems that the SAL framework lacks a clear conceptual grounding. There are a lot of different frameworks, evident also in this review, which all conceptualize approaches to learning a bit differently. It has been suggested that the use of different frameworks is one reason for the ambiguity of results concerning the quality of student learning (Dinsmore and Alexander 2012). Most of the studies included in our review were conducted with Biggs’ Study Process Questionnaire; Entwistle’s framework was the second most commonly used. In Biggs’ Study Process Questionnaire, measures of the deep approach included measures of both deep motive, comprising items measuring intrinsic interest and students’ personal satisfaction or excitement in studying, and deep strategy measuring maximizing meaning (Biggs 1978). However, Entwistle’s framework does not necessarily emphasize the deep motive but focuses on the students’ intention to understand the main message of what they are learning, together with thinking processes such as relating ideas and use of evidence which are similar to Pask’s (1976) holistic and serialistic strategies (Entwistle et al. 2003). Thus, the focus can be regarded quite different between these frameworks. However, in our review, consistency even between studies using the same framework was not evident. Most of the studies were conducted with Biggs’ framework and no consistency was found within the results of those studies or for the studies using the framework of Entwistle. These results suggest that conceptual agreement is not enough to get consistent results about the development in the SAL framework.

A likely responsible factor for the inconsistency of the results can be found in the different contexts of the studies. Across the different frameworks, most studies measured the development of the approaches to learning at a very general level. Only five studies measured the change within one specific course. None of the studies explored the development of students’ approaches to learning at the level of a specific task, although Svensson (1977) measured students’ general way of studying between tasks. Most studies (N = 24) measured changes in students’ approaches to learning over a timespan of 1 year or longer. There has clearly been a conceptual shift from the earlier studies and a somewhat surprising development, from studying and measuring task- and context-specific ways, the learning that is referred to as the starting point of the SAL framework (Marton and Säljö 1976a, b), towards studying and measuring more general orientations to learning in longitudinal studies. Although there has been some evidence that approaches to learning can be relatively stable across time and courses (Entwistle 1991; Zeegers 2001), there is the fundamental assumption that approaches to learning are context specific (Biggs 2003; Entwistle and Ramsden 1983; Marton and Säljö 1984) and can vary across different courses (Nieminen et al. 2004). Hence, when exploring the general development, there is the problem that, although the domain of the study remains largely stable, the contextual variables of the course (the topic, the course design, the assessment, the teacher…) probably vary to a great extent on the different measurement moments. Thus, the effect of the teaching-learning environment is not taken into account so much despite the largely accepted theoretical assumption in the SAL tradition that students’ approaches to learning are not stable but change as a result of the interaction between the contextual aspects of the learning environment and the characteristics of the learners (Biggs 2003; Entwistle and Ramsden 1983). Therefore, it can be concluded that the way most studies measure change in students’ approaches to learning are not in line with the theoretical assumptions of the theory under study. Furthermore, studies did not necessarily make it explicit whether the intention was to measure approaches to learning at a general level or in a particular course or context. For example, the SPQ (Biggs 1987) was used in most of the studies, although this instrument had been developed to measure approaches to learning at course level. Some studies did not bring this issue into question and used the questionnaire to measure the development of approaches to learning at a general level within a long time period, across different courses. Possible modifications of the questionnaire were not discussed. This also raises a question about the validity of these studies and brings us to the issue of methodological considerations.

Methodological Considerations

The lack of clarity in our results could also be due to methodological factors. One factor that could impact the diversity of the results is that the analyses to grasp change varied a lot between the different studies in our review. Of the quantitative studies, most (n = 27) used fairly old methods to measure the change (repeated measures methods or ANOVAs) and only a minority (10) of the studies used newer or more advanced methods such as Latent Profile Analyses. In addition, most of the quantitative studies included only two measurement points. With only two measurement points, the development or growth is hard to estimate and more sophisticated analyzing methods that could be more appropriate to capture longitudinal development, such as multi-indicator latent growth analyses, cannot be used. Nevertheless, most of the studies exploring the development have only two measurement points. One of the potential problems with two measurement points is the phenomenon of regression to the mean, which means that the scores in the second measurements tend to be closer to the mean (Barnett et al. 2005). One reason for the high number of studies measuring the change with only two measurement points could be the problem of subject attrition, meaning that the participants of the study do not participate in the next measurement but leave the study (Gustavson et al. 2012). The risk of subject attrition grows with more measurement and researchers may be tempted to use less measurement points, although three or more measurement moments would be more appropriate when the aim is to measure development (Singer and Willett 2003). To conclude, one might question whether the studies with only two measurement points are really longitudinal in nature as methodological studies, which use only two measurement points, may not necessarily grasp the development. However, the studies in our review, which used more advanced measures and had more than two measurement points, did not have consistent results either which could support the claim that deep approach to learning develops during studies.

Related to the kind of analyses that are reported in the studies included in the review, the idea could be raised that lack of change in the whole group level does not necessarily mean that there are no changes in subgroups of students with similar characteristics. Earlier, Haggis (2003) argued that it could be almost impossible to promote the development of the deep approach to learning if it was not already present. On the other hand, McCune and Entwistle (2011) have suggested that some students have a disposition to understand for themselves, which means that these students’ deep approach to learning is high and stable across courses because they have a strong willingness to understand fully the subject they study. According to Richardson (2011), the stability of students’ approaches to learning over time could indicate also the similarities in their learning environments during studies. All in all, this suggestion, that the deep approach to learning is more stable than the surface approach, was not confirmed by the results of our review. On the contrary, more studies found no significant changes in the surface approach than in the deep approach.

Recent studies (e.g., Fryer 2016; Vanthournout et al. 2013), making use of person-oriented rather than variable oriented longitudinal analyses, indicate that the stability or volatility of students’ approaches to learning are associated with the initial subgroup they belong to, suggesting that students who start higher education with a deep approach are more likely to continue to do so. Studies that explored the development from a person-oriented perspective in our review had all somewhat consistent results: different subgroups of students develop differently and the development is individual in nature (Fryer 2016; Lindblom-Ylanne et al. 2013; Postareff et al. 2014, 2015; Svensson 1977; Saravanamuthu and Yap 2014). Thus, the individual nature of the development of approaches to learning seems to be supported by our review. This could also be a reason why the group-level studies have such different results. As the study by Lindblom-Ylanne et al. (2013) showed that when there are no changes at group level, there can be a lot of individual variation at individual level. The early study of Marton and Säljö (1976b) and Svensson (1977) already showed that different students’ levels of learning outcomes develop differently and different students respond to the requirements of the context more easily than others. It can be suggested that, in order to study the development in approaches to learning, it would be worthwhile to look further than the whole group level as indicated by some studies in our review (e.g., Asikainen et al. 2014).

A final methodological issue that is clear from this review is that all 43 studies are based on self-report data and that one way to advance the field would be to invest in other measures or at least to invest in data-triangulation (cf. Catrysse et al. 2016) which has been more common in the Self-Regulated Learning (SRL) tradition in the past years (see the paper by Zusho in this special issue).

The limitations of this review could be at least partly responsible for these results. The literature search was limited to three databases, the ERIC database, PsycINFO, and the Web of Science, and although relevant researchers in the field were consulted and snowballing was used to find relevant papers, it is possible that this widening the scope of databases and/or search terms might have had an impact on the conclusions.

To conclude, we argue that longitudinal research within the SAL tradition, aiming to investigate how students change their approaches to learning at group level during higher education, is a “dead end.” First of all, there is no clear theoretical or empirical foundation in the SAL tradition for the claim that students should develop deeper or lesser surface approaches to learning across different years. In addition, there is a lack of clarity and consistency within the SAL framework and more work should be done to identify the differences and similarities in different traditions. Second, ignoring the contextual differences between different courses and study years within higher education is in contradiction to the theoretical assumption that students’ approaches to learning change as a result of the individual characteristics of the learner and the characteristics of the context. Meaningfully investigating how students change their approach to learning in higher education from the SAL perspective should therefore always take this context into account and is therefore probably best done at course/topic or even at task level. From this perspective, it seems ironic that other theoretical models, such as the model of domain learning (Alexander 2003), are theoretically probably better elaborated compared to the SAL model in order to explain both stability and variability in deep and surface learning within a particular domain over a period of time. For longitudinal research across different courses, domains, and study years, other and more general models such as the learning pattern model (Vermunt and Vermetten 2004) are probably much more appropriate (see also the paper by Vermunt and Donche, this issue). These issues bring us back to the metaphor of the swamp of student (approaches) to learning in higher education and our failed attempt with this review to put some “piles” in it. We might be able to reach the other side of the swamp and make some progress in our understanding of student learning and its development in higher education by going back to the roots of the concept of approaches to learning. On the one hand, this could be achieved through studying approaches to learning within a clear context (e.g., at course or task level) and relying on other theoretical models, on the other hand, to strengthen the theoretical foundations (e.g., on the model of domain learning for the assumption that students develop deeper approaches with increasing domain expertise) and to explore new ways to measure students’ approaches to learning (e.g., inspired by research in SRL).