Personalized word learning for university students: a profile-based method for e-learning systems

Xie, Haoran; Zou, Di; Zhang, Ruofei; Wang, Minhong; Kwan, Reggie

doi:10.1007/s12528-019-09215-0

Personalized word learning for university students: a profile-based method for e-learning systems

Published: 27 March 2019

Volume 31, pages 273–289, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Computing in Higher Education Aims and scope Submit manuscript

Personalized word learning for university students: a profile-based method for e-learning systems

Download PDF

Haoran Xie¹,
Di Zou²,
Ruofei Zhang²,
Minhong Wang^3,4 &
…
Reggie Kwan⁵

1348 Accesses
25 Citations
1 Altmetric
Explore all metrics

Abstract

It is widely acknowledged that the acquisition of vocabulary is the foundation of learning English. With the rapid development of information technologies in recent years, e-learning systems have been widely adopted for English as a Second Language (ESL) Learning. However, a limitation of conventional word learning systems is that the prior vocabulary knowledge of learners is not well captured. Understanding the prior knowledge of learners plays a key role in providing personalized learning, which many studies suggest is a successful learning paradigm for vocabulary acquisition, one that aims to optimize instructional approaches and paces by catering to individual learning needs. A powerful learner profile model which can represent learner’s prior knowledge is therefore important for word learning systems to better facilitate personalized learning. In this article, we investigated various methods to establish learner profiles and attempted to determine the optimal method. To verify the effectiveness of personalized word learning supported by the proposed model, ESL students from several universities participated in this study. The empirical results showed that the proposed learner profile model can better facilitate vocabulary acquisition compared with other baseline methods.

Personalized Word Learning for ESL Students via Integration of Implicit and Explicit Profiles

A Personalized Task Recommendation System for Vocabulary Learning Based on Readability and Diversity

An Explicit Learner Profiling Model for Personalized Word Learning Recommendation

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

It is widely acknowledged among researchers that the acquisition of vocabulary is the foundation of English learning (Paul Nation and Robert Waring 1997; Oxford and Scarcella 1994). English as a Second Language (ESL) learners and linguists have attempted to develop various learning theories and frameworks (Gu 2003; Hu and Nassaji 2016; Hulstijn and Laufer 2001; Keating 2008; Prince 1996; Schmitt 2008) to identify effective learning methods to promote word retention. With the rapid development of information technology in recent years, e-learning systems have been widely adopted in language learning for ESL students (Golonka et al. 2014; Wang et al. 2017). As revealed by several previous studies (Chen and Chung 2008; Chen and Li 2010; Zou and Xie 2018), conventional word learning systems have the limitation that the prior vocabulary knowledge of learners is not well understood and captured by these systems. There are two underlying reasons for this.

First, the user data obtained by learning systems are limited. There are two types of techniques, intrusive and non-intrusive techniques (Kaya and Bicen 2016; Ortigosa et al. 2014; Ramakers et al. 2012), for collecting user data for learning systems. Intrusive techniques refer to explicit data collection involving users, like user input, surveys, or feedback or the attachment of explicit data collection devices like sensors (Bedogni et al. 2012), EEG headsets (Campbell et al. 2010), or eye-trackers (Alemdag and Cagiltay 2018) to users, while non-intrusive techniques refer to implicit data collection like recording learning logs (Friesner and Hart 2005), recording click-through data (Joachims 2002), and exploiting devices without direct contact with the users [e.g., using digital cameras to collect videos of users for emotional analysis (Poria et al. 2015)]. According to the review study conducted by Fu and Hwang (2018), the devices used in technology-enhanced learning for data collection are mainly traditional portable computers. Furthermore, non-intrusive methods (Ortigosa et al. 2014) are harder to apply in technology-enhanced learning studies. Because of the limited kinds of devices and methods employed, the sources of data collected from users in e-learning systems are not adequately diverse and fruitful for in-depth analysis and the deep understanding of learners.

Second, it is challenging to interpret the user data obtained even if the data are diverse and fruitful because of the limited theories of user data interpretation for language learning. For example, it is still difficult to establish accurate connections between the patterns in the data from Magnetic Resonance Imaging (MRI) of the human brain and language learning processes, although some basic and rough patterns have been identified (Rahmani et al. 2017; Barbeau et al. 2017). Similarly, only some shallow relationships between eye gazing data and learning processes have been identified (Koć-Januchta et al. 2017). To sum up, it is very difficult to link the low-level biological data (e.g., EEG, eye gazing, MRI, etc.) of a learner to high-level semantics (e.g., learning status, affective status, etc.) even with deep neural networks (Khosrowabadi et al. 2014). In other words, building an effective model to interpret and represent user data in learning systems is largely constrained by this gap.

Without a good interpretation and understanding of the prior vocabulary knowledge in word learning systems, word learning systems are unable to cater to individual learning needs. In other words, it is difficult facilitate personalized word learning with word learning systems when the issues of limited interpretation and understanding are not addressed. Formally, personalized word learning (Chen and Chung 2008; Chen and Li 2010; Zou and Xie 2018) refers to employing personalized learning strategies in vocabulary learning processes. Specifically, personalized learning is defined as “instruction in which the pace of learning and the instructional approach are optimized for the needs of each learner. Learning objectives, instructional approaches, and instructional content (and its sequencing) may all vary based on learner needs. In addition, learning activities are meaningful and relevant to learners, driven by their interests, and often self-initiated” in the United States National Education Technology Plan 2017 (US Department of Education 2017).

Previous studies (Hsu et al. 2013; Lin et al. 2013; Jeong et al. 2012; Xie et al. 2016) mainly adopt the learner profile, a data-driven model for learner representation and interpretation to address the above research problem, to provide personalized learning. The learner profile can represent user data from various sources and help the e-learning systems to understand factors like learning styles, learning status, and prior knowledge levels. In this study, we focus on learner profiles for the personalized word learning. As mentioned, there are two categories of techniques, intrusive and non-intrusive, for user data collection. In previous studies, the methods of constructing learner profiles were classified as implicit and explicit methods (Zou et al. 2017a; Wang et al. 2018). Specifically, we conduct a further and extensive study on the following two research questions.

What are the system architectures of the personalized word learning systems based on learner profiles for ESL university students?
Which is the optimal method to integrate implicit and explicit learner profiles for personalized word learning systems?

The remainder of this article is organized as follows. In Sect. 2, we will review the relevant research studies of personalized word learning systems. Section 3 will specify each component in the proposed word learning systems. Section 4 introduces the methodologies employed in the experiment. Section 5 will report the empirical results of this study, analyze the results in depth, and discuss the pedagogical implications of this study. The conclusion will be drawn in Sect. 6.

Related work

Along with the irresistible tide of e-learning, recent decades have witnessed a similar flood of development of educational technologies—personalization (Chen and Chung 2008; Martins et al. 2008; Brusilovsky and Millán 2007), a concept that is increasingly expected to change the landscape of learning and teaching and that has attracted a great deal of worldwide scholarly attention, ranging from the investigation of nature and factors of personalization to the establishment of the conceptual framework of personalized learning (Tseng et al. 2008; Brusilovsky and Henze 2007).

According to Wang et al. (2004), personalization is a pedagogical response to the inherent diversity of learners’ knowledge background, skill levels, and preferences. It is believed that ideal personalization is to maximize the compatibility between the learning method and learners’ “particular educational needs and personal characteristics,” and to therefore enable the largest enhancement of their “satisfaction, learning speed and learning effectiveness” (Gómez et al. 2014), in terms of which word learning, of course, is among the beneficiary skills (Chen and Chung 2008). Now such an ideal is being realized thanks to the development of modern technology, especially that of mobile devices and wireless web (Chen et al. 2005). The data tracking systems and wide accessibility of mobile devices (Mobasher 2007) allow m-learning to involve diverse modes and methods, among which personalized learning is one of the most essential (Subramanya 2014), thereby earning itself a significant place as a particular feature of m-learning (Romrell et al. 2014; Cochrane 2010). As an e-learning factor, it is implemented mainly in the design level of interactive learning environments, where it is proposed that such elements be taken into consideration as “locus of control, learning styles, anxiety, tolerance for ambiguity, prior experience, interests, attitudes, and disabilities” (Reeves and Reeves 1997). For all the extensive discussions of personalization factors, they mainly center on two keywords. One is autonomy. Personalization puts learners’ choice in the center (Baker and Clarke-Midura 2013; Bray and McClaskey 2015), ranging from ensuring a learning pace and learning styles catering to their preferences to providing learning content compatible with their needs and interests. In word learning, it is suggested that learners be allowed to choose the words to learn and create vocabulary lists on their own—only then would word knowledge be of greater salience and of longer retention (Swaffar 1988). The other is scaffolding. This is based on the acknowledgment of the crucial position of a learner’s ability and feedback in the learning process and effect (Chen et al. 2005), as well as on the reasonableness of the zone of proximal development (ZPD) (Vygotsky 1978). ZPD theory holds that the learner would be “frustrated” or “presented with no challenge” if the instruction is too difficult or too simple; the ideal level of learning materials should fit into the “zone” between an individual’s highest and lowest ability limits—which is exactly the goal of scaffolding (Hammond and Gibbons 2005). In word learning, personalization is expected to support adaptation and befit scaffolding by addressing the exact problem of a given individual and offering different levels of support according to their different abilities. Learners’ creation of a word list could be counted as an example of two factors: By being allowed to select the target words in personalized word lists according to their interests and abilities, learners are expected to engage in deeper processing and longer retention of the learned knowledge due to their achievement of autonomy, as well as having their problems exactly met and knowledge properly digested because of the scaffolding.

On a larger scale, a variety of conceptual systems for personalized word learning have been proposed from different perspectives. Some focus on personalization in the educational strategy level, represented by theories of curriculum sequencing (Brusilovsky 2003; Chen et al. 2006; Hübscher 2000) and adaptive presentation (Papanikolaou et al. 1999; Wang et al. 2004); others regard Internet techniques as the basic requisite of personalized learning and investigate the realization of personalization by e-commerce (Wu et al. 2003), web searches (Sugiyama et al. 2004), web data mining (Lin et al. 2013), social media (Xie et al. 2014), etc. Chen and Chung (2008) established Item Response Theory (IRT) and learning memory cycles where learners could achieve their highest learning efficiency by having the learning material cater to their vocabulary abilities and memory cycles. Chen and Li (2010) advance personalized context-aware ubiquitous learning systems in an attempt to adapt the learning content to learners’ locations, schedules, and abilities. Similarly, Huang et al. (2012) propose a ubiquitous English vocabulary learning system using video clips to allow learners to experience systematic word learning without time or space restrictions. Loucky (2012) suggests the pre-arrangement of the target vocabulary into bilingual categories with common semantic keywords in order to build a distance vocabulary learning system. Bulger (2016) builds a typology of technologically-enabled personalized learning systems along with five supporting categories: a customized learning interface, learning management, data-driven learning, adaptive learning, and an intelligent tutor. In terms of facilitating personalized word learning, Xie et al. (2016) discuss two kinds of profiling techniques—explicit user profiling and implicit user profiling, mainly focusing on the ownership of learners’ data and control of their vocabulary proficiency.

System architecture

With the development of connectivist pedagogy in recent years, connectivist approaches have been defined and applied to teaching and learning practices (Downes 2010; Siemens 2005). Learning is defined as “the process of building networks of information, contacts, and resources that are applied to real problems,” and “this pedagogical approach focuses on building and maintaining networked connections that are relevant, current and flexible enough to support student-centered learning” (McLoughlin 2013). Unlike teacher-directed curricula, student-centered learning is supported by personalized learning environments that enable “individuals to select, integrate and construct knowledge using various software, services, and options based on their needs and circumstance” (McLoughlin 2013). Ideally, such a model can lead to learning based on individual needs. The word learning process is also driven by individual needs (Chen and Li 2010). Therefore, in this section, we propose a detailed system architecture to facilitate personalized word learning.

As shown in Fig. 1, the generic system architecture of the proposed personalized word learning system can be divided into three components: user data collection, learner profiling, and personalized learning. The details of each component are introduced in the following subsections.

User data collection

Learner profiling adopts the conventional vector form to represent the target words and their corresponding knowledge levels. Formally, a learner profile is denoted as

$$\begin{aligned} L_{i}=(w_{1}:\varepsilon ^{i}_{1};w_{2}:\varepsilon ^{i}_{2};...;w_{n}:\varepsilon ^{i}_{n}), \end{aligned}$$

(1)

where $w_{x}$ is a target word, $\varepsilon ^{i}_{x}$ is the knowledge level of learner i on word $w_{x}$, and a value in the interval [0, 1] is used to represent $\varepsilon ^{i}_{x}$ (Zou et al. 2017a). The value of $\varepsilon ^{i}_{x}$ is calculated on the basis of the linear combination of the values obtained from explicit and implicit data acquisition as follows. In this proposed system, the integration of explicit data acquisition and implicit data acquisition is employed as proposed in an earlier study (Wang et al. 2018). As mentioned, explicit data acquisition relies on user input data to understand learners’ prior knowledge levels. In the context of word learning, the form of user input is to ask learners to indicate their prior word knowledge levels according to vocabulary knowledge scales (VKS) (Folse 2006) for selected words at different difficulty levels. Normally, 3-rating VKS is adopted, as it provides a good balance between accuracy and efficiency (Zou et al. 2017b). These words are organized in the form of word-nested models, as shown in Fig. 2. The nested model for vocabulary is essential to grouping words at different difficulty levels, and a word set $A_{n}$ ($1\le n \le k$) is the set including all words at the difficulty level n. The difficulty levels of words can be obtained by using current software tools like Twinword^{Footnote 1} or Frequent Level Checking (FLC^{Footnote 2}). Note that learners can only input their prior knowledge levels for a limited number of words. For the remaining words, we use an explicit acquisition function (Zou et al. 2017a) to estimate the knowledge levels.

For implicit data acquisition in word learning systems, the typical data sources to be collected are historical learning logs and current learning data. We compared various kinds of historical learning data sources, including reading texts, writing assignments, and test papers, in a previous study (Zou et al. 2015). In this study, we found that test papers are the most accurate data source for constructing learner profiles. In addition, the integration of all three data sources ensures a more accurate construction of learner profiles than exploiting a single data source. Although there are several potential approaches (Maseleno et al. 2018; Dietz-Uhler and Hurn 2013) to exploiting other data sources in learner profiling, we still adopt the hybrid method to integrate the three data sources above for implicit data acquisition in the proposed system, as the focus of this study is to identify the optimal method for integrating implicit and explicit data acquisition. The main idea of implicit data acquisition is to adopt term-frequency and inverse document frequency (TF-IDF) (Jones 1972) to denote the weights of the words in the external documents. For a learning document $d= \{w_{1}, w_{2},..., w_{d}\}$ and a set D of learning documents, the TF-IDF paradigm is adopted to measure the term weighting as follows:

$$\begin{aligned} rel(w_{j})=\frac{f(w_{j},d)}{max\{f(w,d):w\in d\}}\times log\frac{|D|}{|{d\in D:w_{j}\in d}|}, \end{aligned}$$

(2)

where the first component $\frac{f(w_{j},d)}{max\{f(w,d):w\in d\}}$ is the term frequency (TF) and the other component $log\frac{|D|}{|{d\in D:w_{j}\in d}|}$ is the IDF part, which represents the salience of a given word in this document (Wang et al. 2018; Zou et al. 2017a).

Learner profiling

Learner profiling adopts the conventional vector form to represent target words and their corresponding knowledge levels. Formally, a learner profile is denoted as follows:

$$\begin{aligned} L_{i}=(w_{1}:\varepsilon ^{i}_{1};w_{2}:\varepsilon ^{i}_{2};...;w_{n}:\varepsilon ^{i}_{n}), \end{aligned}$$

(3)

where $w_{x}$ is a target word, $\varepsilon ^{i}_{x}$ is the knowledge level of learner i on word $w_{x}$, and a value in the interval [0, 1] is used to represent $\varepsilon ^{i}_{x}$ (Zou et al. 2017a). The value of $\varepsilon ^{i}_{x}$ is calculated on the basis of the linear combination of the values obtained from explicit and implicit data acquisition as follows:

$$\begin{aligned} \varepsilon ^{i}_{x} = \alpha \cdot \varepsilon ^{i}_{x,ex} + (1-\alpha ) \cdot \varepsilon ^{i}_{x,im}, \end{aligned}$$

(4)

where $\varepsilon ^{i}_{x,ex}$ is the knowledge level obtained from explicit data acquisition of learner i, $\varepsilon ^{i}_{x,im}$ is the implicit data acquisition level, and $\alpha$ is a parameter to adjust the weights of these two values. In a previous study (Wang et al. 2018), a weight of 0.5 was used so that the explicit and implicit knowledge levels of the words were equally weighted. In this study, more weights will be tried and verified to identify the optimized combinations.

In addition to the integration of explicit and implicit knowledge levels of words, two kinds of updating methods, time-decayed update and feedback-driven update, are employed in learner profiling (Wang et al. 2018). As the retention of a word will decrease as time elapses, time-decayed update applies the idea of the Ebbinghaus forgetting curve (Wixted and Ebbesen 1997) and exploits a time-decayed function $\varepsilon _{i}^{x}|t=e^{-t/\varepsilon _{x}^{i}}$ (where $\varepsilon _{i}^{x}|t$ is the knowledge level without the review of word $w_(x)$ after time t) (Wang et al. 2018). Meanwhile, feedback-driven update is a mechanism to adjust $\varepsilon _{i}^{x}$ by considering learning achievements during the word learning processes of the proposed system. The main idea is to categorize the feedback results in four different cases and use a piecewise function to deal with all the cases, as introduced in Wang et al. (2018).

Personalized learning

The personalized learning component aims to offer a sequence of learning tasks according to the knowledge level of each learner. As the knowledge levels are reflected in the learner profile, we therefore recommend learning tasks according to the learner profiles obtained in the above subsection. As the focus is to investigate how to optimize the explicit and implicit knowledge levels in the learner profile, we decided to adopt a recommendation algorithm based on word coverage (Xie et al. 2016), which posits that a learning task should contain more target words unfamiliar to the learners. For a task t, the degree of unfamiliarity of this task can be defined as follows.

$$\begin{aligned} \theta (t,i) = \sum _{\forall w_{x}\in t}\varepsilon _{x}^{i}, \end{aligned}$$

(5)

where $\theta (t,i)$ is the degree of unfamiliarity of learner i with the learning task t, and $w_{x}$ is one of the target words in the learning task t. The recommended tasks are to maximize the degree of unfamiliarity as follows:

$$\begin{aligned} t^{*} = \arg \max _{t\in T}{\theta (t,i)}, \end{aligned}$$

(6)

where T is the set of learning tasks available to the word learning systems.

As shown in Fig. 3, the whole learning process can be divided into seven steps as follows:

1.
The initial step is that the learner first inputs the prior knowledge levels for selected words provided by the system.
2.
By incorporating external data sources, the learner profile is established in the system using both explicit and implicit data.
3.
Two learning tasks are suggested by the system using word coverage recommendations, as mentioned in this subsection.
4.
The learner picks and completes one learning task from two suggested tasks.
5.
After completing the learning task, the system examines whether the whole learning process is completed.
6.
If the learning processes are not completed, the system will provide feedback to the learner profile, and then go back to step 2.
7.
If the learning processes are completed, the system will update the learner profile and terminate the learning processes.

Methodologies

In addition to 32 ESL university students who participated in the experiment of the previous study (Wang et al. 2018), 68 more university students were invited to participate in the further experimental study. There were thus a total of 100 ESL university students with English proficiency at the level of IELTS Band 5.0. We randomly sorted the students into five equal groups. Note that we had already conducted an experiment on two groups with 16 participants in each group in the previous study (Wang et al. 2018). The details of each group are introduced and summarized in Table 1.

Table 1 The five groups in the previous and current experiments

Full size table

Control Group The control group employed only explicit learner profiles. In other words, their knowledge levels of vocabulary were obtained from their explicit specifications in the system. The explicit data about their prior knowledge levels were used as the final learner profile. In other words, weights of 1.00 and 0.00 for explicit and implicit learner profiles were used during learner profile integration, as introduced in Eq. (2). Four participants joined the control group, which thus included a total of 20 participants.
Experimental Group 1 Experimental Group 1 received different settings from the control group. Specifically, the weight values were slightly adjusted to 0.25 and 0.75, respectively, for explicit and implicit data. That is, Experimental Group 2 was more heavily weighted for explicit data (i.e., user-input prior knowledge levels) when constructing the learner profile. As this group did not participate in the previous experiment, 20 participants newly joined this group in this study.
Experimental Group 2 Experimental Group 2 was the experimental group in the previous study (Wang et al. 2018). In this group, equal weights, 0.50 and 0.50, were adopted for the explicit and implicit data for learner profiling, respectively (i.e., $\alpha = 1-\alpha = 0.50$). To maintain the equality of participant numbers of each group, four more participants were included in Experimental Group 2, so that 20 participants were included in this group.
Experimental Group 3 The settings for Experimental Group 3 were the converse of those of Experimental Group 1. In other words, the weights were set at 0.25 and 0.75. Thus, more weight was given to implicit data (i.e., the prior knowledge levels learned from learner assignments, exam papers, and so on) when constructing the learner profile. As this group did not participate in the previous experiment, 20 participants newly joined this group in this study.
Experimental Group 4 The settings of Experimental Group 4 were the converse of those for the control group. In other words, the weights were set as 0.00 and 1.00. The learner profile is thus built only on the basis of implicit data, with explicit data neglected. As this group did not participate in the previous experiment, 20 participants newly joined this group in this study.

Turning to the experimental procedures, a pre-test was conducted to ensure that the participants had the least knowledge of the 20 target words before the learning processes. The learners created an account on the word learning system and followed the seven steps of the learning process introduced in Sect. 3.3. The target words, learning tasks and marking criteria followed those of previous studies of vocabulary acquisition (Folse 2006; Zou 2017). The whole learning process lasted for two days, and each participant had to complete 10 learning tasks suggested by the system. Each learning task could be finished in a very short period of time about ten minutes. After completing the learning process, a post-test was conducted to examine learners’ immediate learning of the 20 target words within 30 min.

For both pre-test and post-test, we used the same test to evaluate learning effectiveness; a sample test paper is provided in Table 2 of the “Appendix”. The 3-rating vocabulary knowledge scale (Folse 2006) was adopted for the marking criteria. The target words were adapted from Zou’s research (2017). Specifically, (1) if the learner could not remember the word meaning, no score (0) would be given; (2) if the learner could remember the word meaning without knowing how to use it in context, a half score (0.5) would be given; and (3) if the learner could remember the word meaning and use the word in the correct context, a full score (1) would be given.

Results and discussion

The experimental results are illustrated in Fig. 4. The green curve and orange bars present the same values (i.e., post-test results) in two different ways, while the blue bars show the pre-test results. The pre-test results of four groups are close to each other. We applied a significance test to verify that the differences between each two groups were not significant ($t > 0.1$). We also applied Student’s t-test to examine whether the differences between the two groups were significant, and found that all differences between any two groups in the post-test were significant ($t < 0.05$). Furthermore, we identified Experimental Group 1 as having the best performance on word retention, while Experimental Group 4 had the worst performance. Given that Experimental Group 3 integrated explicit and implicit data in learner profiling and the control group only employed the explicit data, the result we obtained of the control group outperforming Experimental Group 3 indicates that the integration of both data sources cannot always outperform a single explicit data source when establishing a learner profile.

Furthermore, we found that the curve reached its peak value at $\alpha = 0.75$ and decreased with decreasing $\alpha$, taking its minimum value when $\alpha = 0.00$. These results show that the integration of “implicit data” with explicit data can improve the effectiveness of personalized word learning. However, such integration should be dominated by explicit data. In other words, the optimal method of integrating implicit and explicit data is to give more weight to explicit data (i.e., $\alpha > 1-\alpha$), while implicit data serves as a supplementary source during the integration. This result is consistent with the findings of a previous study (Xie et al. 2014) that the explicit data specified by users is more dominant, and a better quality of data can be generated if implicit data are added as supplements.

The implication of the results is that learners actually understand their own vocabulary proficiency better than “their test papers, assignments and so on” would reveal. From the perspective of the system, the design of personalized word learning systems needs explicit data on users’ prior knowledge levels as obtained through user input. However, the requisite manual efforts are time-consuming and infeasible for large amounts of data. Implicit data then serve as an important source of additional data compensating for this drawback. The designer of a personalized word learning system should pay more consideration to the balance of user-input and implicit data. From the perspective of word learning, university students have already shown that they can clearly understand their prior knowledge levels of the vocabulary in the experiments. In addition to personalized word learning systems, university ESL students are suggested to have their own “personalized learning plans,” including picking English readings with a larger vocabulary size than their own and rehearsing unfamiliar target words in a learning task.

Conclusion

In this article, we studied the system architecture of personalized word learning systems based on learner profiles and the optimal method for integrating implicit and explicit data sources to construct learner profiles. We introduced each component of the proposed word learning system and conducted experimental studies on different combinations of explicit and implicit data sources for learner profiling. The experimental results showed that the explicit data dominates, while implicit data sources can serve as supplements. In addition, we discussed the implications of this study from the perspectives of system design and word learning.

The limitations of this study are that the number of participants in each group was not large and the behavioral data during learning were not actually applied to adjust the learning process. In the future, we will continue investigating the research questions of how to minimize the effort of user input for the explicit data and of how to integrate the behavioral data to better facilitate personalized word learning.

Notes

References

Alemdag, E., & Cagiltay, K. (2018). A systematic review of eye tracking research on multimedia learning. Computers & Education, 125, 413–428.
Article Google Scholar
Baker, R. S. J., & Clarke-Midura, J. (2013). Predicting successful inquiry learning in a virtual performance assessment for science. In International conference on user modeling, adaptation, and personalization (pp. 203–214). Springer
Barbeau, E. B., Chai, X. J., Chen, J.-K., Soles, J., Berken, J., Baum, S., et al. (2017). The role of the left inferior parietal lobule in second language learning: An intensive language training fmri study. Neuropsychologia, 98, 169–176.
Article Google Scholar
Bedogni, L., Di Felice, M., & Bononi, L. (2012). By train or by car? detecting the user’s motion type through smartphone sensors data. In 2012 IFIP Wireless days (wd) (pp. 1–6). IEEE.
Bray, B., & McClaskey, K. (2015). Personalization vs. differentiation vs. individualization report (pdi) v3. Viitattu, 16:2015.
Brusilovsky, P., & Henze, N. (2007). Open corpus adaptive educational hypermedia. In The adaptive web (pp. 671–696). Springer.
Brusilovsky, P., & Millán, E. (2007). User models for adaptive hypermedia and adaptive educational systems. In The adaptive web (pp. 3–53). Springer.
Brusilovsky, P. (2003). Adaptive navigation support in educational hypermedia: The role of student knowledge level and the case for meta-adaptation. British Journal of Educational Technology, 34(4), 487–497.
Article Google Scholar
Bulger, M. (2016). Personalized learning: The conversations were not having. Data and Society, 22.
Campbell, A., Choudhury, T., Hu, S., Lu, H., Mukerjee, M. K., Rabbi, M., et al. (2010). Neurophone: Brain–mobile phone interface using a wireless EEG headset. In Proceedings of the second ACM SIGCOMM workshop on networking, systems, and applications on mobile handhelds (pp. 3–8). ACM.
Chen, C.-M., & Chung, C.-J. (2008). Personalized mobile English vocabulary learning system based on item response theory and learning memory cycle. Computers & Education, 51(2), 624–645.
Article Google Scholar
Chen, C.-M., Lee, H.-M., & Chen, Y.-H. (2005). Personalized e-learning system using item response theory. Computers & Education, 44(3), 237–255.
Article Google Scholar
Chen, C.-M., & Li, Y.-L. (2010). Personalised context-aware ubiquitous learning system for supporting effective english vocabulary learning. Interactive Learning Environments, 18(4), 341–364.
Article Google Scholar
Chen, C.-M., Liu, C.-Y., & Chang, M.-H. (2006). Personalized curriculum sequencing utilizing modified item response theory for web-based instruction. Expert Systems with Applications, 30(2), 378–396.
Article Google Scholar
Cochrane, T. D. (2010). Exploring mobile learning success factors. Research in Learning Technology, 18(2), 133–148.
Article Google Scholar
Dietz-Uhler, B., & Hurn, J. E. (2013). Using learning analytics to predict (and improve) student success: A faculty perspective. Journal of Interactive Online Learning, 12(1), 17–26.
Google Scholar
Downes, S. (2010). Learning networks and connective knowledge. In Collective intelligence and e-learning 2.0: Implications of web-based communities and networking (pp. 1–26). IGI Global.
Folse, K. S. (2006). The effect of type of written exercise on l2 vocabulary retention. TESOL Quarterly, 40(2), 273–293.
Article Google Scholar
Friesner, T., & Hart, M. (2005). Learning logs: Assessment or research method. The Electronic Journal of Business Research Methodology, 3(2), 117–122.
Google Scholar
Fu, Q.-K., & Hwang, G.-J. (2018). Trends in mobile technology-supported collaborative learning: A systematic review of journal publications from 2007 to 2016. Computers & Education, 119, 129–143.
Article Google Scholar
Golonka, E. M., Bowles, A. R., Frank, V. M., Richardson, D. L., & Freynik, S. (2014). Technologies for foreign language learning: A review of technology types and their effectiveness. Computer Assisted Language Learning, 27(1), 70–105.
Article Google Scholar
Gómez, S., Zervas, P., Sampson, D. G., & Fabregat, R. (2014). Context-aware adaptive and personalized mobile learning delivery supported by UoLmP. Journal of King Saud University-Computer and Information Sciences, 26(1), 48.
Article Google Scholar
Gu, P. Y. (2003). Vocabulary learning in a second language: Person, task, context and strategies. TESL-EJ, 7(2), 1–25.
Google Scholar
Hammond, J., & Gibbons, P. (2005). What is scaffolding. Teachers Voices, 8, 13.
Google Scholar
Hsu, C.-K., Hwang, G.-J., & Chang, C.-K. (2013). A personalized recommendation-based mobile learning approach to improving the reading performance of EFL students. Computers & Education, 63, 327–336.
Article Google Scholar
Huang, Y.-M., Huang, Y.-M., Huang, S.-H., & Lin, Y.-T. (2012). A ubiquitous English vocabulary learning system: Evidence of active/passive attitudes vs. usefulness/ease-of-use. Computers & Education, 58(1), 273–282.
Article Google Scholar
Hübscher, R. (2000). Logically optimal curriculum sequences for adaptive hypermedia systems. In International conference on adaptive hypermedia and adaptive web-based systems (pp. 121–132). Springer.
Hulstijn, J. H., & Laufer, B. (2001). Some empirical evidence for the involvement load hypothesis in vocabulary acquisition. Language Learning, 51(3), 539–558.
Article Google Scholar
Hu, H. M., & Nassaji, H. (2016). Effective vocabulary learning tasks: Involvement load hypothesis versus technique feature analysis. System, 56, 28–39.
Article Google Scholar
Jeong, H.-Y., Choi, C.-R., & Song, Y.-J. (2012). Personalized learning course planner with e-learning dss using user profile. Expert Systems with Applications, 39(3), 2567–2577.
Article Google Scholar
Joachims, T. (2002). Optimizing search engines using clickthrough data. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 133–142). ACM.
Jones, K. S. (1972). A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28(1), 11–21.
Article Google Scholar
Kaya, T., & Bicen, H. (2016). The effects of social media on students behaviors; Facebook as a case study. Computers in Human Behavior, 59, 374–379.
Article Google Scholar
Keating, G. D. (2008). Task effectiveness and word learning in a second language: The involvement load hypothesis on trial. Language Teaching Research, 12(3), 365–386.
Article Google Scholar
Khosrowabadi, R., Quek, C., Ang, K. K., & Wahab, A. (2014). ERNN: A biologically inspired feedforward neural network to discriminate emotion from EEG signal. IEEE Transactions on Neural Networks and Learning Systems, 25(3), 609–620.
Article Google Scholar
Koć-Januchta, M., Höffler, T., Thoma, G.-B., Prechtl, H., & Leutner, D. (2017). Visualizers versus verbalizers: Effects of cognitive style on learning with texts and pictures-an eye-tracking study. Computers in Human Behavior, 68, 170–179.
Article Google Scholar
Lin, C. F., Yeh, Y., Hung, Y. H., & Chang, R. I. (2013). Data mining for providing a personalized learning path in creativity: An application of decision trees. Computers & Education, 68, 199–210.
Article Google Scholar
Loucky, J. P. (2012). Designing distance learning tasks to help maximize vocabulary development. International Journal of Virtual and Personal Learning Environments (IJVPLE), 3(2), 35–58.
Article Google Scholar
Martins, C., Faria, L., Carvalho, C. V. D., & Carrapatoso, E. (2008). User modeling in adaptive hypermedia educational systems. Educational Technology & Society, 11(1), 194–207.
Google Scholar
Maseleno, A., Sabani, N., Huda, M., Ahmad, R., Jasmi, K. A., & Basiron, B. (2018). Demystifying learning analytics in personalised learning. International Journal of Engineering & Technology, 7(3), 1124–1129.
Article Google Scholar
McLoughlin, C. E. (2013). The pedagogy of personalised learning: exemplars, MOOCS and related learning theories. In EdMedia: World conference on educational media and technology (pp. 266–270). Association for the Advancement of Computing in Education (AACE).
Mobasher, B. (2007). Data mining for web personalization. In The adaptive web (pp. 90–135). Springer.
Ortigosa, A., Martín, J. M., & Carro, R. M. (2014). Sentiment analysis in facebook and its application to e-learning. Computers in Human Behavior, 31, 527–541.
Article Google Scholar
Oxford, R. L., & Scarcella, R. C. (1994). Second language vocabulary learning among adults: State of the art in vocabulary instruction. System, 22(2), 231–243.
Article Google Scholar
Papanikolaou, K. A., Magoulas, G. D., & Grigoriadou, M. (1999). A connectionist approach for adaptive lesson presentation in a distance learning course. In Proceedings of international joint conference on neural networks (Cat. No. 99CH36339), IJCNN’99 (Vol. 5, pp. 3522–3526). IEEE.
Paul Nation and Robert Waring. (1997). Vocabulary size, text coverage and word lists. Vocabulary: Description, Acquisition and Pedagogy, 14, 6–19.
Google Scholar
Poria, S., Cambria, E., Hussain, A., & Huang, G.-B. (2015). Towards an intelligent framework for multimodal affective data analysis. Neural Networks, 63, 104–116.
Article Google Scholar
Prince, P. (1996). Second language vocabulary learning: The role of context versus translations as a function of proficiency. The Modern Language Journal, 80(4), 478–493.
Article Google Scholar
Rahmani, F., Sobhani, S., & Hadi Aarabi, M. (2017). Sequential language learning and language immersion in bilingualism: Diffusion MRI connectometry reveals microstructural evidence. Experimental Brain Research, 235(10), 2935–2945.
Article Google Scholar
Ramakers, R., Vanacken, D., Luyten, K., Coninx, K., & Schöning, J. (2012). Carpus: A non-intrusive user identification technique for interactive surfaces. In Proceedings of the 25th annual ACM symposium on user interface software and technology (pp. 35–44). ACM.
Reeves, T. C., & Reeves, P. M. (1997). Effective dimensions of interactive learning on the world wide web. In Web-based instruction (p. 63).
Romrell, D., Kidder, L., & Wood, E. (2014). The SAMR model as a framework for evaluating mLearning. Online Learning Journal, 18(2), 1–15.
Google Scholar
Schmitt, N. (2008). Instructed second language vocabulary learning. Language Teaching Research, 12(3), 329–363.
Article Google Scholar
Siemens, G. (2005). Connectivism: A learning theory for the digital age. Instructional Technology and Distance Education, 2(1), 3–10.
Google Scholar
Subramanya, S. R. (2014). Mobile apps as supplementary educational resources. International Journal of Advances in Management, Technology & Engineering Sciences, 9, 38–43.
Google Scholar
Sugiyama, K., Hatano, K., & Yoshikawa, M. (2004). Adaptive web search based on user profile constructed without any effort from users. In Proceedings of the 13th international conference on world wide web (pp. 675–684). ACM.
Swaffar, J. K. (1988). Readers, texts, and second languages: The interactive processes. The Modern Language Journal, 72(2), 123–149.
Article Google Scholar
Tseng, J. C., Chu, H. C., Hwang, G. J., & Tsai, C. C. (2008). Development of an adaptive learning system with two sources of personalization information. Computers & Education, 51(2), 776–786.
Article Google Scholar
US Department of Education. (2017). Reimagining the role of technology in education: 2017 national education technology plan update (p. 9).
Vygotsky, L. (1978). Interaction between learning and development. Readings on the Development of Children, 23(3), 34–41.
Google Scholar
Wang, H. C., Li, T. Y., & Chang, C. Y. (2004). Adaptive presentation for effective web-based learning of 3D content. In Proceedings of IEEE international conference on advanced learning technologies (pp. 136–140). IEEE.
Wang, F. L., Zou, D., & Xie, H. (2018). Personalized word learning for ESL students via integration of implicit and explicit profiles. In International conference on blended learning (pp. 301–310). Springer.
Wang, H.-Y., Liu, G.-Z., & Hwang, G.-J. (2017). Integrating socio-cultural contexts and location-based systems for ubiquitous language learning in museums: A state of the art review of 2009–2014. British Journal of Educational Technology, 48(2), 653–671.
Article Google Scholar
Wixted, J. T., & Ebbesen, E. B. (1997). Genuine power curves in forgetting: A quantitative analysis of individual subject forgetting functions. Memory & cognition, 25(5), 731–739.
Article Google Scholar
Wu, D., Im, I., Tremaine, M., Instone, K., & Turoff, M. (2003). A framework for classifying personalization scheme used on e-commerce websites. In Proceedings of the 36th annual Hawaii international conference on system sciences (p. 12). IEEE.
Xie, H., Li, Q., Mao, X., Li, X., Cai, Y., & Rao, Y. (2014). Community-aware user profile enrichment in folksonomy. Neural Networks, 58, 111–121.
Article Google Scholar
Xie, H., Zou, D., Lau, R. Y. K., Wang, F. L., & Wong, T.-L. (2016). Generating incidental word-learning tasks via topic-based and load-based profiles. IEEE Multimedia, 23(1), 60–70.
Article Google Scholar
Zou, D., Xie, H., Wang, F. L., Wong, T.-L., Poon, C. K., & Ho, W.-S. (2015). Comparative study on heterogeneous profiling sources for second language learners. In International conference on technology in education (pp. 209–218). Springer.
Zou, D., Xie, H., Wong, T.-L., Wang, F. L., Kwan, R., & Chan, W. H. (2017a). An explicit learner profiling model for personalized word learning recommendation. In International symposium on emerging technologies for education (pp. 495–499). Springer.
Zou, D. (2017). Vocabulary acquisition through cloze exercises, sentence-writing and composition-writing: Extending the evaluation component of the involvement load hypothesis. Language Teaching Research, 21(1), 54–75.
Article Google Scholar
Zou, D., & Xie, H. (2018). Personalized word-learning based on technique feature analysis and learning analytics. Journal of Educational Technology & Society, 21(2), 233–244.
Google Scholar
Zou, D., Xie, H., Rao, Y., Wong, T.-L., Wang, F. L., & Wu, Q. (2017). A comparative study on various vocabulary knowledge scales for predicting vocabulary pre-knowledge. International Journal of Distance Education Technologies (IJDET), 15(1), 69–81.
Article Google Scholar

Download references

Acknowledgements

The work described in this article was fully supported by the Standing Committee on Language Education and Research (EDB(LE)/P&R/EL/175/2), the Innovation and Technology Fund (Project No. GHP/022/17GD) of the Innovation and Technology Commission of the Government of the Hong Kong Special Administrative Region, and Eastern Scholar Chair Professorship Fund (No. JZ2017005) from Shanghai Municipal Education Commission of China. A preliminary study was published in the International Conference on Blended Learning 2018 (Wang et al. 2018), and this article has been thoroughly re-written after we studied new research questions, conducted extensive experiments, obtained new findings and found new implications. Di Zou is the corresponding author of this article.

Author information

Authors and Affiliations

Department of Mathematics and Information Technology, The Education University of Hong Kong, Hong Kong, China
Haoran Xie
Department of English Language Education, The Education University of Hong Kong, Hong Kong, China
Di Zou & Ruofei Zhang
Faculty of Education, The University of Hong Kong, Hong Kong, China
Minhong Wang
Department of Educational Information Technology, East China Normal University, Shanghai, China
Minhong Wang
The Open University of Hong Kong, Hong Kong, China
Reggie Kwan

Authors

Haoran Xie
View author publications
You can also search for this author in PubMed Google Scholar
Di Zou
View author publications
You can also search for this author in PubMed Google Scholar
Ruofei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Minhong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Reggie Kwan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Di Zou.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

See Table 2.

Table 2 Folse’s modified vocabulary knowledge scale (2006) was applied to measure the participants’ word learning outcomes. The target words were adapted from Zou’s research (2017)

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xie, H., Zou, D., Zhang, R. et al. Personalized word learning for university students: a profile-based method for e-learning systems. J Comput High Educ 31, 273–289 (2019). https://doi.org/10.1007/s12528-019-09215-0

Download citation

Published: 27 March 2019
Issue Date: 15 August 2019
DOI: https://doi.org/10.1007/s12528-019-09215-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Personalized word learning for university students: a profile-based method for e-learning systems

Abstract

Similar content being viewed by others

Personalized Word Learning for ESL Students via Integration of Implicit and Explicit Profiles

A Personalized Task Recommendation System for Vocabulary Learning Based on Readability and Diversity

An Explicit Learner Profiling Model for Personalized Word Learning Recommendation

Introduction

Related work