Abstract
This study examined the effects of instructional multimedia tasks designed based on five principles of reducing extraneous processing on language learners’ listening and reading comprehension development. The study comprises two phases of design and experimentation. In the design phase, twelve sets of multimedia tasks were designed considering two conditions of applying and violating the principles. In the experimentation phase, the tasks were used in two conversation classes, each consisting of 15 students. The participants’ listening and reading comprehension were assessed before and after the study by the International English Language Testing System test. The experimental group received instruction based on condition 1, using multimedia prepared in accord with the principles; and the control group received the same instruction based on condition 2, using traditionally designed videos. The results revealed that condition 1 was significantly effective in the development of both listening and reading comprehension of the participants. More detailed analyses showed that the tasks had a significant impact on improving the comprehension of monologues rather than dialogues. Further, the instruction under condition 1 led to the development of both reading for gist and reading for specific information, but the effect size of the intervention was larger for the former. The findings elucidate the value of theory-grounded and practice-supported principles for improving the design and production of multimedia presentations and the critical role of rigorously designed multimedia in language learners’ input processing and comprehension.
摘要
本研究探討了根據五種降低外在處理原則設計的多媒體教學任務,對語言學習者聽力和閱讀理解發展的影響。本研究包含了設計與實驗兩階段。在設計階段,考慮到應用與違反原則的兩種條件,我們設計了12種多媒體任務。在實驗階段,這些任務被用在兩個會話課的課堂上,每個班由15個學生組成。在研究前後,透過雅思來評量受試者的聽力與閱讀理解能力。實驗組接受了根據條件1的教學,使用按照原則準備的多媒體;對照組則接受了根據條件2的相同教學,教學時使用傳統設計的影片。研究結果顯示,條件1對受試者的聽力與閱讀理解的發展都有顯著的效果。更詳細的分析顯示,這些任務對於改善獨白而非對話的理解力有顯著影響。此外,條件1的教學導致了閱讀大意和特定資訊能力的發展,但條件1對前者的效果量更大。這些結果闡明了以理論、實踐為基礎的原則之價值,以利改善多媒體呈現的設計與製作,以及嚴謹的多媒體設計在語言學習者的資訊輸入處理和理解上扮演的關鍵角色。
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
Improvement of education in the twenty-first century is strongly tied to the establishment and application of novel educational and pedagogical technologies. With the constant arrival of new and arising educational technologies, it is crucial to estimate the degree to which technology employment leads to more effective learning and teaching (Iriti et al., 2016).
In this regard, Computer Assisted Language Learning (CALL) as a techno-centric practice (Stockwell, 2007) has hugely contributed to the development of teaching approaches in language classes. The enormous progress in software and hardware technology in this arena has enabled the assistance of complicated user interfaces and multimedia exhibition manners (Alzahrani & Roberts, 2021) that has massively promoted the learning and teaching of languages in the last decades.
The use of computer-based learning environments provides the opportunity of combining different media (text, audio, graphics, motion, etc.) in teaching that enriches the input by making it multimodal and more comprehensible. The benefits of this type of input for language learning are enormous as multimedia has been found to both affect the development of almost all language skills (Liu et al., 2018; Racicot, 2016; Türk & Erçetin, 2014) and has a far-reaching impact on language learners’ personality traits such as motivation, anxiety, and attitudes (Huang & Mayer, 2016; Leutner, 2014; McDonald, 2004). However, an underlying question concerning multimedia instruction is how to efficiently display verbal and visual information to encourage learning in multimedia environments. One response to this subject originates from the modality principle of cognitive theory of multimedia learning (CTML) (Mayer, 2003, 2005, 2009, ; Mayer & Moreno, 2003) and the modality effect of cognitive load theory (CLT) (Sweller et al., 2011).
Based on CLT, working memory (WM) has a restricted capacity that is associated with the mental effort invested in a learning task. This span restriction should be taken into account in designing well-organized instructional materials (Sweller et al., 1998); otherwise, the cognitive load may increase. It is known that the CL enforcement on a student during learning is due to a mixture of the complications of the instructional design (i.e., germane load) and the information to be learned (i.e., intrinsic load) (Sweller et al., 2011). In this framework, multimedia learning principles have been established to regulate the CL of multimedia tasks. These principles are of three categories, i.e., reducing extraneous processing, managing essential processing, and fostering generative processing. Those principles that target the way instructional materials should be designed to minimize extraneous cognitive load are grouped in reducing the extraneous processing principles of CTML. The main goal of applying these principles in multimedia design is to prevent instances when poor instructional design can drain limited cognitive processing capacity without contributing to learning (Mayer, 2014).
Considering the human brain architecture and the hypothesis that biologically secondary knowledge (oracy and literacy development in L2) needs delicate instructional design, the role of principles of reducing extraneous processing in the foreign language learning domain is highlighted. Besides, sound pedagogical approaches to the improvement of English as a foreign language (EFL) comprehension (via both reading and listening) assume that learners should be imposed to attractive, appropriate, and comprehensive language input (Krashen, 1985). Without a doubt, multimodal learning can create such a learning condition as “oral communication is multimodal, that is, speech is just one component part of the great amount of oral and visual information that is conveyed and perceived when we construct meaning” (Jewitt et al., 2013 as cited in Campoy-Cubillo & Querol-Julian, 2015, p. 195).
Thus, it can be hypothesized that by implementing principles of multimedia task design, CL would decrease, and as a result processing the aural input would be easier. In this way, language learners would face fewer problems during language instruction, can optimize the deployment of cognitive processing strategies, and comprehend the message more easily. This assumption is partially backed up by empirical studies that demand incorporating certain pedagogical and technical principles in designing effective instructional multimedia (Issa et al., 2011; Leutner, 2014; Wang & Li, 2019) . Although previous studies have focused on the role of incorporating selected multimedia design principles in language learning (Dawson et al., 2021; Hung, 2011; Tsai, 2010), there is still a gap in our understanding of the effectiveness of multimedia on the development of comprehension especially when CTML design principles are applied in listening instruction within an EFL context. To fill this gap, the current study has been performed with the aim of applying multimedia learning principles in designing tasks in listening instruction and probing into the development of listening and reading comprehension in the environment of multimodal learning.
Review of Literature
Listening Comprehension and Multimodal Input
Listening is a key component of communication (Nunan, 2002) and “the most widely used language skill in normal daily life” (Martínez-Flor & Usó-Juan, 2006, p. 29). Listening comprehension as a complex and demanding cognitive mechanism involves four overlapping types of processing, including neurological, linguistic, semantic, and pragmatic (Rost, 2011). This demands learners to differentiate between sounds, comprehend the grammatical arrangement and vocabulary, attain acquaintance with the intonation and stress, and contextualize the speech in terms of sociocultural expressions (Vandergrift, 1999).
While the difficulty of listening comprehension has been attributed to factors such as the difficulty of the task or shortage of linguistic knowledge on the part of the listener, a surge of interest in understanding the contribution of various semiotic resources (verbal, visual, aural, and spatial) to how the message is understood is observed in recent years. Based on the multimodality approach, verbal language analysis is insufficient to understand communication and how messages are sent across (Cui, 2019; Ho & Tai, 2020; Valentini et al., 2018). In this framework, the meaning of input in listening comprehension is transformed as “genuine listening input takes a broader perspective to embrace not only oral features but also visual ones” (Campoy-Cubillo & Querol-Julian, 2015, p. 195). From a purely cognitive perspective, this postulation is supported by Paivio’s dual coding theory (Paivio, 1986, 2007), presuming that cognition happens in two separate but related codes: a nonverbal code for cerebral imagery and a verbal code for language (Sadoski, 2005). The activation of these channels fosters learning, in spite of the fact that the two channels have limited capacity for information processing. One focal goal of any listening instruction thus should be providing language learners with authentic materials encompassing real-life events where comprehension develops through processing multimodal input. As Rost argues “using multimedia involving visuals and audio, and with multiple modes of presentation (e.g., video with subtitles), will increase context, reduce cognitive load, and improve comprehension” (2011, p. 152).
Implementing multimodal learning in listening has a relatively long history tied to the advancements of technological devices and the development of theoretical underpinnings of the brain and memory mechanisms. As literature shows, in pioneering studies, some guidelines for using multimedia in listening comprehension have been provided but coincidental with the development of theory and practice, more empirical studies boomed on the use of multimodal learning in listening. Brett (1995), for instance, designed a multimedia application for developing listening skills in Business English. The basic technical requirements and an overview of the application’s essential features, video, tasks, subtitles, and provision of learner choice were introduced. In another study, Meskill (1996) demonstrated, with illustrative scenarios, how multimedia technology could support a pool of 33 micro-skills of listening that Richards (1985) had claimed to be employed by effective listeners when trying to understand aural input. Neither of these studies performed any experimental investigations to verify the effectiveness of their designed instructional content.
With the development of cognitive psychology and the emergence of revolutionary views toward the brain’s architecture, the scope of listening research has broadened in the last two decades. CTML (Mayer, 2003), the model of thinking process (Moreno & Mayer, 2007), the multicomponent model of working memory (Baddeley, 2012), and CLT (Sweller et al., 2011) dramatically contributed to the way the brain processes information and its cognitive enterprises, the importance of the input modality for working memory, how the working and long-term memory should be fed, and the role of instructional content in assisting listeners in having a better listening experience.
Most of the studies within this arena have focused on how multimodal input vs. single mode input may contribute to listening comprehension. İnceçay and Koçoğlu (2017) investigated the effect of one single mode (audio-only) and three dual input delivery modes (audio–video, audio–video with target language subtitles, and audio with PowerPoint presentation) on listening comprehension. The results demonstrated that the audio with PowerPoint presentation group outperformed other groups in listening comprehension. Zarei and Oruji (2019) examined the effect of three types of glossing (textual, pictorial, and textual-pictorial) on listening comprehension and found that textual-pictorial glosses improved listening comprehension significantly in comparison to the other two conditions. The results supported the superiority of integrating multimedia into cognitive instruction over a metacognitive cycle of teaching. In a case study, Sanguino (2020) explored the impact of audio and video materials presented during pre-, while-, and post-listening activities on facilitating L2 listening comprehension in a group of EFL learners. The findings showed that video materials assisted listening comprehension and were beneficial for other aspects of language learning, such as motivation and cultural awareness. In a recent study, Lee, Liu, and Tseng (2021) examined the effects of four various caption modes (control, real-time, full, partial) on listening comprehension and found no significant difference in learners’ listening comprehension when their caption reliance was unkept in view.
The modality unspecific view, indicating that “reading and listening comprehension are two versions of the same comprehension skill” (Wolf et al., 2019, p. 1748), has inspired studies on bimodality of input focusing on the relationship between reading and listening and how proper instruction in one mode would promote the ability of the other. It is reported that listening comprehension can predict 40% of reading comprehension while reading can predict 34% of listening (Wolf et al., 2019), reading-while-listening makes listening tasks easier and more interesting (Chang, 2009), listening comprehension training has a significant impact on strategic listening and reading (Aarnoutse et al., 1998), and the combination of both modalities is more beneficial for vocabulary acquisition compared to when only one mode is involved (Shamir et al., 2012). The effect of multimodal input on reading comprehension has also been examined by a few studies (Naderi Anari et al., 2019; Pellicer Sánchez, et al., 2020), but the role of multimedia-based listening instruction in the development of both listening and reading comprehension needs further clarification.
Multimedia Design Principles
With regard to the profitable learning of knowledge and the improvement of comprehension, the usage of multimedia is discussed to have the possibility to considerably develop instructional effectiveness (Miller et al., 2011); however, concerns remain about the degree to which its arrangement and application have accomplished or optimized such possibility (Massa & Mayer, 2006). To address these concerns, Mayer (2005, 2009, 2014) proposes twelve practice-driven multimedia principles grounded in cognitive theories of learning and instruction to design effective instructional videos. The principles clustered in three categories, namely, reducing extraneous processing principles (coherence principle, signaling principle, redundancy principle, spatial contiguity principle, and temporal contiguity principle), managing essential processing (segmenting principle, pre-training principle, and modality principle), and fostering generative processing (personalization principle, voice principle, embodiment principle, and image principle). The validity of these principles in designing instructional materials and improving learning outcomes of multimedia instruction has been the focus of some studies.
Issa et al. (2011) probed into the effectiveness of the slides prepared based on multimedia principles in a medical college course. The results showed statistically significant improvements in retention and total scores for those students instructed using multimedia design principles compared with those taught using the traditional design. Similarly, Pate and Posey (2016) examined the effect of multimedia design principles on test item performance, student satisfaction, student confidence in potential exam performance, and classroom dynamics in a medical course. The result showed that students retain information better when presented in a multimedia design adherent format and prefer this method to traditional multimedia.
Schwan, Dutz, and Dreger (2018) investigated the effect of various combinations of text with static pictures based on multimedia principles on visitors’ behavior and knowledge acquisition, as well as the average time visitors spent with the artworks in an art exhibition setting. The result supported the validity of the principles of multimedia learning in informal learning settings and elaborated the assumptions of CTML as a theory that specifies the interplay of multimedia learning material, cognition, and motivation. Nagmoti (2017) examined the effect of multimedia principles on students’ learning and feedback on the quality and content of lectures in a medical course. Significant differences were found between the post-test scores of those who received traditional slides and those who received slides modified based on multimedia principles indicating improved short-term memory, long-term memory, and comprehension. Many students appreciated learning through multimedia slides and suggested their continued use.
Kuba, Rahimi, Smith, Shute, and Dai (2022) designed videos based on multimedia principles for a physics educational game to help learners engage in cognitive processing. The results showed that the designed videos significantly predicted the post-test scores and game levels completed. Pantazes (2021) explored the extent to which higher education instructors who created digital instructional videos for online learning had applied multimedia design principles. The results showed that the instructors often implemented the design principles, but they applied certain principles like redundancy less frequently. The instructors’ personal experiences and preferences had more role in applying the principles than their knowledge of the design principles.
A few studies have examined the validation of multimedia principles in language courses. Ayub, Talib, and Siew (2018) explored the users’ perceptions of using seven multimedia principles, generative learning, spatial contiguity, temporal contiguity, coherence, modality, redundancy, and personalization, in mobile-based Japanese language learning. A mixed methods approach was employed. The results showed that most respondents agreed that the multimedia principles were appropriate for the application design except for the personalization and redundancy principles. Beukes (2019) used four multimedia principles, including the redundancy principle, spatial congruity principle, coherence principle, and personalization principle to design a computer program for teaching vocabulary in a foreign language class. The result showed that except redundancy principle, no significant difference was found for applying multimedia principles in designing the game on vocabulary retention. Schrader, Reichelt, and Zander (2018) investigated the effect of the personalization principle in preparing two different language presentation formats of a multimedia presentation on students’ learning outcomes and interest in the learning material. The result showed a positive effect of personalization on both learning and interest. Liu (2019) examined the applicability of the modality and redundancy principles for English as a second language (ESL) students learning. Both knowledge retention and vocabulary test results indicated that input modes did not have an impact on ESL students’ learning, and consequently the modality and redundancy principles had an insignificant role in instruction.
As this brief review reveals, applying multimedia principles in designing language tasks yields mixed findings. Therefore, the examination of incorporating reducing extraneous processing principles in task design and their possible effects on EFL learners’ comprehension development via listening and reading is open to further research. Focusing on this issue, the current study seeks answers to the following research questions:
-
1
Do educational multimedia presentations designed based on reducing extraneous processing principles of CTML have any significant impact on EFL learners’ development of listening comprehension?
-
2
Do educational multimedia presentations designed based on reducing extraneous processing principles of CTML have any significant impact on EFL learners’ development of reading comprehension?
Methods
Design Phase
A group of researchers consisting of one TEFL faculty member, one TEFL research assistant, and two computer science faculty members teamed up and designed 12 sets of multimedia tasks considering two conditions of applying and violating five principles of reducing extraneous processing of CTML (Table 1).
The design phase lasted for around four months. First, the goals and topics of the videos were set, the scripts and storyboards were created, and the materials for making the multimedia (texts, images, sounds) were prepared. Then, Corel Video Studio X10 was used to produce multimedia videos. In the following, the principles and a brief account of how each has been applied or violated in making multimedia videos will be presented (Clark & Mayer, 2016).
Coherence Principle
This principle indicates that adding extra material to multimedia can hurt learning, and thus extraneous materials should be excluded from multimedia presentations. To achieve this goal, words, graphics, or sounds that are not directly related to the instructional goal of the multimedia should be removed. Examples of how the coherence principle and its three sub-principles were applied and violated in making the multimedia videos for this study are depicted in Fig. 1.
Signaling Principle
The main goal of applying the signaling principle is to add visual or verbal cues to the multimedia to highlight the organization of the essential materials and direct the learners’ attention to them. To add verbal cues, the designers can use outlines, headings, vocal emphasis, or pointer words (Clark & Mayer, 2016). To add visual cues, it is recommended to use arrows, distinctive colors, flashing, pointing gestures, and graying out techniques (Clark & Mayer, 2016). Examples of how verbal and visual signaling principles were applied and violated in making the multimedia videos for this study are depicted in Fig. 2.
Redundancy Principle
Based on this principle, people learn better from concurrent graphics and audio than from concurrent graphics, audio, and on-screen text when the on-screen text is the same as the narration. Examples of how the redundancy principle was applied and violated in making the multimedia videos for this study are depicted in Fig. 3.
Contiguity Principles
Based on this principle, people learn better when corresponding words and pictures are presented near rather than far from each other on the page or screen, both spatially and temporarily. According to the spatial contiguity principle, the printed word should be placed as near as the part it describes. Based on temporal contiguity, spoken words should be synchronized with corresponding graphics. Examples of how the spatial and temporal contiguity principles were applied and violated in making the multimedia videos for this study are depicted in Fig. 4.
Two English language teaching (ELT) experts reviewed all multimedia videos by completing the ELT multimedia courseware evaluation questionnaire (Jiang et al., 2017) that assesses the appropriacy of integrating five principles of reducing extraneous processing in designing courseware and multimedia. Both evaluators were experienced language teachers and members of the materials development department of their district education office. Considering the comments and suggestions, the multimedia presentations were revised and finalized for instruction. The duration of instructional multimedia presentations was about 5–7 min.
Experimentation Phase
Participants
Thirty EFL learners participated in this study. They enrolled in two advanced English conversation courses. There were 15 students in each class. The sample included both male (n = 17) and female (n = 13) students. Female students comprised almost half of the control group (n = 6) and the experimental group (n = 7). Considering the size of the sample, gender was not considered to be an intervening variable in the design of the study.
The homogeneity of both groups in terms of English proficiency was assessed by the International English Language Testing System (IETLS) test before the study. The result of the independent samples t-test indicated an insignificant difference [t(28) = − 0.715, p = 0.481 < 0.05] between the groups, verifying the homogeneity of their English proficiency before the experiment. Further, the normal distribution of the sample (as a whole and as two groups) was assessed by normality tests, and no violation of the normal distribution was observed.
The participants ranged in age from 18 to 22 (mean = 19.2) and 18 to 23 (mean = 18.8) in the control and experimental group, respectively. The results of the independent samples t-test showed that no significant difference existed between the groups in terms of their age [t(28) = − 0.983, p = 0.334 < 0.05].
The Instrumentation
International English Language Testing System Test
The International English Language Testing System (IELTS) test assesses the English language proficiency of people who want to study or work in English-speaking environments. It provides a fair, accurate, and relevant assessment of language skills, based on well-established standards, and covers the full range of proficiency levels, from non-user to expert user. The IELTS test has four sections, assessing the four language skills, i.e., listening, reading, writing, and speaking. The candidates receive individual scores for each section.
For this study, listening and reading papers of the IELTS were given to both groups prior to and after the study to determine their listening and reading comprehension levels before and after the experiment. The details of the reading and listening tests are summarized in Table 2.
The reliability coefficients of the listening section of IELTS for this study for the pre-test and post-test were estimated to be 0.71 and 0.82, respectively. The reliability coefficients of the reading section of IELTS for this study for the pre-test and post-test were estimated to be 0.78 and 0.79, respectively.
The Textbook
The main objective of the course was to improve the oracy skills of the participants in an advanced conversation course. The main textbook of the course was Open Forum 3 (Parker & Duncan, 2008) whose focus is on academic listening and speaking. The themes feature academic content areas such as ecology, business, and astronomy.
Open Forum 3 includes authentic listening materials and a wide variety of texts-including lectures, radio interviews, news reports, and informal conversations. Students’ awareness of features of spoken English is raised through working on various types of exercises with different speakers of English (Parker & Duncan, 2008) and listening to different English accents that may be encountered in lectures, discussions, or on the radio (Zou, 2007) .
All 12 units of the book were worked on throughout one semester that lasted for four months. The materials for both experimental and control groups were the same. Each unit consisted of eight sections with a variety of activities. A summary of the sections, their goals, and activities is depicted in Table 3.
The Procedure
Both classes took part in IELTS listening and reading papers before the study. The listening section of the unit was taught based on a comprehension approach by applying a three-cycle of pre-listening, listening, and post-listening.
In the pre-listening phase, the students were familiarized with the theme of the listening tasks and possibly some language forms (grammatical points, new words, etc.). This part of the instruction was the same for both groups. In the while-listening phase, both groups watched the multimedia presentations. The experimental group watched multimedia prepared in accord with the principles, and the control group watched traditionally designed videos. In the post-listening phase, both groups’ comprehension was assessed by various activities, including questions and answers, summary writing, and fill-in-the-blanks.
At the end of the experiment that lasted for 16 weeks, both groups took part in the IELTS listening and reading posttests again to examine the development of their listening and reading comprehension.
Results
The Development of Listening Comprehension
To examine the effect of the intervention on participants’ development of listening comprehension, the multivariate analysis of variance (MANOVA) was used. In this analysis, IELTS listening served as the dependent variable, and the type of instruction (instruction with multimedia designed with CTML principles vs. instruction with multimedia designed without CTML principles) was the independent variable.
The results from the Multivariate Tests suggested a statistically significant difference between the post-test scores of the two groups on the combined dependent variables (Wilks’ Lambda = 0.531, F = 11.927; p = 0.001 < 0.05; ηp2 = 0.469). As Box’s Test of Equality of Covariance Matrices and Levene’s Test of Equality of Error Variances were not significant at p = 0.001, the results of Tests of Between-Subjects Effects were examined.
The results for considering the dependent variables separately (Table 4) showed that the difference between the groups’ post-test scores reached statistical significance just when the comprehension of the monologues was involved. A new alpha level was selected based on Bonferroni adjustment (0.05/2 = 0.025) to avoid error Type I.
Based on Cohen’s guideline (Cohen, 1988), the effect size for the intervention (ηp2 = 0.469 > 0.14) was large. The descriptive statistics showed that the experimental group outperformed the control group in the IELTS listening post-test (Table 5).
The Development of Reading Comprehension
To examine the effect of the intervention on participants’ development of reading comprehension, MANOVA was used. In this analysis, the IELTS reading section served as the dependent variable, and the type of instruction (instruction with multimedia designed with CLMT principles vs. instruction with multimedia designed without CLMT principles) was the independent variable.
The results from the Multivariate Tests suggested a statistically significant difference between the post-test scores of the two groups on the combined dependent variables (Wilks’ Lambda = 0.583, F = 6.207; p = 0.003 < 0.05; ηp2 = 0.417). As Box’s Test of Equality of Covariance Matrices and Levene’s Test of Equality of Error Variances were not significant at p = 0.001, the results of Tests of Between-Subjects Effects were examined.
The results for considering the dependent variables separately (Table 6) showed that the difference between the groups’ post-test scores reached statistical significance when both understanding the gist of meaning and specific information were in focus. A new alpha level was selected based on Bonferroni adjustment (0.05/3 = 0.017) to avoid error Type I.
The result showed that the effect size for the intervention (ηp2 = 0.583 > 0.14) was large. It was revealed that the effect size for general comprehension (ηp2 = 0.396 > 0.14) was larger than that of understanding the detailed information (ηp2 = 0.194 > 0.14). Examining the descriptive statistics showed that the experimental group outperformed the control group in IELTS reading posttest (Table 7).
Discussion
The potential of multimodal input for increasing the capacity of working memory and assisting comprehension has given rise to research efforts on identifying effective ways to simultaneously present verbal and visual materials. The practice-driven evidence would have profound implications for instructional designers in producing and delivering educational multimedia. Taking this into account, the present study aimed to evaluate the effect of incorporating reducing extraneous processing principles in multimedia task design on comprehension development in listening instruction among EFL learners.
The results first and foremost revealed that incorporating the reducing extraneous processing principles into task design contributed to developing both listening and reading comprehension. This finding gives credence to CTML (Mayer, 2014), as based on this theory, optimum learning occurs when both the auditory and the visual channels in WM are used to a comparable extent. The significant role of multimedia in listening instruction is evident in previous research as it encourages more engagement in performing the task, particularly among low-achievers (Lee & Mayer, 2015), increases comprehension of the aural input (Yang, 2014), and lowers the cognitive load of the listening task (Rahimi & Sayyadi, 2019). In agreement with the literature, this study depicts that multimodal input is effective in promoting listening comprehension; but what it adds to the previous storehouse of knowledge is that multimedia can be even more beneficial in listening courses when it is designed based on practice-driven design principles and human brain architecture. As the findings showed, both groups’ listening comprehension developed as a result of multimedia instruction, but the change was more profound among those who worked on tasks designed based on five principles of reducing extraneous processing.
A more detailed analysis showed that the experimental group better understood the monologues than the dialogues at the end of the experiment. In other words, the multimedia presentations prepared based on design principles helped language learners’ understanding when a single speaker was narrating rather than two or more speakers were conversing. One reason for this finding is that most types of multimedia, such as videos, digital stories, and animated explainer videos, have one narrator and they rarely involve people conversing. This is done to increase the concentration of the viewers and help them focus on a single speaker narrating. Therefore, to improve dialogic ability with digital technology more delicate design procedures and attention to conversations are required. Online sessions combined with multimedia rather than offline multimedia with no chance of interaction and cooperation outside the classroom milieu (Mercer et al., 2019; Park & Kim, 2011) can be a more appropriate multimodal input for listening instruction. More studies concerning Mayer’s multimedia principles in second language learning are also required, as the incorporation of the personalization principle that deals with the formal and informal style of narration has been investigated in multimedia research (Bol et al., 2015; Schrader et al., 2018), but the impact of dialogue vs. monologue voice-overs on learning gains and cognitive load of multimedia tasks is open to further examination.
Focusing on the second goal of the study, it was revealed that listening multimedia tasks prepared based on multimedia principles improved the learners’ reading comprehension. It is suggested that perception is a domain-general competence that is not connected to the modality of the input (Wolf et al., 2019), and based on this, listening and reading are two forms of the same perception competence. The modality of primary data does not affect the generation of the situation pattern, instead, the effect of input modality on perception is “a general comprehension skill that transcends modality” (Gernsbacher et al., 1990, p. 430). This finding is in agreement with what CLT proposes that incorporating an additional mode of input into reading materials such as combining texts with pictures, reading-while listening, and reading captions and/or subtitles in multimedia can manage the cognitive load of reading comprehension (Hannon, 2014; Schaffner & Schiefele, 2013; Schaars et al., 2019). The use of illustrated texts has long been thought to assist comprehension and encourage young L1 readers to read by making reading a more joyous activity. Review studies support this issue and reveal the positive effects of the combination of illustrations and written texts on comprehension and memory in comparison to text-only input (Carney & Levin, 2002; Choi, 2011). An increase in vocabulary learning (Chang, 2009), reading speed (Chang & Millett, 2015), and learners’ satisfaction (Brown et al., 2008) are also among the positive effects of the bi-modal input of reading.
It was also found that the effect size of the intervention for promoting understanding gist of meaning was larger than the effect size for comprehending specific information. The use of multimedia that combines text, audio, and video in reading comprehension shows that multimedia facilitates reading comprehension as students can produce a mental portrait from oral or written language, and their sensory system rapidly transfers fragments to the whole by the image (Wang & Li, 2019). One reason for this can be related to applying the signaling principle to attract the attention of the learners to “the important material in the lesson and how it is organized” (Mayer, 2014, p. 5) by including general ideas as printed words for giving outlines, headings and highlighting these texts. Applying this principle has eased understanding of the text, that in comparison to elementary reading, needs higher stages of cognitive and linguistic expertise for the reader as they need to be competent to comprehend both the literal and inferential meanings of the content (Sun et al., 2013).
Conclusions
Instructional multimedia design and its incorporation into the teaching of different subject matters have been practiced and researched in the last two decades. Generally, considerable potential of multimedia for learning is realized; however, mixed findings with respect to applying design principles in making effective multimedia presentations for second and foreign language classes are reported. To address this issue, the current study probed into the effect of multimedia instruction designed based on CTML principles on EFL learners’ listening and reading comprehension.
The findings of the study, consistent with the results of a few works, underscore the key role of applying practice-driven design principles in making instructional multimedia to help language learners benefit from the instruction. This draws the attention of language materials developers to the role of multimedia in promoting comprehension and how these contents should be produced more carefully and meticulously. Also, it shows how essential it is to train teachers in materials development and evaluation and make them aware of the importance of CTML principles during multimedia instruction and how it should be designed and used. This matter emphasizes the role of teacher trainers in familiarizing teachers with technological advancement and its implication in theories and approaches of language teaching. The study offers valuable insights into how instructional multimedia can affect the understanding of both oral and written input in a listening instruction and how the application of certain principles leads to optimum cognitive processing involved in both listening and reading and the interplay between these processes.
The findings of the present study should be interpreted considering its limitations. Due to practicality issues and the limitations of the seats in the language lab where the classes were held, the study was performed with small sample size. Further, because of time and budget limitations, out of 12 principles of multimedia, the first five principles were considered in designing the multimedia tasks. Follow-up studies are recommended by incorporating language proficiency and gender as intervening variables. The effects of other types of multimedia, such as digital storytelling or animated explainer videos designed based on multimedia principles, can be examined. Also, due to the scarcity of research, investigating the impact of multimedia on the development of productive language skills (wiring and speaking) is recommended.
Data Availability
The data are available from the corresponding author upon reasonable request.
References
Aarnoutse, J. C. A., van den Bos, K. P., & Brand-Gruwel, S. (1998). Effects of listening comprehension training on listening and reading. The Journal of Special Education, 32(2), 115–126. https://doi.org/10.1177/002246699803200206
Alzahrani, S., & Roberts, L. (2021). The effect of visuospatial designing elements of zoomable user interfaces on second language vocabulary acquisition. System, 96, 102396. https://doi.org/10.1016/j.system.2020.102396
Ayub, M. S. M., Talib, O., & Siew, N. M. (2018). The perceptions of users regarding multimedia principles in mobile-based Japanese language learning. Turkish Online Journal of Educational Technology, 17(3), 113–124.
Baddeley, A. (2012). Working memory: Theories, models, and controversies. Annual Review of Psychology, 63, 1–29. https://doi.org/10.1146/annurev-psych-120710-100422
Beukes, V. (2019). The effect of four of Richard Mayer’s design principles on vocabulary retention in an Afrikaans computer programme. Computer Assisted Language Learning, 32(1–2), 118–131. https://doi.org/10.1080/09588221.2018.1488737
Bol, N., van Weert, J.C., de Haes, H.C., Loos, E.F., & Smets, E.M. (2015). The effect of modality and narration style on recall of online health information: Results from a web-based experiment. Journal of Medical Internet Research, 17(4), Article e104. https://doi.org/10.2196/jmir.4164
Brett, P. (1995). Multimedia for listening comprehension: The design of a multimedia-based resource for developing listening skills. System, 23(1), 77–85. https://doi.org/10.1016/0346-251X(94)00054-A
Brown, R., Waring, R., & Donkaewbua, S. (2008). Incidental vocabulary acquisition from reading, reading-while-listening, and listening to stories. Reading in a Foreign Language, 20(2), 136–163. https://doi.org/10.10125/66816
Campoy-Cubillo, M. C., & Querol-Julian, M. (2015). Assessing multimodal listening. In B. C Camiciottoli and I. Fortanet-Gomez, Multimodal analysis in academic settings (pp. 213–238). Routledge.
Carney, R. N., & Levin, J. R. (2002). Pictorial illustrations still improve students’ learning from text. Educational Psychology Review, 14(1), 5–26. https://doi.org/10.1023/A:1013176309260
Chang, A.C.-S. (2009). Gains to L2 listeners from reading while listening vs. listening only in comprehending short stories. System, 37, 652–663. https://doi.org/10.1016/j.system.2009.09.009
Chang, A.C.-S., & Millett, S. (2015). Improving reading rates and comprehension through audio-assisted extensive reading for beginner learners. System, 52, 91–102. https://doi.org/10.1016/j.system.2015.05.003
Choi, J. (2011). Literature review: Using pictographs in discharge instructions for older adults with low-literacy skills. Journal of Clinical Nursing, 20(21–22), 2984–2996. https://doi.org/10.1111/j.1365-2702.2011.03814.x
Clark, R., C., & Mayer, R. (2016). e-Learning and the science of instruction proven guidelines for consumers and designers of multimedia learning (4th ed.), John Wiley & Sons Inc.
Cohen, J.W. (1988). Statistical power analysis for the behavioral sciences (2nd ed.), Lawrence Erlbaum Associates.
Cui, W. (2019). Rhetorical listening pedagogy: Promoting communication across cultural and societal groups with video narrative. Computers and Composition, 54, Article 102517. https://doi.org/10.1016/j.compcom.2019.102517
Dawson, K., Zhu, J., Ritzhaupt, A. D., Antonenko, P., Saunders, K., Wang, J., & Lombardino, L. (2021). The influence of the multimedia and modality principles on the learning outcomes, satisfaction, and mental effort of college students with and without dyslexia. Annals of Dyslexia, 71(1), 188–210. https://doi.org/10.1007/s11881-021-00219-z
Gernsbacher, M. A., Varner, K. R., & Faust, M. E. (1990). Investigating differences in general comprehension skill. Journal of Experimental Psychology: Learning, Memory, and Cognition, 16(3), 430–445. https://doi.org/10.1037/0278-7393.16.3.430
Hannon, B. (2014). Are there gender differences in the cognitive components of adult reading comprehension? Learning and Individual Differences, 32, 69–79. https://doi.org/10.1016/j.lindif.2014.03.017
Ho, W. Y. J., & Tai, K. W. H. (2020). Doing expertise multilingually and multimodally in online English teaching videos. System, 94, 1–12. https://doi.org/10.1016/j.system.2020.102340
Huang, X., & Mayer, R. E. (2016). Benefits of adding anxiety-reducing features to a computer-based multimedia lesson on statistics. Computers in Human Behavior, 63, 293–303. https://doi.org/10.1016/j.chb.2016.05.034
Hung, H. T. (2011). Design-based research: Designing a multimedia environment to support language learning. Innovations in Education and Teaching International, 48(2), 159–169. https://doi.org/10.1080/14703297.2011.564011
IELTS. Cambridgeenglish.org
İnceçay, V., & Koçoğlu, Z. (2017). Investigating the effects of multimedia input modality on L2 listening skills of Turkish EFL learners. Education and Information Technologies, 22(3), 901–916. https://doi.org/10.1007/s10639-016-9463-3
Iriti, J., Bickel, W., Schunn, C., & Stein, M. K. (2016). Maximizing research and development resources: Identifying and testing “load-bearing conditions” for educational technology innovations. Educational Technology Research and Development, 64(2), 245–262. https://doi.org/10.1007/s11423-015-9409-2
Issa, N., Schuller, M., Santacaterina, S., Shapiro, M., Wang, E., Mayer, R. E., & DaRosa, D. A. (2011). Applying multimedia design principles enhances learning in medical education. Medical Education, 45(8), 818–826. https://doi.org/10.1111/j.1365-2923.2011.03988.x
Jewitt, C. (2013). Multimodal methods for researching digital technologies. In S. Price, C. Jewitt, & B. Brown (Eds.), The Sage handbook of digital technology research (pp. 250–266). Sage.
Jiang, D., Renandya, W. A., & Zhang, L. J. (2017). Evaluating ELT multimedia courseware from the perspective of cognitive theory of multimedia learning. Computer Assisted Language Learning, 30(7), 726–744. https://doi.org/10.1080/09588221.2017.1359187
Krashen, S. D. (1985). The input hypothesis: Issues and implications. Longman.
Kuba, R., Rahimi, S., Smith, G., Shute, V., & Dai, C. P. (2022). Using the first principles of instruction and multimedia learning principles to design and develop in-game learning support videos. Educational Technology Research and Development, 69(2), 1201–1220. https://doi.org/10.1007/s11423-022-10125-9
Lee, H., & Mayer, R. E. (2015). Visual aids to learning in a second language: Adding redundant video to an audio lecture. Applied Cognitive Psychology, 29(3), 445–454. https://doi.org/10.1002/acp.3123
Lee, P.-J., Liu, Y.-T., & Tseng, W.-T. (2021). One size fits all? In search of the desirable caption display for second language learners with different caption reliance in listening comprehension. Language Teaching Research, 25(3), 400–430. https://doi.org/10.1177/1362168819856451
Leutner, D. (2014). Motivation and emotion as mediators in multimedia learning. Learning and Instruction, 29, 174–175. https://doi.org/10.1016/j.learninstruc.2013.05.004
Liu, Y., Jang, B. G., & Roy-Campbell, Z. (2018). Optimum input mode in the modality and redundancy principles for university ESL students’ MM learning. Computers & Education, 127(3), 190–200. https://doi.org/10.1016/j.compedu.2018.08.025
Liu, Y. (2019). Multimedia input modes, the modality principle, and the redundancy principle for university ESL students’ learning [unpublished doctoral dissertation]. Syracuse University.
Martínez-Flor, A., & Usó-Juan, E. (2006). Towards acquiring communicative competence through listening. In A. Martínez-Flor & E. Usó-Juan (Eds.), Studies on language acquisition: Current trends in the development and teaching of the four language skills (pp. 29–46). Walter de Gruyter.
Massa, L. J., & Mayer, R. E. (2006). Testing the ATI hypothesis: Should MM instruction accommodate verbalizer-visualizer cognitive style? Learning and Individual Differences, 16, 321–335. https://doi.org/10.1016/j.lindif.2006.10.001
Mayer, R. E. (2003). The promise of multimedia learning: Using the same instructional design methods across different media. Learning and Instruction, 13(2), 125–139. https://doi.org/10.1016/S0959-4752(02)00016-6
Mayer, R. E. (2005). Cognitive theory of multimedia learning. In R. E. Mayer (Ed.), The Cambridge handbook of multimedia learning (pp. 31–48). Cambridge University Press.
Mayer, R. E., & Moreno, R. (2003). Nine ways to reduce cognitive load in multimedia learning. Educational Psychologist, 38(1), 43–52. https://doi.org/10.1207/S15326985EP3801_6
Mayer, R. E. (2009). Multimedia learning (2nd ed), Cambridge University Press.
Mayer, R. E. (2014). Cognitive theory of multimedia learning. In R. E. Mayer (Ed.), Cambridge handbook of multimedia learning (2nd ed., pp. 43–71). Cambridge University Press
McDonald, D. S. (2004). The influence of multimedia training on users’ attitudes: Lessons learned. Computers & Education, 42(2), 195–214. https://doi.org/10.1016/j.compedu.2003.07.003
Mercer, N., Hennessy, S., & Warwick, P. (2019). Dialogue, thinking together and digital technology in the classroom: Some educational implications of a continuing line of inquiry. International Journal of Educational Research, 97, 187–199. https://doi.org/10.1016/j.ijer.2017.08.007
Meskill, C. (1996). Listening skills development through multimedia. Journal of Educational Multimedia and Hypermedia, 5(2), 179–201.
Miller, L. M., Chang, C. I., Wang, S., Beier, M. E., & Klisch, Y. (2011). Learning and motivational impacts of a multimedia science game. Computers & Education, 57, 1425–1433. https://doi.org/10.1016/j.compedu.2011.01.016
Moreno, R., & Mayer, R. E. (2007). Interactive multimodal learning environments. Educational Psychology Review, 19, 309–326. https://doi.org/10.1007/s10648-007-9047-2
Naderi Anari, N., Saeedi, R. A., & A. A., & Shariati, M. (2019). The effects of multimodality on reading comprehension and vocabulary retention among Iranian EFL learners. Iranian Journal of English for Academic Purposes, 8(4), 86–101.
Nagmoti, J. M. (2017). Departing from PowerPoint default mode: Applying Mayer’s multimedia principles for enhanced learning of Parasitology. Indian Journal of Medical Microbiology, 35, 199–203. https://doi.org/10.4103/ijmm.IJMM_16_251
Nunan, D. (2002). Listening in language learning. Methodology in language teaching: An anthology of current practice. Cambridge University Press.
Paivio, A. (1986). Mental representations: A dual coding approach. Oxford University Press.
Paivio, A. (2007). Mind and its evolution: A dual coding theoretical approach. Erlbaum.
Pantazes, T. C. (2021). Online instructors’ use of the cognitive theory of multimedia learning design principles: A mixed methods investigation [unpublished doctoral dissertation]. West Chester University.
Park, H. R., & Kim, D. (2011). Reading-strategy use by English as a second language learners in online reading tasks. Computers & Education, 57(3), 2156–2166. https://doi.org/10.1016/j.compedu.2011.05.014
Parker, A., & Duncan, J. (2008). Open Forum 3: Academic listening and speaking. Oxford University Press.
Pate, A., & Posey, S. (2016). Effects of applying multimedia design principles in PowerPoint lecture redesign. Currents in Pharmacy Teaching and Learning, 8(2), 235–239. https://doi.org/10.1016/j.cptl.2015.12.014
Pellicer Sánchez, A., Tragant, E., Conklin, K., Rodgers, M., Serrano, R., & Llanes, À. (2020). Young learners’ processing of multimodal input and its impact on reading comprehension: An eye-tracking study. Studies in Second Language Acquisition, 42(3), 577–598. https://doi.org/10.1017/s0272263120000091
Racicot, R. (2016). The effect of multimedia writing support software on written productivity. Journal of Occupational Therapy, Schools, & Early Intervention, 9(1), 99–123. https://doi.org/10.1080/19411243.2016.1162000
Rahimi, M., & Sayyadi, M. (2019). The cognitive load of listening activities of a cognitive-based listening instruction. Indonesian Journal of Applied Linguistics, 9(2), 382–394. https://doi.org/10.17509/ijal.v9i2.20236
Richards, J. E. (1985). The development of sustained visual attention in infants from 14 to 26 weeks of age. Psychophysiology, 22(4), 409–416. https://doi.org/10.1111/j.1469-8986.1985.tb01625.x
Rost, M. (2011). Teaching and researching listening (2nd ed). Longman.
Sadoski, M. (2005). A Dual Coding view of vocabulary learning. Reading and Writing Quarterly, 21, 221–238. https://doi.org/10.1080/10573560590949359
Sanguino, D. M. G. (2020) Using video materials to help EFL learners facilitate their listening comprehension skill [unpublished master’s thesis]. Pontifical Bolivarian University.
Schaars, M. M. H., Segers, E., & Verhoeven, L. (2019). Cognitive and linguistic precursors of early first and second language reading development. Learning and Individual Differences, 72, 1–14. https://doi.org/10.1016/j.lindif.2019.03.008
Schaffner, E., & Schiefele, U. (2013). The prediction of reading comprehension by cognitive and motivational factors: Does text accessibility during comprehension testing make a difference? Learning and Individual Differences, 26, 42–54. https://doi.org/10.1016/j.lindif.2013.04.003
Schrader, C., Reichelt, M., & Zander, S. (2018). The effect of the personalization principle on multimedia learning: The role of student individual interests as a predictor. Educational Technology Research and Development, 66(6), 1387–1397. https://doi.org/10.1007/s11423-018-9588-8
Schwan, S., Dutz, S., & Dreger, F. (2018). Multimedia in the wild: Testing the validity of multimedia learning principles in an art exhibition. Learning and Instruction, 55(1), 148–157. https://doi.org/10.1016/j.learninstruc.2017.10.004
Shamir, A., Korat, O., & Fellah, R. (2012). Promoting vocabulary, phonological awareness and concept about print among children at risk for learning disability: Can e-books help? Reading and Writing, 25, 45–69. https://doi.org/10.1007/s11145-010-9247-x
Stockwell, G. (2007). A review of technology choice for teaching language skills and areas in the CALL literature. ReCALL, 19(2), 105–120. https://doi.org/10.1017/S0958344007000225
Sun, S. Y., Chich-Jen, S., & Kai-Ping, H. (2013). A research on comprehension differences between print and screen reading. South African Journal of Economic and Management Sciences, 16, 87–101. https://doi.org/10.4102/sajems.v16i5.640
Sweller, J., van Merriënboer, J. J. G., & Paas, F. (1998). Cognitive architecture and instructional design. Educational Psychology Review, 10(3), 251–296. https://doi.org/10.1023/A:1022193728205
Sweller, J., Ayres, P., & Kalyuga, S. (2011). Cognitive load theory. Springer.
Tsai, S. C. (2010). Developing and integrating courseware for oral presentations into ESP learning contexts. Computers & Education, 55(3), 1245–1258. https://doi.org/10.1016/j.compedu.2010.05.021
Türk, E., & Erçetin, G. (2014). Effects of interactive versus simultaneous display of multimedia glosses on L2 reading comprehension and incidental vocabulary learning. Computer Assisted Language Learning, 27(1), 1–25. https://doi.org/10.1080/09588221.2012.692384
Valentini, A., Ricketts, J., Pye, R. E., & Houston-Price, C. (2018). Listening while reading promotes word learning from stories. Journal of Experimental Child Psychology, 167, 10–31. https://doi.org/10.1016/j.jecp.2017.09.022
Vandergrift, L. (1999). Facilitating second language listening comprehension: Acquiring successful strategies. ELT Journal, 54(4), 168–176. https://doi.org/10.1093/elt/53.3.168
Wang, L., & Li, J. (2019). Development of an innovative dual-coded multimedia application to improve reading comprehension of students with imagery deficit. Journal of Educational Computing Research, 57(1), 170–200. https://doi.org/10.1177/0735633117746748
Wolf, M., Muijselaar, M. M. L., Boonstra, M., & de Bree, E. H. (2019). The relationship between reading and listening comprehension: Shared and modality-specific components. Reading and Writing, 32, 1747–1767. https://doi.org/10.1007/s11145-018-9924-8
Yang, H. Y. (2014). Does multimedia support individual differences? – EFL learners’ listening comprehension and cognitive load. Australasian Journal of Educational Technology, 30(6), 699–713. https://doi.org/10.14742/ajet.639
Zarei, A. A., & Oruji, M. (2019). The effect of multimedia glosses on L2 listening comprehension. Iranian Journal of Applied Language Studies, 11(1), 201–220. https://doi.org/10.22111/IJALS.2019.4932
Zou, B. (2007). A review of Angela Blackwell and Therese Naber (2006) Open Forum: Academic Listening and Speaking, Book 2. The Electronic Journal for English as a Second Language, 11(1). http://tesl-ej.org/wordpress/issues/volume11/ej41/ej41r1/?wscr
Author information
Authors and Affiliations
Contributions
Author 1 carried out the study, gathered the data, and helped in writing the manuscript. Author 2 conceptualized, designed, and supervised the research; and drafted, wrote, reviewed, and edited the manuscript. Authors 3 and 4 gave technical advice on how to design multimedia videos. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethical Approval
The study has been carried out based on research guidelines of the Graduate Office at Shahid Rajaee Teacher Training University.
Competing Interests
The authors declare no competing interests.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sayyadi, M., Rahimi, M., Ebrahimpour, R. et al. Applying Multimedia Learning Principles in Task Design: Examination of Comprehension Development in L2 Listening Instruction. English Teaching & Learning 48, 73–96 (2024). https://doi.org/10.1007/s42321-022-00132-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s42321-022-00132-7