Introduction

Two experiments investigated the extent to which the concreteness of titles affects metacognitive and motivational aspects of learning from expository texts. Expository texts are central components of most educational settings, and highly relevant to children’s learning, adult education, vocational education, self-regulated learning, and life-long learning (Wiley et al. 2005; Coiro 2003; Lin and Zabrucky 1998). However, it is challenging for students to understand expository texts (Graesser 2007; Leopold and Mayer 2015; Best et al. 2005), and students often have difficulties monitoring their own learning process (Weaver 1990; Glenberg et al. 1982; Maki and Berry 1984). To empower students to effectively learn from expository texts, it is important to identify and understand text components that promote (or inhibit) text comprehension. The first text component that students encounter when they approach an expository text is its title—and titles can be more or less concrete. Imagine you were a student and your task was to study a text about airplanes, to learn how they function and work. Would it matter to you if the text you were about to study was entitled ‘Getting Airplanes off the Ground’ or ‘Aerodynamic Laws of Ascension’? Would the concreteness of the title affect your metacognitive estimate of how easily you will understand the text? Would it affect how interesting you expect the text to be, and how motivated you are to study it? And would your expectations and your motivation affect your learning? To our knowledge, there are no investigations that have addressed the metacognitive and motivational effects of titles by examining their level of concreteness relative to students’ text expectations, study motivation, and learning outcomes. The aim of the present investigation is thus to determine how the concreteness of titles affects expected ease-of-comprehension and interestingness, reported study motivation, and comprehension test performance—with the overarching goal of deriving practical implications for the design of titles across a variety of educational contexts.

Cognitive functions of titles

Traditionally, titles have been investigated with regard to their effects on the cognitive processing of text information (for an overview see Filippatou and Pumfrey 1995). Titles were typically varied either in terms of their absence or presence (Ausubel 1968; Bransford and Johnson 1972; Dooling and Mullet 1973; Schallert 1976; Schwartz and Flammer 1981), their location (before vs. after a text; Bransford and Johnson 1972; Dooling and Mullet 1973), and/or their level of correspondence with information in a text (integrative vs. non-integrative; Arnold and Brooks 1976; Schallert 1976; Kozminsky 1977; Schwartz and Flammer 1981; Yuill and Joscelyn 1988; Shepherd 1990; Filippatou and Pumfrey 1995; Lorch and Lorch 1996; Ritchey et al. 2008). Typical dependent measures included the recall and/or comprehension of entire passages or particular information within a passage. The results of these studies showed that titles which are presented prior to a text (a) activate relevant context information (Bransford and Johnson 1972), (b) direct attention and affect which information learners focus on (Lorch and Lorch 1996; Ritchey et al. 2008), (c) bias comprehension towards certain themes in a text (Schallert 1976; Kozminsky 1977), and (d) promote overall comprehension—particularly if the titles refer to the main ideas of their corresponding texts (Arnold and Brooks 1976; Yuill and Joscelyn 1988).

The concreteness of titles

The underlying rationale for the cognitive functions of titles is schema-theoretic in nature and proposes that titles activate relevant prior knowledge in the context of which the following text information is interpreted and remembered (for an overview see Filippatou and Pumfrey 1995). Whenever that activated context information elicits mental imagery, titles can be viewed not only as advance organizers but also as conceptual pegs, which conjure up mental images to which the following information then gets attached (Anderson et al. 1977; Sadoski et al. 2000). This view on titles stems from theories of text comprehension rooted in dual coding theory (Paivio et al. 1968; Sadoski and Paivio 2004, 2013; Schnotz and Bannert 2003; Schnotz 2005). Dual-coding oriented theories of text comprehension suggest that learners process written text in two code systems: the verbal/descriptive, and the non-verbal/depictive system (Sadoski and Paivio 2004, 2013; Schnotz and Bannert 2003; Schnotz 2005). Concrete language, as opposed to abstract, refers to tangible entities that can be perceived with the senses (Glanzer and Bowles 1976; Gernsbacher 1984; Hoffman 2016). Because concrete language relates to sensory, non-verbal experience, it is richer in referential connections between the verbal and non-verbal systems, and therefore more likely to elicit mental imagery than abstract language (Sadoski and Paivio 2004, 2013; Sadoski et al. 1993, 2000). As an example, conjuring up a mental image for the concrete phrase “seeing a green apple” is much easier than for the abstract phrase “knowing a common law”.

From this point of view, Sadoski et al. (2000) proposed that concrete titles would promote text comprehension and recall by encouraging learners to encode the following information both verbally and non-verbally. The researchers tested this hypothesis in a set of experiments in which learners studied twenty-four short texts which were accompanied with either concrete or abstract titles. The results showed that concrete titles fostered the recall of literary stories and narratives, but not that of persuasive or expository texts (Sadoski et al. 2000). From our perspective, these mixed findings are difficult to interpret because the experiments focused solely on the cognitive effects of titles—as was the case in previous studies (Filippatou and Pumfrey 1995; Lorch and Lorch 1996; Ritchey et al. 2008).

Metacognitive and motivational functions of titles

More recent research has paid attention not only to the cognitive but also to the metacognitive and motivational functions of titles. Shimada (2016), for example, showed that titles which are perceived as easy to comprehend promote students’ motivation to study expository text materials on disaster prevention. Because concrete titles are easy to envision, they are typically perceived to be easier to comprehend than abstract titles– even if both types of titles are equally familiar. The title pairs “How a Plane Flies” (concrete) versus “The Laws of Lift” (abstract), “Buffeting Jolts” (concrete) versus “Force Transfer” (abstract), and “Countertop Gadgets (concrete) versus “Domestic Devices” (abstract), for example, are perceived as equally familiar by students, but the concrete titles are perceived to be easier to comprehend (Sadoski et al. 2000).

Concrete titles are also perceived as more interesting than abstract titles, and this effect is partially mediated by ease-of-comprehension (Sadoski et al. 2000). On the one hand, concrete titles are interesting because the mental imagery they elicit is engaging and involving (Sadoski et al. 2000). On the other hand, concrete titles are easier to comprehend than abstract titles, and comprehensible information is typically perceived as more interesting than information which is difficult to understand (Schraw et al. 1995; Schraw and Lehman 2001). The perceived interestingness of learning material is an important source of study motivation—particularly during difficult learning activities, such as studying expository texts (Fulmer et al. 2015; Krapp 1999; Hidi et al. 2004).

Concreteness in linguistic and cultural context

Concrete titles are likely to affect learners from varying linguistic and cultural backgrounds in similar ways because the processing of language in verbal and non-verbal systems is considered to be a fundamental and universal aspect of human cognition (Sadoski and Paivio 2013). Binder et al. (2009), for example, demonstrated in a large meta-analysis that the processing of concrete words includes sensory non-verbal experience (i.e. mental imagery), while the processing of abstract words depends primarily on verbal information. These results were supported by neuropsychological findings from different countries, including the US, the UK, and Germany (Binder et al. 2009; Wang et al. 2010; Hoffman 2016). Following up on these notions, the present investigation was conducted first in the US, and then again in Germany, with the intent to explore the generalizability of potential concreteness-effects of titles across two linguistically and culturally similar contexts. The linguistic and cultural contexts of the US and Germany were selected for two reasons. First, American English and German both belong to the Anglo-Frisian subgroup of West Germanic languages (Durrell 2006). As such, they share similar syntactic and semantic features (König and van der Auwera 1994). The second reason pertains to the close relationship between the academic cultures in the US and Germany. In both academic cultures, expository texts are a main medium for conveying information within educational contexts, and students are used to studying expository texts (Clyne 1987).

Research questions

Despite the ample empirical evidence for the potential metacognitive and motivational benefits of concrete titles, there are no studies that have systematically investigated the extent to which concrete versus abstract titles affect expected ease-of-comprehension, interestingness, study motivation, and learning outcomes. Based on the theoretical and empirical background outlined above, we raise the following research questions: (1) How does the concreteness of titles (Concrete vs. Abstract) affect learners’ expectations of how easily they will understand expository texts?; (2) How does the concreteness of titles (Concrete vs. Abstract) affect learners’ expectations of how interesting expository texts will be?, (3) How does the concreteness of titles (Concrete vs. Abstract) affect learners’ motivation to study expository texts?; (4) How does the concreteness of titles (Concrete vs. Abstract) affect learning outcomes?; (5) What are the relationships between expected ease-of-comprehension, interestingness, reported study motivation, and learning outcomes? (6) Are effects of titles comparable in US American and German learners?

Hypotheses

Concrete titles are perceived as easier to comprehend and more interesting than abstract titles (Sadoski et al. 2000). We therefore hypothesize that learners will expect expository texts with concrete (as opposed to abstract) titles to be easier to comprehend (Hypothesis 1) and more interesting (Hypothesis 2). In line with dual-coding oriented theories of text comprehension (e.g. Schnotz 2005; Schnotz and Bannert 2003; Sadoski and Paivio 2013) and the empirical findings by Sadoski et al. (2000), we expect that expected ease-of-comprehension partially mediates the effects of titles on expected interestingness (Hypothesis 3). Ease-of-comprehension and interestingness are typically positively correlated to study motivation (e.g. Shimada 2016; Fulmer et al. 2015; Hidi et al. 2004; Krapp 1999), and concrete language is engaging and involving (e.g. Sadoski et al. 2000). We therefore expect learners to be more motivated to study expository texts with concrete (as opposed to abstract) titles (Hypothesis 4), partially mediated by expected interestingness (Hypothesis 5), and ease-of-comprehension—directly (Hypothesis 6a) and indirectly (Hypothesis 6b). Study motivation is a predictor for learning outcomes (e.g. Krapp 1999; Hidi et al. 2004). In addition, concrete titles foster text comprehension by functioning as conceptual pegs (Anderson et al. 1977; Sadoski et al. 2000). We therefore expect learners who study expository texts with concrete (as opposed to abstract) titles to achieve higher comprehension test scores (Hypothesis 7), partially mediated by study motivation (Hypothesis 8).

Methods

The hypotheses were investigated in two separate experiments—one conducted in the United States, the other one in Germany. The materials for the American experiment were produced first, and then systematically translated from English into German. Previous to the investigations, all materials were reviewed and approved by the Boards of Ethics at the universities at which the experiments were conducted. All data was collected anonymously, under consideration of the participants’ privacy. All participants were informed about the terms of the data collection, and provided informed consent. After the pilot studies and the experiment, the participants received a detailed debriefing.

Participants

American participants

Sixty-three undergraduate students of a midsized university in the Western United States participated. Forty-three of the students were female, 19 male, and 1 student chose not to declare gender. The participants’ ages ranged between 18 and 40 years (M = 21.7 years; SD = 4.42). All participants were native speakers of English, with a mean GPA of 3.0 (SD = 0.5). Forty-two percent of the participants studied psychology, 33% were enrolled in social studies, and 25% reported other majors, such as business, kinesiology, and theatre. The ethnic composition of the participant sample was: 1.6% Pacific Islander, 4.8% African American; 12.7% Asian American; 12.7% Hispanic, 61.9% Caucasian, and 3.2% of the participants chose not to declare ethnicity.

German participants

Sixty-one undergraduate psychology students of a large university in the East of Germany participated. Forty-two students were female, 19 male. The participants’ ages ranged between 18 and 41 years (M = 23.05; SD = 4.96). All participants were native speakers of German, with a mean German GPA of 1.76 (SD = 0.66), which resembles an American GPA of 3.6. Sixty-five percent of the participants studied psychology, the remaining 35 percent were education majors.

Design

One factor (Concreteness of Titles) was varied on two levels (Concrete vs. Abstract) in a between-subjects-design.

Materials

The experimental materials were comprised of three expository texts (plus one practice text), each accompanied with one of two titles (Concrete vs. Abstract); a set of Likert-type rating scales; and three separate comprehension tests, each consisting of eight Meaning-Identification-Technique test sentences (MIT; Marchant III et al. 1988). All experimental materials were incorporated into a computer-based learning environment that was designed with the Study-2000-technology (Narciss et al. 2013).

Experimental texts

Four expository texts on scientific topics were derived from wiki pages and modified to suit the purpose of this study. The texts were examined by two science experts (i.e. one university professor of learning and instruction in science education and one university professor of physics) who approved of the text contents, and confirmed that domain-specific terminology was used correctly. Each text explained the design and functions of a different technological device used to overcome human boundaries (i.e. scuba regulators/airplanes/thermal imaging cameras/hearing aids). Each text was constructed using the revision techniques by Britton and Gülgöz (1991) as guiding principles to control for text coherence. Topics were chosen from different domains to avoid argument overlap, and texts were equated for word count (range 220–284) and readability (Flesh-Kincaid-grade level range 7.2–10.1). The readability range was chosen so that all texts could be well-understood by university undergraduates (i.e. Flesh-Kincaid-grade level ≤ 13).

The texts (without titles) were piloted on 18 undergraduate American students and 22 undergraduate German students of the universities at which the main studies were going to be conducted. Volunteers that took part in one of the pilot studies could neither take part in another pilot study, nor the main study. Participants rated each text for mental imagery, comprehensibility, interestingness, and content familiarity (as an indicator of prior knowledge) using a separate Likert-type rating scale for each rating, with scales ranging from 1 (very hard to form a mental image of this; very hard for me to understand; not interesting to me; not familiar to me in content) to 7 (very easy to form a mental image of this; very easy for me to understand; very interesting to me; very familiar to me in content). The rating scales were the same as used by Sadoski et al. (1993, 2000) and Sadoski (2001). The order of text presentation was randomized for each participant to prevent sequence effects. Cronbach’s α was at 0.5 for all ratings, which was low, but considered sufficient considering the small participant sample sizes (N = 18; 22) and the small number of texts (N = 4). Because of the small sample sizes in the pilot studies, the texts were compared with non-parametric Friedman tests of differences among repeated measures.

In both samples (American and German), the Friedman tests revealed no differences between the texts on scuba regulators, airplanes, and thermal imaging cameras with regard to the ratings (all p > 0.05). The texts elicited medium levels of mental imagery (M: 4.50–5.41; SD: 1.05–2.01), comprehensibility (M: 4.64–5.36; SD: 1.28–1.78), perceived interestingness (M: 3.73–4.18; SD: 1.47–2.13), and evoked low content familiarity (M: 2.50–3.45; SD: 1.14–1.89).

Low content familiarity was important to ensure that participants had the opportunity to learn something from the texts, and thus, to prevent ceiling effects in terms of text comprehension. In both samples, the text on hearing aids was rated slightly more familiar in content than the other three texts, and was therefore selected to function as the practice text.

Titles

Each of the three experimental texts was accompanied by a semantically concrete versus abstract title (Getting Airplanes off the Ground vs. Aerodynamic Laws of Ascension; Breathing Underwater vs. Subaquatic Regulator Systems; Seeing Objects in the Dark vs. Thermal Variability Imaging). All of the titles were integrative in that they referred to the main ideas, concepts, or themes of their corresponding texts (Filippatou and Pumfrey 1995). The concrete titles were based on phrases that directly referred to specific sensory experience (i.e. breathing, seeing, and getting off the ground). The abstract titles were based on intangible terms (systems, laws, and variability) in combination with general technical terms (subaquatic, regulator, aerodynamics, and thermal), and were mostly unrelated to sensory experience.

The practice text was presented without a specific title (i.e. the text was simply called ‘Practice Passage’) to avoid priming effects. According to Glanzer and Bowles (1976) and Paivio et al. (1968), concrete words have a higher probability to occur in printed text (i.e. a higher word frequency) than abstract words, and high-frequent words tend to be shorter than low-frequent words (Ozuru et al. 2012). The abstract titles are therefore on average three syllables longer than the concrete titles, while having an almost identical word count. Both title versions for each text were anchored to the first text sentence by a two- or three-content-word overlap in order to establish direct verbal associations between text and titles (Sadoski et al. 2000). This was also important because learners expect the most important information to be mentioned first in a text (Kieras 1981), and initially-mentioned information influences learners’ perceptions of whole texts (Kieras 1980). A content word that was used in one title was not used in any other title. Pilot studies were conducted to determine whether the manipulation of semantic concreteness produced differences in mental imagery, ease-of-comprehension, and interestingness, as proposed by Sadoski et al. (1993, 2000) and Sadoski and Paivio (2004, 2013).

The titles (without texts) were piloted on 24 other undergraduate American and 22 other undergraduate German students from the same universities as before, using the same 7-point Likert-type rating scales that were used for piloting the texts. The order of title presentation was randomized for each participant. Cronbach’s α ranged between 0.6 and 0.7 for the ratings in both samples (American and German). In order to determine whether the title-pairs (Concrete vs. Abstract) for each text differed significantly, we compared the titles with non-parametric Wilcoxon tests because of the small sample sizes (N = 24; 22). The results showed significant differences between all of the title pairs (all p ≤ 0.01). The concrete titles were easy to imagine (M: 6.21–6.42; Mdn: 7.00; SD: 0.97–1.56), highly comprehensible (M: 6.63–6.71; Mdn: 7.00; SD: 0.46–1.02), interesting (M: 5.04–6.00; Mdn: 5.00–7.00; SD: 1.10–1.53), and rather familiar in content (M: 5.41–6.37; Mdn: 6.00–7.00; SD: 0.96–1.87). The abstract titles ranged within a medium level of mental imagery (M: 3.92–4.21; Mdn: 4.00–4.50; SD: 1.74–2.19), comprehensibility (M: 3.92–4.58; Mdn: 4.00–5.00; SD: 1.69–1.84); interestingness (M: 4.13–4.46; Mdn: 4.00–5.00; SD: 1.84–1.92); and were not as familiar in content (M: 2.37–2.83; Mdn: 2.00; SD: 1.48–1.78).

Self-report measures

For the main experiments, ratings on expected ease-of-comprehension (EOC), interestingness (INT), and motivation to engage in a text (MOT) were assessed using separate 7-point Likert-type rating scales for each rating, and each text. The scales for each of the ratings ranged from 1 (= very hard to understand/not at all interesting/not at all motivated) to 7 (= very easy to understand/highly interesting/highly motivated). Participants were specifically asked: How easy do you think it will be for you to understand a passage titled…? How interesting do you expect a passage titled…to be? How motivated are you to study a passage titled…? We determined the reliabilities of the ratings, and obtained the following values (Cronbach’s α): EOC α = 0.84; INT α = 0.71; MOT α = 0.77 (American sample), and EOC α = 0.91; INT α = 0.71; MOT α = 0.84 (German sample).

Comprehension tests

To assess conceptual text understanding we constructed three separate comprehension tests (one for each text), using the Meaning-Identification-Technique (MIT; Marchant III et al. 1988). In MIT, comprehension is assessed at the level of the propositional text base (for levels of text comprehension see Kintsch 2004) by exposing participants to test sentences to which they respond with either Yes or No, based on whether they think the sentences accurately describe information they encountered in a previously studied text. There are two types of test sentences in MIT: Paraphrases and meaning changes of paraphrases. Paraphrases carry the same meaning as statements in the text, and the correct response to these test sentences would therefore be YES. Meaning changes of paraphrases do not carry the same meaning as statements in the text, and should therefore be responded to with NO. To construct a comprehension test with the MIT, all original sentences from a text are first paraphrased by changing as many words as possible, without making the sentences artificially difficult or incoherent (Marchant III et al. 1988). Then, half of these paraphrased sentences are slightly changed in their meaning so that they no longer corresponded with the original statements from the texts (Marchant III et al. 1988). These meaning changes typically include the substitution of concepts from the text with other, seemingly similar concepts, as shown in the following example from the text on thermal imaging cameras:

  • Original sentence from text Infrared detector elements create a detailed temperature pattern.

  • Paraphrase The generated heat pattern is produced by infrared light sensors.

  • Meaning change The generated heat pattern is based on ultraviolet emissions.

Based on the item-analysis of a pilot study in which another 38 American undergraduate students took part, we selected eight test-sentences per text—half of them paraphrases, half of them meaning changes. The resulting three comprehension tests (Cronbach’s alphas were 0.7 for each of the tests) only included items that ranged within a medium level of item difficulty (range: 0.55–0.84), and that contributed to the internal consistency of the respective comprehension test. We further ensured that all major concepts which were stated in the texts were also addressed in the comprehension tests, and that each comprehension test addressed a comparable amount of information stated in the first and the second half of the text on which the comprehension test was based. The finalized American MIT test was then systematically translated into German.

Procedure

After providing informed consent for the investigation, the participants in both samples (American and German) were randomly assigned to one of the title-conditions (Concrete vs. Abstract), and completed one practice trial to become familiar with the computer-based learning environment, and the demands of the learning tasks (i.e. the participants studied the practice text and completed a practice MIT-test pertaining to that text). For the three experimental texts, participants first only saw the title of the text they were about to encounter and, based on that title, rated their expected ease-of-comprehension (EOC), and the expected interestingness (INT) of the following text, as well as the motivation to study that text (MOT). Participants studied the text which followed the title, and then completed the MIT-test pertaining to that text. This procedure was repeated for the remaining two experimental texts. All tasks were self-paced. The order of text appearance was counterbalanced. In a last step, participants completed a demographics questionnaire, and then received a detailed debriefing about the experiment.

Results

For each participant, the means of the ratings and comprehension test outcomes were computed. The data were analyzed using SPSS 24. The ANOVAs were Bonferroni-corrected at p < 0.0125 for testing 4 main-effect-hypotheses on one sample. The SEM was conducted using AMOS for SPSS, Version 24.

Descriptive statistics

Table 1 shows the means, standard deviations, skewness, and kurtosis for the ratings of ease-of-comprehension (EOC), interestingness (INT), motivation (MOT), and for comprehension test performance (COMP), for both samples (American and German).

Table 1 Descriptive statistics

All skewness and kurtosis values were less than an absolute value of 3, indicating that normality for subsequent analysis could be assumed (Tabachnick and Fidell 1996). No outliers were identified in the data.

Correlations

Zero-order correlations were conducted in both samples (American and German) to determine whether the proposed relationships between the variables were present (see Table 2). Except for the relationships between comprehension test performance and most other variables, all proposed relationships were found (see Table 2).

Table 2 Pearson correlations main experiments (American and German sample)

Comparability of US and German data sets

The descriptive statistics (Table 1) and correlational patterns (Table 2) for the American and German data sets indicate a high degree of comparability between the two samples. To statistically test for potential differences, the two data sets were compared with independent samples t-tests for each of the four outcome measures. The confidence interval was adjusted to 99.98% for testing multiple hypotheses. There were no significant differences between the US American and the German sample in terms of expected ease-of-comprehension [t (122) = 0.05, p = 0.90], expected interestingness [t (122) = 1.69, p = 0.09], and motivation [t (122) = − 1.53, p = 0.12). The samples only differed with regard to comprehension test performance [t (122) = − 5.56, p < 0.01), with German learners solving on average one more test item correctly (M = 6.66; SD = 0.63) than American learners (M = 5.71; SD = 1.17). Because of the theoretically proposed and empirically evidenced comparability of both samples, the data sets were combined to increase power for further analysis. Follow-up analyses for performance data was conducted separately for the two samples.

Main effects

Expected ease-of-comprehension (EOC)

The ANOVA to test Hypothesis 1 revealed a large effect of Titles on EOC [F (1, 122) = 78.25; MSerror = 1.30; p < 0.001; partial ŋ2 = 0.39]. Students who were presented with concrete titles expected the subsequent texts to be easier to comprehend (M = 4.73; SD = 1.13) than students who were presented with abstract titles (M = 2.92; SD = 1.15).

Expected interestingness (INT)

The ANOVA to test Hypothesis 2 revealed a large effect of Titles on INT [F (1, 122) = 47.12; MSerror = 1.44; p < 0.001; partial ŋ2 = 0.27]: Students who were presented with concrete titles expected the following texts to be more interesting (M = 4.68; SD = 1.12) than students who were presented with abstract titles (M = 3.20; SD = 1.27).

Reported study motivation (MOT)

The ANOVAs to test Hypothesis 4 revealed a large effect of Titles on MOT [F (1, 122) = 25.14; MSerror = 1.72; p < 0.001; partial ŋ2 = 0.17]: Students who were presented with concrete titles reported to be more motivated to study the following texts (M = 4.43; SD = 1.21) than students presented with abstract titles (M = 3.25; SD = 1.41).

Comprehension test performance (COMP)

The ANOVAs to test Hypothesis 7 revealed no effect of Titles on COMP [F (1, 122) = 1.48; MSerror = 1.11; p = 0.22; partial ŋ2 = 0.12]. Learners in both title groups solved about the same amount of test sentences correctly (Mconcrete = 6.26; SDconcrete = 1.03; Mabstract = 6.06; SDabstract = 1.07). Because the two samples originally differed in comprehension test performance, the analysis was repeated separately for each sample, with the same main result: There was no effects of titles in either sample (American: [F (1, 61) = 0.92; MSerror = 1.38; p = 0.34; partial ŋ2 = 0.02]; German: [F (1, 59) = 0.77; MSerror = 0.40; p = 0.38; partial ŋ2 = 0.01]). Learners in both title groups solved about the same amount of test sentences correctly in either sample (American: Mconcrete = 5.85; SDconcrete = 1.22; Mabstract = 5.57; SDabstract = 1.13; German: Mconcrete = 6.72; SDconcrete = 0.53; Mabstract = 6.58; SDabstract = 0.72).

Mediation analyses with structural equation modelling (SEM)

The more complex mediating relationships between titles, ease-of-comprehension (EOC), interestingness (INT), motivation (MOT), and comprehension test performance (COMP)—as proposed in Hypotheses 3, 5, 6, and 8—were investigated with Structural Equation Modelling (SEM). SEM was used because it allows to create and confirm specific path models with more flexibility than traditional methods, such as path analyses via regressions (Kline 2011).

While regression analyses imply statistical relationships between variables based on conditional expected values, SEM assumes functional relationships grounded in conceptual models (Bollen and Pearl 2012). The paths for the conceptual model in this investigation were defined based on the theoretically meaningful relationships described in the introduction. The resulting empirical model is displayed in Fig. 1. For ease of interpretation, all paths are displayed as standardized. Paths marked with ** are significant at p < 0.01, paths marked with * are significant at p < 0.05. The overall model fits the data well: χ2 (N = 124) = 9.18; p = 0.01; GFI = 0.97; NFI = 0.97; RMSEA = 0.17). In line with the ANOVAs, the paths of the proposed main effects of titles on EOC and INT were significant at p < 0.01.

Fig. 1
figure 1

Mediation analyses with structural equation modelling (SEM)

Partial mediation of expected interestingness (INT) by (title) through expected ease-of-comprehension (EOC)

To test Hypothesis 3, we determined the standardized indirect effect of Titles on INT at (0.63) (0.61) = 0.38. We tested the significance of this indirect effect using bootstrapping procedures. Unstandardized indirect effects were computed for each of 2000 bootstrapped samples, and the 95% confidence interval was computed by determining the indirect effects at the 2.5th and 97.5th percentiles. The bootstrapped unstandardized indirect effect was 1.06, and the 95% confidence interval ranged from 0.76, 1.38. The indirect effect of Titles on INT through EOC was statistically significant (p < 0.01), thereby supporting Hypothesis 3.

Partial mediation of study motivation (MOT) by (Title) through expected interestingness (INT) and expected ease-of-comprehension (EOC)

To test Hypotheses 5 and 6, we first determined the standardized indirect effect of Titles on MOT through INT and EOC at (0.63)(0.61)(− 0.03)(0.76) = 0.38. Unstandardized indirect effects were computed for each of 2000 bootstrapped samples at a 95% confidence interval. The bootstrapped unstandardized effect was 1.09, and the 95% confidence interval ranged from 0.75, 1.40. The indirect effect of Titles on MOT was significant at p < 0.01. In a next step, we examined the relative contributions of EOC and INT on the mediation effect by determining the direct effect of INT on MOT, and the direct effect of EOC on MOT. The standardized direct effect of INT on MOT was 0.76. The bootstrapped unstandardized direct effect was 0.77, the 95% confidence interval ranged from 0.59, 0.90, and was significant at p < 0.01, thereby supporting Hypothesis 5. The standardized direct effect of EOC on MOT was − 0.03. The bootstrapped unstandardized direct effect was −0.03, the 95% confidence interval ranged from − 0.21, 0.15, and was not significant at p = 0.76, thereby rejecting Hypothesis 6a. For Hypothesis 6b, we determined the standardized indirect effect of EOC on MOT at (0.61)(0.76) = 0.46. The bootstrapped unstandardized effect was 0.45, at an interval range of 0.31–0.60. The indirect effect of EOC on MOT was significant at p < 0.01, thereby supporting Hypothesis 6 b.

Partial mediation of comprehension test performance (COMP) by title through study motivation (MOT)

The American and German samples differed significantly in comprehension test performance. In addition, comprehension test performance (COMP) did not correlate with any of the other variables in the American sample, and did not yield significant differences between the experimental groups in neither the American nor the German sample. We therefore computed the mediation analysis for Hypothesis 8 only for explorative purposes, and separately for the two samples. In the American sample, the standardized indirect effect of Titles on COMP was determined at (0.06)(0.15) = − 0.02. The bootstrapped unstandardized effect was 0.15, at an interval range of − 0.37 to 0.28. The indirect effect was not significant at p = 0.64, thereby rejecting Hypothesis 8. In the German sample, COMP correlated with MOT and EOC, and the path from MOT to COMP in the empirical SEM model reached significance at p < 0.05. The standardized indirect effect of Titles on COMP through MOT in the German sample was determined at (− 0.01)(0.26) = 0.09. The bootstrapped unstandardized effect was 0.11, at an interval range of − 0.01 to 0.29. The indirect effect of Titles on MOT did not reach significance (p = 0.07), thereby rejecting Hypothesis 8.

Discussion

The cognitive functions of titles are well-researched (e.g. Bransford and Johnson 1972; Filippatou and Pumfrey 1995; Lorch and Lorch 1996; Ritchey et al. 2008), yet little is known about the metacognitive and motivational effects of titles in instructional contexts (Shimada 2016). The primary goals of the present two experiments were thus to a) determine the extent to which concrete versus abstract titles affect metacognitive and motivational expectations students generate with regard to expository texts, with a focus on expected ease-of-comprehension and expected interestingness; b) gain insight into the relationships between expected ease-of-comprehension, interestingness, and study motivation, c) determine if, and how, text expectations and study motivation are related to comprehension test performance; d) explore the comparability and generalizability of effects across two similar linguistic and cultural contexts (American and German), and d) derive practical implications for the design of titles for expository texts.

In line with our main hypotheses, the results of revealed that both American and German students expected texts with concrete (as opposed to abstract) titles to be easier to comprehend and more interesting, and reported to be more motivated to study those texts. Contrary to our hypotheses, the increased motivation to study texts with concrete titles did not result in increased comprehension test performance for these texts. Structural Equation Modelling (SEM) was applied to test a conceptual model of the relationships between expected ease-of-comprehension, interestingness, reported study motivation, and comprehension test performance. In line with our hypotheses, the concreteness of titles affected students’ motivation to study expository texts, mediated by how interesting the students expected the texts to be. Expected interestingness and study motivation were partially mediated by the ease with which students expected to understand the texts (see Fig. 1).

Theoretical considerations, limitations, and implications

Dual-coding oriented theories of text comprehension and neuropsychological evidence (e.g. Sadoski and Paivio 2013; Schnotz 2005; and Bannert 2003; Binder et al. 2009; Hoffman 2016) suggest that concrete language is richer in referential connections between the verbal and non-verbal system, and therefore easier to mentally picture and comprehend, and perceived as more interesting, than abstract language (Sadoski and Paivio 2004, 2013; Sadoski et al. 2000). Theoretical approaches to motivation in text comprehension (e.g. Krapp 1999; Hidi et al. 2004) propose that the interestingness of learning materials is a driving force for study motivation (Hidi et al. 2004; Krapp 1999; Fulmer et al. 2015). In line with both theoretical perspectives, the students in our pilot studies found concrete titles relating to sensory experiences (such as Breathing Underwater or Seeing Objects in the Dark) easier to mentally picture and comprehend, and more interesting, than abstract titles based on intangible, rather technical terms (such as Aerodynamic Laws of Ascension or Subaquatic Regulator Systems). In line with empirical evidence for the universal role of language concreteness across linguistic contexts (Binder et al. 2009; Wang et al. 2010; Hoffman 2016), the same results were obtained for American and German participants.

For the main experiments, we proposed a conceptual model for the relationships between the concreteness of titles and students’ expected ease-of comprehension, interestingness, study motivation, and comprehension test performance. Overall, this conceptual theoretical model was strongly supported by the data (Fig. 1)—particularly with regard to the mediating relationship between the concreteness of titles, expected text interestingness, and study motivation. This finding aligns with previous research showing that concrete information is often perceived as more interesting and motivating than information that is difficult to envision and understand (e.g. Sadoski et al. 2000; Schraw et al. 1995; Schraw and Lehman 2001).

With regard to the role of prior knowledge, it is important to note that effects may vary for learners with different levels of expertise. The pilot studies for the present experiments revealed low levels of prior knowledge in American and German participants, and there were no outliers identified in terms of comprehension test performance in the main experiments. However, it is certainly possible that learners with high levels of prior knowledge might experience a trade-off for title concreteness and interestingness/motivation. That is, learners with a lot of prior knowledge might expect texts with concrete titles to be too easy and therefore not at all interesting, which could result in a decreased motivation to study texts with concrete as compared to abstract titles; an interesting hypothesis for future research.

A rather puzzling finding was the lack of association between comprehension test performance and the other variables. We attribute this lack of effects primarily to the employed comprehension measure, which was likely not sensitive enough to detect potential differences in learning outcomes between the experimental groups. In the Meaning-Identification-Technique (MIT; Marchant III et al. 1988), students identify whether a presented statement aligns with a statement they had encountered in a previously studied text. The presented statement is either a paraphrase (i.e. a correct statement) or a meaning-change of an original statement from the text (i.e. an incorrect statement). Even though we altered as many words as possible to create the paraphrases and meaning changes from the original statements, the resulting items still provided contexts which could be utilized by students to access their mental model of the texts (Bransford and Johnson 1972; Anderson et al. 1977; 2005). In free recall tasks—which are more typically employed as outcome measures in studies on titles (e.g. Dooling and Mullet 1973; Kozminsky 1977; Schwartz and Flammer 1981; Yuill and Joscelyn 1988), students have fewer cues available to access their mental text models. Free recall tasks may thus be more suitable to assess differences in text comprehension, based on titles—particularly when titles serve as the retrieval cues for the free recall tasks. A second consideration regarding the lack of effects on comprehension test performance pertains to the experimental setting and procedure. In both title-groups (Concrete vs. Abstract), students were aware that they were going to be tested on their knowledge after reading. In addition, the experiments were conducted in highly controlled lab settings, in presence of the experimenters. These two factors may have induced high levels of socially desirable study behavior in the participants, resulting in proper study behavior despite low levels of motivation in the abstract title-group (Richman et al. 1999). An interesting way to approach this issue would be an online replication of the present studies in which the participants are removed from the highly controlled lab environment, and less inclined to behave in socially desirable ways (Richman et al. 1999). A third consideration pertains to the experimental texts, which were comparatively short (220–284 words) and of medium concreteness, comprehensibility, and interestingness (see Tables 1 and 2). Because the texts did not fully align with either title condition (i.e. they were less concrete, comprehensible, and interesting than their titles in the concrete-title-condition, and more concrete, comprehensible, and interesting than their titles in the abstract-title-condition), students in either title-group may have adjusted their motivation levels in either direction during reading, based on the text characteristics. A way to address this issue in future research could include a slightly more complex 2 × 2-design in which both, titles and texts, are varied systematically in terms of their concreteness. However, in spite of the limitations in terms of comprehension test performance, the results of the present investigation substantially extend the current literature by helping us understand the processes by which titles affect student’s text expectations in terms of ease-of-comprehension and interestingness, and how these expectations relate to study motivation.

Practical implications

Based on the mediating relationships between the concreteness of titles and students’ text expectations and study motivation, the present investigation offer a number of practical implications for the design of titles in instructional contexts. The first implication pertains to the relationship between the concreteness of titles and expected ease-of-comprehension. In both samples (American and German), students expected texts with concrete titles to be easier to comprehend than texts with abstract titles. This is important because many students are in fact afraid of approaching expository texts, particularly if they know they are going to be tested on the text contents (Lin et al. 2000). Employing concrete as opposed to abstract titles will help to counteract that initial fear by increasing the students’ expected ease-of-comprehension.

The second practical implication pertains to the relationships between the concreteness of titles and expected interestingness and study motivation. Students were more motivated to study expository texts with concrete (as opposed to abstract) titles, and this was in part due to the expectation that those texts would be more interesting. Studies over the past three decades have shown that students’ academic motivation declines over the course of their educational paths (e.g. Hidi and Harackiewicz 2000; Anderman and Maehr 1994; Harter 1981). More specifically, students’ interests and attitudes towards school decline over time, particularly with regard to scientific topics (e.g. Hidi and Harackiewicz 2000; Eccles and Wigfield 1992; Eccles et al. 1998). It is therefore imperative for educators, text book editors, publishers, and designers of online learning environments to employ instructional strategies that promote and cultivate study motivation—particularly with regard to topics that rely heavily on learning from expository texts as is the case in science education (e.g. Coiro 2003; Leopold and Mayer 2015). As the results of the present two investigations show, employing concrete as opposed to abstract titles has a strong potential to promote and maintain study motivation by increasing the expected interestingness of expository texts.