Abstract
People from Western societies generally find it difficult to name odors. In trying to explain this, the olfactory literature has proposed several theories that focus heavily on properties of the odor itself but rarely discuss properties of the label used to describe it. However, recent studies show speakers of languages with dedicated smell lexicons can name odors with relative ease. Has the role of the lexicon been overlooked in the olfactory literature? Word production studies show properties of the label, such as word frequency and semantic context, influence naming; but this field of research focuses heavily on the visual domain. The current study combines methods from both fields to investigate word production for olfaction in two experiments. In the first experiment, participants named odors whose veridical labels were either high-frequency or low-frequency words in Dutch, and we found that odors with high-frequency labels were named correctly more often. In the second experiment, edibility was used for manipulating semantic context in search of a semantic interference effect, presenting the odors in blocks of edible and inedible odor source objects to half of the participants. While no evidence was found for a semantic interference effect, an effect of word frequency was again present. Our results demonstrate psycholinguistic variables—such as word frequency—are relevant for olfactory naming, and may, in part, explain why it is difficult to name odors in certain languages. Olfactory researchers cannot afford to ignore properties of an odor’s label.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
We all recognize the smell of freshly baked bread when walking past a bakery, and so you might mistakenly think you can also easily name that odor. However, naming smells in the absence of contextual cues appears to be difficult: on average, people only name 25% to 50% of odors correctly (Cain, 1979, 1982; Desor & Beauchamp, 1974; Distel & Hudson, 2001; Yeshurun & Sobel, 2010), which stands in sharp contrast to the near-ceiling performance reported in visual naming studies (Damian, Vigliocco, & Levelt, 2001; Jescheniak & Levelt, 1994). So, why are people so bad at naming odors?
Various proposals abound (see Jönsson & Stevenson, 2014, for an overview). Perhaps we are not very good smellers because of the loss of functional olfactory receptor genes in modern humans (Gilad, Man, Pääbo, & Lancet, 2003), for example. However, behavioral studies paint a different picture (Majid, Speed, Croijmans, & Arshamian, 2017). Cross-species comparisons show that humans have comparable olfactory sensitivity to nonhuman primates (Laska, Seibt, & Weber, 2000; see also Shepherd, 2004; McGann, 2017). Others argue the problem lies in the link between olfactory and verbal areas of the brain: These links are either inherently weak (e.g., Engen, 1987), interfere with each other (Lorig, 1999), or are too direct (Olofsson & Gottfried, 2015). However, it is unclear whether the nature of this link is the cause of poor odor naming or whether it merely reflects learning history (Majid, 2015). Recently, Majid and Burenhult (2014) showed that Jahai speakers from the Malay Peninsula were able to name odors with more ease than English speakers. Unlike English speakers, Jahai speakers have a dedicated vocabulary to describe different qualities of smell (Burenhult & Majid, 2011). This raises the question of whether poor odor naming could be the result of properties of the lexicon itself; and more broadly, what role psycholinguistic variables play in odor naming.
Prior research examining odor naming has overlooked the role of the lexicon, and focused almost exclusively on properties of the odor instead. It has been shown that an odor’s familiarity is an important factor in free naming (Lawless, 1978; Lawless & Cain, 1975), as well as in recognition (Rabin & Cain, 1984) and discrimination (Rabin, 1988). Odor pleasantness and intensity also influence naming (Distel & Hudson, 2001), with identified odors rated as more intense and pleasant. “Real” odors (i.e., taken from a natural source; e.g., actual chocolate) are named correctly more often than synthetic odors (e.g., microencapsulated odors; cf. Engen, 1987).
In contrast, there is little examination of whether properties of the odor label might impact odor naming, even though some words are simply easier to produce than others. One property shown to influence word production is how often a word is used, that is, its frequency. Pictures with high-frequency labels are named faster (e.g., Oldfield & Wingfield, 1965), and more accurately (e.g., Jescheniak & Levelt, 1994), than pictures with low-frequency labels. Odor-naming studies never control for this factor. It is simply assumed—without ever checking any corpora for the linguistic facts—that because odors used in naming studies are familiar, they must have high-frequency names (cf. Jönsson, 2005). When word frequency has been discussed, it has been used only as a proxy for odor frequency (e.g., Cain et al., 1995; Wijk & Cain, 1994), the idea being that frequently encountered odors are probably referred to with high-frequency words. Again, this assumption has never been explicitly tested. To date, there is no evidence that word frequency of the odor label, rather than familiarity or frequency of occurrence of the odor itself affects odor naming.
In the current study, we investigated to what extent a psycholinguistic variable such as odor label frequency influences odor naming. Based on previous research, we hypothesized odors with high-frequency labels to be named correctly more often than odors with low-frequency labels, even when taking other known factors into account.
Experiment 1
Method
Participants
Participants were 42 native speakers of Dutch (Mage = 22.8 years old, SD = 3.7, range: 18–37 years, 32 female), who were recruited through the Radboud University participant system. They all gave written consent before the experiment began and were paid €7.50 for their participation.
Stimuli
Twenty-four odors (see Appendix Table 10) were selected such that their veridical labels were either high or low frequency. We were restricted in our selection of odors such that the concrete objects were easily administrated as odors as well as differing substantially in lexical frequency. Odors were presented in 30-mL dark glass jars with cotton wool covering the objects so participants could not see them. Log-label frequency was determined by combined occurrences in Dutch CELEX (Burnage, 1990), Dutch SUBTLEX (Keuleers, Brysbaert, & New, 2010), Spoken Dutch (Oostdijk, 2000), and OpenSoNaR corpora (Oostdijk, Reynaert, Hoste, & Schuurman, 2013). The number of occurrences in each corpus was summed and then divided by the total size of the four corpora combined. High-frequency (Mlog(F) = 1.56 per million, SD = 0.21) and low-frequency labels (Mlog(F) = 0.14 per million, SD = 0.32) differed significantly from each other, t(22) = 12.881, p < .001, d = 5.36.
In addition to word frequency, factors that have been shown to influence word production are word length (Klapp, Anderson, & Berrian, 1973; Meyer, Roelofs, & Levelt, 2003), age of acquisition (Barry, Morrison, & Ellis, 1997; Carroll & White, 1973), and image ability/concreteness (Strain, Patterson, & Seidenberg, 1995). We examined each of these variables too. Word length was operationalized as the number of characters in Dutch spelling. Age of acquisition and concreteness values were taken from data provided by Brysbaert, Stevens, De Deyne, Voorspoels, and Storms (2014). There were no differences in word length t(22) = 1.290, p = .210, d = 0.53, and concreteness t(13.511), p = .233, d = 0.51, between the high-frequency and low-frequency conditions, but age of acquisition differed significantly, t(18.795) = 4.89, p < .001, d = 2.00. However, as all these factors have been shown to be strongly related (Morrison, Chappell, & Ellis, 1997), we focused on the effect of label frequency in the analyses to avoid issues with collinearity (following Baayen, 2008).
Procedure
The experiment consisted of two parts: a naming and rating task, followed by a questionnaire, and took approximately 45 minutes to complete.
In the naming/rating task, participants were asked to name the odors by verbally answering the question Welke geur is dit? (“Which odor is this?”). Participants were allowed to smell each odor as often as they liked, and were permitted multiple responses. After naming each odor, participants rated it on five different 7-point Likert scales, using Qualtrics Survey Software on a desktop computer. The order of rating scales was identical for all trials and all participants: (1) intensity (how strong the odor smelled), (2) familiarity (how familiar the odor was), (3) pleasantness (how pleasant the odor was), (4) edibility (how edible an object with this odor would be) and, (5) odor frequency (how often the participant personally encountered the odor). High values on the scale stood for high intensity, familiarity, and so forth. Odor presentation order was randomized between participants.
After the main experimental task, participants were asked to complete a demographic questionnaire about their background, including information pertinent to the experiment (e.g., smoking, allergies, illness, cooking experience). All participants were instructed not to eat or smoke at least 1 hour before the experiment.
Results
Odor ratings
Before examining odor naming, participants’ rating scores for odors (see Table 1) were compared to assess potential covariates. Participant fatigue to odors was also assessed by correlating rated intensity with the order in which odors were presented. In addition, we checked correlations between label frequency, familiarity rating, and odor frequency rating to assess whether familiarity and frequency are related and whether familiar odors are indeed described with high-frequency labels.
Odors with low-frequency labels were rated as more intense than those with high-frequency labels, t(42) = 7.99, p < .001, d = 1.23. However, there was no significant correlation between intensity rating and the order in which odors were presented, r(22) = .129, p = .550, indicating that participants showed no signs of fatigue. Odors with low-frequency labels were rated as more pleasant than odors with high-frequency labels, t(42) = 4.42, p < .001, d = 0.68. There was no difference in edibility ratings between odors with low-frequency or high-frequency labels, t(42) = 1.33, p = .189, d = 0.21.
Odors with low-frequency labels were rated as more familiar than those with high-frequency labels, t(42) = 4.42, p < .001, d = 0.68, and as more frequently occurring, t(42) = 2.71, p = .010, d = 0.41. There was a strong correlation between odor frequency rating and odor familiarity rating, r(22) = .945, p < .001, indicating these two factors are related. However, there were no significant correlations between familiarity ratings and log-frequency of odor labels, r(22) = −.137, p = .524, or odor frequency rating and log-frequency of odor label, r(22) = −.021, p = .923. While participants’ subjective ratings might not reflect true odor frequency in the environment, previous assumptions in the literature that familiar and frequently occurring odors are also described with high frequency words (e.g., Jönsson, 2005) is not necessarily true.
As we found significant differences between odors with high-frequency and low-frequency labels, rating scores were also included in the modelling process to assess the contribution of these factors to odor naming.
Odor naming
Audio-recordings of participants’ responses on the naming task were transcribed, after which main responses were extracted. Main responses were defined as content responses (without modifiers; e.g., a bit like beer was coded as “beer”), excluding hedonic judgments, intensity judgments, and descriptions of elusive sensations (e.g., “familiar”; “recognizable”). For the critical analysis of naming accuracy, only the 24 predefined veridical labels were counted as target answers since their word frequencies and other psycholinguistic variables were considered the point of reference. Results from the naming task were analyzed using mixed logit models (Jaeger, 2008), appropriate for binomially distributed outcomes described as a combination of fixed and random effects. The analyses were done in R (R Core Team, 2013), using the lme4 package (Bates, Maechler, Bolker, & Walker, 2015). In the final model,Footnote 1 log-label frequency was added as fixed factor, with odor familiarity, odor frequency and odor object edibility ratings as covariates.Footnote 2 Including interactions did not significantly improve the model. The dependent variable was naming accuracy (correct; incorrect). We controlled for random participant and item effects. As participants were permitted to give more than one response, separate analyses were conducted for first responses and all responses, to see if effects were stable across time, as Jescheniak and Levelt (1994) found that frequency effects are only detectable in the immediate time frame and disappear soon after. As such, we would expect to find frequency effects in first responses but not necessarily in all responses. For the all-responses analysis, if any of the responses participants gave for a particular odor was the veridical label, this was counted as correct.
For participants’ first responses (see Table 2), properties of the label influenced odor naming, as did properties of the odor itself. Label frequency significantly predicted naming accuracy, β = 1.01, SE = 0.51, z = 2.01, p = .044. Odors with high-frequency labels (25.2%) were named correctly more often than those with low-frequency labels (17.0%); see Fig. 1. In addition, odor familiarity predicted naming accuracy: β = 0.43, SE = 0.12, z = 3.50, p = .001; as did edibility: β = 0.34, SE = 0.08, z = 4.15, p = .001; and rated odor frequency: β = 0.29, SE = 0.10, z = 2.78, p = .005.
For all responses (see Table 3), naming accuracy was again predicted by odor familiarity: β = 0.43, SE = 0.11, z = 3.90, p = .001; odor edibility: β = 0.31, SE = 0.08, z = 4.36, p = .001; and odor frequency: β = 0.33, SE = 0.10, z = 3.33, p = .001. However, the effect of label frequency disappeared, β = 0.65, SE = 0.40, z = 1.63, p = .103, even though odors with high-frequency labels (30.0%) were still named correctly more often than those with low-frequency labels (22.7%) numerically; see Fig. 1.
Nonveridical responses
As most responses were nonveridical, it is interesting to look at these in more detail and see whether there are psycholinguistic factors involved in these responses as well. That is, when people are unsure of how to name a smell and have to choose from a set of similar alternative responses, they may resort to higher frequency options. Following Cain (1979), incorrect responses were divided into near misses (the label given was similar to the odor object, e.g., fennel for “anise”), and far misses (generic terms, e.g., spice for “anise”); and clearly incorrect responses (e.g., petrol for “anise”). Two independent judges classified participant’s responses (Cohen’s κ = .938). Disagreements were resolved by discussion, which led to a final list of terms counted as near misses used in the analysis. We expected more near misses for odors with low-frequency labels, as these would undergo more competition from similar responses in the word production process.
We compared the distribution of response types (hit; near miss; far miss) by label frequency (high; low)—see Table 4. There was a significant association between label frequency and response type, both for first responses, χ2(2) = 22.42, p < .001, and all responses, χ2(2) = 25.40, p < .001. In both cases, the odds of participants responding with a near miss (e.g., fennel for “anise”) was 2.73 times higher for odors with low-frequency labels than for odors with high-frequency labels.
Discussion
Overall, results from the naming task were in line with findings from previous olfaction studies: People seem to perform rather poorly when naming odors. Even when taking all responses into account, participants were, on average, able to correctly name only 26.3% of odors in total. However, the analyses also showed that certain odors were named correctly more easily than others. Factors related to the odor mattered: Odors with higher familiarity, odor frequency, and perceived edibility all contributed to correct naming. Even though the stimuli were chosen based on their widespread availability in the Netherlands—and therefore presumably familiar to Dutch native speakers—familiarity and odor frequency ratings still spanned the entire scale, demonstrating variation, nevertheless. Edibility was not used as a selection criterion in the study design, and most of the odor objects (19 out of 24) were edible. But it seems that edibility plays a role in identifying odors, and subsequently naming them. Some food objects were not recognized as edible: participants gave a food object the lowest edibility rating around 13% of the time. This resulted in misidentification, making perceived edibility—rather than generally accepted object edibility—the relevant factor in naming odors.
As predicted, psycholinguistic variables also contributed to correctly naming odors. Most importantly, odors with high-frequency labels were named correctly more often than odors with low-frequency labels upon first response. This confirms the hypothesis that there is a frequency effect at work in odor naming. This study is the first, to our knowledge, to demonstrate a label frequency effect for odor naming. The frequency effect only appeared for first responses, but not for responses produced thereafter, in line with results from picture-naming studies that show the frequency effect is short lived (e.g., Jescheniak & Levelt, 1994). However, this could be considered surprising, as odor perception itself has a longer time course than visual processing of pictures (Keetels & Vroomen, 2012; Khan & Sobel, 2004), which could mean that we would be less likely to uncover frequency effects for odor naming. But this was not the case. Characteristics of the odor label did influence odor naming. We matched odors for the length of their labels as well as concreteness ratings. However, there was a difference in the age of acquisition between conditions that was confounded with frequency. So the results from Experiment 1 could be explained as the result of the frequency of odor labels or the age at which the labels were learned. We come back to this after Experiment 2.
Based on the familiarity and frequency ratings of the two sets of odors (high vs. low label frequency), we believe the differences we find can indeed be attributed to properties of the odor label and not to characteristics of the odors themselves. As far as can be determined, the odors in our study were equally discriminable across frequency conditions (cf. Chrea, Valentin, Sulmont-Rossé, Hoang Nguyen, & Abdi, 2005). Some of the odors used in the current study appear in previous tests of odor identification and naming too (e.g., Doty, Shaman, & Dann, 1984; Hummel, Sekinger, Wolf, Pauli, & Kobal, 1997; Kobayashi, Saito, Kobayakawa, Deguchi, & Costanzo, 2006; Cho, Jeong, Lee, Hong, Yoon, & Kim, 2009)—this includes odors with high-frequency labels (coffee, mint, fish) as well as odors with low-frequency labels (anise, cinnamon, turpentine). Importantly, odors with low-frequency labels in our study were rated as both more familiar and more frequently occurring, so properties of the odor are not confounded with odor label properties.
If odor naming is influenced by the odor label (i.e., its frequency), do other psycholinguistic variables also play a role? Aside from word frequency, another factor that influences word production is semantic context. When semantically related pictures (e.g., animals or fruits) are presented consecutively (versus intermingled) for multiple trials, they are named slower (Damian et al., 2001; Kroll & Stewart, 1994), and with increased error rates (Vitkovitch, Humphreys, & Lloyd-Jones, 1993). While this effect has been shown for visual stimuli, we do not know whether it plays a role in other modalities, such as olfaction. It is possible there would be a higher likelihood of interference and problems in naming odors, since the difference between oranges and lemons, for example, is generally perceived to be smaller in smell than in vision (see Schab & Cain, 1991). This is likely to increase competition between related candidates, the cause of semantic interference (Levelt, Roelofs, & Meyer, 1999; Xavier-Alario, Segui, & Ferrand, 2000).
We therefore investigated whether semantic context also influences odor naming. We conducted a second experiment to investigate this issue. The dimensions along which odors are perceived are poorly understood. Even so, edibility is often shown to be important (Ayabe-Kanamura, Kikuchi, & Saito, 1997; Schiffman, Reynolds, & Young, 1981; and Experiment 1 above), so the semantic context we manipulated was edibility. Most semantic interference studies in the visual domain include categories like “food,” “fruit,” or “vegetables” (e.g., Costa, Alario, & Caramazza, 2005; Damian et al., 2001; Jescheniak, Matushanskaya, Mädebach & Müller, 2014; Kroll & Stewart, 1994). Interfering distractors in these studies are always other food items, whereas distractors from other categories are inedible objects. Based on these previous studies, we predicted consecutive odors would be named correctly more often when presented in a semantically unrelated context (i.e., an edible odor followed by an inedible one) than in a semantically related context (e.g., a series of edible odors).
Experiment 2
Method
Participants
Participants were 40 native speakers of Dutch (Mage = 24.3 years old, SD = 7.3, range: 18–65 years), recruited through the Radboud University participant system. There were 20 participants in each experimental condition, with equal numbers of males and females. Participants gave written consent before the experiment and were paid €15.
Stimuli
Twenty-four odors (see Appendix Table 11) were selected such that their veridical labels were either high or low frequency, but also that the words denoted either edible or inedible objects. The odors were presented in 500-mL opaque white plastic squeezy bottles. Participants could not see the odor object but could smell the odor by squeezing the bottle.
As in Experiment 1, log-label frequency was determined by combined occurrences in Dutch CELEX (Burnage, 1990), Dutch SUBTLEX (Keuleers et al., 2010), Spoken Dutch (Oostdijk, 2000), and OpenSoNaR (Oostdijk, Reynaert, Hoste, & Schuurman, 2013) corpora. High-frequency (Mlog(F) = 1.30 per million, SD = 0.38) and low-frequency labels (Mlog(F) = −0.23 per million, SD = 0.43) differed significantly from each other t(22) = 9.29, p < .001, d = 3.78. Veridical label frequency for edible and inedible odor objects did not differ significantly t(22) = 0.92, p = .368, d = 0.37. Word length (number of characters in Dutch spelling) did not differ significantly for high-frequency versus low-frequency labels t(22) = 1.957, p = .063, d = 0.80, nor for edible versus inedible odor objects t(22) = 0.258, p = .799, d = 0.11. There were also no significant differences in concreteness (Brysbaert et al., 2014) for high-frequency versus low-frequency labels t(22) = 2.052, p = .052, d = 0.84, nor edible versus inedible odor objects, t(22) = 1.625, p = .118, d = 0.66. Age of acquisition of the odor labels (Brysbaert et al., 2014) differed for high-frequency versus low-frequency labels t(22) = 5.287, p < .001, d = 5.37 (as in Experiment 1), but not for edible versus inedible odor objects t(22) = 0.168, p = .868, d = 0.07.
Procedure
The experiment consisted of three parts: a naming task, a rating task, and a demographic questionnaire. Participants rated the odors in a separate task this time, increasing the duration of the experiment as well as possible fatigue. We therefore had short breaks of approximately 10 minutes between tasks. The experiment took approximately 75 minutes in total.
In the naming task, participants were asked to name odors as in Experiment 1, with multiple responses permitted. There were two presentation conditions: odors were presented in either a random order or in two blocks of 12 odors based on their edibility. Odor presentation order was randomized between participants.
In the rating task, participants smelled the odors a second time—also in either random or blocked order—and rated the odors on intensity, familiarity, pleasantness, edibility, and odor frequency, as in Experiment 1. The order of rating scales was identical across trials and participants.
After the two main experimental tasks, participants were asked to complete a questionnaire about their personal and linguistic background. Participants were again instructed not to eat or smoke at least 1 hour before the experiment.
Results
Odor ratings
Participants’ rating scores for odors (see Table 5) were compared using a 2 × 2 within-participants ANOVA, with label frequency (high; low) and odor type (edible; inedible) as independent variables, and rating scores as the dependent variables. We also checked correlations between intensity and the order in which odors were presented, and between label frequency, familiarity rating, and odor frequency rating.
There was no difference in intensity ratings between food and nonfood odors, F(1,39) < 1, p = .461, nor between odors with high-frequency labels and low-frequency labels, F(1, 39) < 1, p = 927. There was also no significant correlation between intensity and the order in which the odor was presented r(24) = .200, p = .348, meaning participants did not show signs of olfactory fatigue. Food odors were rated as more edible than nonfood odors, F(1, 39) = 395.53, p < .001, ηp2 = .91, confirming the manipulation was successful. Food odors were also rated as more pleasant than nonfood odors, F(1, 39) = 25.22, p < .001, ηp2 = .39. Odors with low-frequency labels were rated as more pleasant, F(1, 39) = 146.99, p < .001, ηp2 = .79, and as more edible than those with high-frequency labels, F(1, 39) = 5.03, p = .031, ηp2 = .11.
Food odors were rated as more familiar, F(1, 39) = 7.51, p = .009, ηp2 = .16, and more frequently occurring than nonfood odors, F(1, 39) = 8.47, p = .006, ηp2 = .18. Odors with low-frequency labels were rated as more familiar than those with high-frequency labels, F(1, 39) = 5.23, p = .028, ηp2 = 12, but there was no difference in their odor frequency rating, F(1, 39) < 1, p = .939. There were no significant correlations between familiarity rating and the log-frequency of the odor label, r(22) = .06, p = .794, or between odor frequency rating and the log-frequency of the odor label, r(22) = .28, p = .188. However, there was a strong correlation between odor frequency rating and odor familiarity rating, r(22) = .75, p < .001.
As Experiment 2 used different stimuli, some of the rating scores (intensity, odor frequency) differed compared to Experiment 1. Nevertheless, findings from the rating task were largely in line with what we found in the first experiment, and as such, rating scores were again included in the modelling process.
Odor naming
The data were coded and analyzed as in Experiment 1. In the final model,Footnote 3 log-label frequency and semantic context (random; blocked) were added as fixed factors, with familiarity and edibility ratings as covariates. Including interactions did not significantly improve the model. The dependent variable was naming accuracy (correct; incorrect). We controlled for random participant and item effects.
For participants’ first responses (see Table 6), label frequency significantly predicted naming accuracy, β = 0.71, SE = 0.29, z = 2.44, p = .014. Odors with high-frequency labels (24.3%) were named correctly more often than those with low-frequency labels (17.6%). However, there was no difference in naming accuracy between the random (22.5%) and blocked (19.3%) semantic context conditions, β = −0.08, SE = 0.27, z = 0.30, p = .761; see Fig. 2. In addition, familiarity predicted naming accuracy: β = 0.57, SE = 0.11, z = 5.47, p < .001, as did edibility: β = 0.24, SE = 0.06, z = 3.86, p < .001.
For all responses (see Table 7), odors with high-frequency labels (28.9%) were also named correctly more often than those with low-frequency labels (24.4%) in all responses, β = 0.51, SE = 0.24, z = 2.14, p = .033. But again, there was no difference in naming accuracy between random (28.2%) and blocked (25.2%) conditions, β = −0.06, SE = 0.26, z = 0.24, p = .81; see also Fig. 2. There was also an effect of familiarity: β = 0.47, SE = 0.08, z = 5.60, p < .001, and edibility: β = 0.26, SE = 0.05, z = 4.84, p < .001.
Nonveridical responses
As in Experiment 1, incorrect responses were divided into hits, near misses (e.g., fennel for “anise”) and far misses (generic terms and clearly incorrect responses) by two independent judges (Cohen’s κ = .931). Disagreements were resolved by discussion.
We compared the distribution of response types (hit; near miss; far miss) by label frequency (high; low). In the random-order condition (see Table 8), there was a significant association between label frequency and response type, χ2(2) = 13.46, p < .005. Upon first response, the odds of participants responding with a near miss (e.g., fennel for “anise”) were 2.29 times higher for odors with low-frequency labels than for odors with high-frequency labels. The same pattern appeared when taking all responses into consideration, but this was not significant at the conventional level of significance χ2(2) = 5.46, p = .065.
In the blocked condition (see Table 9), however, there was no significant association between label frequency and the type of response examining first responses, χ2(2) = 3.26, p = .2, or all responses, χ2(2) = 1.36, p = .51.
General discussion
In both Experiment 1 and 2, we found a robust frequency effect demonstrating properties of the label matter for odor naming. At the same time, there was little evidence of semantic interference for odor naming. Blocking odors by edibility led to numerically lower correct naming (28% for random order vs. 25% for blocked order), but this was not statistically significant. As mentioned in the discussion of Experiment 1, the dimensions of odor perception are poorly understood, which is why we chose to investigate fairly broad semantic categories. This is in line with previous picture-naming studies that also used superordinate categories, such as animals. However, it is possible stimuli from such broad semantic categories are too diverse to adequately capture semantic interference in odor naming. Even so, semantic categories that have been used in picture naming and word naming studies (e.g., animals, tools, professions) are usually not suitable for odor experiments, as most of the items in these groupings do not typically have an odor, so this remains a challenge for future work along these lines.
We do not believe the lack of a semantic interference effect is due to a failure of odor identification. Although we only test naming ability, interference effects are predicted to come from edibility characteristics of the odor object, and the rating data from Experiment 2 show that participants have no problem judging edibility. This is in line with previous research (Fusari & Ballesteros, 2008) and means that, in principle, the experimental manipulation in itself should have worked.
There are differences in production between semantically related words (e.g., lemon and orange) and associatively related words (e.g., lemon and juice). Whereas semantically related words suffer from interference in production, associatively related words enjoy facilitation (e.g., Xavier-Alario et al., 2000). As there was no difference between conditions, it might be the case that some labels were both semantically and associatively related (e.g., coffee and tea), thereby neutralizing interference and facilitation effects. If there are interference or facilitation effects at work in odor naming, they might be revealed with a more limited set of odors specifically selected for either semantic relatedness or associative relatedness, with label frequency and other psycholinguistic variables balanced.
Analysis of nonveridical responses also revealed some interesting patterns. In some cases, responses were actually more specific than the predetermined veridical labels (e.g., green tea instead of just tea), which might reflect the speaker’s intention to be as informative as possible (Grice, 1975; Levelt, 1996). In others, a particular response classified as a near miss (erroneous, yet appropriate) was used by a large number of participants. Overall, odors with low-frequency labels were named with a near miss label more often than odors with high-frequency labels. For example, we used bleach to represent chloor (chlorine) in Experiment 1, a label with a higher word frequency (log(F) = 0.13 per million) than the actual source term bleekmiddel (log(F) = −0.65 per million; veridical in Experiment 2). In the two experiments combined, chloor was used by 53 out of 83 participants, whereas bleekmiddel was used by only three participants. This suggests predetermined target labels (e.g., based on the odor source) might not always be considered the conventional label by the speech community (cf. Dubois, 2000), who instead might opt for the an alternative, frequently occurring label, if it describes the odor adequately.
In these studies, we focused on the role of odor label frequency on odor naming. However, a number of psycholinguistic variables correlate strongly with one another: high-frequency words tend to be shorter, are acquired earlier, and are higher in concreteness (e.g., Brysbaert et al., 2014). Indeed, in both Experiment 1 and Experiment 2, high-frequency and low-frequency conditions also differed in age of acquisition, although not in length or concreteness. So, the current studies leave open the possibility that the psycholinguistic effects demonstrated herein are related to age of acquisition of words as well as frequency. Ultimately, both are likely to be of importance (cf. Brysbaert, Lange, & Van Wijnendaele, 2010).
Overall, then, the results of this study demonstrate that it is important to consider properties of the lexicon, alongside properties of odors, when investigating olfactory language and cognition. These results have broader implications too. As mentioned, Majid and Burenhult (2014) showed there is a difference in the odor-naming ability between speakers of Jahai and English. The current study sheds possible new light on why this difference exists; Perhaps Jahai speakers talk about odors more frequently than English speakers do. There is no direct evidence for this proposal, but San Roque et al. (2015) compared the frequency of perception verbs (e.g., look, hear, touch, taste, smell) in 13 diverse languages and found smell verbs were more frequent in Semai (a language closely related to Jahai) than in any other language, suggesting that smell is talked about more often. So, in principle, word frequency is a possible proximate explanation for why smells are easier to name for the Jahai, and, conversely, more difficult to name for speakers of Standard Average European languages (cf. Köster, Møller, & Mojet, 2014).
Moreover, various olfaction tests have been used in clinical settings—for example, the University of Pennsylvania Smell Identification Test (UPSIT]; Doty et al., 1984) and the Sniffin’ Sticks Identification Test (Hummel et al., 1997), and efforts have been made to create culturally appropriate tests as well (e.g., the Odor Stick Identification Test for Japanese [OSIT-J]: Saito et al., 2006; Barcelona Smell Test–24 [BAST-24]: Cardesín et al., 2006; Italian Olfactory Identification Test [IOIT]: Maremmani et al., 2012). Such tests often use a forced-choice format, where the participant has to choose which of, for example, four different labels applies to an odor. Much consideration has been given to how the test set is constructed because it is known that people make more errors when the alternate choices come from a related rather than an unrelated semantic category (e.g., Engen, 1987; Goubet, McCall, Ducz, & Bingham, 2014), and it has been postulated such errors might even have a chemical basis, since related entities may share chemical compounds (Fjaeldstad, Peterson, & Oversen, 2017). Now that we have shown that properties of the lexicon play a role in odor naming, such tests can be further improved by taking these results into consideration so as to enable more control over inadvertent factors that influence test performance. For example, high-frequency labels may become inadvertent false lures in such tests; and when tests are translated from language to language, descriptors ought to be matched on psycholinguistic variables so as to avoid inadvertent confounds (cf. Fjaeldstad et al., 2017).
To conclude, we show that odor naming is influenced by word frequency—a factor previously ignored in the olfactory literature—and, at the same time, demonstrate that frequency effects are relevant beyond picture naming, the mainstay of the language production literature. So properties of the odor label are just as important to consider as properties of the odor itself in olfaction research; and psycholinguists should consider how language interfaces with all perceptual modalities, not just vision (Levinson & Majid, 2014).
Notes
The initial model also included odor intensity and odor pleasantness. However, these factors did not significantly improve the model, all ps > .05.
Odor familiarity and odor frequency were strongly correlated, but both factors were included following Wurm & Fisicaro (2014).
The initial model also included odor intensity, odor pleasantness, and odor frequency. However, these factors did not significantly improve the model, all ps > .05.
References
Ayabe-Kanamura, S., Kikuchi, T., & Saito, S. (1997). Effect of verbal cues on recognition memory and pleasantness evaluation of unfamiliar odors. Perceptual and Motor Skills, 85(1), 275–285.
Bates, D., Maechler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48.
Barry, C., Morrison, C.M., & Ellis, A.W. (1997). Naming the Snodgrass and Vanderwart pictures: Effects of age of acquisition, frequency and name agreement. Quarterly Journal of Experimental Psychology, 50(3), 560-585
Baayen, R. H. (2008). Analyzing linguistic data: A practical introduction to statistics using R. Cambridge: Cambridge University Press.
Brysbaert, M., Lange, M., & van Wijnendaele, I. (2010). The effects of age-of-acquisition and frequency-of-occurrence in visual word recognition: Further evidence from the Dutch language. European Journal of Cognitive Psychology, 12(1), 65–85.
Brysbaert, M., Stevens, M., De Deyne, S., Voorspoels, W., & Storms, G. (2014). Norms of age of acquisition and concreteness for 30,000 Dutch words. Acta Psychologica, 150, 80–84.
Burenhult, N., & Majid, A. (2011). Olfaction in Aslian ideology and language. The Senses & Society, 6(1), 19–29.
Burnage, G. (1990). CELEX: A guide for users. Nijmegen: Centre for Lexical Information.
Cain, S. W., Stevens, J. C., Nickou, C. M., Giles, A., Johnston, I., & Garcia-Medina, M. R. (1995). Life-span development of odor identification, learning, and olfactory sensitivity. Perception, 24(12), 1457–1472.
Cain, W. S. (1979). To know with the nose: Keys to odor identification. Science, 203(4379), 467–470.
Cain, W. S. (1982). Odour identification by males and females: Predictions vs. performance. Chemical Senses, 7(2), 129–142.
Cardesín, A., Alobid, I., Benítez, P., Sierra, E., de Haro, J., Bernal-Sprekelsen, M., … Mullol, J. (2006). Barcelona Smell Test–24 (BAST-24): Validation and smell characteristics in the healthy Spanish population. Rhinology, 44(1), 83–89.
Carroll, J.B. & White, M.N. (1973) Word frequency and age of acquisition as determiners of picturenaming latency. Quarterly Journal of Experimental Psychology, 25(1), 85-95.
Cho, J. H., Jeong, Y. S., Lee, Y. J., Hong, S. C., Yoon, J. H., & Kim, J. K. (2009). The Korean version of the Sniffin’ stick (KVSS) test and its validity in comparison with the cross-cultural smell identification test (CC-SIT). Auris Nasus Larynx, 36(3), 280–286.
Chrea, C., Valentin, D., Sulmont-Rossé, C., Nguyen, D. H., & Abdi, H. (2005). Semantic, typicality and odor representation: A cross-cultural study. Chemical Senses, 30(1), 37–49.
Costa, A., Alario, F. X., & Caramazza, A. (2005). On the categorical nature of the semantic interference effect in the picture-word interference paradigm. Psychonomic Bulletin & Review, 12(1), 125–131.
Damian, M. F., Vigliocco, G., & Levelt, W. J. M. (2001). Effects of semantic context in the naming of pictures and words. Cognition, 81(3), B77–B86.
Desor, J. A., & Beauchamp, G. K. (1974). The human capacity to transmit olfactory information. Perception & Psychophysics, 16(3), 551–556.
Distel, H., & Hudson, R. (2001). Judgement of odor intensity is influenced by subjects’ knowledge of the odor source. Chemical Senses, 26(3), 247–251.
Doty, R. L., Shaman, P., & Dann, M. (1984). Development of the University of Pennsylvania Smell Identification Test: A standardized microencapsulated test of olfactory function. Physiology & Behavior, 32(3), 489–502.
Dubois, D. (2000). Categories as acts of meaning: The case of categories in olfaction and audition. Cognitive Science Quarterly, 1, 35–68.
Engen, T. (1987). Remembering odors and their names. American Scientist, 75(5), 497–503.
Fjaeldstad, A., Petersen, M. A., & Ovesen, T. (2017). Considering chemical resemblance: A possible confounder in olfactory identification tests. Chemosensory Perception, 10(1/2), 42–48.
Fusari, A., & Ballesteros, S. (2008). Identification of odors of edible and nonedible stimuli as affected by age and gender. Behavior Research Methods, 40(3), 752–759.
Gilad, Y., Man, O., Pääbo, S., & Lancet, D. (2003). Human specific loss of olfactory receptor genes. Proceedings of the National Academy of Sciences of the United States of America, 100(6), 3324–3327.
Grice, P. (1975). Logic and conversation. In Cole, P. & J.L. Morgan (Editors), Syntax and semantics Speech acts. New York: Academic Press.
Goubet, N., McCall, D. D., Ducz, J. K., & Bingham, M. L. (2014). Semantic context facilitates odor identification in children and adults. Developmental Psychobiology, 56(3), 592–598.
Hummel, T., Sekinger, B., Wolf, S. R., Pauli, E., & Kobal, G. (1997). ‘Sniffin’ Sticks’: Olfactory performance assessed by the combined testing of odor identification, odor discrimination and olfactory threshold. Chemical Senses, 22(1), 39–52.
Jaeger, T. F. (2008). Categorical data analysis: Away from ANOVAs (transformation or not) and towards logit mixed models. Journal of Memory and Language, 59(4), 434–446.
Jescheniak, J. D., & Levelt, W. J. M. (1994). Word frequency effects in speech production: Retrieval of syntactic information and of phonological form. Journal of Experimental Psychology: Learning, Memory, and Cognition, 20(4), 824–843.
Jescheniak, J. D., Matushanskaya, A., Mädebach, A., & Müller, M. M. (2014). Semantic interference from distractor pictures in single-picture naming: Evidence for competitive lexical selection. Psychonomic Bulletin & Review, 21(5), 1294–1300.
Jönsson, F. (2005). Olfactory metacognition: A metamemory perspective on odor naming. Acta Universitatis Upsaliensis. Uppsala, Sweden: Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Social Sciences 6.
Jönsson, F. U., & Stevenson, R. J. (2014). Odor knowledge, odor naming and the “tip of the nose” experience. In B. W. Schwartz & A. S. Brown (Eds.), Tip of the tongue states and related phenomena (pp. 306–326). Cambridge: Cambridge University Press.
Keuleers, E., Brysbaert, M., & New, B. (2010). SUBTLEX-NL: A new frequency measure for Dutch words based on film subtitles. Behavior Research Methods, 42(3), 643–650.
Keetels M., & Vroomen J. (2012). Perception of synchrony between the senses. In M.M. Murray & M.T. Wallace (Eds.), Frontiers in the neural basis of multisensory processes, pp. 147-177. London: Taylor & Francis.
Khan, R. M., & Sobel, N. (2004). Neural processing at the speed of smell. Neuron, 44(5), 744–747.
Klapp, S. T., Anderson, W. G., & Benian, R. W. (1973). Implicit speech in reading, reconsidered. Journal of Experimental Psychology, 100(2), 368-37.
Kobayashi, M., Saito, S., Kobayakawa, T., Deguchi, Y., & Costanzo, R. M. (2006). Cross-cultural comparison of data using the odor stick identification test for Japanese (OSIT-J). Chemical Senses, 31(4), 335–342.
Köster, E. P., Møller, P., & Mojet, J. (2014). A “Misfit” Theory of Spontaneous Conscious Odor Perception (MITSCOP): Reflections on the role and function of odor memory in everyday life. Frontiers in Psychology, 5(64), 1–12.
Kroll, J. F., & Stewart, E. (1994). Category interference in translation and picture naming: Evidence for asymmetric connections between bilingual memory representations. Journal of Memory and Language, 33(2), 149–174.
Laska, M., Seibt, A., & Weber, A. (2000). ‘Microsmatic’ primates revisited: Olfactory sensitivity in the squirrel monkey. Chemical Senses, 25(1), 47–53.
Lawless, H. T. (1978). Recognition of common odors, pictures, and simple shapes. Perception & Psychophysics, 24(6), 493–495.
Lawless, H. T., & Cain, W. S. (1975). Recognition memory for odors. Chemical Senses & Flavour, 1(3), 331–337.
Levelt, W. J., Roelofs, A., & Meyer, A. S. (1999). A theory of lexical access in speech production. Behavioral and Brain Sciences, 22(1), 1–38.
Levelt, W. J. M. (1996). A theory of lexical access in speech production [Abstract]. In COLING-Proceedings (Vol. 1, pp. 3). Copenhagen: Center for Speechtechnologie.
Levelt, W. J. M. (1999). Models of word production. Trends in Cognitive Sciences, 3(6), 223–232.
Levinson, S. C., & Majid, A. (2014). Differential ineffability and the senses. Mind & Language, 29(4), 407–427.
Lorig, T. S. (1999). On the similarity of odor and language perception. Neuroscience & Biobehavioral Reviews, 23(3), 391–398.
Majid, A. (2015). Cultural factors shape olfactory language. Trends in Cognitive Sciences, 19(11), 629–630.
Majid, A., & Burenhult, N. (2014). Odors are expressible in language, as long as you speak the right language. Cognition, 130(2), 266–270.
Majid, A., Speed, L., Croijmans, I., & Arshamian, A. (2017). What makes a better smeller? Perception, 46(3-4), 406-430.
Maremmani, C., Rossi, G., Tambasco, N., Fattori, B., Pieroni, A., Ramat, S., … Meco, G. (2012). The validity and reliability of the Italian Olfactory Identification Test (IOIT) in healthy subjects and in Parkinson’s disease patients. Parkinsonism & Related Disorders, 18(6), 788–793.
McGann, J. P. (2017). Poor human olfaction is a 19th-century myth. Science, 356(6338), eaam7263. https://doi.org/10.1126/science.aam7263
Meyer, A. S., Roelofs, A., & Levelt, W. J. M. (2003). Word length effects in object naming: The role of a response criterion. Journal of Memory and Language, 48(1), 131-147.
Morrison, C. M., Chappell, T. D., & Ellis, A. W. (1997). Age of acquisition norms for a large set of object names and their relation to adult estimates and other variables. Quarterly Journal of Experimental Psychology, 50A, 528–559.
Oldfield, R. C., & Wingfield, A. (1965). Response latencies in naming objects. Quarterly Journal of Experimental Psychology, 17(4), 273–281.
Olofsson, J. K., & Gottfried, J. A. (2015). The muted sense: Neurocognitive limitations of olfactory language. Trends in Cognitive Sciences, 19(6), 314–321.
Oostdijk, N. (2000). Het Corpus Gesproken Nederlands [Spoken Dutch Corpus]. Nederlandse Taalkunde, 5(3), 280–284.
Oostdijk, N., Reynaert, M., Hoste, V., Schuurman, I. (2013) The construction of a 500 million word reference corpus of contemporary written Dutch. In P. Spyns & J. Odijk (Eds.), Essential speech and language technology for Dutch: Results by the STEVIN-programme. Berlin: Springer Verlag.
R Core Team. (2013). R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. Retrieved from http://www.R-project.org
Rabin, M. D. (1988). Experience facilitates olfactory quality discrimination. Perception and Psychophysics, 44(6), 532–540.
Rabin, M. D., & Cain, W. S. (1984). Odor recognition: Familiarity, identifiability, and encoding consistency. Journal of Experimental Psychology: Learning, Memory, and Cognition, 10(2), 316–325.
Saito, S., Ayabe-Kanamura, S., Takashima, Y., Gotow, N., Naito, N., Nozawa, T., Mise, M., Deguchi, Y., & Kobayakawa, T. (2006). Development of a smell identification test using a novel stick-type odor presentation kit. Chemical Senses, 31(4), 379–391.
San Roque, L., Kendrick, K. H., Norcliffe, E. J., Brown, P., Defina, R., Dingemanse, M. … Majid, A. (2015). Vision verbs dominate in conversation across cultures, but the ranking of non-visual verbs varies. Cognitive Linguistics, 26(1), 31–60.
Schab, F. R., & Cain, W. S. (1991). Memory for odors. In D. G. Laing, R. L. Doty, & W. Breipohl (Eds.), The human sense of smell (pp. 217–240). Berlin: Springer.
Schiffman, S. S., Reynolds, M. L., & Young, F. W. (1981). Introduction to multidimensional scaling. Orlando: Academic Press.
Shepherd, G. M. (2004). The human sense of smell: Are we better than we think? PLoS Biology, 2(5), 572–575.
Strain, E., Patterson, K., & Seidenberg, M. S. (1995). Semantic effects in single-word naming. Journal of Experimental Psychology: Learning, Memory, and Cognition, 21(5), 1140–1154.
Vitkovitch, M., Humphreys, G. W., & Lloyd-Jones, T. J. (1993). On naming a giraffe a zebra: Picture naming errors across different object categories. Journal of Experimental Psychology: Learning, Memory, and Cognition, 19(2), 243–259.
Wijk, R. A., & Cain, W. S. (1994). Odor identification by name and by edibility: Life-span development and safety. Human Factors, 36(1), 182–187.
Wurm, L.H. & Fisicaro, S.A. (2014). What residualizing predictors in regression analyses does (and what it does not do). Journal of Memory and Language, 72, 37-48
Xavier-Alario, R., Segui, J., & Ferrand, L. (2000). Semantic and associative priming in picture naming. The Quarterly Journal of Experimental Psychology Section A: Human Experimental Psychology, 53(3), 741–764.
Yeshurun, Y., & Sobel, N. (2010). An odor is not worth a thousand words: From multidimensional odors to unidimensional odor objects. Annual Review of Psychology, 61(1), 219–241.
Acknowledgements
This work was funded by The Netherlands Organization for Scientific Research: NWO VICI grant Human Olfaction at the Intersection of Language, Vulture and Biology. Special thanks to Patricia Manko for help with data collection, to Ilona Plug for help with data processing, and to Artin Arshamian, Ilja Croijmans, Carolyn O’Meara, Laura Speed, and Josje de Valk for comments on an earlier version of this paper.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1
Appendix 2
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Huisman, J.L.A., Majid, A. Psycholinguistic variables matter in odor naming. Mem Cogn 46, 577–588 (2018). https://doi.org/10.3758/s13421-017-0785-1
Published:
Issue Date:
DOI: https://doi.org/10.3758/s13421-017-0785-1