Keywords

1 Introduction

Our long-term knowledge of language requires us to store information about the words of our language—what they sound like (phonology) , what they mean (semantics), and how they are combined to construct utterances (syntax). This array of information is made available to us when we hear or see words. When native speakers of English, for example, hear the word walk, they will recognize it as a phonological form they have encountered before, and will recover both its meaning and its syntactic characteristics (as a verb or a noun), where walk as a noun behaves differently from walk as a verb. But many English surface forms—and even more so in other languages—are more complex than the simple form walk, as in the inflectional variants walks, walking, walked. The question therefore arises about the kind of representation used and accessed when such complex forms are read. Does each of these complex forms have its own stored representation, or are they analyzed into their stems (e.g., walk) and their respective suffixes (e.g., ~s, ~ing and ~ed)? Similar questions arise with regard to the representation of complex derivational forms, such as walker or walkable, as well as the issue of whether semantic factors play a role in the choice of decomposed or full-form representations.

These questions become particularly relevant when considering languages like Arabic where at least two views contend to account for how complex forms are represented in the lexicon and processed on-line . The first is the traditional root and pattern view, originating with the medieval Arabic lexicographers, taken on board with some important modifications by structuralist linguists (Cantineau, 1950a, b; Cohen 1961; Cohen 1951) , and ultimately formalized in terms of autosegmental phonology (McCarthy 1979, 1981, 1982) . The second is the stem-based or word-based view, motivated by certain theoretical hurdlesfaced by the root and pattern model on the one hand, and on the other by the pressure felt by theorists to account for Semitic morphology using the same set of universal constraints or rules applied across the world languages (Benmamoun 1998, 2003; Gafos 2003; Heath 1987, 1997, 2003; Ratcliffe 1998, 2004) .

1.1 The Root and Pattern Model

There are at least three distinct versions of the root and pattern model which differ either in terms of the number of morphemic units they posit and/or in terms of the way surface word forms are thought to be built. According to the oldest version of the model which we owe to the medieval Arab grammarians, the workings of the morphological system hinge on two morphemes: a consonantal root which conveys a broad semantic meaning, and a vocalic word pattern which conveys non-referential aspects of meaning such as perfective, or active. These two units are interleaved to build a deverbal noun stem (called ma dar).Footnote 1 However, the derivation of all the other surface forms does not involve root and pattern combination, but proceeds on the bases of the ma dar using different morpho-phonological procedures such as prefixation, infixation, and vowel deletion or insertion. To illustrate, the root QTL ‘kill’ for example is initially combined with the pattern faʕ l (CaCC) to form the stem QaTL ‘killing’. However, to build the place noun maQTaL, the root QTL is not mapped onto the pattern maCCaC; instead the perfective verb QaTaL ‘he killed’ is derived from the stem QaTL ‘killing’ by vowel stem modification. The imperfective form yaQTuL is then derived from the perfective QaTaL, and is in turn used to derive the form maQTaL (Bohas and Guillaume 1984) . Thus, the relationship between the root QTL and a complex form like maQTaL is remote and mediated by three surface forms. In contrast to this, the second version of the root and pattern view, put forward by structuralistlinguists such as Cantineau (1950a, b) and Cohen (1951) , and advocated more recently by lexicographers like Hilaal (1990) , conceives of every surface form as a combination of a root and a word pattern , and of the lexicon as a repository of roots and patterns with a set of rules to associate them. On this view, the difference between QaTL and maQTaL is that the root QTL is mapped onto the pattern CaCC in the first, but onto maCCaC in the second.

The third instantiation of the root and pattern model is developed within the framework of autosegmental phonology (McCarthy 1979, 1981, 1982) . Here Arabic morphology is thought to operate with three morphemes : a consonantal root still believed to convey the core semantic meaning, a vocalic melody conveying morpho-syntactic meaning such as active-passive, and a CV-Skeleton that contributes morpho-syntactic information as well as determining the phonological structure of the surface form . According to this model, a form like KaTaM ‘remained silent’ is comprised of the root KTM, the vocalic melody a, and the CV-Skeleton CVCVC. A more complex form like maKTaB ‘office’ is analyzed into a locative prefix ma~, a consonantal root KTB, a vocalic melody a, and a CV-Skeleton CVCCVCVC. So much like the structuralist approach, McCarthy’s model entails that the root, the vocalic melody and the CV-Skeleton are combined to derive every surface form, although in later developments of his theory some word formation processes like broken plural and diminutive formation do not operate on roots, CV-Skeletons, and vocalic melodies, but on prosodically defined portion of the input (McCarthy and Prince 1990) .

Despite the differences between the three versions of the root and pattern model, there is an interesting unity underlying their apparent diversity. Specifically, they all assign a morphemic status to the root and the pattern, whether the latter is viewed as a unitary construct, as in the Arab grammarians’ and the structuralists’ approaches, or as a composite construct consisting of a vocalic melody and a CV-Skeleton as in McCarthy’s approach.The morphemic status of roots and patterns hinges on two kinds of argument. The first is distributional, based on the observation that consonantal roots and word patterns appear in many words with overlapping meanings. For example, the root KTB surfaces in 31 forms all of which, save one form KaTɩ ˥ Bah ‘squadron’, revolve around the general meaning of writing inherent in this root, while the word pattern maCCaC appears in hundreds of surface forms, most of which are place nouns (e.g., maDXaL ‘inlet’, maKTaB ‘office’, maSBaĦ ‘swimming pool’. A second type of argument derives from the patterning of certain co-occurrence restrictions, which apply to the consonants of the root but not to affixal consonants. For example, the first and second consonant of the root are generally neither identical nor homorganic such that roots like *SSM or *BMS are very rare (Frisch and Adnan Zawaydeh 2001) . The same constraint applies, if less stringently, to first and third consonants with very few roots like *KBK or *MTM. The second and third consonant can be identical (e.g. MDD), but not homorganic (e.g., *MDT). Since these co-occurrence restrictions can be stated in terms of the root consonants, while intervening vowels and affixal consonants are ignored, this argues for a level of representation at which the consonantal root functions as an independent entity (Greenberg 1950; McCarthy 1981) .

The morphemic status assigned to roots and patterns sets this class of models in sharp contrast with the more recent stem-based approach. We will not be juggling with the three versions of the root and pattern approach in the remaining part of this paper. Instead we will use the term root and pattern model to encompass only the structuralist version (Cantineau 1950a, b; Cohen 1961; Cohen 1951) and McCarthy’s version (McCarthy 1979, 1981, 1982) . We do this for a number of reasons: first because these two versions suffer from similar problems relating to the derivation of certain forms such as the broken plurals, diminutive nouns, or place nouns, which have either directly (McCarthy’s version) or indirectly (structuralist version) motivated the emergence of stem-based or word-based accounts. Second, the most significant difference between the structuralists’ view and McCarthy’s view relates to McCarthy’s fractionation of the word pattern into a CV-Skeleton and vocalic melody. However, in previous psycholinguistic research we have found only partial evidence for the parsing of the word pattern into further components (Boudelaa and Marslen-Wilson 2004) . Therefore, we take the two versions to be cognitively equivalent. Third, both versions provide similar predictions regarding the cognitive relevance of the root and the pattern and these can be clearly pitted against those derived from a stem-based or word-based approach.

1.2 The Stem-Based/Word-Based Model

Like the root and pattern approach, the stem-based model is not a homogenous approach, but has a number of different versions. For instance, Heath (1987, 1997, 2003) draws a distinction between lexical representations , morphological derivation, and lexical processing and argues, on the basis of observations such as the above, that as far as lexical representation and morphological derivation are concerned, the consonantal root is best “consigned to oblivion” (Heath 2003, p. 115) . There is no principled way, according to Heath (2003) , to segregate consonants and vowels and assign them to different levels of representation. This is because the word patterns , or ablaut templates as he refers to them, cannot be said to contribute any grammatical information in many cases. Stems such as XuBZ ‘bread’, KaLB ‘dog’ and SiLM ‘peace’ abound in the language, yet their respective word patterns CuCC, CaCC and CiCC do not convey any grammatical information. On this view the stem is taken to be the singular form for nouns (e.g., KaLB ‘dog’, QaMaR ‘moon’, Ba:B ‘door’) and the imperfective form for verbs (e.g., KTuB, XRuJ). Where lexical processing is concerned however, Heath acknowledges that “root-like strings are extracted from input representations […] but these extracted consonantal sequences do not correspond exactly to the traditionally recognized roots, particularly where vowels and semi-vowels are concerned” (Heath 2003, p. 126–128) . Accordingly he speculates that an input like KaWwiN ‘bring into being, causative’ is initially decomposed into the causative word pattern CaCCɩC and the root KWN; subsequently this root is identifiedas the stem Ku N ‘be, imperfective’, based on the fact that medial geminated /w/ in causative is usually the result of mapping a vowel /u/ onto a CC slot (Heath 2003, p. 128) .

Other stem-based accounts dispense with roots and patterns altogether (Benmamoun 1998, 2003; Ratcliffe 1998, 2004) . For instance, Ratcliffe (2004) suggests a sonority-based mechanism serving to strip off affixes and recover the stem, and although he is not explicit about the tripartite distinction between derivation, lexical representation and processing, his analysis carries unmistakable overtones that the stem is the pivotal element governing all three domains. Similarly, Benmamoun (1998, 2003) defines the primitive of Arabic word formation processes -and by extension of Arabic word processing- as the imperfective stem . He argues that a form like muʕ aLLim ‘teacher’ is built not by mapping the root ʕLM onto the pattern muCaCCiC as the root and pattern theory would argue, but by appending the prefix mu~ to the imperfective stem ʕaLLim .Footnote 2

The emphasis on the stem as the unit of morphological representation and processing allies these accounts with the Generalized Template Theory (GTT), which suggests that constraints dictating minimal and maximal prosodic word length guide word formation in Semitic languages (McCarthy and Prince 1990; Ussishkin 2000, 2005) . On this account word formation processes operate on existing words to derive new words and this is achieved by allowing word formation rules to adjust the structure of existing stems as necessary to produce the desired output. For instance, the Hebrew form GiDeL ‘he raised’ is thought to be derived by over writing the vowels -a-a- of the lexically stored stem GaDaL ‘he grew up’ without the root GDL ever being accessed as an independent element (Ussishkin 2005) . Thus words with the CaCeC pattern are the primitive of morphological processing and representation in Hebrew, and presumably, those with the CaCaC pattern would serve as such a primitive for Arabic (Ussishkin 2005) .

The foregoing paragraphs underline the heterogeneity of the stem-based approach. Not only are different stems posited by different theorists, but some of these theorists roundly reject the root and pattern as relevant morphological units (Benmamoun 2003; Ratcliffe 2004; Ussishkin 2005) , while others concede a role for the root or a root-like unit in language processing (Heath 2003) . In order to be able to adjudicate between the stem-based account and the root and pattern account, we will focus on the imperfective stem-based version as developed by Benmamoun (1998, 2003) . There are three reasons for this: first, the imperfective stem version provides a unified treatment of morphological derivation arguing that both verbs and nouns can be built from the appropriate imperfective stem. Second, this account emphatically rules out any functions for the root or the pattern, and so stands in sharp contrast with the classic root-pattern model. Third, it shares with other instantiations of the stem-based account the goal of aligning Semitic languages with the rest of the world languages. Consequently, if the imperfective-stem account is strained by psycholinguistic data, this will to a large extent apply to other instantiations of the stem view .

1.3 Empirical Questions

Against this linguistic backdrop, the question we ask here, using experimental psycholinguistic and cognitive neuroscience techniques, concerns the nature of the lexical representation used and accessed by native Arabic speakers as they read or hear Arabic words . Is a form like maSBaĦ ‘swimming pool’ processed as a prefix ma~ and an imperfective stem SBaĦ, or as a word pattern maCCaC and a root SBH? To evaluate the predictions of these two approaches we bring to bear data from behavioral experiments , pathological data from aphasic patients, and recent imaging data using event related potentials. Because most of the available behavioral data are based on the priming task, we start by giving a brief description of this technique and its rationale.

1.3.1 What is Priming?

In a typical priming experiment , words are presented in pairs. The first member of the pair is called the prime, the second the target, and participants are usually instructed to make a lexical decision about the target (i.e., decide whether it is a word of the language or not). The relationship between the prime and target can be varied depending on the goals of the study. Prime and target may, for example, share morphological elements (e.g., happiness/DARKNESS), orthographic (e.g., mile/MILL) and/or phonological properties (e.g. quay/KEY), or simply be semantically related (e.g., pledge/OATH). Priming is said to occur when the timed response to the target (e.g., DARKNESS) is affected—either speeded up or slowed down—as a consequence of having previously encountered a related prime (e.g., happiness), relative to responses following an unrelated prime (e.g., faithful). The most common interpretation of priming is that the mental representations of the prime and target are interconnected or overlap in such a way that activating the representation of a prime word either activates the representation of the target word (Forster 1999; Neely 1991) , or activates the representations of lexical or morphemic competitors.

In research investigating whether the mental lexicon is organized in terms of the phonetic word or the morpheme , two versions of the priming technique—cross-modal and masked priming—have been used extensively.

Cross-modal priming: here the prime and target are distinct perceptual events, with a visual target presented immediately at the offset of an auditory prime (Marslen-Wilson et al. 1994) . Since the prime and target are in different sensory modalities, priming is thought to occur at the more abstract level of the lexical entry, since it is here that prime and target overlap, rather than at lower, more modality-specific levels of representation. Cross-modal morphological priming in Indo-European languages like English has been found to be contingent on the prime and target sharing either a stem and a transparent semantic link (e.g., darkness/DARK) or a productive affix (e.g., happiness/DARKNESS). Words that are related purely on historical linguistic grounds (e.g., department/DEPART) fail to prime (Marslen-Wilson et al. 1994) . This result is theoretically captured by hypothesizing that priming between pairs like happiness/DARKNESS and government/GOVERN reflects repeated access to the same underlying lexical entry, while absence of priming among pairs of words like department/DEPART reflects access to different and unrelated entries, as illustrated in Fig. 2.1 Footnote 3 (Marslen-Wilson and Zhou 1999) .

Fig. 2.1
figure 1

Illustration of how input forms sharing a morpheme map onto the same underlying representations, both for cases of a free stem as in darkness/dark and happiness/happy and a bound suffix as in darkness/happiness. Forms sharing a stem but no semantics map onto different representations. (Adapted from Marslen-Wilson and Zhou 1999)

On this view of priming, words sharing only form (e.g., electrode/ELECT; tinsel/TIN) do not show facilitatory priming effects, but instead compete with each other, leading to slower responses relative to the baseline. The prime electrode, for example, does not contain the morpheme {elect}, so hearing it as a prime will activate its own lexical entry. This will generate interference with the subsequent recognition of the target ELECT (Gaskell and Marslen-Wilson 2002) . For pairs that are only semantically related (e.g., pledge/OATH; curve/BEND), cross-modal priming seems to be an elusive phenomenon (Gaskell and Marslen-Wilson 2002; Marslen-Wilson et al. 1996) . It is only observed when the prime is unambiguous, and even so it decays quickly, suggesting that sharing facilitatory links between distinct lexical entries is not as effective a mechanism for priming as accessing the same shared entry.

Masked priming: in this version of the priming paradigm both the prime and the target are visually presented. However, participants are typically not aware that a prime is present at all, since it is presented very briefly (50 ms), and is sandwiched between a forward pattern mask (often a series of hash marks) and a backward mask, the target itself. Recent masked priming research suggests that morphological decomposability alone determines priming in this task. For example, the word corner, which is morphologically simple but which is potentially parsable into corn + ~er, is found to facilitate the processing of the stem CORN, even though the two words are not in fact either morphologically or semantically related (Rastle et al. 2000) . This suggests that in languages like English, masked priming does not tap into processes occurring at central levels of representation, but into early processes of word segmentation that apply to any potentially morphologically decomposable input regardless of meaning.

Cross-modal and masked priming can be seen as complementary techniques that track on-line processing at different stages, with cross-modal priming tapping into stored long-term representations, and masked priming providing a window into the early stages of lexical processing when visual input is segmented into morphemes. Should one take this view as a starting point for thinking about how the Arabic mental lexicon is organized, then a number of questions arise . First, do we see priming between Arabic words that just share a root (e.g., maKTaB/KiTa:B ‘office’/‘book’)? Do words sharing a word pattern (e.g., FaRraQ/KaSsaR ‘scatter’/‘smash’) also show priming? Is root priming modulated by the transparency of the semantic relationship between prime and target such that transparent pairs (e.g., maKTaB/KiTa:B ‘office’/‘book’) prime, but opaque ones (e.g., KaTi:Bah/KiTa:B ‘squadron’/‘book’) do not? Are the effects of priming likely to vary depending on whether we use masked or unmasked primes? As discussed below, stem-based and root-based approaches make different predictions here.

1.4 Priming Evidence for Roots and Word Patterns

Cross-modal priming with roots: the following illustrates a cross-modal investigation of the potential effects of the root morpheme in Arabic by using a within-word design (Table 2.1).

Table 2.1 Sample stimuli used to probe for root effects in cross-modal priming. +/−R indicates whether the prime/target pair share a root or not, +/−S whether they share semantics. Unrelated refers to the baseline condition. (For further details see Boudelaa and Marslen-Wilson 2000)

If Arabic roots are stored at a central level of representation and play a role similar to that played by stems in Indo-European languages, then hearing the prime word maDXaL ‘inlet’ should have two immediate processing consequences. It should activate the morpheme DXL , and at the same time inhibit other morphemes which are similar to it only in a form like DXN ‘smoke’, or DJL ‘dupe’. The prior activation of DXL should facilitate the response to the target DuXu:L ‘entering’ when it is subsequently displayed. But what happens when the root has different interpretations across prime and target, as illustrated by muDa:XaLah/DuXu:L ‘participation’/ ‘entering’ in Condition 2? Does the morpheme DXL in the prime map onto the same underlying representation as the morpheme DXL in the target in spite of their different meanings? Or are there two different entries, one for DXL meaning participate, and one for DXL meaning enter?

Having a separate entry for every meaning variation of a root, although potentially costly in terms of cognitive storage, may nonetheless lead to priming via facilitatory links between the two morphemes . A more parsimonious alternative would be to posit a unique entry to all the words featuring the same root and to associate that entry with all the shades of meaning variations that the root can have. On this scenario, hearing muDa:XaLah ‘participation’ should, as in condition 1, activate the root DXL, and inhibit phonologically similar roots. This activation of the root by the priming word should lead to subsequent facilitation of the target DuXu:L ‘entering’.

In condition 3, where the prime ʔ i:La:J ‘insertion’ and target DuXu:L share meaning but feature different roots, significantly weaker priming is expected. Activation of the entry for the root WLJ ‘enter’ can only affect the entry for DXL through interlexical links, and these do not support priming as effectively as when the same linguistic entity (such as the root) is shared between prime and target (Gaskell and Marslen-Wilson 2002; Marslen-Wilson et al. 1996) .

This set of predictions contrasts sharply with the predictions made by a stem-based approach where the root is thought to play little or no role in morphological processes, and the lexical entry is the imperfective stem, or the full surface form itself (Benmamoun 1998, 2003) . It is not clear, on a cognitive interpretation of such an analysis, how forms sharing a root can be psycholinguistically linked to each other except on the basis of possible semantic relationships, either between their full forms or between the imperfective stems underlying primes and targets. This predicts no difference in priming between Conditions 1 and 3. In both cases there should be facilitatory activation between lexical representations of the prime and the target because of their semantic similarities, but in neither case should the strength of these effects be affected by morphological structure. Priming between words sharing a root and a transparent semantic relationship (+R+S) should be of the same magnitude as priming between pairs that are only semantically related (−R+S). For Condition 2, since the meaning of the full-form muDaXaLah is not related to the meaning of the target DuXu:L—and similarly for their imperfective stems—there is no cognitive basis for any facilitatory priming effects, although interference may be generated given the phonological similarity between them.

The results, displayed in Fig. 2.2, show strong priming effects between words sharing a root. This effect is not modulated by semantic transparency . The target DuXu:L ‘entering’ is primed equally well by the semantically related maDXaL ‘inlet’, and the semantically opaque muDa:XaLah ‘participation’ (Boudelaa and Marslen-Wilson 2000) , suggesting that surface forms featuring the same root map onto the same underlying lexical representation corresponding to the root morpheme . Similar results have been found in Hebrew, again using a cross-modal priming task and varying morphological and semantic relatedness (Frost et al. 2000) . This finding, of overt root priming effects in the absence of a transparent semantic relationship between prime and target, seems to be specific to Semitic languages. In the Indo-European languages tested to date (e.g., English, French, Polish, Dutch), except German (Smolka et al. 2009), overt morphological priming is contingent on the transparency of the semantic relationship between prime and target , such that casually primes casual, but casualty does not (Marslen-Wilson et al. 1994) . This cross-linguistic difference may reflect the specific structural role of Semitic morphology, especially on a root and word-pattern account, compared to a language like English, where morphological operations do not play the same fundamental role in generating the surface word-forms of the language (Boudelaa and Marslen-Wilson 2005; Marslen-Wilson 2001) .

Fig. 2.2
figure 2

Cross-modal priming effects for words sharing a root with and without semantics (+R+S an +R−S), and words sharing semantics without sharing a root (−R+S)

Priming effects in Condition 3, for words that are only semantically related (R+S), are much weaker and more variable. This is consistent with the view that morphological priming and semantic priming are subserved by different mechanisms: repeated access of the same underlying lexical entity in the case of morphological facilitation, and facilitatory links between different lexical entries in the case of semantic priming.

This pattern of results, which has been replicated in various forms of overt priming (including cross-modal, and auditory-auditory in Standard as well as Dialectal Arabic, where no formal teaching of roots and patterns is ever received), follows directly from a root-based account of lexical representation and processing in Semitic languages like Arabic or Hebrew . It is clearly inconsistent with a strong stem-based view on which the root morpheme would play no role in processing (Benmamoun 1998, 2003) and the only relevant unit of linguistic analysis would be the imperfective verb stem. The regularities provided by the root morpheme are picked up by the language learner and used as an organizing principle of lexical space.

Cross-modal priming with word patterns: similar sets of issues and contrasts arise when we turn to the second class of morphemes distinguished on a root-based approach—namely the word pattern morpheme . To probe for priming effects among words sharing a word pattern , Boudelaa and Marslen-Wilson (2000) adopted the same factorial approach as with the roots described above. The same target word was paired with four priming words as illustrated in Table 2.2.

Table 2.2 Example of stimuli used to probe for word pattern effects in cross-modal priming. +WP+M stands for pairs sharing the form of the word pattern and its meaning, while +WP−M refers to pairs sharing the form of the word pattern but not its meaning. (Boudelaa and Marslen-Wilson 2000)

If lexical processing in Arabic requires access to the root morpheme , as the priming data suggest, then the other component of the surface form, the word pattern , must also be accessed at some point during processing. Accordingly, upon hearing an Arabic prime word such as TiJa:Rah ‘trade’, a number of processing operations are triggered. These involve not only the activation of the root TJR ‘trade’, and the suppression of its cohort competitors on the one hand, but also the activation of the word pattern CiCa:Cah ‘profession noun, singular’ (and possibly the suppression of its cohort competitors as well) on the other. Since residual activation of the root morpheme generates priming, so should residual activation of the pattern. Therefore priming is expected in Condition 1 (+WP+M) among words sharing the phonological structure and the morpho-syntactic meaning of the word pattern as a consequence of the same underlying unit being accessed in prime and target .

By contrast, in Condition 2, the prime and the target share the phonological structure of the word pattern but not its morpho-syntactic meaning (+WP−M). The pattern CiCa:Cah has a ‘profession noun’ reading in the target ṭiBa:ʕ ah ‘ art of typography’, but a ‘deverbal noun’ reading in the primeĦiKa:Yah ‘story’. If word pattern priming depends on the prime and target sharing the same morpheme , then there should be no facilitation for these pairs since their word patterns are homophonic. Alternatively, if priming through shared word patterns is much more a function of shared phonological similarity, independent of its possible linguistic interpretation, then there would be no reason not to see priming for these pairs. In Condition 3, word pairs sharing phonology like muṭa:ʕ /ṭiBa:ʕ ah ‘obeyed’/ ‘art of typography’ should show signs of competition rather than priming.

On a stem-based approach, word patterns are not thought to be relevant linguistic units at all . So even if we relax our assumption to the point of accepting that ṭiBa:ʕ ah ‘art of typography’ and TiJa:Rah ‘trade’ are derived by modification of the imperfective stems tBaʕ and Ta:JiRFootnote 4 respectively, no relationship can be established between these two items since they have different semantics and they relate to different stems. Therefore there is no basis for facilitatory priming between such items, which should behave like the phonologically related words in Condition 3 and show interference irrespective of whether they share the same morpho-syntactic interpretation of the pattern or not .

Figure 2.3 plots the net priming in the +WP+M, the +WP−M, and the + Phonology conditions. Priming between words sharing a word pattern is strongly significant, but only if the word pattern in prime and target is the same underlying morpheme , as in Condition 1 (Boudelaa and Marslen-Wilson 2000) . In Condition 2, where the word patterns in prime and target are phonologically but not morphologically identical, we see no priming. This seems to be compelling evidence that the effect seen here is based on the abstract linguistic relationship between prime and target, and not on overlap in terms of their phonological or orthographic properties. Consistent with this, there is no priming in the + Phonology condition, and even some signs of interference . Taken together these results cast further doubts on the stem-based approach to Arabic in particular and Semitic in general, and provide support for the view that word pattern priming is driven by repeated activation of shared morphemes at a central level .

Fig. 2.3
figure 3

Net cross-modal priming effects for words sharing a word pattern (+WP), where their morphemic identity (+/−M), and words sharing phonology only (+Phonology)

It is interesting to note that reliable overt word pattern priming was also found in Hebrew among verbs sharing the same word pattern (Frost et al. 2000) . In Arabic, word pattern priming was found not only for verbs but also for nouns provided that the nominal word pattern occurs in the context of a productive root. Overall however, overt cross-modal priming in the two major Semitic languages, Arabic and Hebrew, provides compelling evidence for roots and word patterns as lexical units governing the process of spoken word recognition .

Masked priming with roots and word patterns: a growing body of visual word recognition research on Indo-European languages suggests that masked priming reflects early processes of segmentation into stems and suffixes, rather than the properties of central lexical representations (Longtin et al. 2003; Marslen-Wilson et al. 2008) . This segmentation process applies automatically and blindly, so words like corner, which have no actual internal morphological structure, are nonetheless initially decomposed into a stem {corn} and a suffix ~er, generating masked priming to the pseudo-stem corn. On a root-based approach where every Arabic surface form is morphologically structured , consisting of a root and a word pattern , priming by roots and word patterns should again be found, and should not be modulated by the semantic transparency of the relationship between prime and target. We tested this prediction using the design illustrated in Table 2.3 .

Table 2.3 Example of stimuli used to probe for word pattern and root effects in masked priming

In Condition 1, the target DuXu:L ‘entering’ is paired with the prime ŠuRu:ʕ ‘starting’, with which it shares the nominal word pattern CuCu:C . On the traditional root and pattern approach, both DuXu:L and Š uRu:ʕ are morphologically structured and should be subject to the early decomposition process picked up in masked priming. This will activate the component morphemes of these words at the level of access representation and should provide a basis for priming based on repeated access to the same component CuCu:C at this level. In Conditions 2 and 3, priming is also expected based on the activation of the same morpho-orthographic component shared by the prime and target. Priming in these two conditions should be of the same magnitude although the prime and target are +R+S in Condition 2, but +R−S in Condition 3. This is because semantics does not affect the early decomposition process on which masked priming seems to be based. For the same reason no priming is expected in Condition 4 (−R+S), where prime and target are semantically but not morphologically related. In Condition 5, the prime and target overlap orthographically, which should lead to inhibition at the level of processing tapped onto by masked priming.

On a stem-based approach, the morphological structure provided by roots and patterns presumably has no role to play. The overlap in form between words sharing a root and words sharing a pattern should, if anything, give rise to competition between forms sharing such units at the level of access representations. This predicts that two of the three morphological conditions (+WP, +R−S) should pattern with the orthographic conditions (+Orthography), possibly showing inhibitory effects. The third morphological condition (+R+S) should pattern with the semantic condition (−R+S), although purely semantic priming is generally either weak or non-existent in masked priming at SOA’s of 60 ms or less.

Figure 2.4 shows the masked priming outcome for the first five conditions relative to the Unrelated condition (Boudelaa and Marslen-Wilson 2005) .

Fig. 2.4
figure 4

Masked priming effects for words sharing a word pattern (+WP), words sharing a root and semantics (+R+S), a root but no sema;ntics (+R−S), semantics but not a root (−R+S), and words sharing orthography (+Orth) at an SOA of 48 ms

As predicted by the decompositional root and pattern model, there is a strong and statistically significant masked priming effect for both word patterns and roots (Boudelaa and Marslen-Wilson 2005) . Words overlapping only in form (Condition 5: Orthography) fail to prime at all, even though they share more letters on average than words related just by a word pattern . This suggests that masked priming in Arabic hinges on the activation of potential morphemic units at the access level rather than on simple orthographic overlap between prime and target . And again there is no difference in magnitude of priming between the +R+S condition and the +R−S on the one hand and no priming in the purely semantic condition −R+S on the other. This is consistent with the claim that masked priming primarily reflects early segmentation processes—although it should not be forgotten that in Arabic, unlike English and related languages, we see no semantic interaction with morphological effects in overt priming either. Similar masked priming effects for word patterns and roots have also been reported in Hebrew (Frost et al. 1997) . It is difficult to see how a stem-based approach can be modified to accommodate these findings.

1.4.1 Neuropsychological Evidence

Neuropsychological research focuses on the study of damaged brain systems in order to understand normal cognitive functions. The rationale underlying research in this area is that if a specific cognitive problem can be found after an injury to a specific area of the brain, it is likely that this part of the brain in some way supports the cognitive function in question. Although the inference from patterns of breakdown to normal function is notoriously difficult and depends on the theory of normal cognition (Bullinaria and Chater 1995; Caramazza 1986; Shallice 1988) , looking at the way in which the cognitive function breaks down in patients with brain damage may nonetheless be informative about the organization of the normal system. Competing morpheme-based and stem-based views of Arabic morphology seem to make different predictions about the patterns of dysfunction likely to be observed in Arabic following brain damage . Because it assigns a cognitive status to roots and patterns, the morphemic approach predicts that damage to brain areas supporting language understanding and production may affect these morphemes differentially and selectively. By contrast, on a stem-based approach neither the root nor the word pattern is referenced by cognitive processes, since the stem is the basic unit for such processes, and should be implicated in any lexically-related deficit following damage to the brain .

The Semitic neuropsychological literature offers two reports addressing the issue of whether roots and patterns can be selectively impaired. Prunet et al. (2000) assess the extent to which metathesis errors (where the ordering of elements is compromised) target root consonants as opposed to word pattern consonants. They examined the speech of an Arabic aphasic patient , ZT, suffering from stroke damage to left hemisphere territories known to be important for normal language function. When prompted to read a word like maMLaKah ‘kingdom’ from the root MLK, ZT would produce the non-word *maLMaKah or *maKMaLah where the order of the root consonants is swapped around, but he would almost never produce something like *KaMLamah where a root consonant swaps positions with a word pattern consonant . A second report focused on the selective impairment of the word pattern vowels of a Hebrew speaking patient, Dudu (Barkai 1980) . This patient exhibited severe problems in producing the vowels of the word pattern while his uses of the consonants of the root were preserved. For example in response to a form like GaZaZ ‘cut’, Dudu would produce the nonce form *GiZeZ, where the order and identity of the root consonants are intact, while the vocalic pattern [a-a] is realized as [i-e].

Taken together, the cases of ZT and Dudu suggest that unless the root and the word pattern have a special cognitive status, as embodied in the morphemic approach, it would be hard to explain how errors can selectively target root consonants in ZT’s case, and the vowels of the word pattern in Dudu’s case (Barkai 1980; Prunet et al. 2000) . Like the priming evidence reviewed earlier, the neuropsychological evidence is at variance with the stem-based approach .

1.4.2 Neuro-Imaging Evidence

Neuro-imaging techniques fall into two broad classes: haemodynamic methods, such as PET and fMRI, and electro-physiological methods such as EEG and MEG). Haemodynamic methods are predicated on the close coupling between changes in the level of activity of a neuronal population and changes in its blood supply (Raichle 1987) , while electro-physiological methods take advantage of the fact that some classes of neurons act like electrical dipoles which create an electromagnetic field that can be detected and recorded from outside the head (Wood 1987) . Neuro-imaging provides a means to separate and identify different cognitive operations in terms of their neuro-physiological correlates. It assumes that if two experimental conditions generate qualitatively distinct patterns of neural activity, they are more likely than not to engage functionally distinct cognitive operations (Rugg 1999) .

Within this framework, Boudelaa et al. (2010) conducted an Event Related Potential (ERP) experiment to look at how the brain responds to Arabic words differing either by a consonant belonging to the root, or a vowel belonging to the word pattern . We used the Mismatch Negativity (MMN) technique which relies on electroencephalography (EEG) to measure the brain’s electrical activity and make inferences about regional cortical activities (Näätänen and Alho 1997; Pulvermüller and Shtyrov 2003) .

Participants were presented with two pairs of auditory stimuli while watching a silent movie. The first pair formed the root condition and was made up of the two words ʕaRi:S/ʕ aRi:F ‘bride’/ ‘corporal’, which feature the same word pattern CαCi:C, but different roots ʕRS and ʕRF. The second pair represented the word pattern condition and consisted of the words ʕaRi:S/ʕaRu:S ‘bride’/ ‘bridegroom’. These are made up of the same root ʕRS, but use the different word patterns CαCi:C and CαCu:C respectively. In both cases the word ʕaRi:S was used as standard and presented 85 % of the time, while the words ʕaRi:F and ʕaRu:S served as deviants in the root and pattern conditions respectively, being presented only 15 % of the time. Performance on these word-word pairs was compared to performance on closely matched non-word-non-word pairs which differed either by a consonant (e.g., *NiRi:S/*NiRi:F), or a vowel (e.g., *NiRi:S/*NiRu:S).

On a morpheme-based approach, words differing by a root consonant should elicit a different brain response than words differing by a word pattern vowel, because the diverging segment belongs to functionally distinct morphemes. The stem-based approach, in contrast, seems to predict no difference between the root and the word pattern conditions. A word like ʕaRi:F, represented either as a full form or in terms of an imperfective stem, is as different from ʕaRi:S as it is from ʕaRu:S.

The results are in keeping with the predictions of a morpheme-based approach, showing that at 160 ms after the deviation point, the word deviant ʕaRi:F ‘corporal’ elicits a larger MMN than its matched non-word deviant at fronto-central recording sites. There is no significant lateralization. This pattern of activation is typical of responses to content words which exhibit no inter-hemispheric differences. In the word pattern condition there was a significantly larger MMN response evoked by the word deviant ʕ aRu:S relative to the matched non-word deviant *NiRu:S. This effect was seen in left inferior temporal regions at 250 ms after the word recognition point . These inferior temporal regions are typically associated with the processing of grammatical morphemes in earlier studies (Caplan 1992; Chapman 1999; Mohr et al. 1994; Pulvermüller 1999; Pulvermüller et al.1995) . These results provide a demonstration of a neural dissociation in the processing of roots and word patterns in a Semitic language: roots, like content words in Indo-European languages, are subserved by neural assemblies equally distributed over both hemispheres, while word patterns are similar to function words, lateralizing strongly to the left. It is not clear how stem-based or full-form approaches can provide a basis for explaining these contrasts.

1.5 Implications

The across-the-board morphemic effects described above have far reaching implications for how Arabic words are recognized from script and speech . Not only do they strongly suggest that access representations are organized in terms of roots and patterns, they also indicate that modality-free central representations of lexical form and meaning are structured in terms of the same units. Furthermore, the same units seem to govern both the early decomposition processes and the central processes of access to meaning from speech and script with the stem itself playing no role in the lexical access process per se (see Berent et al. 2007; Vaknin and Shimron 2011 for a different opinion) . So, what kind of cognitive architecture do we need to model this? In what follows we sketch out a tentative account of morphological effects in spoken and written Arabic word recognition. We then compare this suggestion to the dual route model of Frost et al. (1997) , and to the distributed connectionist account of Plaut and Gonnerman (2000) ; two models designed to account for similar phenomena in Hebrew.

1.5.1 The Obligatory Morphological Decomposition Account

The above data strongly suggest that lexical processing in Arabic evolves around roots and word patterns , and that the extraction of these units during spoken language comprehension and reading is subserved by an obligatory decomposition mechanism as schematically depicted in Fig. 2.5.

Fig. 2.5
figure 5

Schematic representation of the Obligatory Decomposition view of Arabic spoken and written word recognition

This idea is similar to Taft’s (2004) and can be instantiated as an interactive activation network with localist representations corresponding to roots and word patterns . According to this view, all content words in Arabic undergo a process of obligatory morphological decomposition (OMD) whereby their roots and word patterns are accessed as lexical entries . A lexical entry will feature the morpho-syntactic, phonological, semantic and functional information associated with the component morphemes of a given word. As long as the input word has an identifiable morphological structure it will undergo decomposition whether its meaning is combinatorial or not. This obligatory decomposition naturally accounts for the fact that transparent forms like Ka:TiB ‘writer’ and non-transparent forms like KaTi:Bah ‘squadron’ generate comparable amounts of priming in overt (i.e., cross-modal) and covert (i.e., masked) priming.

Why would a non-transparent form like KaTi:Bah be decomposed when its meaning is not combinatorial? The answer to this lies in the distributional properties of the language. Most of the Arabic words sharing a root have significant overlaps in meaning . As mentioned earlier, the root KTB for instance is encountered in 31 distinct derived forms in Modern Standard Arabic , all of which, save one –namely KaTi:Bah- evolve around the general meaning of ‘writing’ (Boudelaa and Marslen-Wilson 2010) . The overall consistency of the mapping between the form of a root—in this case KTB- and its semantic interpretation—‘writing’- provides a valuable island of reliability in the otherwise arbitrary task of learning to relate a given spoken or written form to its meaning. This promotes the development of a parsing strategy to extract the linguistic component that helps attenuate the severity of the arbitrariness of the form-to-meaning mapping; and this unit happens to be the morpheme in Arabic .

The initial obligatory decomposition applied to non-transparent KaTi:Bah-like forms needs to be subsequently followed by a recombination stage so that their idiosyncratic interpretation can be established. So the claim here is not so much that there are no full form representations in the Arabic mental lexicon, but that such representations are available only for opaque forms, and that most importantly the storage of such forms does not circumvent obligatory decomposition. So, unlike the dual route view of Frost et al. (1997) described below, the OMD posits a single parsing route to the lexicon and a post-access recombination stage in the case of opaque forms. An interesting consequence of this is that surface form frequency effects are expected to affect processing at the recombination stage, whereas root frequency and word pattern frequency effects should affect the earlier stages of the lexical access process itself. This prediction is being tested by the present author in a lexical decision EEG experiment where variables such as surface frequency, root frequency and word patterns frequency, among others, are correlated with the Event Related Potential on a millisecond by millisecond basis.

1.5.2 Frost et al.’s Dual Route Model

In Frost et al.’s (1997) model, lexical units (words) and sub-lexical units (morphemes) are both represented. Processing of Hebrew printed stimuli consists of a lexical retrieval process in which lexical units are located at the word level and a morphological parsing process in which morphemic units are extracted and located at the sub-lexical level. One of the critical features of this model is that the morphological level of representation encodes only the orthographic form -and by extension to the auditory domain only the phonology - of the root. This allows the model to account for priming among words sharing a root but an opaque semantics (e.g.,KaTi:Bah/ KiTa:B ‘squadron’/ ‘book’).

What is not clear however is whether the morphological representation of the word pattern encodes only the phonology/orthography of this unit or whether it also encodes aspects of its morpho-syntactic meaning. If the morphological representation of a word pattern like CuCu:C or CiCa:Cah is assumed to encode only the phonological -and orthographic- attributes of this unit, by analogy to the morphological representation of the root, then the model would predict facilitation among primes and targets that share the phonological structure of a word pattern but not its morpho-syntactic meaning such as TiJa:Rah/QiLa:Dah ‘trade’/ ‘necklace’, where the pattern CiCa:Cah has a profession noun interpretation only in the prime.This is not the case however; significant priming in nounsFootnote 5 at least is observed only when the word pattern occurs in the context of a productive root and when the prime-pattern and target-pattern have the same phonological structure and the same morpho-syntactic interpretation as in TiJa:Rah/ṭiBa:ʕah ‘trade’/ ‘art of topography’ where the pattern encodes the meaning of profession noun in both the prime and the target (Boudelaa and Marslen-Wilson 2011) .

A similar problem for the dual route model is raised by the Hebrew cross-modal priming data which suggests that +R+S prime-target pairs show evidence of stronger priming than matched +R−S pairs (Frost et al. 2000) . This suggests that semantics modulates root priming in overt tasks in Hebrew and consequently that the morphological level of representation of the root cannot be claimed to represent only the form of this unit. In addition to this, the dual route model is not clear about the time taken to recognize a word via the lexical route or the sub-lexical route. Do the two routes race with each other? What variables affect the race? Or is the race between the lexical and the sub-lexical routes fixed such that processing via the two pathways is instantiated simultaneously and systematically delivers an output?

1.5.3 Plaut and Gonnerman’s (2000) Connectionist Network Model

Plaut and Gonnerman (2000) used a simple feed-forward network, where orthographic input is mapped onto semantic output via a set of hidden units, to demonstrate that non-semantic morphological effects are not incompatible with a distributed connectionist account. Like many other connectionist accounts (e.g., Rueckl et al. 1997; Joanisse and Seidenberg 1999; Seidenberg and Gonnerman 2000) , Plaut and Gonnerman’s model assumes morphology to be a characterization of the learned mapping between the surface form of words, that is their orthography or phonology , and their meanings. Since morphologically related words necessarily share form, and this is mapped onto largely overlapping aspects of meaning, the internal representation of a connectionist network should pick up on this quasi-regular mapping and treat morphological structure in a combinatorial way. Priming between morphologically related but semantically unrelated words was simulated using a set of morphologically related words varying in semantic transparency. These were embedded either in a morphologically rich or a morphologically poor language corresponding respectively to English and Hebrew. Morphological priming was found to increase with the degree of semantic overlap in both languages; and morphological priming occurred between words that share morphology without semantics in the morphologically rich language (Hebrew) but not in the morphologically impoverished language (English).

When lexical knowledge is represented distributedly, words that share parts of their spelling (e.g., muMTɩ ʕ ‘enjoyable’, MuTʕ ah ‘pleasure’), and map that spelling onto similar meanings ‘enjoyment/pleasure’, have similar effects on some of the weights; therefore exposure to one word improves performance on the other. By contrast, words that share their spelling but map onto differential meanings (e.g. MaTa:ʕ ‘commodities’, MuTʕ ah ‘pleasure’) push the weights in competing directions, and exposure to one word does not benefit processing of another. Consequently the fact that the network exhibits priming among morphologically related semantically opaque forms is in itself a success, and a clear demonstration that the way opaque items are represented and processed depends on the overall linguistic environment to which the network is exposed. If most derivative forms featuring a particular root have similar semantic interpretations and only few of them deviate from the general semantics of the root, the semantically transparent items will ally themselves to exert a coherent influence on the opaque ones such that every member of the morphological family is represented more or less componentially. Plaut & Gonnerman’s model is however at odds with the data summarized above.These data do not reveal a graded morphological effect as a function of semantic transparency. Instead, what we see is comparable amounts of root facilitation among semantically transparent pairs and semantically opaque ones.

2 Conclusion

We have evaluated the predictions of two views of Arabic morphology , a morpheme-based approach and a stem-based approach in the light of the available data from cognitive psychology, neuro psychology, and cognitive neuro-science. The data from these three areas of research converge to support the view that Arabic surface forms are cognitively represented on a morphemic basis, with entities such as roots and word patterns playing a crucial role in processes of lexical access and in the structure of lexical representation (Saiegh-Haddad in press). The stem-based approach does not seem able to accommodate this kind of data, and is further strained by much other behavioral data, such as slips of the tongue (Berg and Abd-Al-Jawad 1996) , and novel word acceptability judgments (Frisch and Adnan Zawaydeh 2001) . Even on a purely formal linguistic level, this model arguably suffers from significant in adequacies (Prunet 2004; Tucker 2009) . On a more general level, the stem-based approach fails to strike the right balance between the aim of accounting for Semitic languages using the same set of formal apparatus used with other languages, and the aim of capturing the specificity of each individual language.

The OMD sketched above is a root and pattern based account and provides a good fit to the existing data. It provides a better fit to the data than the dual route model (Frost et al. 1997) , and the distributed connectionist model (Plaut and Gonnerman 2000) . Future development of the OMD will need to be informed more significantly by neural consideration in order to build a neuro-cognitively viable account of speech and reading comprehension in Arabic in particular and Semitic in general .