Introduction

Reading is an essential skill that relies on visual, orthographic, phonological, and semantic information (Perfetti, 2007). It involves two basic information processing activities; the first is decoding, where children learn to read based on the Grapho-Phonemic Correspondence (GPC) by translating written letters (graphemes) to the corresponding sound (phonemes). However, to more advanced reading skills, children need to move from this initial and primitive stage to an efficient and fluent one. To do so they need to read bigger orthographic units such as syllables and words, which contribute to more efficient word recognition (Gough & Tunmer, 1986). Based on the dual route view (Coltheart et al., 1993), this latter stage relies more on lexical knowledge that includes the expressive and receptive vocabulary, which are used to reflect processing related to the lexical route. In this case, the lexical route makes activation forward to both whole-word orthographic representations and higher-level semantic information (vocabulary). Therefore, readers identify the different letters of the word (knowledge of the alphabet and orthographic lexicon), this orthographic information makes direct contact with semantic and spoken vocabulary (connections between the printed word and the decoded meaning) (Gough & Tunmer, 1986). The first stage however, relies mainly on phonological awareness (Perry et al., 2019), and thus reflect the non-lexical (phonological) route.

The involvement of both lexical and non-lexical routes in reading may differ between languages depending on the orthographic transparency of the language, which is defined as the correspondence between a word’s orthographic (written) units and their phonologic (sounds) forms (Frost, 1998; Seymour et al., 2003). While the phonological route plays a critical role in more transparent orthographies, the involvement of the lexical route seems to be highlighted in more deep orthographies where the GPC is not applicable. Yet, the contribution and availability of the phonological and lexical information may differ between first and second languages due to orthographic-specific influences (specific writing system type of each language: transparent vs. deep orthography) and word exposure and use of oral language in each language (Lallier et al., 2014) and in particular languages that use dialects in their spoken form that are relatively different between the standard and written language, which might be particularly relevant to the diglossic situation in Arabic. Very few studies have shown that exposure to two or more languages concurrently and, as a result, having varying amount of input in each language does not impede phonological development, thereby improving phonological production, when the amount of input, the phonological saliency of the target system, and the degree of relatedness of language systems are all taken into account (Montanari, 2011; Yang & Hua, 2010). Thus, in this study we will test the relevant contribution of lexical and non-lexical routes to reading in transparent and deep orthographies of Arabic and English.

Overall, reading is a cognitive, interactive and important process, in which readers extract meaning and knowledge (Bojovic, 2010). The development of reading may differ between languages because of their orthographic depth. For example, Share (2008) emphasized that characteristics of orthographic depth have always been centered on English; yet, English may be an irregular orthography and orthographic uniqueness affects reading models. In different language scripts, the transparency of orthographic depth vary (e.g., Aro, 2004; Caravolas et al., 2013; Protopapas & Vlahou, 2009; Ziegler et al., 2010). In languages that have a transparent orthography, such as Indonesian, Italian, and Finnish, surface phonology is reflected in the consistent pronunciation of a given letter in different words (Winskel & Lee, 2014; Ziegler et al., 2010). However, in deep orthographies, such as English, grapheme-phoneme relationships can be ambiguous with a high degree of inconsistent and irregular spellings (Share, 2008; Seymour et al., 2003). A given letter is often pronounced inconstantly in different words, such as the ‘a’ in saw and cake. Some letters have no corresponding sound (e.g., ‘w’ in answer), whereas the same sound can have various spellings (e.g., /k/ in cat, king, opaque, and track).

In the case of orthographic depth, the literature has addressed two approaches to theorizing basic reading components. The first is the traditional dual-route approach of reading, which distinguishes between two independent lexical-orthographic and non-lexical phonological processes for the conversion of letters to sounds (Coltheart et al., 1993, 2001; Zorzi, 2010). The lexical route, which is also called the direct route, involves direct contact between sublexical orthographic information and whole-word orthographic representations. This gives access to both whole-word phonology and higher-level semantic information. This may be applicable to reading languages with deeper orthographies, where the word reading process requires a visual graphic route in order to achieve meaningful comprehension (Forster et al., 1987; Hoosain, 1991). On the non-lexical route, which is also called the indirect route, sublexical orthographic information is transformed into a sublexical phonological code before making contact with phonological output units, whole-word phonological representations, and semantics. Therefore, according to the traditional dual-route model, if the language writing system accurately represents the sounds of the language, i.e. has high orthographic transparency, readers are more likely to rely on non-lexical phonological coding (Forster et al., 1987). In the second approach, which emerged from the modern research of connectionists that has increasingly advocated for more flexible interactions between the two routes, phonological and orthographic processes are more dependent and interconnected (Plaut, 2005; Plaut et al., 1996; Seidenberg & McClelland, 1989; Van Orden et al., 1990). This latter theory claims that there is a single, rather than dual, mechanism for processing words, as well as distributed representations and weighted connections between units, rather than symbolic rules for mapping letters and sounds, although the contributions of different parts may differ. To sum, the dual-route theory would suggest only phonological decoding for transparent orthographies, while connectionist models might show differences, it means that in transparent orthography, phonological and lexical processing are both involved, with stronger phonological processing.

In the case of reading acquisition in languages that vary in their orthographic depth, reading research suggests that reading acquisition is easier in certain orthographies (e.g., Aro & Wimmer, 2003; Seymour et al., 2003). The variation in the orthographic depth of different scripts was found to impact reading acquisition in different languages. The orthographic depth hypothesis (ODH; Katz & Frost, 1992) and the grain size theory (Ziegler & Goswami, 2005), as well as other studies (e.g., Aro & Wimmer, 2003; Caravolas et al., 2013; Seymour et al., 2003) have proposed that transparent or shallow orthographies are easier to read than deep orthographies with highly inconsistent grapheme-phoneme coding. For example, Seymour et al. (2003) examined the differences in the rates of acquisition of literacy components in European languages that vary in their orthographic depth. They tested letter knowledge, familiar word reading, and simple nonword reading in English and 12 other European orthographies (Finnish, Greek, Italian, Spanish, German, Norwegian, Icelandic, Portuguese, Dutch, Swedish, French, and Danish). The findings revealed that Scottish English-speaking children (aged 5 to 7 years) performed poorly in accuracy tests by the end of the first school year, and read only 29% of English pseudowords correctly. Aro & Wimmer (2003) examined numeral reading in English, number-word reading, and pseudoword reading among English-speaking children and compared them to German-, Dutch-, Finnish-, French-, Spanish-, and Swedish-speaking children in grades 1 to 4. By the end of the first year, the Dutch-, German-, Finnish-, and Spanish-speaking children demonstrated 85% reading accuracy for pseudowords, whereas the Swedish-speaking children demonstrated 90% reading accuracy. However, the English-speaking children demonstrated only 50% accuracy. They did not close the gap in accuracy levels when compared to their peers until grade 4. This shows that the ability to translate graphemes into acceptable pronunciations was easily demonstrated in all the orthographies tested, except English.

Overall, similar to monolinguals, research has indicated the existence of orthographic-specific influences on reading in bilinguals in two orthographies simultaneously. Lallier et al. (2014) examined reading words among Spanish-French balanced bilingual children (aged 9 to 12 years) in both their deep (French) and shallow (Spanish) orthographies. The findings indicated that bilingual children were more accurate at reading in their shallow orthographies than in their deep ones. This shows that orthographic depth impacts lexicality effects, indicating that bilinguals rely more on the lexical process when they read in their deep than in their shallow or transparent orthographies. In the context of relations between first and second languages in reading acquisition among bilinguals, it was found that L1 has a positive impact on L2 reading skills (Jiang & Kuehn, 2001; Perfetti et al., 2002). Based on the interdependence hypothesis of Cummins (1979), which suggests that Cognitive Academic Language Proficiency (CALP) is transferred between languages, researchers have proposed that good linguistic skill in L1 predicts similar skill in the second language (Leikin et al., 2005). Therefore, bilingual students can transfer reading skills, such as phonological skills and word identification skills between their languages (Abu-Rabia et al., 2012).

The impact of orthographic depth on reading was also found in the predictors of reading accuracy between orthographies. for example, Ziegler et al. (2010), investigated the role of phonological awareness, memory, vocabulary, rapid naming, and nonverbal intelligence on reading performance across five different languages that varied in their orthographic transparency (Finnish, Hungarian, Dutch, Portuguese, and French). The sample included 1,265 children from the second grade. The findings revealed that phonological awareness was the main predictor of reading fluency in all five languages, but its impact was modulated by the transparency of the orthography, being stronger in less transparent orthographies. Also, the importance of phonological awareness, as one of the predictors of reading performance, was relatively universal across the alphabetic languages studied, though its precise weight varied systematically as a function of script transparency. The influences of phonological awareness and vocabulary were modulated such that phonological awareness was a more important component in opaque orthographies compare to vocabulary which was a more important component in transparent orthographies. As a result, a reader will read familiar words using more the lexical-orthographic processing alongside non-lexical processing, whereas new and unfamiliar lexical items will prompt them to use more non-lexical routes involving grapheme-phoneme coding alongside lexical processing (Seidenberg, 1985). This is particularly relevant in the context of diglossic reality in Arabic, which also has its unique orthographic system.

Arabic: orthography and diglossia

Reading acquisition in Arabic is a challenging process for children (Eviatar et al., 2004). It is affected by a number of factors. The first is the complexity of Arabic orthography (Eviatar & Ibrahim, 2014), which slows reading acquisition (Saiegh-Haddad, 2003). The Arabic alphabet consists of 28 letters representing consonants. All letters are written from right to left. In some cases, the letters are connected together from both sides, in other cases they are not connected. It also includes three vowel signs (short and long) (Abu Chacra, 2017). The vowelized written words represent the shallow or transparent orthography that is presented to beginner readers. Around the third grade, a non-vowelized written system is presented to them in various written texts and materials. So, unlike Roman orthographies, where each language has a transparent or deep writing system, based on how orthography represents phonology, Arabic orthographic systems have shallow and deep representations (Katz & Frost, 1992). The unvowelized writing system in Arabic challenges readers with its high number of homographs. Thus, it requires relying on general contextual clues in order to identify the correct pronunciation and meaning of specific words. For example, (أكِل/ʔakel /’food’, أكَل/ʔakal/’ate’) have similar letters when vowelization diacritics are omitted. Therefore, readers have to rely on general context to infer pronunciation and meaning of those words.

Some researchers have indicated that vowelization contributes to word recognition in Arabic because of the phonological contribution of vowel diacritics while reading (Abu-Rabia, 1998, 1999, 2007). In contrast, Ibrahim (2013) revealed that unvowelized words were read more fluently than full vowelized words among native Arab eighth graders. These results can be explained by the visual load produced by vowelization diacritics while reading familiar words. Asadi (2017) examined the effect of diacritics for short vowels on the accuracy and fluency of reading in Arabic among children from the first to sixth grades using unvowelized and fully vowelized words. The findings showed that children read unvowelized words more fluently and accurately across all grades, except for those in the first and second grades, who showed equal reading accuracies.

Notably, the involvement of lexical and non-lexical components in predicting reading was examined in orthographic depth context in the Arabic language. Asadi and Khateb (2017) examined the involvement of lexical (vocabulary) and non-lexical (phonological awareness) components in predicting reading based on orthographic transparency of Arabic among first and second readers. Findings revealed that the prediction of phonological awareness to reading was strong and similar both in the vowelized/transparent and unvowelized/deep orthographies and decreased in the second grade in both versions. The contribution of vocabulary to reading was slightly higher in the unvowelized orthography compared to the vowelized version, and it increased with age. It seems that at the beginning of reading, the lexical route is not available to children and as they learn the alphabet they move on to processing words as a whole. Thus, when it comes to reading in Arabic, the relative strength of lexical and non-lexical processing seems to be a narrow framework.

As a result, the contribution and availability of phonological and lexical information may differ between languages, particularly in diglossic languages such as Arabic, where the spoken form differs significantly from the standard written form. In Arabic, diglossia refers to the existence of two forms of the same language: spoken Arabic (SA), which is acquired at home from birth and is dominantly used prior to schooling, and modern standard Arabic (MSA), which is formally learned and used around the age of 5–6 when children begin formal schooling (Ferguson, 1959; Omar, 1973), implying that MSA is not the first language of children who speak Arabic. Given the large linguistic distance between MSA and SA, diglossia has had a negative impact on Arabic reading acquisition and literacy development (Abu Ahmad et al., 2014; Saiegh-Haddad, 2007; Saiegh-Haddad et al., 2011). According to Abu-Rabia (2000), diglossia is a significant factor influencing low levels of reading among Arab speakers. Similarly, Saiegh-Haddad and Schiff (2016) discovered differences in reading MSA versus SA words in elementary school, supporting the effect of linguistic distance on the development of word reading in Arabic. However, it appears that lexical and non-lexical processing in Arabic are speculative and need to be empirically tested due to its diglossic nature (along with orthographic depth and complexity), which may result in low efficiency in automatic lexical processing, since children are not exposed sufficiently to unfamiliar MSA words, which may lead to involving stronger non-lexical processing while reading. When compared to other languages (such as English), which have unified spoken and written forms, phonological representations in Arabic written form differ from those in spoken form.

The present study

Studying reading among children in both transparent and deep orthographies simultaneously is a powerful approach that can provide useful insights on reading development in different orthographies and languages, by examining whether the reading system in two orthographies (languages) is flexible to the orthography that represented. Apart from the grain size accommodation hypothesis that postulates that bilinguals have similar reading mechanisms that are influenced by their reading experience in both their languages, few studies suggest that they adapt depending on the depth of the language (Egan we al., 2019). Accordingly, few studies in language reading research have investigated the lexical and non-lexical processes in bilingual (or trilingual) contexts in general, or among native Arabic speakers in particular. If the balance between lexical and non-lexical processing is language-specific (Frost et al., 1987; Katz & Feldman, 1983; Katz & Frost, 1992), this assumption would have important consequences for understanding reading in different languages.

Given the co-occurrence of diglossia and orthographic transparency in Arabic, especially Arabic uniqueness, and the difference between predictors of reading acquisition in Arabic vs. English, based on the assumption that there is differnce in the orthographic depth of Arabic (its two versions: vowelized/transparent and unvowelized/deep) vs. English, the current study compares the involvement of lexical vs. phonological routes to reading in these languages. It will be informative to compare the involvement of lexical vs. phonological routes to reading between vowelized and unvowelized Arabic (L1) and English (L3). The logical hypotheses in this case is that in the deeper English orthography as well as unvowelized Arabic script, variables related to lexical reading would be more clearly linked to reading skills, while variables linked to phonological decoding would be related to reading vowelized Arabic.

Taking into account the different orthographic depths of Arabic (and its unique features) and English, examining word reading in Arabic can determine whether the traditional dual-route model or connectionist model are applicable to it. Thus, it is interesting to see whether young Arabic sequential trilinguals have functionally separate lexicons for each form of their languages (L1 and L3). Accordingly, our research question is as follows: Are there differences between the relative contributions of lexical and non-lexical processing in word reading according to the the existed assumption of orthographic depth in transparent vs. deep orthographies in Arabic (L1, vowelized and unvowelized) and deep English (L3)?

Methods

Participants

A total of 532 trilingual Israeli Arabic-speaking children (289 girls) participated in this study. The sample comprised 267 and 265 participants from the fourth (Mage in month = 117.2; SD = 6.1) and fifth (Mage in month = 129.3; SD = 5.8) grades, respectively. They were recruited from 33 Arabic-speaking elementary schools from the north, center, and south of Israel, and came from various socioeconomic backgrounds. The children were native speakers of different dialects from the north, and the Triangle region in Israel. The group consisted of sequential trilingual children who spoke Arabic as their native language; it was the main instructional language in their schools, while they studied Hebrew (second language starting from second grade) and English (as a third language from third grade) as additional required subjects. Because English was learned beginning in third grade, the best opportunity to examine reading acquisition of Arabic and English was from fourth and fifth grade.

The teachers in the target programs used the specified curriculum provided by the Israeli Ministry of Education. Children had studied Hebrew (L2) for two hours a week from second grade (and five hours a week from third grade), and English (L3) for five hours a week from third grade. When children start learning to read vowelized Arabic, the teaching instructions focus on graphemic-phonemic recoding and decoding strategies. Children start learning to read in English in the beginning of grade three. The inconsistent orthography forces them to rely on larger orthographic units more than on phonemes and grapho-phonemic recoding strategies alone. Decoding is also taught when learning to read in English, however, the focus is on whole-word reading.

The tester selected every fourth name from the class list (approximately eight children per class). The study was approved by the college ethical committee. Only children who presented filled parental consent forms participated in the study. All participants were in regular classes – that is, none of them were in special education classes. None of them had visual, hearing, language, or learning difficulties of any kind.

Measures

Children were assessed using measures of isolated word reading in Arabic and English, phonological awareness and memory, and vocabulary. The measures used here were presented to five teachers who worked with children in this age group, and all of them thought that the items were appropriate for students of the fourth and fifth grade. All measures, except for word reading in English, were used in previous studies (Asadi & Khateb, 2017; Asadi, 2020).

Arabic word reading

This test examined the reading accuracy of Arabic vowelized words. The task comprised 30 vowelized words from children’s third- and fourth-grade textbooks. The words represented several morphological patterns in Arabic and varied in terms of length (two to four syllables) and the children’s familiarity (low, medium, and high). An identical list of non-vowelized (without short vowels), but not homographic words was also presented. Non-homographic words were chosen to avoid the probability of the words being read correctly while being pronounced differently. A third list of pseudowords comprising the same words as those of the vowelized version (after changing two phonemes such that the words became meaningless in Arabic) was also included. Each participant was required to read the words out loud as accurately as possible at a rate that suited them. The participants’ scores were based on the total number of correctly read items, with a maximum score of 30 in each version. The reliability (Cronbach’s α) was 0.93, 0.88, and 0.91 for the vowelized, non-vowelized, and pseudowords versions, respectively.

English word reading

This test examined the reading accuracy of the isolated words in English. The task comprised 30 words from children’s third- and fourth-grade textbooks. Most of the words were monosyllabic, whereas a few were bi-syllabic and disyllabic. The words varied in terms of children’s familiarity (low, medium, and high). To assess words familiarity, five English teachers were required to point to a list of stimuli to indicate children degree of familiarity to words (to what degree the word is familiar to children) (Noble, 1952).

Like the Arabic reading task, each participant was required to read the words out loud as accurately as possible at a rate that suited them. The participants’ scores were based on the total number of correctly read items, with a maximum score of 30. The reliability of the test (Cronbach’s α) was 0.92.

Phonemic deletion (in Arabic)

This test examined the ability to delete phonemes at the beginning (40) and at the end (40) of words. The items were mono- and disyllabic and included three to six phonemes. They were generated from the SA (20 items), MSA (20 items), sharedFootnote 1 (20 items) and pseudo words (20 items). During the test, each word was read out to the participant, who had to repeat it after the examiner and then say it again after deleting a specific phoneme. One point was assigned for every successfully deleted final or initial phoneme, with a maximum score of 80. The reliability of the test (α) was 0.92.

Phonemic segmentation (in Arabic)

This test examined the ability to repeat and segment words into their basic sounds. The items used in this task were the same as those used in the phonemic deletion task. The participant had to repeat each word after the examiner and segment it into separate sounds. One point was assigned to every successfully segmented word with a maximum score of 80. The reliability of the test (α) was 0.90.

Receptive vocabulary (in Arabic)

This test evaluated semantic knowledge at the perceptual level using a list of 30 literary Arabic words including verbs, nouns, and adjectives. The participant heard a target word followed by three other words and was required to choose the one word that had a similar meaning (synonym) to that of the target word. Each correct answer earned one point. The maximum score was 30. The reliability of the test (α) was 0.82.

Expressive vocabulary (in Arabic)

This test examined semantic knowledge at the production level using a list of 32 literary Arabic words including verbs, expressions of time, quantity, and adjectives. Each word was presented auditorily to the participant, who was asked to give its opposite. Each answer was compared to the possible list of correct answers. Each correct answer earned one point. The maximum score was 32. The reliability of the test (α) was 0.83.

Procedure

The participants were tested individually by the examiner in a quiet room in two short testing sessions in the third trimester (between April and June). To prevent the effects of fatigue and to avoid fluctuations in concentration among the participants, there were short breaks after each test. In one session, the children took the Arabic word reading test and one test on phonological awareness and vocabulary. In the other session, which took place about one or two weeks later, the children took the English word reading test and phonological awareness and vocabulary. All examiners were students in the field of education and had received specific and detailed training on administering the different tasks.

Results

Table 1 presents the descriptive statistics of the observed variables of the participants’ scores. The means reflect the raw scores of all variables. The performance was acceptable for all variables. There were no indications of floor or ceiling effects. Bivariate correlation.

Table 1 Descriptive statistics of raw scores mean and SD

among all the observed variables are presented in Table 2. As shown in Table 2, there were significant correlations between all variables ranging from 0.26 to 0.64 (p < .001). Arabic Pseudo word reding correlated positively with Arabic-vowelized/transparent (r = .64, p < .001), and Arabic-unvowelized/deep (r = .47, p < .001). English word reading also correlated positively with phonemic segmentation (r = .40, p < .001), phonemic deletion (r = .42, p < .001), receptive vocabulary (r = .45, p < .001), and expressive vocabulary (r = .43, p < .001).

Table 2 Correlation analyses* for all observed variables

Structural equation modeling (SEM) analysis was used to assess the relative contribution of the basic components of the phonological and lexical routes to reading accuracy. Maximum likelihood estimation procedures were used to analyze the explained variance of the predictors (latent variables) using AMOS 20.0 (Arbuckle, 2011), simultaneously in reading Arabic (pseudowords, vowelized/transparent and unvowelized/deep versions) and in English. The goodness of fit indices used were the root mean square error of approximation (RMSEA), the comparative fit index (CFI), and the Tucker-Lewis Index (TLI). As indicated by the fit indices, RMSEA = 0.02, CFI = 0.99, and TLI = 0.99, the SEM model had a good fit to the data.

The results of the SEM analysis showed that the latent variables of both the phonological (represented by phonological deletion and phonemic segmentation) and lexical (represented by receptive and expressive vocabulary) routes explained 26% of the variance in pseudowords, 31% in vowelized/transparent Arabic, 33% in the unvowelized/deep Arabic orthographies, and 40% in English (see Fig. 1). The contributions of both latent independent variables to reading were significant in all versions, except for that of the lexical component to pseudowords, which was not significant. In Arabic real word reading, the contribution of the latent variables was similar in the vowelized/transparent and unvowelized/deep versions, with a higher contribution of the lexical route in the unvowelized/deep version. Compared with English, the contribution of the independent latent variables, shown by the standardized coefficients (Beta-β), provided opposite patterns between the vowelized/transparent Arabic and the deep English models. Whereas the contribution of the lexical component was more dominant in English (β = 0.49; p < .001) than in vowelized/transparent Arabic (β = 0.21; p < .05), the phonological contribution was more dominant in vowelized/transparent Arabic (β = 0.38; p < .001) than in English (β = 0.20; p < .05). This pattern was also present, but more moderately, when comparing English with unvowelized/deep Arabic.

Fig. 1
figure 1

Structural equation model for predicting word reading accuracy in Arabic (pseudo-words, vowelized/transparent and unvowelized/deep orthographies) and in English languages. Note: All contributions (Beta-β) of the latent variables were significant at p < .001, otherwise specified. n.s = non-significant; *p < .05

Discussion

This study examined the differences between the relative contributions of lexical and non-lexical processing among Arabic-speaking children in Arabic (L1) and English (L3) word reading. The results showed that the contribution of the latent independent variables (lexical and phonological) to reading gradually changed based on the orthographic depth. They also seemed to be dependent components.

The contribution of the lexical route was minimal in Arabic pseudowords, increased in Arabic vowelized and unvowelized words, and increased significantly in English word reading. However, the contribution of the phonological variable was maximal in Arabic pseudowords, and decreased in Arabic vowelized and unvowelized words, until it decreased significantly in English word reading. This finding supports the approach in which lexical-phonological route involvement is thought to vary from one language to another, depending on the orthographic depth of the language (Share, 2008). As for Arabic real word reading, the contribution of the latent variables was similar in the vowelized/transparent and unvowelized deep versions, with a higher contribution of the lexical route in the latter. While the contribution of the lexical component was more dominant in English than in vowelized/transparent Arabic, the phonological contribution was more dominant in vowelized/transparent Arabic. This pattern was also present, but more moderately, while comparing English with unvowelized/deep Arabic.

These findings align with those of earlier studies on monolinguals and bilinguals (Forster et al., 1987; Goswami et al., 2001; Hoosain, 1991), indicating that the different involvement of lexical or non-lexical routes is based on the orthographic depth of different languages. Readers of Arabic, a more transparent orthography compared to English, were more likely to rely on non-lexical phonological coding. However, a direct lexical process was more applicable to reading in deeper orthography (English).Based on the dual-route model, the phonological route alone contributed to Arabic pseudoword reading. The contribution of the lexical route was almost non-existent. As expected, according to the dual-route model, pseudowords in Arabic represent the sounds of the language with high orthographic transparency. As readers were not exposed to non-words (they had no supporting visual information), they were more inclined to rely on non-lexical phonological coding (Goswami et al., 2001; Hoosain, 1991). This finding imply that at the beginning of reading (when the vowelized script is introduced first), the lexical route is not available to children and as they learn the alphabet they move on to whole-word orthographic representations and processing.

Despite the fact that this difference is minor, the conclusion that raises the orthographic depth question in Arabic is ambiguous, implying that we cannot assess Arabic orthographic transparency solely on vowelization. This contradicts the Arabic transparency assumption for vowelized words. Asadi and Khateb (2017) conducted a previous large-scale cross-sectional study on vowelized Arabic reading among first to sixth graders. The findings showed that vowelized Arabic differed from other transparent orthographies, with consistent contributions of phonological awareness to reading transparent orthographies through sixth grade. These findings are consistent with previous research on the long-term impact of phonological awareness on Arabic reading (Abu-Rabia et al., 2003; Mannai & Everatt, 2005; Taibah & Haynes, 2011). Moreover, the Asadi and Khateb (2017) investigated the role of phonological awareness and vocabulary in first- and second-grade Arabic children reading vowelized/transparent and unvowelized/deep orthographies. The results revealed that the contribution of phonological awareness to reading was strong and similar in both the vowelized/transparent and unvowelized/deep orthographies, and decreased in both versions in the second grade. The contribution of vocabulary to reading, on the other hand, increased with grade and was greater in the unvowelized orthography. As a result, the minimal difference between vowelized/transparent and unvowelized/deep orthographies can be attributed to Arabic’s vowelization feature, which requires a reader to decode all available phonological information (Asadi & Khateb, 2017).However, the similar involvement of phonological and lexical routes in reading in both vowelized and unvowelized versions (observed in this study) may imply that vowelization, the solely criterion for transparency/depth in Arabic, does not improve reading performance as previously suggested (Saiegh-Haddad & Schiff, 2016; Taha, 2016).

The difference in contribution of the lexical route was minimal in both vowelized and unvowelized orthographies, with a slight advantage for the latter. Children in the fourth and fifth grades seemed rather dependent on the phonological route, and had not developed word patterns efficiently. This insufficient lexical contribution to reading both vowelized and unvowelized orthographies in Arabic may be related to diglossia, which may lead to less efficient automatic lexical processes and has a negative effect on reading acquisition (Abu Ahmad et al., 2014; Saiegh-Haddad, 2007; Saiegh-Haddad et al., 2011). These results have implications for lexical processes in reading in Arabic, where words must be retrieved from a complex words in the lexicon including both MSA and SA words (Nevat et al., 2014). Some linguistic skills in standard Arabic are not well promoted in children’s oral language. This can negatively affect the relationship between these standard linguistic skills in the oral language and their correspondents in the written language. The lexical quality hypothesis (Perfetti, 2007) posits that different features and characteristics of words (including phonological, morphological, semantic, and syntactic characteristics), represent the properties of the mental lexicon. The quality of these representations is thought to be influenced by the experiences of children in using their oral language. Researchers have claimed that Arabic-speaking children arrive at school with some immaturity in different aspects of their literary oral language that are less represented in their mental lexicon (Saiegh-Haddad & Joshi, 2014) and are considered, along with the complexity of the orthographic system (Ibrahim et al., 2002), one of the main reasons for the difficulties that Arabic-speaking children face in their reading acquisition (Saiegh-Haddad, 2003).

To conclude, lexical vs. phonological route involvement varies between languages, based on their depth. The shallow/deep orthography in the same language, Arabic, didn’t yield differences, thus, there is a need for reconsideration of vowelization as the sole criterion in determining the orthographic transparency/depth continuum of Arabic. Based on the dual-route model, only the phonological route contributed to Arabic pseudoword reading, whereas the lexical route did not. However, varied contibution of lexical and phonological routes was observed in English and Arabic word reading, English requires more lexical skills alongside phonological ones, while Arabic requires more phonological skills alongside lexical ones. The findings of this study have important theoretical implications. They support the connectionist model and the varies contribution of the latent variables (lexical and phonological) to reading Arabic real words and English words. They corroborate the view that based on the assumption that this difference in orthographic depth exists, lexical-phonological route involvement varies from one language to another which contributed differently to reading in the two languages as was argued by the connectionist model, rather than traditional dual route model which indicated that the two routes are independent. However, they support the traditional dual route model in reading Arabic pseudoword, while separate independent phonological contribution was observed. Thus, our findings may impact the validity and suitability of the dual route model to Arabic (Share, 2008) since the contribution of lexical vs. phonological routes to reading did not differ according to writing system (vowelized/transparent vs. non-vowelized/deep) while reading real words. Thus, we have to postulate the dual route model carefully in case of Arabic because its orthographic depth did not differ on the two different writing systems (vowelized and non-vowelized), this calls into question the issue of transparency in Arabic.

Unlike previous work that has highlighted how Arabic and English is read, mostly separately, the current study demonstrates Arabic and English reading processes simultaneously, showing that the same reader may involve different lexical and phonological processing based on the orthographic depth of the language, necessitating different instructional approach for each of them.

The current research implies that the focus of Arabic language approach among Arabic speaking children is different from English. From first grade, children become used to analytic-based reading instruction when reading Arabic (and later reading Hebrew as a similar Semitic language); however, this approach may not be applicable to English reading, which requires high lexical alongside phonological skills in the instruction as reflected in our findings. Therefore, the current study recommends English teachers to focus on lexical and whole language reading in their instruction alongside phonological decoding, which is different from the analytic-based reading used in Arabic and Hebrew. The current study encourages policymakers to bring the research and applied field together to help Arabic speaking children to overcome obstacles in reading English in particular, and other readers who may encounter similar situations in reading and learning different and distant orthographies. It calls for language teachers to implement adequate strategies and instruction to help Arabic pupils become fluent readers in English, and proposes English instruction or exposure to especially lexical English words (alongside phonological decoding) begin at an earlier age before access to written English. Moreover, this study demonstrates the contribution of Arabic phonological and lexical skills to reading in English, it shows the contribution of L1 skills to L3 reading. The results recommend policymakers to not reconsidering vowelization as the sole criterion for determining orthographic transparency/depth continuum in Arabic.

This study also highlights the centrality of enriching children in MSA word patterns to minimize the gap between written and spoken language and to develop a lexical route for words. Future research should test reading speed rather than accuracy. It should include sentences and texts rather than isolated words to better understand how the two orthographic versions of Arabic are read when compared to English. Moreover, it should examine the relative strength of lexical and non-lexical processing in Arabic, Hebrew (two similar Semitic languages) and English orthographies. It should be noted that English among our population is L3, which may impact and impede their knowledge of English, future studies should examine our research question when English is L2 among Arabic speakers. Researchers should also examine this question among sixth or seventh grade students, when they are more proficient in English, and consider controlling for factors such as non-verbal intelligence, and also to test orthographic knowledge.