Introduction

Lexical databases not only provide a rich array of lexical properties from orthographic, phonological, and semantic perspectives for researchers to access and retrieve, but also facilitate research in lexical processing by providing high-quality and reliable normative data. Despite the fact that several databases on Chinese language, a logographic writing system in which a written character usually represents a word or a semantic unit (De Francis, 1989; Yang et al., 2009), have been constructed over the past two decades (Liu et al., 2007; Cai & Brysbaert, 2010; Sze et al., 2014; Chang et al., 2016; Tse et al., 2017; Tsang et al., 2018; Sun et al., 2018; Wang et al., 2020), resources based on reading materials for school-age children in Mainland China are surprisingly limited (Shu et al., 2003; Xing et al., 2004; Cai et al., 2021; Li et al., 2022). Most recently, Li et al., (2022) has developed a lexical database that incorporates Chinese characters and words sampled from elementary school children’s curricular and extracurricular books and provides frequency statistics. Nonetheless, it remains to be explored how various lexical properties of these characters and words change with grade levels, and how these factors affect lexical processing among readers of different age.

To fill these gaps, in the present study we aim to establish a database for Chinese characters and words collected from the official textbooks recently issued by the Ministry of Education of the People's Republic of China (MOE) for most elementary schools in Mainland China, and further identify the trajectory of lexical factors and its effects on Chinese literacy acquisition. In the following sections of the introduction, we first present an overall picture of Chinese lexical units followed by a summary of existing databases. We then focus on the source of our database – the current textbooks used in most elementary schools in Mainland China. Finally, we present a detailed description of the lexical variables involved in our database.

Chinese lexical units

Most words in Mandarin Chinese are represented in disyllabic forms, i.e., with two characters (He & Li, 1987). Chinese characters are fundamental building blocks of a Chinese lexicon. Taking a close look at the internal structure of each Chinese character, one can find that it is composed of various radicals, which are also formed by different strokes arranged in a stipulated order (DeFrancis, 1989). For example, the character “江”, meaning river, consists of two radicals: the left radical “氵” and the right radical “工”. “工” is also composed of two basic strokes: the horizontal “一” and the vertical “|”. Therefore, the writing system of Chinese is organized in a hierarchical way: from words to characters, to radicals, to strokes (McBride, 2016).

It has been shown in previous studies that learning Chinese characters is affected by the combination and distribution of different sub-lexical levels, such as radicals or strokes. For example, Tong and McBride (2014) found that children had already developed the positional awareness (i.e., the ability to know the positional constraints of radicals or stroke patterns) as early as in kindergarten and were able to name pseudo-characters after simple instruction. Moreover, the diverse and flexible combination of radicals in Chinese is partly due to the fact that approximately 80% of characters are compound characters, which are composed of two or more radicals, according to DeFrancis (1989) and Zhu (1988). Most compound characters include two critical radicals: a semantic radical that provides lexical-semantic cues for a character, and a phonetic radical that shares pronunciation information with a character (Li et al., 2022; Sun et al., 2018; Wang et al., 2020). Take “洋” (yáng, meaning ocean) as an example. The semantic radical, ‘氵’ indicates that this character is related to water, while the phonetic radical, ‘羊’ on the right side, shares the same pronunciation with ‘洋’. These compound characters are termed phonograms in the literature. Nevertheless, Fan et al. (1984) suggested that only a limited number (about 26%) of phonograms could bear phonological similarity with their phonetic radicals. Furthermore, semantic radicals have varied transparency (referring to the extent to which the lexical meaning of the character can be derived from its semantic radical) and combinability (also known as neighborhood density, referring to the number of compound characters that share the same semantic radicals; Chen et al., 2006; Feldman & Siok, 1999; McBride, 2016; Shu et al., 2003; Tzeng et al., 1995). Knowing the regularity or pattern embedded in phonograms facilitates lexical recognition and literacy acquisition (Ho et al., 2003; Packard et al., 2006).

Existing Chinese lexical databases

As more researchers have begun to realize the importance of lexical databases for psycholinguistic research, several large-scale Chinese lexical databases have been published. Table 1 summarizes 12 Chinese lexical databases in the past two decades. The materials of four databases on written language were extracted from various corpora published previously and three databases were based on sampled characters and/or words from Cai & Brysbaert (2010), a frequency database for spoken language collected from film and television subtitles. Although they differ in sources and materials, most of them included normative data from adults and most relevant factors that affect Chinese character and word processing. To achieve this, these databases included a wide range of lexical variables such as phonology, orthography, semantics, and frequency to examine their effects in lexical processing tasks. Note that some databases not only have expanded the number of Chinese characters and words but have also delved into more precise classifications while taking into account the relationships among characters. Chang et al. (2016), for example, examined the consistency effect (i.e., the number of characters sharing the same phonetic radicals and have the same sound) on naming RTs and found that this effect was modulated by character frequency. Furthermore, Tsang et al. (2018) and Sun et al. (2018) extended their investigations by including three-character words or even four-character words, which basically encompass all constructive units in Chinese, offering the possibility to explore the relationship between words and characters.

Table 1 Published Chinese lexical databases over the last 20 years

Among these studies, lexical decision and word-naming tasks are most frequently used to test the effects of various lexical characteristics in word identification (Katz, et al., 2012). Most recently, Wang et al. (2020) have adopted dictation task to study the contribution of 14 lexical variables on 1600 characters to writing measurements, including writing latency, duration, and accuracy, which provides a valuable resource for studies on Chinese handwriting. In the present study, the normative data were collected through the word-naming task as Liu et al. (2007) and Chang et al. (2016).

It should be noted that most existing norms of the above-mentioned Chinese databases are exclusively obtained from materials for adults, so they may not be suitable for studies on school-aged children. On the other hand, children-centered databases have been well established in most alphabetical languages, such as the Children’s Printed Word Database (Masterson et al., 2010; Stuart et al., 2003) for British English; NOVLEX (Lambert & Chesnet, 2001) and MANULEX (Lété et al., 2004) databases for French; LEXIN(Corral et al., 2009) and ONESC (Martín & Pérez, 2008) databases for Spanish; Lessico Elementare (Marconi et al., 1993) for Italian; childLex (Schroeder et al., 2015) for German; and ESCOLEX (Soares et al., 2014) for Portuguese.

Shu et al.’s (2003) was the first study to investigate various properties of Chinese characters and their distributions at each grade level based on elementary school textbooks (1996) issued by the Ministry of Education used in Beijing and other regions at that time. Specifically, they focused on the development of certain features, e.g., visual complexity and the proportion of regular character (i.e., characters containing phonetic radicals that can provide salient clues to the pronunciation of the character), across different grades. This database indeed has been a useful tool for studies on Chinese language acquisition in children (e.g., Chen et al., 2009a, b; Li et al., 2012, 2020; Song et al., 2015; Tong et al., 2009). Xing et al. (2004) also constructed a corpus of Chinese characters from the textbooks issued more than two decades ago (Beijing Academy of Educational Sciences, 1998). They examined the role of regularity, consistency, and frequency in Chinese character processing, and confirmed the effects of these lexical variables in the acquisition of Chinese characters. Other databases established on textbooks can be found in Cai et al. (2021), which include new age of acquisition (AoA) norms for 3300+ simplified Chinese characters based on Chinese textbooks of the 2001 and the 2011 editions, both of which were published by the People’s Education Press and were once widely used in elementary schools in mainland China. Moreover, they found that the objective AoA norms from these textbooks can well explain accuracy and reaction times in lexical access tasks used in previous databases. Given the fact that textbooks for elementary schools in mainland China have been undergoing changes in recent decades, it is necessary to develop new databases based on newly issued textbooks. Therefore, in the present study, we aim to establish a more up-to-date Chinese lexical database from children’s learning materials. Such an attempt can help to uncover how the properties of Chinese lexical units evolve across different developmental stages and provide an effective tool for studies on Chinese language learning.

The instructional materials

In mainland China, the instructional materials used in elementary schools (i.e., Chinese language textbooks) are the primary source for children to receive formal language education. The series of Chinese language textbooks have undergone more than 70 years of revisions, from the initial Mandarin Chinese Textbook, which was revised and adapted by the North China Textbook Editorial Committee in 1949, to the most recent Chinese Compulsory Education Textbook, which was validated by the MOE in 2017 (Li, 2018; Wang & Chen, 2019). Textbooks have varied greatly from version to version over the past several decades in terms of (1) requirements and (2) selections for texts and words. Issued in the fall of 2017, the new edition of the textbooks, which places more emphasis on the scientific arrangement of the content, was introduced for use nationwide. These textbooks are the first version that begins with learning Chinese characters rather than Pinyin (the official romanization scheme for Mandarin used in mainland China and some other Chinese-speaking regions) in the first volume, demonstrating that this set of textbooks attaches great importance to character learning. Moreover, there are a total of 12 volumes of the textbooks, ranging from the first to the sixth grade, with two volumes for each grade level. There are nearly 20 pieces of texts distributed across eight units for each volume. The additional section Learning Activity summarizes and extends basic information for each unit to help students enhance learning.

According to Wen (2016, 2017), the editor-in-chief of this edition, the textbooks were compiled based on the principle of "a separation of recognition and writing, more recognition and less writing". This principle foregrounds the distinction of ‘learning to read’ and ‘learning to write’ in that they set different goals for children; ‘learn to read’ is less demanding than ‘learn to write’ because the latter one requires children to write Chinese characters in stipulated sequences, which calls for a higher level of memory and motor schema (Zhang et al., 2021). Furthermore, the textbooks emphasize the need to learn Chinese characters through a combination of Pinyin, structure, and components. It may be worth noting that Chinese orthography places different demands on school-age children at different stages to acquire the knowledge of orthography, phonology, and the meaning of characters. For example, children in early grades are provided characters of simple structure and high frequency. As their orthographic awareness becomes mature in later grades, they begin to consolidate those already-learned characters and learn new characters and words with relatively low frequency. Accordingly, the 300 fundamental characters which are necessary for reading are provided in the first volume, reflecting the influence of phonological mediation, orthographic rules, and frequency effects on lexical access from a psycholinguistic perspective. Li (2018) also pointed out that Chinese characters that children need to write in the early years are generally simple characters with strong productivity so as to form other compound characters or words. In this regard, the learning materials can reasonably corroborate the evidence of children’s learning trajectory during their literacy development. Therefore, the present study aimed to examine the objective linguistic attributes of Chinese lexical units based on this set of textbooks. This attempt will provide more evidence to support the development and improvement of learning materials.

The current database

The current database contains a total of 2999 unique Chinese characters and 2182 words from textbooks for Grades 1 to 6. We incorporated multilevel lexical features regarding orthography, phonology, meaning, and frequency. The norming data collected from the word-naming task (primarily reaction times and accuracy) among school-aged children and adults were also included in order to examine the influential factors of reading in Chinese. Compared with previous databases, this study extends the current body of database in at least three aspects.

First, the current database systematically investigated the new version of instructional materials for elementary school children. Notably, in order to make suggestions for teaching Chinese language, we tested whether the distribution of characters and words across all grades was truly based on changes in some important lexical attributes. Also, we can use the data from the word-naming task to examine the contribution of these lexical variables to word recognition and production.

Second, the database is important for studies on the effect of age of acquisition (AoA). Numerous studies have demonstrated that words acquired at an earlier age have a significant advantage in word recognition and production (Bonin et al., 2001; Carroll & White, 1973; Gilhooly & Logie, 1980; Johnston & Barry, 2006; Pérez, 2007). However, most previous studies on AoA have relied on subjective measures, which require adult subjects to determine the age when they were able to speak or read a certain word. Such subjective recalling is easily confounded with other factors such as word frequency, familiarity, and/or complexity (Morrison et al., 1997; Morrison & Ellis, 2000) and therefore raises concerns about its validity (Stadthagen-Gonzalez & Davis, 2006). Due to the fact that most Chinese characters and words are acquired in school at early ages, a more objective way to define the AoA is the grade in which a character or a word first appears in textbooks (Cai et al., 2021; Shu et al., 2003; Wang et al., 2020). It should be noted that the textbook-based estimation of AoA centers on written characters and words, which differ from spoken forms. As Cai et al., (2021) indicated, children may have been exposed to spoken forms before school age. Moreover, the acquisition of spoken forms does not entail the acquisition of written forms of characters and words. Therefore, it bears reiterating that the AoA collected in the current database (represented by Volume) reflects the age of learning written forms of Chinese characters and words.

Third, patterns generalized from the current database can help children learn Chinese characters more efficiently. Previous studies have suggested that children are sensitive to statistical structures in a given language environment and are capable of extracting embedded abstract regularities to acquire languages (known as statistical learning, see Perruchet & Pacton, 2006; Romberg & Saffran, 2010; Saffran et al., 1996). Such ability has also been demonstrated to have associations with children’s literacy outcomes (Arciuli & Simpson, 2012; Spencer et al., 2015). In character learning, statistical learning is manifest in many aspects, such as knowing the pronunciations of phonograms by consulting their phonetic radicals and developing awareness of visual orthographic regularities. The current database has provided information regarding to what extent the phonetic radical can inform the sound of character by categorizing characters into different types.

Lexical variables

Previous studies have shown that visual complexity, the number of different pronunciations and meanings, and character frequency play important roles in Chinese word recognition and production (Chang et al., 2016; Hsu et al., 2011; Liu et al., 2007; Peng et al., 2003; Peng & Wang, 1997; Tan & Perfetti, 1997; Wang et al., 2020). The multilevel lexical variables presented in the current database focus on four main aspects of Chinese characters and words: orthography, phonology, meaning, and frequency.

Strokes and radicals

As the word length effect found in alphabetic languages, the stroke number effect detected in considerable studies on Chinese word recognition also suggests that response latency increases with the number of constituent strokes (Chen et al., 1996; Fang, 1994; Leong et al., 1987; Su & Samuels, 2010; Tan & Peng, 1990). The number of strokes in the current database was extracted from Xinhua Dictionary (新华字典, Linguistics Institute of Chinese Academy of Social Sciences, 2020), and the information about character radicals was obtained from Dictionary of Chinese character properties (汉字属性字典, Fu, 1989). The number of radicals is also a common proxy to measure visual complexity of Chinese characters, as indicated in previous studies (e.g., Liu et al., 2007; Wang & Dong, 2013; Xing et al., 2004). To ensure the number of radicals could also reflect the visual complexity of characters, we dissected the characters into basic radicals and then calculated the total number of radicals (Wang et al., 2020). Take ‘别’ (bié, other) as an example: The basic radicals are ‘口’, ‘力’ and ‘刂’ rather than ‘另’and ‘刂’, so the number of radicals is 3.

Meaning and pronunciation

Some characters such as ‘水’ (shuǐ, water) and ‘人’ (rén, human beings) convey specific meanings. Alternatively, many characters have multiple meanings and thus are semantically vague and highly context-dependent (Perfetti & Tan, 1998; Tan et al., 1995). For example, the character ‘打’ (dǎ) means ‘to get or obtain’ when it combines with the character ‘水’ as ‘打水’(dǎshuǐ); but when it precedes the character ‘球’ (qiú, ball), it means ‘to play’. Many studies have demonstrated that semantic ambiguity has an effect on word recognition (Borowsky & Masson, 1996; Kellas et al., 1988; Rodd et al., 2002).

In addition, some Chinese characters may have different pronunciations with different combinations of other characters (known as heteronyms). We also incorporated this attribute into the database to see how the variability of pronunciations affects lexical access. For example, the character ‘好’ has two pronunciations that differ in the lexical tone: hǎo and hào, representing different syntactic categories. When it is pronounced as hǎo, it is an adjective that has six different meanings including ‘good’, ‘friendly or harmony’, and so on. When it is hào, it is a verb meaning ‘to love or like’. We retrieved the meaning and pronunciation information for all characters from Xinhua Dictionary (新华字典, Linguistics Institute of Chinese Academy of Social Sciences, 2020). For words, we resorted to Modern Chinese Dictionary (现代汉语词典, Chinese Academy of Social Sciences, 2012), The Great Chinese Word Dictionary (汉语大词典; Luo, 1986) and Xinhua Idiom Dictionary (新华成语词典, Lexicographical Center of Commercial Press, 2002). However, 87 words were not found in these dictionaries and were treated as missing values in our analysis. Note that only literal meanings of characters and words were used in the current study. We did not consider the sense of characters and words that goes beyond the dictionary definitions, such as figurative, sarcastic, or ironic usages because children mainly learn the most basic meanings of these characters in elementary school stages.

Structure

The relative positions of constituent radicals within characters also affect character recognition (Feldman & Siok, 1999; Li et al., 2012, 2000; Yang et al., 2019). In the current database, we identified six primary structures to capture the relationship of main radicals within a particular character: single, right-left, top-down, full-surrounded, semi-surrounded, and others (including two special structures, one of which is arranging three identical components, like ‘品’ or ‘晶’; the other one is ‘embedded structure’, like ‘爽’ or ‘巫’). Considering the vast majority (80%) of characters in our database are left-right (1744) and top-down (748) structure, the character structure was reclassified into three categories (i.e., left-right, top-down, and other) in the analysis below.

Type

Traditionally, all Chinese characters can be classified into six basic types based on the manners in which they are formed and derived according to Xu Shen’s Shuowen, also known as liùshū (‘six writings’). Characters within the type of Xiàngxíng (‘pictographs’) are physically similar to the entities or objects that they represent, like ‘日’ (roughly resembling the shape of the sun). Zhǐshì is the other type that denotes ideas by indicating things in a metaphorical way; for example, ‘上’ with the focus on the upper part, is later used to indicate a higher level or upward. Apparently, they have a high degree of iconicity as they are either depicting or indicating entities or parts. Another two categories, Xíngshéng (phono-semantic compounds, also referred to as ‘picto-phonetic characters’ or ‘phonograms’) and Huìyì (associative compounds or compound ideographs), involve the combination of character radicals. The former one is composed of semantic and phonetic radicals (Li, 1993; Yin & Rohsenow, 1994), whereas the latter one is composed of pictographic or ideographic characters to convey the meaning. The last two types are actually associated with character etymology, Zhuǎnzhù (derivative cognates) and Jiǎjiè (rebus or phonetic loan characters; Myers, 2019), so they were excluded from the current database. The criterion for type of characters in the current database is based on online dictionaries ChaZiWang (http://www.chaziwang.com/) and Hanzi Quanxi Ziyuan Yingyong Xitong (https://qxk.bnu.edu.cn/). Considering the vast majority of compound characters existing in the system, we only distinguished whether a character belongs to the type of phonogram in our statistical analysis.

Regularity

The ability of mapping letters to sounds is essential in reading acquisition (Brady & Shankweiler, 1991; Byrne, 1992; Goswami & Bryant, 1990). Regular words, which are in accordance with the GPC rules in alphabetic languages, have been demonstrated to be processed faster and more accurately compared to irregular words (Baron & Strawson, 1976; Parkin, 1982; Seidenberg et al., 1984). The correspondence between the written form and the sound in Chinese mostly manifests in phonograms, as we outlined earlier. Thus, we made a further distinction regarding the degree to which phonetic radicals inform the pronunciation of a character in order to obtain the patterns of regularity for these characters. Table 2 displays six types of phonological relations between compound characters and their phonetic radicals. The classification was adapted from Zhou and Marslen-Wilson (1999), with an additional type ‘unpronounceable’ added. For characters or phonetic radicals with multiple pronunciations, all pronunciations were considered. For example, the radical ‘隹’ has three pronunciations: zhuī, cuī and wéi. The character ‘准’ (zhǔn) shares the initial with one of the pronunciations of ‘隹’, thus ‘准’ is categorized as Alliteration. In addition, some radicals are presented in a different way when they serve as constituent radicals and independent characters; for instance, the upper part of the character ‘党’ (dǎng, party) is also a character ‘尚’(shàng, still), but it has been distorted and less likely to be recognized. Therefore, we treated those characters with distorted constituent radicals as “unpronounceable”. In our analysis below, three major types were investigated: regular, semiregular (including semi-regular, rhyming, and alliteration), and irregular (including both irregular and unpronounceable).

Table 2 Phonological relations between the compound character and its phonetic radical

logCHR-CD and logW-CD

Frequency effect, which has been observed consistently in a wide range of tasks across all languages, is a reliable and fundamental predictor in lexical access. In Chinese, Sze et al. (2014) summarized seven major Chinese character frequency norms established in the literature and suggested that the character frequency based on contextual diversity (CHR-CD, referring to the number of films in which a character occurs) from Cai & Brysbaert (2010) could account for the most variance (nearly 31%) in response times for the lexical decision in Chinese. The effect of character and word contextual diversity (CD) was also found in lexical processing in fourth-grade children (Huang, et al., 2021). Tse et al. (2017) compared six word-frequency measures in lexical decision performances and also observed the same patterns. Several subsequent studies used the subtitle frequencies from Cai and Brysbaert (2010) in their analyses (Sun et al., 2018; Tsang et al., 2018; Tse et al., 2017; Wang et al., 2020). In the current database, we included the logarithmic CHR-CD for all characters (represented as logCHR-CD) and logW-CD (Cai & Brysbaert, 2010) for all words to represent lexical frequency. Moreover, we included the contextual diversity (log-transformed) from CCLOWW in Li et al., (2022) for all characters and words to represent children's frequencies, which contrasts with adult frequencies from Cai and Brysbaert (2010). This variable was excluded from analysis (only reported in the descriptive statistics) due to the high collinearity with other frequency measures.

Count_Sum

In addition to the frequency measures obtained from the external resources, we also included the frequency of occurrence in the textbook. Here, Count_Sum refers to the number of times a character or a word appears in the texts throughout all 12 textbooks. The textbooks contain 162,177 character tokens (Grade 1: 5445; Grade 2: 14,434; Grade 3: 25,548; Grade 4: 35,211; Grade 5: 39,223; Grade 6: 42,316) and 110,226 word tokens (Grade 1: 3,803; Grade 2: 9,863; Grade 3: 17,534; Grade 4: 23,724; Grade 5: 26,902; Grade 6: 28,400), which were derived using Stanford CoreNLP with Chinese model in version 4.3.2 (Manning et al., 2014). This density information reflects how frequently children are exposed to these lexical items during Chinese learning at school. Forty-eight characters, which are absent in the main texts, but present in the Learning Activity section, are regarded as missing values in the analysis below.

Method

Participants

One hundred and fifty adults and 66 3rd–6th graders from a Chinese elementary school were compensated for their participation in this study. All of them were native Chinese speakers. Written informed consent was obtained from all adult participants and from the parents of all children before the experiment. All participants had normal or corrected-to-normal vision and reported no reading disorder or a history of neurological or psychiatric disorder. Information of participants is shown in Table 3.

Table 3 Sample size and personal characteristics in three groups

Materials

All Chinese characters and words were collected from the Appendices of the textbooks. The Appendix in each textbook includes one character list for reading, one character list for writing, and one list of words. For characters in the Reading list, children are supposed to be able to recognize them (mapping the orthographies onto their phonological forms and meanings). For those in the Writing list, children are required to write these characters correctly. Note that there is an overlap between characters in the two lists. Specifically, 2491 characters in the Writing list (which totals 2500 characters) are also present in the Reading list (which totals 3172 characters). It is often the case that a character in the Reading list will appear in the Writing list at a later grade. For example, the character “蛛” (zhū, spider) first appears in the Reading list in Grade 1 and then later in the Writing list in Grade 3. Such arrangement helps children consolidate their knowledge of characters and further achieve proficient reading and writing. There is no Reading list in Grade 6 and no Word list in Grade 1. Therefore, our database covers all characters that are required to be recognized from Grade 1 to Grade 5 (ten volumes), characters required for writing from Grade 1 to Grade 6 (12 volumes), and all words from Grade 2 to Grade 6 (ten volumes). All words were taken into analysis regardless of their number of syllables. There are 1966 disyllabic, 82 trisyllabic, and 142 tetrasyllabic words. Only nine disyllabic words are repeated in different grades. In the word-naming task, a total number of 2999 characters (Grade 1: 700; Grade 2: 899; Grade 3: 501; Grade 4: 479; Grade 5: 399; Grade 6: 3) and 2182 words (Grade 2: 496; Grade 3: 498; Grade 4: 426; Grade 5: 374; Grade 6: 388) were used as stimuli. They were randomly split into five sub-lists for the adult group. Four of the lists consisted of 600 characters and 436 words (1036 items), and one list consisted of 599 characters and 438 words (1037 items). For child participants, the number of lists was expanded to ten in order to reduce the experimental time for each of their visits. As a result, children were required to name 518 or 519 characters and words every time they performed the task.

Procedure

The present study was approved by the Ethical Committee of the State Key Laboratory of Cognitive Neuroscience and Learning at Beijing Normal University. Prior to the word-naming task, all participants were asked to complete a non-verbal Raven’s Standard Progressive Matrices (SPM, Raven et al., 1983) to ensure that they exhibit typical cognitive abilities. This test is crucial for child participants as it enables us to ascertain their typical developing status. Child participants were additionally required to complete two extra individual differences tasks: A Chinese character recognition task adopted from Li et al. (2012); Lei et al., (2011), and a word reading task (Zhang et al., 2012), both of which are used to measure children’s vocabulary knowledge and reading fluency. In the Chinese character recognition task, children read out aloud 150 characters listed in the order of increasing difficulty level; the procedure stopped if they failed to recognize 15 consecutive items. In the word reading task, children were asked to read 180 disyllabic words as quickly and accurately as possible; the score was calculated by dividing the number of correct responses by the time spent on this task. Children’s performance on two reading ability tasks was used as supplementary evidence to substantiate the differentiation between younger (3rd and 4th grader) and older children (5th and 6th grader) group in the analysis.

In the word-naming task, adult participants were randomly assigned to one of the five sub-lists, with each participant only naming one list. Each child participant completed three individual difference tasks on the first day and the naming task in the following few days. On average, each 3rd and 4th grader completed four or five sub-lists, while each 5th and 6th grader completed five or six sub-lists on separate days. As a result, each character or word received 30 responses from adult participants, 15 responses from children in Grade 3 and 4, and 20 responses from children in Grade 5 and 6. During the experiment, each sub-list was further divided into ten blocks with an equal number of trials. Participants were allowed to take a short break between every two consecutive blocks. All stimuli were presented in black with the SimSun font of size 30 at the center of a white screen. Participants sat in front of a computer at a distance of approximately 80 cm from the screen. Each trial began with a fixation presented at the center of the screen when participants were required to press the space bar to continue. Then, a target character or word appeared after a blank screen of 300 ms. Participants were asked to name the character or word as quickly and accurately as possible. The target disappeared upon a naming response or after 3000 ms without a naming response. Participants took a practice session with 20 trials, which were not included in experimental lists, before the formal experiment. The entire experiment lasted 30–40 minutes.

Data analyses

Naming accuracy (ACC) was coded manually by the experimenter. Incorrect responses (accounting for 2.53% of all responses) and absolute outliers (RTs that were faster than 200 ms or slower than 2500 ms) were excluded from further analysis. All participants achieved an overall ACC higher than 64%. We also rejected RTs over 2.5 standard deviations away from the mean for every participant. These trimming steps led to an exclusion of 5.95% of the data. To minimize individual differences in timing, we transformed each participant’s RTs into z scores. The average z scores across participants yielded zRTs for each item. ACC across participants for each item was also calculated. Numerical lexical variables, including the number of strokes, number of radicals, number of meanings, number of pronunciations, and Count_Sum, were log-transformed to better approximate a normal distribution as they are highly positively skewed (Hair et al., 2009). Categorical variables (i.e., Structure and Regularity) with three levels were independently coded into two new variables based on the dummy coding system (Hair et al., 2009) in line with previous studies (Chang et al., 2016; Liu et al., 2007; Wang et al., 2020). The structure was divided into a left-right structure and top-down structure. For the left-right structure variable, characters with a left-right structured were coded as 1, while others were coded as 0. The same criteria were applied to the top-down structure variable, with top-down structured characters coded as 1 and others coded as 0. Regularity was separated into regular and semi-regular variables, and a similar coding system was employed. For the regular variable, characters with regular pronunciation were coded as 1, and others were coded as 0. For the semi-regular variable, characters with semi-regular pronunciations were coded as 1, and all others were coded as 0.

The goal of our data analyses was threefold. First, to trace the gradual and subtle development in characters (in both Reading and Writing list) and words that children need to learn at elementary schools, we attempt to capture potential differences in those variables that may reflect essential features across six grades through Poisson regression and simple regression analyses. Due to the fact that most variables were count data, such as nStroke, nRadical, nPronunciation, nMeaning, Count_Sum, the Poisson regression analyses were performed to model these measures as a function of Grade; to compensate for the overdispersion identified in some variables, negative binomial models were used instead (Ismail & Jemain, 2007). Simple regression was used for the continuous variable logCHR-CD.

Second, to identify significant predictors for naming RTs and ACC among these lexical variables and investigate the relative importance of those predictors, we used a stepwise multiple regression analysis (MRA). For the word-level analysis, we included word-level variables (volume, length, nPronunciation, nMeaning, Count_Sum and logW-CD), character-level variables of the first (C1) and the second (C2) character of the word (lexical variables of the third (C3) and forth (C4) character were not included because three- and four-character words accounted for only 10% of the current database), and the sum value of C1-C4 (the log-transformed values across characters were summed, Sum_nStroke, Sum_nRadical, Sum_nPronunciation, Sum_nMeaning, Sum_Count-Sum, and Sum_logCHR_CD; see Table 9). It should be pointed out that averaged character-level factors were used in word-level analysis in previous studies such as Tsang et al., (2018). However, we used the summed value on the character-level to better capture the overall features of words. Items with zRTs longer or shorter than 2.5 standard deviations from the mean, as well as items with low ACC (< 50%) were removed before running the MRA (Sun et al., 2018). The application of the criterion resulted in the inclusion of 2677 characters and 2146 words in the younger children group, 2881 characters and 2182 words in the older children group, and 2941 characters and 2164 words in the adults group. All of the reported models below were constructed using a stepwise multiple regression and excluded cases listwise. Furthermore, the relative importance analysis was used to reveal how each variable contributes to the naming task performance. We calculated the lmg metric for variables entered the final model using the relaimpo package (Grömping, 2006) to examine the relative contribution of any particular variable.

Third, to understand how lexical effects would vary across three different aged-groups, we conducted a between-group analysis to compare the magnitude of those effects. Considering the abundance of predictors in current study, we selected those predictors which were revealed to be significant in all three groups in all analyses (naming RTs and ACC, on the character and word level) to determine the trajectory of reading performance among different age groups.

Results

Responses were divided into three groups: the younger children group, the older children group, and the adult group. The younger children group (mean age was 10.65, mean Raven’s SPM score was 42.2, mean character and word recognition scores were 112 and 86.9, respectively) included 3rd and 4th graders, and the older children group (mean age was 12.4, mean Raven’s SPM score was 43.05, mean character and word recognition scores were 124.8 and 97.65, respectively) included 5th and 6th graders. The result of independent t tests showed that the reading performance in the older children group was significantly better than that in the younger children group (see the Note in Table 3).

The character level

Table 4 shows the descriptive statistics of lexical variables and naming responses for all Chinese characters. Note that the frequency-related variables (logCHR-CD) of seven characters were not found in Cai and Brysbaert (2010). Table 5 presents the correlation (Pearson’s r) matrix among the log-transformed numerical variables and the character naming performance for the three groups. It shows that naming latency and accuracy for of all the three groups were highly correlated to all the variables except for the number of pronunciations.

Table 4 Descriptive statistics of 2999 characters
Table 5 Correlations of lexical variables and naming performance (zRT and ACC) for characters

Differences across the six grade levels

In the Reading list (see Fig. 1), it is obvious that nStroke and nRadical increase with grade level. Difference on nStroke is not attested in the comparison between Grades 2 and 3 (p = 0.51) and between Grades 4 and 5 (p = 0.16) according to the Tukey’s tests. nRadical in Grade 1 is significantly smaller than other grade levels. nMeaning, on the other hand, decreases with grades, but further analysis reveals no significant difference between Grades 2 and 3 (p = 0.12) and between Grades 4 and 5 (p = 0.06). There is no significant difference in nPronunciation across grades. Comparisons across all grades on Count_Sum and logCHR-CD indicate that these two frequency-related variables significantly decrease with grades (all ps < .05).

Fig. 1
figure 1

Critical predictors across five grades in the Reading list in Chinese character database. The error bar represents standard errors. Variables such as nStroke, nRadical, nPronunciation, nMeaning, and Count_Sum are counted as numbers; logCHR-CD is log-transformed

Similar to the Reading list (see Fig. 2), differences in nPronunciation across the six grades in the Writing list are not significant (all ps > 0.77). nStroke shows an increasing trend as grade advances, but exceptions are found in the comparison between Grades 3 and 5 and between Grades 4 and 5 (ps > 0.09). Differences across grades in nRadical converge on the comparison between Grade 1 and other grades (all ps < .01). Grades significantly differ on nMeaning and Count_Sum, except that the differences between Grade 5 and Grade 6 on nMeaning (p = 0.64) and on Count_Sum (p = 0.05) are not significant.

Fig. 2
figure 2

Critical predictors across five grades in the Writing list in Chinese character database. The error bar represents standard errors. Variables such as nStroke, nRadical, nPronunciation, nMeaning, and Count_Sum are counted as numbers; logCHR.CD is log-transformed

The effects of the lexical variables

In the multiple regression analysis for naming zRTs and ACC, the numerical variables, Volume, logCHR-CD, Count_Sum, nStrokes, nRadicals, nMeaning, and nPronunciation were entered in Block 1. Next, categorical variables, character phonograms, regularity (regular, semi-regular) and Structure (left-right, top-down) were entered in Block 2.

For the naming zRTs, the result showed that Volume, logCHR-CD, Count_Sum, and nStroke in Block 1, and left-right structure, top-down structure, and regular in Block 2 had significant effects on zRTs in the younger children group (overall R2 = .351, F (7, 2894) = 223.89, p < 0.001) (see Table 6). In the older children group, the effects of Volume, logCHR-CD, Count_Sum, nMeaning, and nPronunciation in Block 1, and left-right structure, top-down structure, phonograms and regular in Block 2 were significant (R2 = .443, F (9, 2930) = 248.95, p < 0.001), see Table 7 for details. For the adult group, results showed that Volume, logCHR-CD, nMeaning, and nPronunciation in Block 1 and left-right structure and semi-regular in Block 2 were significant contributors to the naming zRTs (R2 = .384, F (7, 2929) = 256.14, p < 0.001). The effect of nStroke was significant (p = 0.02) in Block 1 before the factors in Block 2 were entered, see Table 8 for details. Given that Volume, logCHR-CD, and left-right structure were significant predictors in all three groups, we analyzed how the effects of these predictors might vary across groups. Results showed that the magnitudes of Volumn and logCHR-CD effects in the younger and older children groups were significantly greater than that in the adult group (ps < 0.001); however, the differences between the younger children and the older children group were not significant (ps = 0.188). For the left-right structure, group comparisons only revealed a significant difference between the younger children group and the adult group (p < 0.001). Details are shown in Table 9.

Table 6 Results of stepwise MRA on zRT and ACC in the younger children group on the character level
Table 7 Results of stepwise MRA on zRT and ACC in the older children group on the character level
Table 8 Results of stepwise MRA on zRT and ACC in the adult group on the character level
Table 9 Results of regression coefficient comparisons on zRT among three groups on the character level

For ACC, we found that Volume, logCHR-CD, Count_Sum, nStroke and nMeaning in Block1, and top-down structure and regular in Block 2 were entered into the final model (R2 = .410, F (7, 2936) = 291.17, p < 0.001) (see Table 6) in the younger children group. In the older children group, Volume, logCHR-CD, Count_Sum, nStroke and nPronunciation in Block 1, and top-down structure and regular in Block 2 was found to be significant (R2 = .274, F (7, 2936) = 158.32, p < 0.001) (see Table 7). In the adult group, logCHR-CD, Volume and nPronunciation in Block 1 were entered into the final model (R2 = .161, F (3, 2933) = 188.65, p < 0.001) (see Table 8). The comparisons between three groups on the effects of Volume and logCHR-CD showed a gradient trend, with the younger children group showing the largest magnitude, followed by the older children group, and finally the adult group with the lowest magnitude (ps < 0.001). See Table 10 for details.

Table 10 Results of regression coefficient comparisons on ACC among three groups on the character level

Results of relative importance in this analysis showed that logCHR-CD and Volume are the two strongest predictors for both naming latencies and accuracy across three groups. In addition, variance inflation factors (VIFs) were less than 3 for all variables that were added into the final models, excluding the possibility of collinearity (Hair, 2011; O’Brien, 2007). As noted by Plonsky & Ghanbar (2018), R2 equal to or less than 0.2 indicates a small effect size. All models in the current analysis provided good fits to the character naming data except for the ACC in the adult group (R2 = 0.161).

The word level

Table 11 shows the descriptive statistics of a total of 2182 Chinese words on lexical variables and responses from the word-naming task. Note that the frequency-related variables (logW-CD) of 110 characters were not found in the database of Cai and Brysbaert (2010). Table 12 demonstrates the correlation (Pearson’s r) matrix among the log-transformed numerical variables and the word-naming performance (zRT and ACC) in three groups. It shows that zRT in three groups were highly correlated with all the word-level variables, character-level variables of C1 and C2, and total number of strokes and radicals of the whole-word except the number of pronunciations. In addition, naming accuracy was highly correlated with Volume, Count_Sum and logW-CD at the word level, and logCHR-CD, Count_Sum of C1, C2 at the character level.

Table 11 Descriptive statistics of 2182 words
Table 12 Correlations coefficient matrix of variables for words

Differences across the six grade levels

As shown in Fig. 3, the comparisons across all grades at the word level reveal no significant differences on Length or nPronunciation. This pattern is also shown in nMeaning, except that the comparison between Grade 2 and Grade 5 is significant (p < 0.05). Sum_nStroke increases as the grade progresses, but this trend is not statistically attested in the comparison between Grade 5 and Grade 6 (p = 0.88). Analysis of Count_Sum reveals that comparisons between any grades are significant (ps < 0.05). Grades differ in logW-CD except for the comparisons between Grade 3 and 4 (p = 0.64) and between Grade 5 and 6 (p = 0.78).

Fig. 3
figure 3

Critical predictors across five grades (no first grade) in the Chinese word database. The error bar represents standard errors. Variables such as Length, nPronunciation, nMeaning, Sum_nStroke, and Count_Sum are counted as numbers; logW is log-transformed

The effects of the lexical variables

For naming zRTs, we found that logW-CD, Volume, Sum_nStroke, and C1_Count_Sum were strong predictors in the younger children group, along with other variables (R2 = .304, F (8, 2059) = 112.27, p < 0.001), see Table 13 for details. In the older children group, strong predictors were logW-CD, C1_CS, C1_logCHR-CD and Sum_nStr; the final model accounts for 32.8% of the variance (F (9, 2062) = 112.03, p < 0.001) (see Table 14). In the adult group, logW-CD and C1_logCHD-CD, along with other variables, were found to significantly affect the naming RTs (R2 = .206, F (9, 2051) = 59.17, p < 0.001) (see Table 15). Among these predictors, Volumn, logW-CD, C1_Count_Sum, C2_Count_Sum, and Sum_nStrokes were shown to significantly affect the naming RTs in all three groups. Results from the group comparisons indicate that the effects of Volume, C2_Count_Sum, and Sum_nStrokes are particularly pronounced in the younger children group than other two groups. The two child groups also exhibited greater logW-CD and C1_Count_Sum effects than the adult group, though there was no significant difference on logW-CD between the two child groups (p = 0.065). See Table 16 for details.

Table 13 Results of stepwise MRA on zRT and ACC in the younger children group on the word level
Table 14 Results of stepwise MRA on zRT and ACC in the older children group on the word level
Table 15 Results of stepwise MRA on zRT and ACC in the adult group on the word level
Table 16 Results of regression coefficient comparisons on zRT among three groups on the word level

On the other hand, we found that Volume, logW-CD, C1_logCHR-CD, and C1_Count_Sum and C2_Count_Sum exerted significant effects on naming ACC for the younger children group. Along with other variables, the final model for this group accounts for 15.9% of the variance (F (8, 2063) = 48.66, p < 0.001) (see Table 13). For the older children group, it should be noted that Sum_nPro also made significant contribution to account for the variance except for logW-CD and Volume (R2 = .090, F (6, 2065) = 33.95, p < 0.001) (see Table 14). For the adult group, the result showed that Sum_nPro, logW-CD, C1_CS, and C2_logCHR-CD were significant contributors to the naming accuracy (R2 = .050, F (4, 2056) = 27.27, p < 0.001) (see Table 15). The comparisons between three groups on the effects of logW-CD and C2_Count_Sum showed a gradient trend, with the younger children group showing the largest magnitude, followed by the older children group, and finally the adult group with the lowest magnitude (ps < 0.05). See Table 17 for details.

Table 17 Results of regression coefficient comparisons on ACC among three groups on the word level

Results of relative importance in this analysis showed that logW-CD is the strongest predictor for both word-naming latencies and accuracy across three groups. The VIF results for all variables were less than 4, excluding strong collinear relationships between them. In addition, naming accuracy for words was lower than that for characters in all three groups. All models provided good fits to the zRTs (R2 > 0.2), but not the word-naming accuracies (R2 < 0.2).

Discussion

In this study, we present a large-scale database of Chinese characters and words based on the contemporary elementary school textbooks issued by the Ministry of Education in mainland China. The current database incorporates key lexical variables collected from 2999 Chinese characters and 2182 words, as well as norming data in a word-naming task from both school-age children and adults. We found that, first, visual complexity of characters and words learned in elementary schools increases with grades, whereas the semantic richness and frequency tend to decrease. Second, frequency, visual complexity, semantic and phonological attributes, structural type, and phonetic regularity are significant contributors in lexical processing. However, the effects of these predictors in lexical processing vary across children and adults. Third, the factor of age of acquisition, objectively indexed by the Volume in this study, had a significant effect in the naming task, but different patterns were attested in character and word naming.

Changes of lexical variables across six grades

The distribution of character-level variables in the Reading list from Grade 1 to Grade 6 indicates that visual complexity of character gradually increases with grade level, whereas semantic richness, frequency, and naming accuracy decrease with grade level. Similar patterns can be found in the Writing list, given that there is a considerable overlap between the two lists. With regard to the development of variables in the word list, it can be clearly seen that three or four-characters are clustered in later grades, which explains the increase of visual complexity across grades. Another notable trend is that the percentage of polysemes (i.e., words with multiple lexical meanings) and high-frequency words decrease steadily. All these findings reflect a fundamental principle in literacy acquisition: characters and words to be learned tend to become more structurally complicated, less frequent, and semantically unambiguous.

These findings are consistent with those of Shu et al. (2003). Moreover, we expanded Shu et al., (2003)’s database in several aspects. First, the current database explicitly indicates the requirement (i.e., reading or writing) for characters learned at a certain level, which is referenced in the Reading and Writing lists in the new edition of the textbooks. Second, the current database covers a wider range of words (up to 2182 words) than Shu et al. (2003), in light of the fact that both characters and words are building blocks of the Chinese lexicon, and most language units or concepts are represented in words as they can convey relatively complete meanings. Third, we collected naming data from both children at different grades and adults, which allows us to examine the developmental trajectory of the effects of lexical variables in visual word processing. Therefore, the word cluster in the current database arranged by grade provides a useful resource to study literacy acquisition among school-aged children.

Influential factors for character and word processing

We found that factors such as frequency, volume, visual complexity, structural type, semantic and pronunciation richness, and phonetic regularity contributed significantly to naming latency and accuracy for both characters and words in three groups of participants. Differences between children and adults were also observed in the current study. However, it should be noted that regression models did not fit well for character naming accuracy in the adult group and word-naming accuracy in all three groups. We speculated that this could be due to relatively high naming accuracy and less variances for items with different attributes (see Table 4 for character and Table 9 for word). As a result, the overall models for accuracy failed to fit well, and thus we mainly discussed the results of naming latency. These results were consistent with previous databases based on adult materials (Liu et al., 2007; Sun et al., 2018; Sze et al., 2014; Tsang et al., 2018). Firstly, character frequency retrieved from Cai and Brysbaert (2010) was found to be the strongest predictor in both naming latency and accuracy across three groups. Specifically, words composed of more frequent characters led to faster responses and higher accuracy, compared with words composed of less frequent characters. Consistent with the results of Sun et al. (2018) and Li et al. (2022), this finding demonstrates significant frequency effect for both the first and the second characters in the word-naming task. Moreover, the frequency effect was notably stronger in the younger children group than in the other two groups, reflecting a significant impact of frequency on lexical processing at an early stage of reading acquisition. However, the number of times that characters occur in the textbooks (Count_Sum) only showed a significant effect on character naming latency and accuracy in children but not in adults. In the word level, Count_Sum of the constituting characters, mostly the first two characters, made a significant contribution to word processing in all three groups, but accounted for less variance in adults than in children. This is probably due to the effect of recent exposure (Kaschak, 2007). Specifically, compared to adult participants, children who have more recent exposure to these characters and words are more sensitive to this frequency factor solely based on textbooks. Furthermore, our results showed that the factor of Count_Sum explained less variance in the naming data for both characters and words than the factor of frequency from Cai and Brybaert (2010). One possible reason is that the Count_Sum was extracted from a limited size of texts. According to the CCLOWW database which incorporates 34,671,424 character tokens and 22,427,010 word tokens (Li et al., 2022), robust frequency effect was detected in lexical processing tasks in both adults and children. Therefore, it is possible that character frequency obtained from the textbooks for elementary schools in the current study is not adequate to capture the overall distributional information.

Secondly, we found that the number of strokes had a significant effect on naming latency of characters and words in all three groups, but the number of radicals did not. As expected, characters and words with more strokes were named more slowly than those with less strokes, which replicated the stroke-number effect observed in many previous studies (for example, Just & Carpenter, 1987; Leong et al., 1987; Su & Samuels, 2010). Resembling the frequency effect, a more pronounced stroke effect was attested in the younger children group than in the other groups, showing that it was more difficult for younger children to name characters or words with more visually complex. One possibility is that these characters or words require additional cognitive resources for processing. Similarly, naming latency increased with the length of words. That is, it took longer to retrieve encoding information of words that contain two or more characters. However, it was shown in the current analysis that all three groups responded faster to words than to characters, regardless of the number of constituent characters (see Table 5 and Table 12). These findings are consistent with the U-shaped relationship between word length and response times in Tsang et al., (2018). They found that two- and three-character words induced longer RTs than single and four-character words, and that naming latency for four-character words were slightly longer than for single-character words. These findings together indicate that word processing is not necessarily more difficult than character processing, and words processing might be affected by several factors such as distributional frequency. On the other hand, the number of radicals did not show a significant effect on character naming latency in all three groups. Some studies have used both the number of strokes and radicals as indicators of the visual complexity of Chinese characters and found that both of them play significant roles in lexical processing (Liu et al., 2007; Wang & Dong, 2013; Xing et al., 2004). However, some other studies did not replicate this effect. For example, Su and Samuels (2010) examined the effect of the number of strokes, the number of radicals and word length on Chinese character recognition among children and adults. They ascribed the absence of the radical effect to the notion that characters with high-frequency and moderate visual complexity may not follow a radical-by-radical processing basis. In the current database, the number of strokes and radicals of characters was highly correlated (r = .73); therefore, it is possible that the number of strokes takes the role of the number of radicals in naming latency (Wang et al., 2020).

In addition to the effect of visual complexity on Chinese lexical processing, the structure type of Chinese characters also showed a significant effect on naming latency and accuracy in both younger and older children. In adults, however, this effect was only significant in naming latency. Specifically, naming latency for characters with top-down structure were named faster than those with left-right structure or other structures. Several studies have examined the structural effects in Chinese character processing and learning, but found inconsistent results (e.g., Li et al., 2000, 2005; Wang et al., 2020; Yu & Cao, 1992). For example, Li et al. (2000) investigated whether character recognition was affected by the character structure among school-age children (Grades 1, 3, and 5 in primary school) and adults. The structural effect was only found in Grade 1, such that characters with left-right structures showed a processing advantage relative to those with top-down or semi-enclosed structures. Tong and McBride (2014) also found that children had more errors in producing left-right structured characters and were more inclined to produce top-down structured characters. They argued that such production asymmetry is due to children’s asymmetrical exposure to these two types of structures. However, in the study by Wang et al., (2020), writing latency and duration for left-right characters in adults were found to be shorter than characters with other structures, and latency and duration of top-down characters were longer than those for characters of other structures. Despite the inconsistent results, all the aforementioned studies indicate an effect of character structure on character processing. These divergent findings may be attributable to differences in tasks and reading experience among participants, and thus call for further empirical evidence.

Thirdly, as the essential role of grapheme-to-phoneme correspondence (GPC) rules in word recognition and reading acquisition observed in languages with regular and consistent orthographies (e.g. Seidenberg et al., 1984; Ziegler et al., 2003, 2010), the print-to-sound mapping at the sublexical level in Chinese is also important in the early stages of reading (Li et al., 2018; Shu et al., 2000). The result of the current study showed that characters with regular pronunciation were named faster than those irregular, or semi-regular phonograms in both younger and older children. As for adults, characters with semi-regular pronunciation (including semi-regular, rhyming, and alliteration), were named more slowly relative to regular and irregular phonograms. This was consistent with previous findings (Chang et al., 2016; Liu et al., 2007; Wang et al., 2020), and further supported that phonological activation of phonetic radicals in phonograms would interfere character naming. Shu and Meng (1996) found that children performed significantly better for regular characters than for irregular characters and those with unpronounceable radicals. They claimed that children were able to make use of phonological cues of character radicals in naming tasks, especially when these characters were unfamiliar to them. Liu et al. (2006) also suggested that lower graders (Grades 1 and 2) with an earlier awareness of character regularity were able to develop better literacy. Note that the knowledge of how phonological cues can be used to access character pronunciations is not explicitly taught in school. Children typically acquire orthographic and phonological knowledge about characters based on reading experience. In addition, we found that approximately 31% of all characters in the current database were phonograms with reliable phonological cues (lexical tones are not considered) provided by phonetic radicals, compared to 39% in Shu et al. (2003). Adults are more likely to reduce their reliance on phonological clues provided by phonetic radicals when naming phonograms as their reading experience increases. This might also explain the reason why the advantage of naming regular characters was absent in adults.

Last but not least, we found that both the number of pronunciations and meanings of characters and words had significant effects on lexical processing, but in the opposite way. Specifically, consistent with Tsang et al. (2018), characters with multiple pronunciations were named more slowly and less accurately, indicating a phonological interference in character naming (Tan & Peng, 1990). In addition, this effect was more pronounced in adults than in older children but was absent in younger children. One possible reason is that different pronunciations (often with different meanings) of the same character are often distributed in different grades, and younger children who have not been exposed to all pronunciations or meanings are less likely to be affected by the interference of character pronunciations in the naming task. By contrast, characters with multiple meanings and words with semantically rich characters (especially when those characters appear in the initial position) were named much faster. Many previous studies have also demonstrated that characters with more than one meaning were processed faster than semantically unambiguous characters, known as ‘ambiguity advantage’ (Lee et al., 2015; Chang et al., 2016; Lin & Ahrens, 2010). Note that the ambiguity advantage in our analysis of naming latency was only found in older children and adults. For naming accuracy, this effect was only significant for younger children. A possible reason is that younger children have not yet acquired multiple meanings for some common characters. On the other hand, significant ambiguity advantage in naming accuracy for the younger children group might be attributed to other factors such as frequency (Count_Sum and logCHR-CD), given that multi-meaning characters are more likely to have high frequency (r = .465 for Count_Sum and .522 for logCHR-CD, see Table 5).

Taken together, the current results indicate that different lexical variables may have different effects for children and adults. Specifically, the orthography-related factors, such as visual complexity and structural types of characters, play more significant roles for children who are at the early stages of learning. The phonological and semantic effects in character processing might come into play as literacy and reading experience increase.

Age of acquisition

The current database also extended a number of studies concerned with the age of acquisition of Chinese lexical units (Cai et al., 2021; Chang et al., 2016; Chang & Lee, 2020; Sze et al., 2014; Wang et al., 2020; Xu et al., 2020) by using objective measures for characters and words based on textbooks used in elementary schools. The volume information was used to represent the time when these characters and words were typically learned, same as Cai et al (2021) and Liu et al. (2007). The current results showed that the factor Volume significantly affected character and word naming across all three groups. Moreover, the contribution of Volume in character naming performance was second only to frequency in all three groups. These results were consistent with existing evidence across different languages, showing that AoA plays a crucial role in lexical decision (Bylund et al., 2019; Chang & Lee, 2020; Chen et al., 2009a, b), word-naming (Bonin et al., 2001; Bylund et al., 2019; Chang & Lee, 2020; Chen et al., 2004; Liu et al., 2007), and word-writing tasks (Bonin et al., 2001; Wang et al., 2020), indicating that early acquired characters and words can also be recognized, produced or written more quickly and accurately (Brysbaert & Cortese, 2011; Chang & Lee, 2020; Ferrand, 2011; Sze et al., 2014).

Moreover, AoA effect was consistently observed in character and word naming in both adults and children. However, the AoA effect in lexical processing demonstrated different patterns among the three groups. The overall pattern was that the effect decreased with age; we found that the AoA effect on word naming was less significant in older children and adults than in younger children. Cumulative-frequency hypothesis postulates that the difference of cumulative frequency between early-acquired words and late-acquired words narrows with age, so younger children are more sensitive to the AoA effect (Ghyselinck et al., 2004; Lewis et al., 2001). Chen et al. (2004) also found that both AoA and frequency had significant effects on naming Chinese disyllabic words; they claimed that AoA was related to semantic processing because stronger and more reliable AoA effect was observed in semantic-related tasks, such as word association and categorization (Brysbaert et al., 2000; Van Loon-Vervoorn & Willemsen, 1989). Therefore, we speculate that word processing may also be affected by the interaction between AoA and semantics. Furthermore, written AoA, rather than spoken AoA (Cai et al., 2021; Xu et al., 2020), was investigated in the current study. For Chinese children, individual characters are not commonly used in spoken language and usually acquired by classroom-learning at school age (McBride-Chang & Ho, 2000), while they are more likely to have acquired words’ spoken forms prior to the school education. Therefore, the spoken AoA might be more influential for word processing. From this perspective, it is reasonable why the current written AoA had different effects on naming characters and words.

Conclusion

In conclusion, the present study extends the current body of studies by establishing a database of Chinese characters and words based on the newly issued elementary school textbooks used in mainland China. For all characters and words to be learned in elementary schools, we extracted key lexical variables from either the textbooks or external resources and obtained naming latency and accuracy from children and adults to measure character and word processing. Our database offers a nuanced view of the development of lexical variables as grade increases and provides an empirical basis for a better understanding of the effects of those variables in lexical processing among school-age children and adults. This attempt will facilitate studies in Chinese language acquisition by showing normative data and distributional information of lexical units in terms of orthography, phonology, semantics, and other aspects.