A sentence is more easily identified in writing than in speech; yet most speakers of a language appear to know what a sentence is, even though grammarians and linguists continue to grapple with its definition. (Harris & Hodges, 1995, p. 231).

Teachers encourage children to write in complete sentences. Yet little is known about developing writers’ conceptions of a complete sentence. A sentence is traditionally defined as a group of words that expresses a complete thought, but what is a complete thought? Do beginning writers in first grade and thereafter understand what a complete sentence is? When instructed to write just one sentence, will they write one and only one sentence? When they are asked to combine two sentences into one good sentence without altering the meaning, will they understand how a complete thought can be expressed within one sentence unit? When freely translating their thoughts into written language, what kinds of syntactic constructions will they produce in a sentence unit and in a text unit? Are these syntax structures simple or complex clause constructions and do they vary across genre (narrative or essay), transcription mode (pen or keyboard), and length of writing task (text or single sentence)? Collectively, the three initial, exploratory studies employing descriptive linguistic analyses reported in this article addressed these questions.

The studies did so within a theoretical framework grounded in the cognitive processes of writing, with focus on translation of ideas into the sentences of written language. According to this cognitive theory of the developing writing process (e.g., Fayol, 1991, 199; 2004; Hayes, 1996; Hayes & Flower, 1980), the translation process in child writers draws on transcription processes (handwriting and spelling) (Berninger, Yates, Cartwright, Rutberg, Remy, & Abbott, 1992) and multiple levels of language involved in text generation—word choice, syntax underlying sentence construction, and text composing (Whitaker, Berninger, Johnston, & Swanson, 1994). However, compared to text-level composing, relatively little research has focused on the syntactic level in children’s writing and the relationship between syntax construction and transcription in text generation at both the sentence and text level, as the current study did. For a review of studies that have examined oral or written syntax at some time in writing development, ranging from early childhood to adolescence, see Myhill (2009).

Several levels of language are involved in translation of cognitions into writing. Transcription involves the subword level (handwriting) and word level (spelling) of written language, whereas syntax is a level of language that provides structure for organizing multiple words. Although more is probably known about how the multiple levels of language contribute to oral language development than to written language development, progress is being made. Writing research in the cognitive tradition has generated substantial evidence that internally coded and, subsequently, externally produced letter forms and word spellings play a fundamental role in translating thoughts and ideas into written language (Berninger & Swanson, 1994; Fayol, 1991, 1999, 2004; Hayes, 1996; Hayes & Chenoweth, 2006; Richards, Berninger, & Fayol, 2009). Less is know about whether these processes contribute independently of or jointly with the syntactic level of language for ordering multiple words in child writers. Although word-level phonological, orthographic, and morphological skills emerge early and continue to evolve during writing acquisition (for review of evidence, see Berninger, Abbott, Nagy, & Carlisle, 2010a), less is known about when and how syntactic skills, which are learned during the preschool years for oral language, contribute to the sentence construction of school age writers.

However, the syntactic construction process of writing may not be exactly the same as the syntactic construction process of oral language. Conversation is comprised of utterances during turns-at-talk rather than sentences (e.g., Halliday, 1987). On the one hand, child writers may write the way they often talk—that is, produce many complex clausal constructions that have independent plus dependent clauses rather than only a simple, single independent clause. Because the concept of ‘sentence’ is defined in terms specific to written language, in which idea units are marked by capitalization and punctuation (Fayol, 1997), and because oral language is organized differently than written language is, T-units have been used to study oral language. Hunt (1970) defined a T-unit as the minimal terminable unit equivalent to “one main clause plus any subordinate clause or nonclausal structure that is attached to or embedded in it” (page 4).

However, T-units may not be the only syntactic measures in either oral or written discourse. Neither oral or written discourse consists of only complex syntactic constructions with both an independent and dependent clause. Oral discourse has syntactic (as a minimum, subject and predicate) and non-syntactic units (e.g., contingent queries, which are used for ensuring conversational clarity, and exclamations); these units perform many discourse functions, and occur in conversational turns, which are separated by pauses that regulate the turn-taking process (Garvey & Berninger, 1980). Child writers may also acquire an academic register for written discourse that differs from the oral register of conversation (e.g., Beers & Nagy, 2008; Chafe & Danielewicz, 1987; Schleppegrell, 2004; Silliman & Scott, 2009). Alternatively, developing writers may draw on both sources of syntactic knowledge as they sometimes write as they talk, but other times use the new written language register they are learning.

Prior research had shown that young children produce both simple and complex clausal constructions in their writing and that complex clausal constructions equivalent to T-units are common in first graders’ written composing (e.g., Traweek, 1993). However, relatively little research has examined the variation in syntactic structures produced by developing writers to express ideas either in single-sentence writing or multi-sentence text writing. Thus, in the current study we examined not only when ability to write just one sentence is first evident, but also the kinds of syntactic constructions observed in both single-sentence and multi-sentence composing and the frequency of occurrence of each kind of syntactic construction. Of interest were changes in the frequency of specific kinds of clausal constructions and the relative mix of the various kinds of clausal constructions across grade level from second to seventh grade.

The target population for the current research was normally developing writers. The emergence of syntactic awareness for written sentence and text construction in normally developing writers from the beginning to middle years of formal schooling is an important research topic because the research results may inform future research on what is and what is not deviation from normal developmental patterns. For example, research on understanding writing skills of children with oral language disabilities is underway (e.g., Dockrell & Connelly, 2009; Nelson, Roth, & Van Meter, 2009).

Two studies were conducted with a longitudinal sample, but analyzed cross-sectionally, to test the hypothesis that the relationship between word-level transcription and syntax-level sentence construction during translation of ideas into written language changes from the primary grades (ages 6 to 8) to the intermediate grades (age 9 and above). The first study used an experimenter-designed measure of sentence construction, referred to as the sentence integrity task, which required the child to write just one complete, syntactically acceptable sentence about a provided topic prompt that was kept constant across children. Successful completion of this task requires knowledge of what constitutes a complete sentence. Lack of this syntactic knowledge results in a fragment, which is less than a complete sentence, or a run-on sentence, which contains more than a complete sentence. The second task was a sentence combining task, which requires the child to rewrite two sentences as one sentence without changing meaning. Sentence combining tasks, which are increasingly used in both instruction and assessment research (e.g. Nelson et al., 2009; Saddler & Graham, 2005; Shanahan, 2009), require child writers to re-construct syntactic structures in a flexible manner to express the same meaning with less words and new syntactic constructions.

Two predictions derived from this hypothesis were tested. First, early in the writing acquisition process many children can construct a single sentence when asked to do so, thus reflecting their syntactic awareness, but the relationship between syntactic awareness and transcription may change across writing development. Second, early in writing development many children may have knowledge of what constitutes a complete sentence, but their syntactic knowledge may not yet contribute uniquely to sentence writing that requires both knowledge of syntactic structure and related transcription processes. Only with writing experience do the transcription processes become automatized (typically during the early grades) and then contribute to sentence-construction independently of the syntactic-level processes underlying sentence re-construction during sentence combining (typically during the later grades). Thus, during the first three grades, the sentence integrity task may not be correlated with the sentence combining task, but beginning in fourth grade the sentence integrity task may explain unique variance in sentence combining, along with sub-word and/or word-level transcription.

To test the first prediction, the percentage of children successfully completing the write-one-sentence task was computed at each grade level one to four. Developmental trends in these percentages were examined across grade levels to evaluate if any beginning writers might have syntactic knowledge for what constitutes a complete sentence. Then this measure was correlated with sentence combining. To test the second prediction, at each grade level correlations were computed between the sentence integrity task, two measures of transcription, one morphology measure, and one reading comprehension measure, and the sentence combining task; then multiple regressions were performed in which all the measures correlated with sentence combining were used as predictors and the sentence combining task was the outcome. The sentence combining task was scored for acceptable sentence-level sytax, quality of sentence construction, and word-level transcription, punctuation, and capitalization, The other predictors in the multiple regressions were chosen based on past research showing that (a) handwriting automaticity is a unique predictor of composing length and quality during the first four grades (for review, see Berninger & Amtmann, 2003); (b) spelling has the most consistent longitudinal relationships to written composition across grade levels one to seven (Abbott, Berninger, & Fayol, 2010); (c) morphology that marks grammar (derivational suffixes) may contribute to composing (Green, McCutchen, Schwiebert, Quinlan, Eva-Wood, & Juelis, 2003); and (d) silent sentence reading comprehension fluency, which requires integration of word identification and comprehension of sentence syntax (Berninger, Abbott, & Alsdorf, 1996a), may be related to written sentence composing.

The morphological measure provided an interesting control in that derivational suffixes mark word-level grammar (part of speech) but not syntactic-level clausal structures. As such, morphological skill may contribute grammatically at the word-level, which in turn contributes to the syntactic-level skill for constructing a single sentence (sentence integrity) or re-constructing two sentences into a single sentence (sentence combining). The sentence combining task requires transforming two complete clausal constructions into one complete, but newly constructed, clausal construction for translating the same idea(s). The silent sentence-reading fluency task provided another potentially informative control in that it requires sentence-level syntactic processing but in reading rather than writing.

A third study was conducted with two cohorts, one beginning in first grade and one in third grade, in the longitudinal study that provided the sample for the first two studies. The goal was to test the hypothesis that the translation process is influenced by the amount of syntactic construction, transcription mode, and/or genre of extended text. Three predictions, related to the size and nature of constructed written language units and transcription processes, were tested: (a) Single independent clauses are more common in single-sentence constructions, whereas complex clauses (single independent clause plus dependent clause) are more common in multi-sentence constructions (levels-of-language contrast); (b) more clauses are produced by pen than keyboard when writing multi-sentence texts but not single sentences (transcription mode contrast); and (c) the nature of syntax in the written sentences varies by narrative and expository texts (genre contrast). Levels-of-language in composing was of interest because syntactic construction may vary with the job size in writing, that is, whether a single sentence or many sentences have to be constructed. Transcription mode was of interest because past results showed advantages for writing by pen for text-level writing, but advantages for writing by keyboard for letter-level writing (Berninger, Abbott, Augsburger, & Garcia, 2009). Genre was of interest because relatively little research attention has focused on how syntactic-level construction might interact with genre-specific discourse-level construction (but see Beers & Nagy, 2008, 2009, and this special issue).

General method

Participants

Personnel in a large urban school system in the Pacific Northwest, within driving distance of the university where the study was conducted, mailed letters on district letterhead announcing the opportunity to participate in the research to all parents of all prospective first and third graders who attended one of the district’s 51 elementary schools. Interested parents contacted the research coordinator who screened children to make sure they met research inclusion criteria (i.e., no developmental, neurological, or psychiatric disorder prior to or during school age years and no language learning disability LLD, also known as selective language impairment SLI, during the preschool years). After informed consent was obtained, parents completed questionnaires about developmental, medical, family, and educational history, including other languages spoken in the home. Although other languages were spoken in some of the homes, none of the participating children were English-language learners; all had adequate English for completing the administered tasks. The children were diverse in terms of race (more than one-third non-white) and parent’s level of education (less than high school to high school to college to postgraduate work). For additional information about the participants in the entire sample across five years, beyond that provided in the text that follows, see Abbott et al. (2010).

Cohort 1

At the start of the study, the 128 first graders had a mean age of 82.72 months (SD = 3.80). Attrition rate was low across years 2 (n = 124), 3 (n = 122), 4 (n = 119), and 5 (n = 114) and was due to families moving out of the area or other reasons unrelated to the study. The sample reflected the diversity of the local school population: European American (64.8%), Asian-American (23.4%), African-American (6.3%), Hispanic (1.6%), Native American (1.6%), and other (2.3%). Parental level of education ranged from less than a high school education or graduated from high school (7% mothers and 12.5% fathers) to more than a high school education but less than a college education (11.7% mothers and 7.8% fathers) to an undergraduate education (45.3% mothers and 39.8% fathers) to graduate degrees (33.6% mothers and 32.0% fathers); no information was reported for 2.4% of the mothers and 7.9% of the fathers.

Cohort 2

At the start of the study the 113 third graders had a mean age of 106.01 months (SD = 3.70). Attrition rate was also low across year 2 (n = 110), year 3 (n = 106), year 4 (n = 106), and year 5 (n = 99). Self-reported ethnicity and parents’ level of education also exhibited diversity: European American (65.5%), Asian-American (21.2%), African-American (9.7%), Hispanic (0.9%), and other (2.7%). Parental level of education ranged from less than a high school education or graduated from high school (7.1% mothers and 7.1% fathers) to more than a high school education but less than a college education (11.5% mothers and 14.2% fathers), to an undergraduate education (50.4% mothers and 36.3% fathers), to completed graduate degrees (30.1% mothers and 35.4% fathers); no information was reported for 0.9% of the mothers and 7.2% of the fathers.

Enrolled children were tested annually for five years at the university where they completed writing and writing-related tasks in the second, third, or fourth month of the school year. Half began in first grade (Cohort 1) and half began in third grade (Cohort 2).

For Studies 1 and 2, we describe tasks and results for both cohorts only during the first two years of a longitudinal study when measures of sentence integrity and sentence combining were available for both cohorts. Children in cohort 1 were in grade 1 (71 girls and 57 boys) in year 1 and grade 2 (69 girls and 55 boys) in year 2; children in cohort 2 were in grade 3 (57 girls and 56 boys) in year 1 and in grade 4 (57 girls and 53 boys) in year 2.

For Study 3, we report results for measures of narratives (grades 2 or 4, cohort 1, or 4 or 6, cohort 2), essays (grades 3 or 5, cohort 1, or 5 or 7, cohort 2), and single sentences (grades 2 to 5, cohort 1; grades 4 to 7, cohort 2), with the text-level and sentence-level writing both by pen and by keyboard for each cohort and grade level. Across the final four years of the longitudinal study, for which attrition was low, children in cohort 1 were in grades 2 (69 girls and 55 boys), 3 (68 girls and 54 boys), 4 (67 girls and 52 boys), and 5 (62 girls and 52 boys); children in cohort 2 were in grades 4 (57 girls and 53 boys), 5 (54 girls and 52 boys), 6 (53 girls and 53 boys), and 7 (50 girls and 49 boys).

Procedures

All writing tasks and test measures were administered by highly trained and supervised graduate students. Specific writing tasks or writing-related measures, procedures, data analyses, and results are reported for each of the three studies that follow. For additional details about the reliability and validity of these measures and test administration procedures, see Abbott et al. (2010).

Study 1

Writing tasks

In year 1 when Cohort 1 was in first grade and Cohort 2 was in third grade, children were asked to write one good sentence about “My favorite thing at school.” In year 2 when Cohort 1 was in second grade and Cohort 2 was in fourth grade, children were asked to write one good sentence about “Writing.” Their written productions were coded on two measures: (a) sentence integrity (complete sentences, fragments— less than a complete sentence, or run-ons—more with than a complete sentence), and (b) number of grammar errors.

Procedures

For sentence integrity, fragments were constructions that were less than a main clause and run-ons were defined as (a) two or more main clauses that were not separated from each other with appropriate punctuation (period, question mark, exclamation point) or a conjunction, or (b) a main clause plus a non-clausal construction. Otherwise, if only a main clause (single independent clause or independent clause plus subordinate clause) was produced, it was coded in Study 1 as a complete sentence even if it was not capitalized at the beginning and punctuated at the end. The rationale was that children had only been instructed to write just one sentence; they had not been asked to capitalize or punctuate it and may have thought that was not important because they did not write any other sentences from which to differentiate it.

Of interest in Study 1 was whether what the child produced was a complete sentence. The sentence integrity measure was thought to reflect the developing writers’ syntactic knowledge of what a complete sentence is.

Data analyses

Two research team members were trained by the first author in coding the three possible outcomes on sentence integrity and in identifying grammar errors (e.g., use of singular noun with plural verb or vice versa; tense errors such as he runned). The two coders practiced coding until they achieved 100 percent agreement in scoring both sentence integrity and grammar errors. Then each coded all the sentence writing tasks and compared their results; at weekly research meetings they discussed any discrepancies with the first author until they were resolved.

Chi-square was used to assess whether within each grade level the frequency of occurrence of complete sentences, fragments, or run-ons occurred with comparable frequency or varied in frequency across the three categories. Frequencies were also converted to percentages of each of the three kinds of constructions at each grade level.

In an exploratory analysis, the mean number of grammatical errors and percentage of children who made grammar errors on the sentence writing task were computed for each grade. Then correlations were computed between sentence integrity (rescaled quantitatively with a score of 1 for complete sentences and 0 for either fragments or run-ons) and number of grammatical errors.

Results and discussion

For sentence integrity, of the 127 first graders who completed the year 1 sentence-writing task, 88 wrote a complete sentence, 3 wrote a sentence fragment, and 36 wrote a run-on sentence; χ 2(2) = 86.76, p < 0.001. Of the 123 second graders who completed the year 2 sentence writing task, 105 wrote a complete sentence, 9 wrote a fragment, and 9 wrote a run-on sentence; χ 2(2) = 149.85, p < 0.001. Of the 113 third graders who completed the year 1 sentence writing task, 105 wrote a complete sentence, none wrote a fragment, and 8 wrote a run-on; χ 2(2) = 83.27, p < 0.001. Of the 110 fourth graders who completed the year 2 sentence writing task, 98 wrote a complete sentence, 2 wrote a fragment, and 10 wrote a run-on sentence; χ 2(2) = 154.76, p < 0.001. All chi-square tests confirmed that frequency of the three possible outcomes on the sentence writing task differed significantly from the null distribution in which all categories occur equally often. Thus, the most frequently occurring construction was a complete sentence, which occurred significantly more often than did fragments or run-ons.

For purposes of summarizing and explaining the results of the descriptive statistics, the frequencies were converted to percentages of children at each grade level who produced each of the three kinds of sentence or non-sentence constructions on the sentence integrity task. Children were more likely to write complete sentences (69% first graders, 85% second graders, 92% third graders, and 89% fourth graders) than run-on sentences (28%, first graders, 7% second graders, 7%, third graders, and 9%, fourth graders) or sentence fragments (3%, first graders, 8%, second graders, 0% third graders, and 2% fourth graders).

Inspection of these summary descriptive conclusions supported three conclusions. First, the majority of child writers in first, second, third, and fourth grade could write a complete sentence when instructed to do so. Second, the absolute number of children (frequency) who could do so increased from first to second grade and then to third and fourth grade, when it tended to asymptote on the sentence integrity task. Third, those child writers who could not write a complete sentence accounted for a non-trivial percentage of the first graders (about 30%), second graders (15%), and third and fourth graders (range 8% to 11%); both less than complete sentences (fragments) and more than complete sentences (run-on sentences) occurred.

For grammar usage errors, which are not the same as syntactic structure for the complete clausal or sentence construction (see Chomsky, 1965), two levels of descriptive statistics were computed: (a) total proportion of sentences at each grade level with errors in grammar usage rules; and (b) percent of child writers who made grammar errors. Regarding the first, the proportion of sentences with observed grammatical usage errors was on average 0.52 (SD = 0.72) in first grade and 0.23 (SD = 0.50) in third grade in year 1 and 0.23 (SD = 0.52) in second grade and 0.24 (SD = 0.58) in fourth grade in year 2. However, the majority of the child writers did not make grammar errors. Only 28% of the first graders and 19% of the third graders in year 1 and 17% of the second graders and 19% of the fourth graders in year 2 made grammatical usage errors.

Correlations between sentence integrity (rescaled quantitatively with 1 for complete sentence and 0 for fragment or run-on) and number of grammatical usage errors each child produced were significant: at first grade, r(1,126) = 0.72, p < 0.001; at second grade, r(1, 122) = 0.82, p < 0.001; at third grade, r(1, 112) = 0.52, p < 0.001; and at fourth grade, r(1,109) = 0.63, p < 0.001. Thus, in an unreferred sample in which most normally developing writers in first through fourth grade can write a complete sentence when requested to do so, the percent variance shared between the number of complete sentences and the number of grammar errors ranged from a low of 27% in third grade to a high of 67% in second grade. Thus, some individuals may lack syntactic awareness for a sentence unit and knowledge of grammar usage but others are impaired in only one or neither of these. Future research might explore whether an impaired language learning mechanism, supported by a working memory architecture (Berninger, Abbott, Swanson, Lovitt, Trivedi, Lin et al., 2010b), may explain both syntactic and grammar usage errors in some individuals. In educational settings, children who make frequent sentence fragments or run-on sentences and grammar usage errors in writing should be referred for screening or diagnostic assessment to identify possible language learning needs across oral and written language; and, if identified, plan language intervention (see Silliman & Scott, 2009). Also, addressing these language learning needs early in writing development may enhance children’s motivation and self-efficacy resulting in better writing achievement throughout schooling (Hidi & Boscolo, 2006).

Study 2

Measures

Each of the following measures was administered according to instructions in the test manual. For sentence combining, which is one of three tasks in the Wechsler Individual Achievement Test, 2 nd Edition (WIAT II) (The Psychological Corporation, 2002) Written Expression subtest, children were given two sentences and asked to combine them into one sentence with the same meaning. Children’s written productions were then scored according to criteria in the test manual for awarding points: 2 (preserves the meaning, is syntactically complete and well-written, and makes no transcription errors), 1 (syntactically acceptable but not as well written or makes a transcription error), or 0 (syntactically unacceptable).

Five measures were given to use as predictors in multiple regressions at each grade level in years 1 and 2: automatic alphabet letter writing, dictated spelling, morphology (choosing word whose suffix marks grammar that fits a specific sentence context), silent sentence-reading comprehension fluency, and sentence integrity (1 point for complete sentence and 0 points for fragment or run-on). The first two predictor measures assessed transcription skills (subword-level handwriting and word-level spelling) in cognitive models of writing. For alphabet writing (Berninger, 2001), the child printed from memory the letters of the alphabet in order as accurately (legibly) and quickly as possible. The score was the number of legible, correctly formed letters in alphabetic order in the first 15 s, an index of automatic letter retrieval and production. For spelling, the child spelled in writing the dictated words of increasing difficulty on the WIAT II. Words were pronounced once, then used in a sentence to provide context clues for meaning, and then pronounced a final time.

The last three predictor measures assess different kinds of syntactic skills. For morphological—signals (Nagy, Berninger, Abbott, Vaughan, & Vermeulen, 2003, an adaptation of Singson, Mahoney, & Mason, 2000; and Tyler & Nagy, 1989, 1990) that assesses awareness of the role of suffixes in marking grammar (tense, plurality, part of speech) at the word-level, the child selected one of four choices (pseudomorphs—pseudowords with a real inflectional or derivational suffix that fits a sentence context). For example, of these choices–wibbled, wibbling, wibbler, wibbly—only wibbler fits in the sentence context: The _____________ really enjoys sharing his sport with others. Such grammatical awareness at the word-level may serve as a bridge to syntactic awareness at the sentence-level in reading comprehension and written composing. Performance on the Signals test assesses morphological knowledge independent of reading ability because the examiner read each sentence and the choices to the child who could read or look at them silently while the examiner read them orally. For silent sentence-reading comprehension fluency, which assesses dual attention to both word-level word identification and syntax-level reading comprehension (Berninger, Abbott, & Alsdorf, 1997), the child chose the one sentence in a set of three that was meaningful (Berninger, 2001). Although the three sentences contain only real words, two contain one word (a foil) that renders the whole sentence meaningless. Successful performance requires syntactic processing, but not necessarily syntactic knowledge of what constitutes a complete sentence. For administration of sentence integrity, see Study 1 Methods.

Data analyses

Correlations were performed to evaluate whether measures chosen, for theoretical reasons and evidence from past research regarding their relationship to composing, were correlated with the outcome of interest—sentence combining, at each grade level. If they were, measures were entered as predictors into the multiple regression. The purpose of the multiple regression was to identify which predictors in a set, which were shown to be correlated with the outcome, explained unique variance in the outcome over and beyond shared covariance with other predictors.

Results and discussion

The correlations between sentence combining and the other five predictors are reported in Table 1. For the first, second, and third grades, the first four predictors were significantly correlated with sentence combining (see Table 1) and thus were entered into the multiple regression with sentence combining as the outcome. In the fourth grade, when the sentence integrity measure was also correlated significantly with the outcome (Table 1), it was also entered, along with the other predictors, which were still correlated with the outcome, into the multiple regression.

Table 1 Correlations of predictors with WIAT II Sentence combining outcome by grade level

Developmental changes

Results of the multiple regressions are reported in Table 2 by grade. All regressions accounted for significant variance at each grade level. In interpreting results, keep in mind that both syntax and transcription are taken into account in scoring sentence combining. In first grade, only morphological signals, a word-level task, explained unique variance in sentence combining. Thus, endings on words that mark grammar functions (Nunes, Bryant, & Bindman, 1997, 2006) are related to sentence combining. Morphology serves a unique scaffolding function within and across levels of language, beginning in first-grade writers: It is both (a) a bridge across spoken and written words at the word-level where morphemes correspond to both phonology and orthography, and (b) a bridge across the word- and syntax- levels for relating word-level suffixes marking grammatical function to sentence syntax. In second grade, only spelling, which shares common variance with morphological knowledge (Carlisle, 1994; Carlisle & Nomanbhoy, 1993; Garcia, 2007; Garcia, Abbott, & Berninger, 2010) and marks parts of speech (grammar) (Nunes & Bryant, 2006; Nunes, et al., 1997, 2006; Tyler & Nagy, 1989, 1990), explained unique variance in sentence combining. In third grade, subword- (handwriting automaticity) and word- (spelling and morphological signals) levels explained unique variance in sentence combining. This result is consistent with prior findings that both transcription skills (letter writing and word spelling) (Berninger & Amtmann, 2003; Berninger & Swanson, 1994) and morphological knowledge, as already explained, are related to writing in grades one to four. In fourth grade, both word-spelling and sentence integrity explained unique variance in sentence combining.

Table 2 Multiple regression for constant outcome (WIAT II sentence combining) grades 1–4 and predictors (automatic handwriting, spelling, morphological signals, and silent reading comprehension fluency grades 1–4; plus sentence integrity in grade 4)

Two findings observed only at the fourth grade level provide converging evidence for the hypothesis that the relationship between syntax knowledge and transcription during translation changes from the first three grades to the fourth grade. First, sentence integrity, which reflects knowledge of what constitutes a complete sentence, was not correlated with sentence combining, which reflects syntactic knowledge enabling flexible transformation of syntactic structures for more concise idea expression, in the first three grades but was in fourth grade (Table 1). Second, in fourth grade both word-level transcription (spelling) and syntactic level (sentence integrity) skills explained unique variance in sentence combining, whereas earlier only word-level skills did. Thus, the first two hypothesis-driven predictions were confirmed.

Differences across reading and writing modes at the syntactic level

That the silent sentence-reading comprehension fluency measure did not explain any unique variance in the sentence-combining measure, even though it was correlated with it, is of some interest, especially because children have to read each of the sentences to be combined and the new sentence they are constructing on this task. One explanation for this finding is that reading comprehension and written expression may draw on syntactic knowledge in different ways. Also, the silent reading measure used may have focused readers’ attention more on the details within words, whereas the sentence combining task requires (a) focusing attention on word meaning or relationships among words across the syntax unit, (b) omitting or reordering words, and (c) introducing new words or phrases to express the same idea(s) in a more concise way. Thus, multiple kinds of knowledge and processing may contribute to syntactic knowledge underlying sentence re-construction.

Study 3

Writing tasks

In years 2 and 4, when children in cohort 1 were in second or fourth grade and children in cohort 2 were in fourth or sixth grade, they wrote essays, first by pen and then by keyboard. In years 3 and 5, when children in cohort 1 were in third or fifth grade and children in cohort 2 were in fifth or seventh grade, they wrote narratives, first by pen and then by keyboard. For the essay-by-pen task, children were given this topic, “Explain what a computer is and what it does to someone who has never seen one or used one. “They were instructed to write sentences to complete the essay. For the essay-by-keyboard task, children were given this topic, “Explain what a robot is and can do to someone who has never seen one or used one.” They were instructed to add sentences to complete the essay. For the narrative-by-pen task, children were given this topic, “One day at school a funny or surprising thing happened…” and were asked to choose funny or surprising and then add sentences to complete the story. For the narrative-by-keyboard task, they were given this topic, “One weekend at home a funny or surprising thing happened…” and were asked to choose funny or surprising and then add sentences to complete the story. For both the essay and the narrative, children had ten minutes to complete their composition, and, if they stopped writing before the ten minutes had elapsed, they were prompted up to two times to continue their writing with, “What else can you think of?”

Before writing the essays or narratives, children completed a sentence integrity task by pen and then by keyboard. In years 2 and 4, the topic for the sentence-by-pen task was “Writing,” and for the sentence-by-keyboard task was “Reading”. In Years 3 and 5, for the sentence-by-pen task the topic was to write one sentence about “My favorite thing at school …”, and for the sentence-by-keyboard task, the topic was to write one sentence about “My favorite thing at home …” There was no time limit for writing the single good sentence. For additional details about the tasks, administration procedures, and lack of effects due to pen tasks being administered earlier in the session than the keyboard tasks, see Berninger et al. (2009).

Coding syntactic categories

In the first phase of data analyses, a linguistic coding system was developed to account for all observed syntactic constructions in the extended text writing (essays or narratives) and sentence writing of both cohorts from the second to seventh grades. In the second phase of data analyses, the coding scheme, which identified eleven kinds of syntactic structures in the children’s writing, was used to code the frequency of occurrence of each of these categories across all writing samples that were codable (the child produced writing that met research criteria established for reliable scoring). Results were used for making comparisons by the three contrasts of interest: different levels of language (text or sentence), transcription mode (pen or keyboard), or genre (essay or narrative).

In the process of reading the writing samples to develop the coding scheme during the first phase, we noted that the child writers generally, but not always, used both an initial capital letter and final punctuation to mark sentence boundaries. Because the goal of the third study was to identify frequency of occurrence of kinds of syntactic constructions in children’s writing and not to study their reliable application of capitalization and punctuation, the following coding rules were adopted:

  1. (a)

    Units of writing were coded for syntax if (a-1) The child had capitalized and punctuated (clearly marked the unit as a sentence); or (a-2) The child had either capitalized at the beginning or punctuated at the end and the unit did not seem to be part of either a preceding sentence clearly punctuated or succeeding sentence clearly capitalized. The lack of either a capital letter or punctuation mark was likely due to momentary inattention rather than lack of intention to construct a syntactic unit. Considerable inspection showed that use of one marker (capital letter or punctuation mark), but not both markers, occurred occasionally in protocols in which sentence units were generally marked by capital letters and punctuation marks. Rarely, except in first grade, did a child produce written text without any capital letters or punctuation marks, but in such cases written productions were not coded for kinds of syntax in children’s writing, which is why Table 3 indicates the N for the number of writing samples available for coding each level of language, transcription mode, and genre for each cohort and year.

    Table 3 Frequency of each of 11 coded written syntactic structures for level of language (text or sentence), transcription mode (pen or keyboard), and genre (essay or narrative) by grade level. N number of codable writing samples; fewer were available at first grade than other grades because some children did not capitalize or punctuate so that sentences could be differentiated. See “Appendix” for coding criteria with examples
  2. (b)

    Clauses correctly joined by a semicolon or conjunction were coded for nature of complex clause construction.

  3. (c)

    Fragments were constructions that were less than a main clause.

The final coding scheme had eleven categories, each of which is illustrated in the “Appendix”. The coding scheme differentiates among (a) a single independent main clause, (b) an immature, noncordinate main clause construction, in which a coordinating conjunction (e.g., And) is placed at the beginning of an independent clause (McCutchen, 1987), (c) coordinate main clauses in which two independent clauses are combined with either a coordinating (and) or correlative (either …. or) conjunction, (d) four kinds of independent clause plus dependent clause construction (relative, complement, subordinate, and adverbial), and (e) other (quotations, non-clausal constructions, and fragments). Of considerable research interest was how often child writers use primarily single independent clauses and how often they use a complex construction with an independent and a dependent clause and, when the latter, what kind of complex construction is used.

To calculate inter-rater reliability for the coding scheme developed in the first phase, a research coordinator and a graduate research assistant randomly selected 20 writing samples for each of four tasks from different years, cohorts, and writing ability levels: narratives by pen, the sentence integrity task by pen, essays by keyboard, and the sentence integrity task by keyboard. One rater served as the anchor for computing percent agreement between coders. The inter-rater reliability was generally very good for each coded category. Except for two categories, initial percent agreement ranged from 84.6% to 100%. For the only two categories of low interrater reliability, subordinate and adverbial clauses, additional training and reliability checks improved inter-rater reliability for these to acceptable levels of over 0.80.

Table 3 reports the frequency of each of the eleven coded syntactic categories for two genre of extended text for each transcription mode—essays in second, fourth, and sixth grade and narratives in third, fifth, and seventh grade, and sentence writing in grades 2 to 7.

Data analyses

Descriptive analyses were used to identify the frequency with which various kinds of syntactic structures were observed in children’s writing at different levels of language, by different transcription modes, and across genre. Inferential statistical analyses were then used to examine these contrasts from the perspective of drawing conclusions about the syntactic structures that are likely to appear in writing produced by second through seventh graders.

Results and discussion

Descriptive analyses of coded syntactic categories

Observed frequencies for each of the eleven coded syntactic categories are displayed in Table 3, which is organized by writing task (essays or narratives or single sentence), transcription mode (pen or keyboard), and grade levels (second to seventh). The coded categories did account for the variation in syntactic structure observed. Inspection of the frequencies of the coded syntactic structures across levels of written language, transcription mode, and genre, and grade levels identified the following patterns in the data.

For extended text writing (essays or narratives), consistently across genre and transcription modes from grade one to seven, the most frequent syntactic construction was the single independent clause. Single independent clauses introduced with a coordinating conjunction, an immature form of writing, occurred far less often. Constructions involving two independent clauses were connected with coordinating conjunctions far more often than with correlative conjunctions. Of the four kinds of syntactic constructions involving an independent clause and a dependent clause, relative clauses occurred the most often in essays. Although subordinate clauses occurred the most often in narratives, adverbial clauses occurred relatively infrequently, but more often in narratives than in essays. Quotations and nonclausal constructions were more likely to occur in narrative than essay texts. Fragments occurred at a relatively low frequency in both genres.

However, for sentence writing tasks, both single independent clause and independent clause plus dependent clause constructions occurred, but more complex T-units (independent clause with dependent clause) were produced than single independent clauses at grade three for sentence writing by pen (cohort 1 only), and at grade five (both cohorts) and grade seven (cohort 2) for sentence writing by pen and by keyboard.

Thus, whether children were asked to write an extended piece of writing (essay or narrative) or just a single good sentence appears to be related to whether a single independent clause (more likely for multi-sentence construction) or an independent clause plus a dependent clause (more likely for a single-sentence construction) was produced. This finding was contrary to the prediction that single independent clauses would be more likely to occur in single sentences (sentence integrity task), but more complex clausal constructions were more likely in extended text of essays and narratives.

Developmental trends were also evident. Although many single clauses were written across the grades and genre, the number of complex syntactic constructions increased for the older compared to the younger writers (see Table 3). These results are consistent with other reports that the syntactic complexity becomes more advanced over the grades (Hunt, 1965, 1966, 1970). However, the exact trajectory of this development depends on a number of factors, including the specific measure of syntactic complexity used and genre (Beers & Nagy, 2008), as the analyses for genre later in this section also show.

Inferential statistical analyses: contrasts for levels of language and transcription modes

Table 4 summarizes the results of repeated analyses of variance that compared the clausal constructions for extended text by transcription modes at different grade levels. Table 5 summarizes comparable findings for the sentence writing tasks. These analyses were performed only on frequently occurring syntactic categories of theoretical interest: single independent main clauses and three kinds of independent plus dependent clause constructions—relative, complement, and subordinate. For these analyses, extended text writing and single sentence writing were analyzed separately because of differences in their lengths and thus opportunity to use the different kinds of syntactic constructions.

Table 4 Differences between transcription modes by pen and keyboard for specific clausal productions in extended text (essays and narratives) in grades 2–7
Table 5 Differences between transcription modes by pen and keyboard for specific clausal productions in sentence writing in grades 2–7

As shown in Table 4, in grades 2 to 7 for one independent clause, the main effect for transcription mode was always significant for text writing. The means showed that more single independent clauses were produced when writing by pen than by keyboard. These findings converge with others showing that for essays children wrote more words and wrote words faster (Berninger et al., 2009) and expressed more ideas (Hayes & Berninger, 2010) by pen than by keyboard.

The findings were mixed for the syntactic constructions involving an independent clause plus a dependent clause. For independent plus relative clause, transcription mode was significant in only three of the eight analyses, for which independent plus relative clause constructions were produced more often by pen. Likewise, the transcription mode was significant for only one of the eight analyses for independent plus complement clause and two of the eight analyses for independent plus subordinate clause. In all these cases more independent plus complement clause constructions were produced by pen than by keyboard.

Length of text written appears to matter for transcription mode effects, as also found by Berninger et al. (2009). Results for single-sentence composing were different from what was found for multi-sentence composing and in general were mixed, as Berninger et al. (2009) had also found. As shown in Table 5, the results for the same eight outcomes on each of the sentence writing tasks given in years 2, 3, 4, and 5 were not clear cut, but were certainly different from those observed for extended text writing of either essays or narratives. For the four significant effects for a single independent clause, three favored more clauses on average when writing by pen than by keyboard. However, the one significant transcription mode effect for independent plus relative clause favored more of these clause constructions by keyboard than by pen. Of the four significant mode effects for independent plus subordinate clause, three favored more of these clause constructions by keyboard than pen. No transcription mode effects were found for independent plus complement clause constructions.

Thus, converging evidence, based on multiple dependent measures—number of words, rate of word production, and spelling (Berninger et al., 2009), ideas expressed as information (Hayes & Berninger, 2010), and now clausal structure in Study 3, supports the conclusion that in early and middle childhood the advantages of writing by pen are observed only on writing tasks requiring extended writing to construct multi-sentence texts (essays or narratives). That is, they are most often observed when the translation process is sustained over time in working memory to construct multiple sentences rather than executed relatively briefly to construct a single sentence or produce a single letter (Berninger et al., 2009). One possible explanation for this finding, which requires further research to substantiate, is that the act of forming letters by pen rather than selecting them on keyboard is more able to engage the orthographic loop of working memory, that is, the orthographic letter or spelling representations in internal memory linked to sequential finger and hand movements during the act of writing. Consequently, when writers more readily engage the orthographic loop to form letters they can sustain composing in working memory over time longer stretches of time to create extended text. When the task requires only letter- or single-sentence construction, the effects of transcription mode on orthographic loop may not be as robust because the demands on working memory are not as great.

Future research might also address how any benefits found for writing by pen when translating ideas into multiple sentences may (a) enhance efficiency of temporally constrained working memory (e.g., Fayol, 1991, 1999, 2004; Hayes & Chenoweth, 2006) in ways other than engaging orthographic loop, or (b) depend on the complexity of the clausal construction and other cognitive constraints beyond the syntax level. Of some interest, sentence integrity first explained unique variance in sentence combining in fourth grade, which is an important transition time in writing development when writing requirements of the curriculum increase. For example, only word-level working memory contributed uniquely to writing and reading achievement in the first three grades in normally developing students, but both word-level and syntax-level working memory contributed uniquely beginning in fourth grade (Berninger et al., 2010b).

Future research might apply the Eye and Pen device developed by Alamargot and colleagues (e.g., Alamargot, Chesnet, Dansac, & Ros, 2006) to study the timing parameters in language bursts and pauses between them during writing (Hayes & Chenoweth, 2006) for constructing single independent clauses versus independent plus dependent clauses by pen and by keyboard, to determine if the current results replicate and, if so, rule between alternative explanations for them. These initial descriptive studies need to be extended with on-line experiments (see Fayol, in press, for overview) to understand more fully the translation process as it unfolds in real time.

Inferential analyses: genre contrasts

Table 6 summarizes the results showing that, contrary to prediction, the most frequent syntactic category for multi-sentence constructions was the single independent clause. However, complex clause constructions involving an independent clause and dependent clause did occur in multi-sentence text, even if less frequently, but the nature of the complex causal construction varied by genre. For essays in grades two and four (year 2) and grades four and six (year 4), the relative ordering of frequency after single independent clause from most to least was (a) independent plus relative clause, (b) independent plus subordinate clause, and (c) independent plus complement clause. For both narratives in grades three and five (year 3) and five and seven (year 5), the relative ordering after single independent clause from most to least was (a) independent plus subordinate clause, (b) independent plus complement clause, and (c) independent plus relative clause. In sum, relative clauses were more prevalent in essay writing than in narrative writing, and subordinate clauses were more prevalent in narrative writing than essay writing.

Table 6 Relative production of the four most prevalent clause constructions by genre

These results are consistent with research reports showing that early in writing development children (a) differentiate between different genres (Berman & Nir, 2004; Purcell-Gates, 1988), and (b) use genre for different communication purposes (Hudson & Shapiro, 1991). They are also consistent with findings that syntax and genre interactions continue to show interesting relationships in adolescence (Beers & Nagy, 2009).

General discussion

Individual and developmental differences in writing complete sentences

As shown in Study 1, most, but not all, beginning and developing writers could write complete sentences, and the percentage who could increased from grade one to grade three and stayed about the same in grade four. For those who could write a complete sentence, they had probably learned during the preschool and early school years what a sentence is from (a) producing clausal constructions in their oral conversations, (b) looking at the printed text while listening to adults and older children reading books to them, and (c) early experience in translating their ideas into speech and then to writing on paper.

However, two to nine percent of the child writers produced after thoughts (run-ons) and partial thoughts (fragments), showing that thoughts are not always translated into complete syntactic structures during the writing process. These children may require specialized instruction that heightens their syntactic construction abilities across oral and written language.

Developmental changes in combining sentences

Ability to write just one sentence does not mean that child writers have acquired full understanding of what a complete thought or what a well written sentence is. Nevertheless it is encouraging and interesting that complete-sentence construction emerges as early as first grade for many child writers, as reflected on the sentence integrity task—writing a complete thought rather than a partial thought or after thought. Although scoring on the sentence combining task always took into account both syntax (acceptability and quality) and transcription skills, only in fourth grade did both the sentence integrity task and word-level transcription contribute uniquely to it. This developmental pattern in the results suggests that the relationship between syntax and transcription during translation may change from the first three grades to the fourth grade. Future research might explore via on-line experiments the nature of the syntactic awareness that contributes to emerging sentence construction ability and how syntactic awareness might evolve over writing development. Just as study of linguistic awareness (Mattingly, 1972), including phonological awareness (Liberman, Shankweiler, Fischer, & Carter, 1974) and word awareness (Ehri, 1987), has proved fruitful in reading acquisition research, so might it in writing acquisition research.

Contributions of morphology and syntax to translation

In first grade, the morphological signals test, which assesses knowledge of how suffixes mark grammar information related to sentence syntax, contributed uniquely to sentence combining, even though syntax (sentence integrity task) did not. Thus suffixes which mark tense, number, and part of speech may play an early role in learning how to use grammar to construct syntactic structures.

In the current study, syntax was conceptualized as a structural unit for translating ideas into language. We did not code for part of speech (grammatical roles of certain kinds of words within the syntactic structure), but do not minimize the importance of parts of speech for understanding how ideas are translated into written language via (a) content words (nouns and verbs and the adjectives and adverbs that qualify them, respectively), (b) function words (conjunctions, prepositions, pronouns, and articles that glue content words to each other within the syntactic envelope), and (c) phrases that describe actions, states of being, attributes, and relationships. Rather, we focused on the number and nature of the syntactic structures that may influence how complex ideas are expressed in syntactic units the writer constructs based on language-specific word order. These multi-word, syntactic units contain as a minimum a subject and a predicate, which together are analogous to the topic/comment units observed in children’s early language development. The topics become more complex across language development—from simple objects or actions to complex phrases—and so do the comments, which are packaged within the larger syntactic envelope.

We used two kinds of syntactic construction measures to allow for the possibility that developing writers are learning to translate ideas into an ordered series of words with underlying structure (sentence integrity) as well as to re-translate ideas flexibly by altering syntactic structures (sentence combining). Undoubtedly many other aspects of syntax are involved in mastering the translation process whereby ideas are expressed via written language.

Although sentences are believed to represent complete thoughts, much theoretical and empirical work remains to be done to understand fully what thought is, let alone a complete thought. The idea of a complete sentence may be but one way thought can be expressed in language. Moreover, sentence construction draws not only on the word- and syntactic-levels of language during translation, but also in a multi-sentence text on non-linear discourse-level structures for deciding what to write for the next sentence in reference to an earlier topic and in reference to the emergent non-linear discourse schema (Berninger, Fuller, & Whitaker, 1996b).

At the word level, written words index the conceptual structures of the cognitive representation system which range from categorical (with prototypical and non-prototypical features) to non-categorical spreading networks of associations. The challenge for the developing writer is to find a single good word (a single lexical item) to translate what may be a multi-dimensional, underlying cognitive representation. For further discussion of these issues, see Stahl and Nagy (2005).

At the multi-word, syntactic level, words are sequenced within the linear syntactic envelope to express ideas via (a) the predicate and its arguments (Kintsch & Van Dijk, 1978), grammatical relationships (e.g., for singular and plural forms of subjects and predicates, Fayol, Totereau, & Barrouillet, 2006), and cohesive ties including but not restricted to pronominal references (Halliday, 1987).

Discourse level

However, at the discourse level of written composition, the syntactic envelope also functions in a non-linear fashion to mark local relevance to a topic in previous text and global relevance to emergent structures that transcend linear structures (Berninger et al., 1996b). Thus, the deep structures may not be in language alone (Chomsky, 1965) or cognition alone, but rather in the complex cognitive-language mappings for translation that brings the unconscious into consciousness. For analogous arguments at the word-level regarding vocabulary instruction, see Stahl and Nagy (2005).

Role of syntax construction in writing instruction

In recent years the importance of teaching sentence grammar in the instructional program in writing has been dismissed based on the meta-analyses showing that intensive diagramming of parts of speech in sentences did not transfer to improved quality of composing (Hillocks, 1986). However, these studies confused levels of cognition-language mapping in their analyses. Instruction focused on the syntactic level, but its effectiveness was assessed for quality of content and organization at the discourse level. The more appropriate questions are whether instruction (a) at the syntactic level transfers to improved sentence construction and awareness, and (b) at the discourse level transfers to improved extended text writing quality.

Sentence diagramming is not the only way to teach syntactic and sentence awareness in first through seventh grade. Evidence-based options include (a) language cueing to find words performing specified sentence functions and sorting games to develop awareness of linguistic cues for guiding children to develop both syntactic and morphological awareness (Berninger & Wolf, 2009); and (b) explicit strategies such as sentence combining (e.g., Graham & Perin, 2007; Nelson et al., 2009; Saddler & Graham, 2005), which is essentially re-translating and may provide a developmental foundation for learning to revise to improve translation. However, additional research on syntactic construction is needed in order to optimize the writing achievement of all students at all levels of language, including word choice, syntax construction, and creation of discourse schema.

In this study we applied the methods of descriptive linguistics to the writing samples of developing children from first through seventh grade. What was learned about early and emerging syntactic construction may hopefully inform future instructional as well as developmental research on writing. Just as the eminent writer, Mark Twain, complained that reports of his death were premature, so may be the reports of the death of teaching syntax. Developing syntactic awareness in child writers may enhance their ability to translate their thoughts into carefully crafted written sentences, alone or in text.