Introduction

Reciprocal behaviors, one of the most important social interaction skills, are defined as mutual and symmetrical exchanges between individuals (Gernsbacher 2006). Reciprocal behaviors are pivotal to the minute process of collaborative behaviors between individuals who perform activities to reach shared goals, such as when children are talking, working, or playing together (Cole and Teboul 2004). The ability to execute reciprocal behaviors develops with age, depending on the development of cognitive skills and emotional functioning (Perez and Gauvain 2005). At around 2 years of age, typically developing children play in proximity with each other and do the same type of activity but without mutual exchange, called parallel activity. Gradually, children begin to interact with other children and develop basic reciprocal behaviors, such as equal turn taking and object sharing (Eckerman et al. 1989; Warneken and Tomasello 2006). At 3 years of age, children start to share themes among playmates (e.g., building a sandcastle together) (Howes 1988). During middle childhood, collaborative reciprocal behaviors emerge when the children understand others’ goals and intentions; therefore, children can share their psychological states with others (Tomasello et al. 2005). This sharing behavior is composed of basic to complex reciprocal behaviors and enables children to play with a full variety of such behaviors to achieve a common goal, such as building a small play house together or making a joint drawing (van Ommeren et al. 2012). Therefore, reciprocal behaviors play a decisive role in child development in interactive play and social interaction.

Social interaction deficit is an obvious symptom in children with autism spectrum disorder (ASD), manifested by the lower frequency and complexity of reciprocity in their social development (Spence et al. 2004; van Ommeren et al. 2012). Children with ASD have been found to be unable to show appropriate reciprocal behaviors with peers and adults, including the failure to develop peer relationships, the lack of spontaneous seeking to share enjoyment, interests, or achievements with other people, and the lack of social or emotional reciprocity (Sally and Hill 2006). Several studies have also shown that children with ASD have lower social network centrality and less friendship reciprocity than do typically developing children (Chamberlain et al. 2007; Kasari et al. 2011; Rotheram-Fuller et al. 2010).

The neural correlates of human reciprocity have been investigated as possible intervention targets (Caceda et al. 2017; Kuss et al. 2015; Sakaiya et al. 2013; van den Bos et al. 2009). These neuroimaging results have indicated that the right middle dorsolateral prefrontal cortex (DLPFC), dorsal caudate, ventromedial prefrontal cortex (VMPFC), and precuneus are activated when individuals perform reciprocal actions (Sakaiya et al. 2013). The region of the DLPFC is known to be associated with human executive function, and the VMPFC and precuneus are known as the theory of mind (ToM) regions. Therefore, reciprocity performance may be affected by ToM and EF, which are respectively controlled by the ToM regions and the DLPFC. However, so far, no behavioral research has been conducted to directly examine the correlates of reciprocity in children with ASD.

ToM and EF have also been examined for impaired reciprocity in children with ASD. Children with ASD have been found to have impaired ToM (Adibsereshki et al. 2015; Baron-Cohen et al. 1985; Schuwerk et al. 2016), which may lead to impairment in social interaction, such as collaboration and reciprocity (Sally and Hill 2006). ToM is a cognitive ability (Adolphs 2001) with which a person understands and infers the mental states (e.g., belief, desires and intensions) of others and uses such information to explain and predict behaviors and actions (Premacka 1978). Therefore, ToM is essential for a person to develop reciprocal conversation and social interactions (Mastrangelo 2009). In Sally and Hill’s (2006) study, three types of strategy games were used to investigate the relationship between ToM and reciprocity in children with ASD. However, the results showed that children with ASD had impaired ToM ability in both first and second-order false belief tasks, but not in the reciprocity game. Their results also showed that the reciprocity of children with ASD was similar to that of typically developing children, even though social reciprocity deficit is an obvious limitation in the daily lives of children with ASD. It may be that children with ASD perform better on a structured computer reciprocity game and worse in unstructured daily life situations.

In addition to ToM, EF is another neural correlate of reciprocity. EF is a collection of interrelated cognitive and behavioral skills that are responsible for purposeful, goal-directed activity, include the highest levels of human functioning (Lezak 1995). EF allows us to access information, think about solutions, and implement those solutions. Currently, EF can be divided into cool and hot EF. Cool EF is defined as goal-directed and future-oriented skills that are manifested within abstract, decontextualized, nonemotional and analytical conditions, which is a more pure cognitive aspect of EF associated with the DLPFC (Metcalfe and Mischel 1999). Hot EF is goal-directed and future-oriented cognitive processes manifested within the contexts of endangering emotion and motivation (Zelazo and Carlson 2012; Zelazo and Mu¨ller 2002; Zelazo et al. 2005). Hot EF is the affective aspect of EF and associated with the orbitofrontal cortex (OFC), an area of the VMPFC, and other medial regions (Happaney et al. 2004) for reappraisal of the affective or motivational significance of stimuli (Rolls 2004). Traditionally, the cool aspect of EF has been predominately investigated in children with ASD, but the results have been inconsistent (Craig et al. 2016; Geurts et al. 2009; Hill 2004). Some have suggested that children with ASD have cool EF deficits (Hughes et al. 1994; Lai et al. 2017; Szatmari et al. 1989), while others have argued to the contrary (Dawson et al. 1998; McEvoy et al. 1993; Yerys et al. 2007). This inconsistency may cause difficulty in recognizing the relationship between EF and social interaction (Zimmerman et al. 2016). As regards hot EF, although children with ASD have hot EF deficit (Zimmerman et al. 2016), so far, no studies have investigated the relationship between social interaction and hot EF in children with ASD.

In summary, children with ASD have obvious deficits in reciprocity in their daily living contexts. The regions controlling ToM and EF have been found to be the neural correlates of reciprocity, but to the best of our knowledge, no clinical behavioral studies to date have examined the relationships of ToM and EF simultaneously with reciprocity in children with ASD to confirm the neuroimaging evidence. In addition, the structured computer game used in past studies might not reflect the reciprocal performances of children with ASD in real-life reciprocal contexts (Sally and Hill 2006). Therefore, to resolve the above two issues, this study aimed to investigate the correlates of reciprocity in children with ASD by considering cool EF, hot EF, and ToM with a more unstructured assessment in reciprocity. If clinicians are informed of these evidenced cognitive correlates of reciprocity, assessment and intervention plans could be better targeted to better improve the children’s reciprocity in their interactive play and social interaction.

Methods

Participants

Children with ASD were recruited from January 2015 to September 2015 from teaching hospitals, clinics, and developmental centers of Taiwan. The inclusion criteria were: (a) formal diagnosis of autistic disorder or Asperger’s syndrome according to the DSM-IV or DSM-IV-TR by a trained psychiatrist or pediatrician; (b) age of 4–12 years; (c) basic verbal ability, indicated by a score of 60 on the WISC-IV or WPPSI-IV (determined by a pilot study); (d) ability to follow orders and complete all procedures; (e) absence of symptoms associated with organic brain dysfunction (e.g., seizures), and (f) absence of uncorrectable visual or hearing impairments.

Measures

Reciprocity: Interactive Drawing Test (IDT)

The IDT was developed by Dr. Tineke Backer van Ommeren and Dr. Sander Begeer (van Ommeren et al. 2012). The IDT was designed to examine reciprocity in a joint, unstructured situation similar to unpredictable aspects of real-life social interactions. No drawing skill is needed in this test. The test elicits spontaneous interaction with the experimenter to achieve a mutual goal, which takes approximately 10 min. For administering the IDT, a table, drawing paper of size A3, four colored markers (red, blue, black, green), and a video camera are needed. The experimenter starts the test by saying, “We are going to draw together. You may choose a marker.” This is the only instruction for the children. The desired goal of the IDT is equal participation of the participants. The experimenter is instructed to be reciprocal with increasing dynamics (i.e., initiating his or her own objects and contributing to the child’s objects) and to elicit reciprocal behaviors by drawing incomplete objects and by giving examples of reciprocal responses (i.e., adding elements to the incomplete drawing of the child). Then the experimenter begins turn-taking by turning the paper to the child; this is repeated throughout the administration. The experimenter starts the test by drawing a line. The first thing the experimenter draws is a house. The experimenter has to finish it in five turns of only one line per turn. The experimenter also has to draw things that are not finished (e.g., a part of a window or part of a car) so that the child is always able to add meaningful elements to the objects drawn by the experimenter. In addition, the experimenter can partly color an object to join in the creation of the objects drawn by the child without contributing new elements. After a few turns, the experimenter makes a specific and new contribution; i.e., an interfering input. In later objects drawn by the child, the experimenter adds absurd and damaging input.

Three types of reciprocal behaviors are measured in this assessment. The first is the proportion of reciprocal drawing (i.e., joining in by adding meaningful elements) of total turns, which is also called total reciprocity. Reciprocal drawing (total reciprocity) can be divided into the proportion of reciprocity in the other’s initiative and reciprocity in the child’s own initiative, which refers to the amount of reciprocity in the objects drawn by the experimenter and the amount of reciprocity in the objects drawn by the child, respectively.

Theory of Mind: Theory of Mind Task Battery (ToMTB)

The ToMTB (Hutchins et al. 2012) was designed to assess the ToM understanding of younger and older children, who vary widely in their cognitive and linguistic profiles. The ToMTB consists of 15 test questions within 9 tasks. Tasks are presented as short vignettes arranged in ascending difficulty, including identifying emotions associated with facial expressions, understanding the visual perspective of the experimenter, inferring desire-based emotion, perception-based belief, perception-based action, and first- and second-order false beliefs. The tasks are presented in a storybook format. Each page has color illustrations and accompanying text. Memory control questions are included, and the control questions must be passed in order for credit to be given on the test questions. Appropriate for verbal and nonverbal children with ASD, the test is constructed such that children can respond by answering verbally or by pointing to a picture that shows the correct answer. Each ToM question is scored as pass (1) or fail (0), and no credit is awarded for a ToM target question unless the associated control questions are passed. The ToMTB has been shown to have good test–retest reliability and internal consistency (Hutchins et al. 2008).

Cool Executive Function: Computerized Dimensional Card Change Sort Task (DCCS)

The computerized DCCS (Diamond and Kirkham 2005; Dichter et al. 2010) was administered and scored for cool EF in this study. The picture files consist of two response icons (a red truck and a blue star) and two stimuli (a blue truck and a red star). No stimulus matches a model picture in both color and shape. Therefore, the correct response for sorting by color is always the wrong response for sorting by shape, and vice versa. During a trial, the word “color” or “shape” indicates the relevant sorting criterion for that trial.

In this study, each participant first completed 15 practice trials with performance feedback (i.e., “correct” or “incorrect”) prior to the test blocks. After the practice trials, they completed seven blocks of test trials without feedback. Each block used one of the two sorting criteria, either color or shape. Blocks 1, 3 and 6 used the “color” criterion, blocks 2, 5 and 7 used the “shape” criterion, and block 4 was a mixed task consisting of color and shape sorting rules (pseudo-randomly intermixed color and shape trials with 13 non-switch trials and 7 switch trials). Each block comprised 10 trials, except for the mixed block, with 20 trials. Reaction time (RT) from stimulus onset to key press was measured in milliseconds. As soon as a key was pressed, the stimulus and cue word disappeared, leaving only the model pictures. Consistent with the criteria of Diamond and Kirkham (2005), only correct responses were included in the RT analyses. Additionally, trials in which an RT was more than 2.5 standard deviations above the mean and those in which RT was < 200 ms were dropped. Shifting cost was calculated as the difference between the last two trials in the first block and the first two trials in the second block.

Hot Executive Function: Children’s Gambling Task (CGT)

The children’s hot executive function was measured with the Children’s Gambling Task (CGT) (Kerr and Zelazo 2004). The Children’s Gambling Task, a simplified version of the Iowa Gambling Task (Bechara et al. 1994), involves two decks of 50 cards. The cards are divided into two parts, a white top and a black bottom. Black happy faces appear on the top half and white sad faces appear on the bottom half (Fig. 1). The number of happy faces on each card refers to the number of rewards that the participant gains, and the number of sad faces refers to the number of rewards that the participant loses. Rewards are represented by candies or toys determined by the child. A plastic box is positioned between the two decks of cards. When children gain or lose rewards, candies are moved into/out of the plastic box from/to a plastic container kept by the experimenter.

Fig. 1
figure 1

Sample card face from the disadvantageous deck (gain = + 2, Loss = − 4)

The two decks of cards are different. One is advantageous, and the other is disadvantageous. A child who chooses from the advantageous deck consistently earns a net gain in rewards, and a child who chooses from the disadvantageous deck consistently experiences a net loss of rewards. The number of gains is the same across the cards, but the number of losses is not. In the advantageous deck, cards always provide a gain of one reward (i.e., they show one happy face) together with zero or one loss (for an average of five candies gained per block of ten cards). In the disadvantageous deck, cards always provide a gain of two rewards (i.e., they show two happy faces) together with losses of 0, 4, 5, or 6 candies (for an average of five candies lost per block of ten cards). The order of cards in each is fixed, and the gain/loss contingencies associated with each card in each deck are proportional to those used in the Iowa Gambling Task (Table 1). The CGT score is calculated by subtracting the number of disadvantageous deck choices from the number of advantageous deck choices.

Table 1 Outcomes of each card in the advantageous and disadvantageous decks

Severity of the Symptoms of Autism: Childhood Autism Rating Scale (CARS)

The children’s autistic symptom severity was measured with the CARS (Schopler et al. 1980, 1988), which is a 15-item behavioral rating scale. Each item ranges from 1 (no abnormality) to 4 (severe abnormality), so the total score of the CARS ranges from 15 to 60. The rating of the CARS is based not only on clinical observation and classroom observation but also on a caregiver interview (Matson et al. 2010; Schopler et al. 1988). The cutoff point of a diagnosis of autism is 30 points. The score range from 30 to 36 represents mild to moderate symptoms of autism, and a score higher than 37 suggests severe symptoms of autism. The psychometric properties of the CARS have been well established. The CARS has good internal consistency (r = .896) and good inter-rater reliability (r = .796) (Breidbord and Croudace 2013).

Verbal Ability: Wechsler Intelligence Scale for Children®-Fourth Edition (WISC®-IV) or Wechsler Preschool and Primary Scale of Intelligence™-Fourth Edition (WPPSI™-IV)

Children’s verbal ability was measured with the Verbal Comprehension Index (VCI) of the Chinese version of the Wechsler Intelligence Scale for Children®-Fourth Edition (WISC®-IV) (Chen and Chen 2007/2003) or the Wechsler Preschool and Primary Scale of Intelligence™-Fourth Edition (WPPSI™-IV) (Chen and Chen 2013/2012). In the current study, the verbal abilities of the children aged from 4 to 6 years old were measured with the WPPSI-IV, and the verbal abilities of children older than 6 years old were measured with the WISC-IV.

The WISC-IV is an intelligence test for children aged from 6 years old to 16 years old 11 months consisting of ten core subtests and five additional subtests. The VCI contains the Vocabulary, Similarities, and Comprehension subtests. The standard mean score of the VCI is 100, and the standard deviation is 15. The Chinese Version of the WISC-IV, published in 2007, has good reliability and validity (Chen et al. 2009). The WPPSI-IV is an assessment for measuring children’s cognitive development from age 2 years 6 months to age 7 years 7 months. It consists of 14 subtests. The VCI for children aged from 4 to 7 years 7 months contains the Information and Similarities subtests. The Chinese Version WPPSI-IV, published in 2013, is for children aged from 2 years 6 months to 7 years 11 months and also has good reliability and validity (Chen and Chen 2013).

Procedures

The protocol of this study was approved by the Institutional Review Board of two teaching hospitals. After recruitment, participants’ caregivers were informed of relevant details of this study, and informed consent forms were signed by those willing to participate in the study.

Children’s data were collected in two visits to avoid fatigue and prolonged assessment. Each visit took about 1 hour, or the time to complete the assessments. At the first visit, the child was assessed with the WISC-IV or WPPSI-IV, and children who did not achieve a score of 60 (e.g., based on the norm for 8-year-old children) on the VCI of the WISC-IV or WPPSI-IV were excluded. The criterion of basic verbal ability was decided based on the results of a previous pilot study for a child to understand and follow the instructions for the present study. Children with sufficient verbal ability were then assessed with the ToMTB. Their caregivers filled out the basic information sheet and were interviewed with the CARS. At the second visit, children were assessed with the DCCS, the CGT, and the IDT by a trained administrator.

All the data of the IDT were collected by the first author. Her training on the IDT was specifically provided by the developer of the IDT, Tineke Backer van Ommeren (van Ommeren et al. 2012) from May 2014 to September 2015. The training process included: (1) developing familiarity with the manual, instructions, scoring criteria, and drawing inputs of the IDT; (2) standardizing the procedures and scoring criteria of the IDT; (3) practicing evaluation with the IDT; (4) assessing a child with ASD with the IDT, recording the evaluation process, rating the child’s reciprocal behaviors, and sending the drawing and scoring form to Dr. van Ommeren; and (5) correcting the scoring based on suggestions by Dr. van Ommeren. Steps (4) and (5) were repeated until Dr. van Ommeren approved the assessment skills, such as giving special input accurately and coding behaviors appropriately.

Statistical Analysis

The collected data were analyzed in SPSS 17.0 for Windows (SPSS Inc., USA) for the descriptive analysis and hypothesis testing. For descriptive purposes, means and standard deviations were computed for the demographic data of the children. Pearson’s correlation analyses (two-tailed) were computed among cool EF, hot EF, ToM, reciprocity, age, and verbal ability. The cut off points for the Pearson correlation coefficients were 0.25, 0.5, and 0.75, indicating mild, moderate, and good levels of correlation (Portney and Watkins 2007). Three hierarchical regression models were used to identify the significant predictors of reciprocity in children with ASD while controlling for age, verbal ability, and severity of the symptoms of autism. The dependent variables were the three IDT variables: reciprocal drawing (total reciprocity), reciprocity in own initiative, and reciprocity in other’s initiative. Independent variables were introduced to the regression models in two steps. In the first step, all of the controlled-for variables, including the age, VCI from the WISC-IV or WPPSI-IV, and CARS total score were entered and denoted as the baseline. In the second step, the variables of interest, including ToMTB total score, DCCS, and CGT variables, were added to the baseline model, the combination of which was denoted as the augmented model. Comparing the augmented and baseline models allowed us to examine the additional contributions from the variables of interest while controlling for the effects of age, VCI, and CARS total score. The F-test of overall significance and model comparison, the coefficient of determination (R2) and R2 change (ΔR2), and the regression coefficients were provided to determine the significance of ToM, cool and hot EF. To finalize the model, non-significant variables of interest were removed from the augmented model. The significance level was set at alpha < 0.05.

Results

A total of 59 children diagnosed with ASD (50 boys, 9 girls) were included in the present study, including 12 children with Asperger’s syndrome and 47 diagnosed with autistic disorder. Participants were recruited from nine clinics, six hospitals, and one developmental center in Tainan, Kaohsiung, and Taipei. Originally, a total of 75 children with autistic disorder or Asperger’s syndrome were recruited in this study. In the end, 16 children were excluded because they could not finish the whole process (n = 1) or pass the verbal ability criterion (n = 14), or some data were missing. The mean age of the 59 participants was 88.8 months. The children’s mean VCI on the WISC-IV or WPPSI-IV was 104.9, indicating that our participants had moderate verbal comprehension ability. The mean score of autistic symptom severity on the CARS is 30.2, indicating that the included participants had mild autistic traits. The mean score of socioeconomic status (SES) was 39.4, indicating the medium level of the socioeconomic level of the participants’ families (Hollingshead 1957).

The results of descriptive analysis of ToM, cool EF, hot EF, and social interaction are shown in Table 2. The children’s ToMTB scores indicated that the children in our sample had basic ToM development on average and could understand the first order false belief task. As for the DCCS, the means of the children’s accuracy in the single block, accuracy in the mixed block, reaction time in the single block, reaction time in the mixed block, and shifting cost are listed in Table 2. The mixed block in the DCCS was more difficult than the single block for the children due to the lower accuracy and longer reaction time. The children’s CGT mean score indicated that the children chose more cards from the advantage deck than from the disadvantage deck. In the IDT, the mean percentage of the children’s reciprocal drawing was 67.3%, including the mean percentage of the children’s reciprocity in their own initiative, 36.1%, and the mean percentage of children’s reciprocity in the other’s initiative, 31.5%. The IDT results indicate that most of the children could participate in the joint drawing activity to add meaningful elements, but it was more difficult for them to add elements to the other’s initiative.

Table 2 Sample characteristics of children with autism spectrum disorder (N = 59)

The results of the Pearson correlation coefficients among the ToM, cool EF and hot EF with reciprocity are shown in Table 3. DCCS-single block accuracy (r = .482, p < .01), DCCS-mixed block accuracy (r = .501, p < .01), and CGT (r = .396, p < .05) were significantly correlated with the reciprocal drawing with mild to moderate positive correlation coefficients. The significant correlates of reciprocity in the child’s own initiative were the variables of the DCCS-single block accuracy (r = .342, p < .01) and DCCS-mixed block accuracy (r = .441, p < .01). However, none of the variables were found to be significantly correlated with reciprocity in the other’s initiative. Furthermore, the VCI (r = .380, p < .01) was found to be significantly correlated with reciprocity in the child’s own initiative.

Table 3 Correlations on measurements of interest in children with autism spectrum disorder (N = 59)

The results of the hierarchical regression analyses are summarized in Table 4. For the model of reciprocal drawing, the results showed that in addition to age, VCI, and CARS, the DCCS-mixed block accuracy and CGT made significant additional contributions to predict reciprocal drawing (total reciprocity) with an F-value of 10.473, accounting for an additional 26.1% of the total variance. For reciprocity in the child’s own initiative, in addition to age, VCI, and CARS, only the DCCS-mixed block accuracy was significant with an F-value of 13.498, which additionally explained 17.2% of the variance in reciprocity in the child’s own initiative. For the last dependent variable, no variables of interest were significant in explaining reciprocity in the other’s initiative while controlling for age, VCI, and CARS.

Table 4 The hierarchical regression models of reciprocity in children with autism spectrum disorder (N = 59)

Discussion

The present study examined the relationships among ToM, cool EF, hot EF, and reciprocity in children with autistic disorder and Asperger’s syndrome while considering their verbal ability and autistic symptom severity. The results suggest that cool EF and hot EF are both significantly correlated with total reciprocity, and that only verbal ability is significantly correlated with reciprocity in the child’s own initiative. The present study fills important gaps in previous research to provide better understanding of the correlates in reciprocity in children with autistic disorder and Asperger’s syndrome by conducting a behavioral study of reciprocity, considering hot EF as a correlate, and using a reciprocity task similar to unstructured and unpredictable aspects of real-life social interactions.

In this study, total turns of reciprocal drawing (i.e., joining in by adding meaningful elements) is referred to as total reciprocity. Reciprocal drawing (total reciprocity) can be divided into the proportion of reciprocity in the other’s initiative and reciprocity in the child’s own initiative, which refers to the amount of reciprocity in the objects drawn by the experimenter and children, respectively. According to the correlational results, for reciprocal drawing, both cool EF and hot EF were found to be significant correlates. Children with better cool EF and hot EF ability may perform more reciprocal behaviors. In addition, cool EF and verbal comprehension ability were the significant correlates of reciprocity in the child’s own initiative. That is, children with better cool EF and verbal ability may have better reciprocity in their own initiatives. However, the correlate of reciprocity in the other’s initiative was not identified in the present study, so further investigation is warranted to discover other correlates of children’s reciprocity in others’ initiatives, such as social interaction style.

Regarding the significant correlation between cool EF and reciprocal drawing, the cool EF task, the DCCS, focuses on children’s abilities of shifting attention, impulse control and sustained attention. Therefore, children who are able to correctly shift their attention based on the sorting rule, inhibit the impulse to follow the rules, and sustain attention in the task may perform more reciprocal drawing behaviors because it is easier for them to inhibit their own impulses and then shift and sustain their attention to the collaborative theme. The result is consistent with previous neural process research reporting that the DLPFC, the neural region for cool EF, is the neural correlate of reciprocal behaviors (Sakaiya et al. 2013).

On the other hand, hot EF, a goal-directed and future-oriented cognitive process, is manifested within the contexts of endangering emotion and motivation (Zelazo and Carlson 2012; Zelazo and Mu¨ller 2002; Zelazo et al. 2005). Hot EF was found to be significantly associated with the capacity for reciprocal drawing behavior. It could be that the IDT is a task that requires maintained motivation, so the importance of the drawing themes may require reappraisal. Therefore, children with better hot EF ability can maintain their attention and motivation on the reciprocal themes and contribute meaningful elements toward the collaborative goals.

Two possible reasons may explain the non-significant relationship between ToM and reciprocal drawing. First, the drawing theme of the experimenter was obvious, so the children may have used their cognitive ability more than ToM, social cognition, to speculate on the experimenter’s behaviors. Second, the IDT is an unstructured and unpredictable task, so it is hard for children with autistic disorder and Asperger’s syndrome to generalize their ToM ability to this unstructured situation. As a result, they are unable to give proper responses. Despite being able to perform well on the ToM task, they cannot perform well on a reciprocity task in an unstructured context. The findings of the present study are consistent with those of previous studies, in which children with ASD were found to have difficulties in using their ToM in an unstructured context (Begeer et al. 2010; Dahlgren and Trillingsgaard 1996; Tomasello et al. 2005). High functioning adolescents and adults with ASD are able to pass the ToM task at various levels of complexity (Dahlgren and Trillingsgaard 1996). They can give proper responses in structured situations (Begeer et al. 2010), but they cannot respond properly in unstructured and unpredictable situations. However, Tomasello et al. (2005) have found that children with ASD have fewer reciprocal behaviors, possibly due to difficulties in understanding others’ intentions, which is inconsistent with the non-significant correlation between ToM and reciprocity found in the present study. In summary, further study is still warranted to better elucidate the relationship between ToM and reciprocity with consideration of the task characteristics, such as the collaborative themes and unstructured or structured tasks used in measuring children’s reciprocal behaviors.

One possible explanation is that the reciprocity in their own initiatives of children with ASD was greater than the reciprocity in the other’s initiative in our study. The less reciprocity in the other’s initiative may be due to their restricted and repetitive behavioral characteristics, which cause them difficulty in shifting their attention to another’s drawing theme. For instance, many of the children in this study would only focus on their own drawing themes of dinosaurs, cars, and busses, so it was difficult for them to follow the experimenter’s initiative. A previous study by van Ommeren et al. (2012) also suggested that children with ASD would perform more reciprocity on their own initiatives because children want to avoid the confrontation of unexpected situations and prefer to lead the drawing themes.

As for the positive correlation of verbal ability and cool EF to reciprocity in the child’s own initiative, a possible explanation may be that better verbal comprehension ability and better accuracy in the cool EF task revealed that children having better cognitive ability can better execute reciprocal behaviors based on their own initiatives. On the other hand, children with better cognitive ability still cannot perform more reciprocity in others’ initiative behaviors. This may be influenced by their characteristic of restricted behaviors, which led to focusing on their own theme in the IDT test. As seen in the correlational results, the relationship between cool EF-shifting and reciprocity in the child’s own initiative was negative (r = − .246, p = .061), which is close to statistical significance. A larger sample size may be needed to better examine the relationship between reciprocity in the child’s own initiative and cool EF.

In this study, based on our observation, children’s reciprocity performance on the IDT could be divided into three types: active, passive, and proper reciprocity. Children of the active type presented much reciprocity in their own initiatives, and the drawing themes usually focused on typical objects for the child, such as cars and busses. The child responded less to the experimenter’s initiatives, a tendency common in younger children diagnosed with Asperger’s syndrome. Children of the passive type exhibited much reciprocity in the experimenter’s initiative, and the children responded properly to the experimenter’s drawing theme. However, the experimenter had to lead the drawing theme, and these children tended not to initiate their own drawing themes. Children of the proper type exhibited balanced reciprocity in their own initiatives and in the experimenter’s initiative. The child and the experimenter would take turns leading the drawing theme, which would increase the dynamic between the experimenter and the child. These types of reciprocal behaviors on the IDT in the current study are similar to the four types of social interaction styles: typical, active-but-odd, passive, and aloof (Castelloe and Dawson 1993; O’Brien 1996; Scheeren et al. 2012). Active-but-odd children actively seek social interaction, but they do so in an unusual way (e.g., holding a monologue about a particular interest), which is similar to the active type of IDT performance (e.g., children focus on their own specific drawing themes and cannot properly respond to the experimenter’s theme). Passive children do not initiate social interaction, but they do respond suitably to another’s initiative, resembling the passive type of IDT performance (e.g., seldom initiating their own drawing themes but being able to add proper and meaningful elements to the experimenter’s drawing theme). Typical children act properly in social interactions, similarly to the proper type in IDT performance (e.g., the children both initiate their own themes and participate in the experimenter’s drawing theme). Aloof children seek no social interaction and do not respond to another’s initiative. These children usually have less verbal ability and are low-functioning; thus, this type may not be a factor in the present study because of the verbal ability inclusion criterion. Regarding the similarities between the types of IDT performance and social interaction style, children’s social interaction styles might be an important correlate of reciprocity in the child’s own or the other’s initiative.

The present study has two main limitations. First, the findings cannot be generalized to lower-functioning and non-verbal children with ASD or to children diagnosed with PDD-NOS because the present study included only children with better functioning, verbal ability, and diagnosis of autistic disorder or Asperger’s syndrome. Second, children’s reciprocity may be correlated with their social interaction styles, but the present study did not assess social interaction styles. Therefore, further study is warranted to (1) recruit children with lower-functioning and less verbal ability and a diagnosis with PDD-NOS if the corresponding measurements adjusted for these children are available; (2) apply a ToM measurement that is less verbally demanding; and (3) add a measurement for evaluating social interaction styles.

Conclusion

The present study examined the relationships of ToM, cool EF, and hot EF to reciprocity in children with ASD. This study, which used a relatively unstructured reciprocity task to better reflect the children’s daily reciprocity performance, provides the first step in considering hot EF in reciprocity. The results showed that children with autistic disorder and Asperger’s syndrome who have better cool EF and hot EF may have a better capacity for reciprocity. For clinicians, on the basis of this research, assessment plans could be developed to evaluate children’s ToM, cool EF, hot EF, and reciprocity as the foundation of intervention. With consideration of the children’s cognitive correlates, targeted interventions could be accordingly developed to improve the children’s basic and collaborative reciprocal behaviors in their interactive play and social interaction, such as equal turn taking, object sharing, theme sharing, and understanding of psychological states. For researchers, it is recommended that future studies include more low-functioning and non-verbal children with autistic disorder and Asperger’s syndrome, apply a ToM measure that is less verbally demanding, and assess social interaction styles to better illustrate the cognitive correlates in reciprocity.