Effective classroom management is dependent on several factors, including classroom rules that establish behavioral expectations designed to prevent inappropriate behaviors while enhancing the classroom climate and desired behaviors (Buck, 1999; Doyle, 2006; Jones & Jones, 2012; Mastropieri & Scruggs, 2014). Researchers have indicated that a single and generalized (e.g., broad) set of approximately three to five rules is a sufficient and effective method for managing the classroom (Malone & Tietjens, 2000). Several systems that incorporate a single set of rules and contingencies for rule breaking or following include the Good Behavior Game, the Caught Being Good Game, the ADHD Classroom Kit, and the Caterpillar Game (e.g., Filcheck, McNeil, Greco, & Bernard, 2004; Floress, Rock, & Hailemariam, 2017; Wright & McCurdy, 2011).

Although one set of three to five rules may be easy to learn, behavior expectations change as students switch from one activity to another (Skinner & Skinner, 2007). Consequently, a single set of three to five rules rarely provides clear and specific expectations for all activities across the school day, leaving students and teachers unsure of what rules are in place during specific activities (Fudge, Reece, Skinner, & Cowden, 2007). Students cannot be expected to follow unknown rules, and teachers cannot support student rule following unless the teachers know what rules are in place at all times (Fudge et al., 2008).

To address problems associated with unclear classroom rules, educators developed a classroom management strategy known as the color wheel system (CWS; Skinner, Scala, Dendas, & Lentz, 2007). The CWS incorporates multiple sets of classroom rules (i.e., green rules, yellow rules, and red rules) designed to provide specific behavioral expectations for different classroom activities. For example, yellow rules are designed for academic activities (e.g., teacher-led instruction) and include rules such as to remain in your seat, raise your hand to speak, and keep your eyes on your work or the speaker. Green rules are used for other activities designed to allow for more free communication and may include rules such as use your inside voice and keep your hands and feet to yourself. Visual prompts (e.g., a physical color wheel and rule posters) are publicly displayed so that at any given moment in time, everyone in the class can know which set of rules is in place.

Even experienced educators have difficulty managing student behavior during transitions (Codding & Smyth, 2008; Hine, Ardoin, & Foster, 2015). The CWS allows educators to consistently apply procedures designed to transition the class from one set of rules to another as it transitions from one activity to another (Skinner et al., 2007). The red rules are designed for transition and include keep your desk cleared, keep your eyes on the speaker, and no talking. When changing from one activity to another, teachers provide temporal warnings (e.g., “The color wheel will be turned to red in 2 minutes.”), which cues students to stop what they are doing and put materials away before receiving directions and instructions for the next activity (Skinner et al., 2007). If students are following red rules, they will be attending to teacher-delivered directions for the next activity, allowing the teacher to quickly introduce the next activity and turn the color wheel to the appropriate color to begin this activity. For example, when transitioning from reading to spelling, the teacher may turn the wheel to red and inform the class that for their next activity they are to take out a pencil and their spelling workbook and turn to page 21. When the teacher turns the wheel to yellow, a student may raise his hand to have needs addressed (e.g., state, “I forgot a pencil.”). Addressing such needs before beginning the lesson should allow the lesson to proceed with fewer disruptions (Saecker et al., 2008).

Evidence suggests that the CWS is effective and modifiable across settings, teachers, and general education classrooms (Skinner & Skinner, 2007). A number of consultants evaluated the CWS using procedures (e.g., A-B designs) that did not control for threats to internal validity (e.g., Below, Skinner, Skinner, Sorrell, & Irwin, 2008; Choate, Skinner, Fearrington, Kohler, & Skolits, 2008; Hautau, Skinner, Pfaffman, Foster, & Clark, 2008; Saecker et al., 2008). Others applied procedures that allowed them to draw stronger cause-and-effect conclusions. Researchers used A-B-A-B withdrawal designs to decrease inappropriate vocalizations (Fudge et al., 2007) and enhance on-task behavior (Fudge et al., 2008) in elementary classrooms. Watson et al. (2016) used a multiple-baseline design to evaluate the CWS in three kindergarten classrooms. For each class, results showed meaningful reductions in inappropriate vocalizations after the CWS was applied. Blondin, Skinner, Parkhurst, Wood, and Snyder (2012) evaluated a modified CWS system that included a fourth set of blue rules designed for collaborative learning activities (i.e., quiet voices, talk about work only, ask three before me, hands and feet to self, follow directions) and found increases in on-task behavior in a fourth-grade classroom.

Several factors suggest that the CWS may be effective when working with students with autism spectrum disorder (ASD). Many students with ASD engage in high rates of disruptive behavior, including out-of-seat behavior and inappropriate comments or noises (Conroy, Asmus, Boyd, Ladwig, & Sellers, 2007; Machalicek, O’Reilly, Beretvas, Sigafoos, & Lancioni, 2007; Reese, Richman, Belmont, & Morse, 2005). These behaviors often distract classmates, disrupt educators who are trying to teach, and are incompatible with desired behaviors, such as attending to teacher instruction (Scattone, Wilczynski, Edwards, & Rabian, 2002). Additionally, students with ASD may be more likely to engage in undesired behaviors during transitions (McCoy, Mathur, & Czoka, 2010). As the CWS has reduced general education students’ inappropriate behaviors during transitions, it may also be effective with students with ASD.

Some students with ASD are easily distracted, have difficulty ceasing or stopping specific activities, and have difficulty attending (Chien et al., 2015; Goldstein, Johnson, & Minshew, 2001; Kleinhans, Akshoomoff, & Delis, 2005; Lequia, Wilkerson, Kim, & Lyons, 2015; Mostert-Kerckhoffs, Staal, Houben, & Jonge, 2015; Occelli, Esposito, Venuti, Arduino, & Zampini, 2013; Rydzewska, 2016). These students may benefit from the consistent procedures (e.g., temporal warnings that the color wheel will be turned to red in 2 min) used to elicit stopping one activity and to enhance attending (e.g., having desks cleared and eyes on the teacher as she introduces the next activity). Students with ASD have responded well to visual stimuli (Bedford, Pellicano, Mareschal, & Nardini, 2016; Cihak, 2011; Ganz & Flores, 2010; Pierce, Spriggs, Gast, & Luscre, 2013). The CWS provides a visual indicator of behavioral expectation at all times (i.e., the displayed color), as well as verbal prompts (e.g., temporal warnings when switching to red) that allow students to participate regularly in routine transitions.

In previous studies, researchers used description and demonstration (e.g., role-play with different scenarios) to teach students CWS procedures. For some students with ASD, additional strategies may be needed (Gray, 2000). Researchers investigating other complex behaviors have found evidence that social stories, when used with other research-supported behavioral procedures, may be an effective tool for teaching students with ASD how to behave in social situations (Karkhaneh et al., 2010; Rhodes, 2014). Social stories are short stories, which may also incorporate pictorial cues, designed to enhance students’ ability to interpret complex social situations and understand the appropriate behavioral response (Sansosti, Powell-Smith, & Kincaid, 2004).

Although researchers have evaluated the CWS across general education elementary school classrooms, this system has not been evaluated in classrooms serving students with ASD. The current study was designed to extend research on the CWS to students with ASD while using social stories to teach students CWS procedures. Specifically, a multiple-baseline design across self-contained elementary school classrooms was used to evaluate the effects of the CWS on inappropriate behaviors. In one classroom, the primary inappropriate behavior measured was out-of-seat behavior; in the other two classrooms, it was inappropriate vocalizations.

Method

Participants and Setting

The study was conducted in two kindergarten through second-grade (K–2) classrooms and one third- through fifth-grade (3–5) classroom in a self-contained school for students with ASD. Participants were 21 students who were placed in the school based on their current individualized education programs (IEPs). The school provided more intensive services than available at the home district. All students had a diagnosis of ASD and could follow one-step directions. Classroom A, a K–2 room, included six students, four boys and two girls, aged 5–8. All Classroom A students were considered nonvocal, as they did not communicate verbally through words. Classroom B, a room with grades 3–5, included five boys and three girls, aged 8–11. Classroom C, a K–2 room, had five boys and two girls, aged 5–8. All students in Classrooms B and C communicated their needs vocally through words.

Each classroom had one lead teacher and at least one teaching assistant. Other teaching assistants rotated between the classrooms as needed. All teachers possessed at least a bachelor’s degree in special education, early education, psychology, or a related field. Academic programming in each classroom was similar to that of same-aged peers; students were exposed to the Common Core Curriculum with extended standards. All students received daily individual behavior-analytic instruction and some had individual behavior management plans. Procedures used in the classrooms included structured and incidental teaching during formal group activities and play times, discrete trial training, working walks, token systems, visual supports, consistent reinforcement, prompting hierarchies, shaping and chaining of complex behaviors, ongoing data collection, and formal behavior plans to address severe challenging behaviors.

Materials

As opposed to the traditional pie-shaped paper-wedge color wheel, we chose to use preconstructed plastic traffic lights purchased through Italtrike® to serve as the color wheel (see Below et al., 2008). The traffic light had three colors (red, yellow, and green) and the teacher changed the color by turning the top clockwise until the desired color was reached. In Classrooms B and C, the teachers chose to mount the traffic light on the 3-ft pole included with the traffic light, but the teacher in Classroom A chose not to use the pole and instead mounted the traffic light directly onto the stand.

Red, yellow, and green colored poster boards were created with the corresponding rules written on each poster board. Accompanying the written rules was a photograph of students responding to each set of rules. The teachers were encouraged to adapt the rules for their classrooms. Table 1 displays the rules for each classroom. Teachers from Classroom A chose slightly different rules for yellow as their children were considered nonvocal.

Table 1 CWS Rules for each classroom

Design and Dependent Variables

A multiple-baseline across-classrooms design was used to evaluate the effects of the CWS on disruptive behavior. For all experimental phases, data were collected by trained graduate students using 20-s partial-interval time-sampling procedures. Each graduate student entered the classroom and sat in a designated area. During the 20-min session, 20-s intervals were signaled using a smartphone app with attached earbud headphones. Data were recorded by writing a slash over each interval in which any student engaged in the disruptive behavior during that interval.

During a modified problem-identification interview (Bergan, 1977), each teacher selected which disruptive behavior was measured in her classroom. Classroom B and C teachers selected inappropriate vocalizations. Inappropriate vocalizations were defined as talking or making noises (e.g., humming, singing, and grunting) without permission. Examples of inappropriate vocalizations included talking while the teacher was talking, talking during independent seatwork, or calling out an answer. Nonexamples included answering a question after being called on by the teacher or talking quietly when placed in small groups to work.

Classroom A contained only students considered nonvocal. Because the teacher did not want to reduce student talking, she selected out-of-seat behavior. Out-of-seat behavior was defined as the student’s bottom off his or her assigned chair or area (e.g., carpet) when the student was not given permission to move. Some examples of out-of-seat behavior include walking around the classroom, crawling on the floor, and taking another student’s chair. A nonexample was moving from the table to the carpet area when instructed to do so by the teacher.

Procedures

Soliciting Participants and Training Observers

At the end of the previous school year, the primary researcher conducted a 60-min workshop describing CWS procedures and results from studies evaluating the CWS. All three participating teachers attended the workshop and expressed interest in applying CWS procedures the following school year. Approximately six weeks into the school year, researchers observed each classroom and developed operational definitions and data-recording systems. Data were collected by the primary researcher and a secondary researcher who collected interobserver agreement data during 29% of the sessions. The data collectors were graduate students with experience collecting direct-observation data in classrooms. The data collectors were systematically trained on the CWS using the implementation guide developed by Skinner et al. (2007), as well as on the definition of the dependent variables and how to collect data using the time-sampling procedure. Data collectors practiced recording data during two 20-min in vivo sessions and were provided feedback. Practice ceased when interobserver agreement between the observers exceeded 90%; the observers were then cleared to collect data used for the current study.

Baseline

During 20-min sessions, researchers used 20-s partial-interval time sampling to record disruptive behaviors. In each classroom, data were collected between 8:30 a.m. and 10:30 a.m. on Mondays, Wednesdays, and Thursdays. For each classroom, specific 20-min sessions were selected because academic activities were scheduled during those sessions, and approximately 10 min into each session, the classroom typically transitioned from one activity to another. Activities typically observed during these sessions were whole-group and small-group academic tasks. The observer(s) sat at the back of the classroom with a data-collection sheet on a clipboard and a smartphone with an earpiece used to signal each 20-s interval.

An interval was scored if the disruptive behavior (out-of-seat behavior or inappropriate vocalizations) was observed at any time during that 20-s interval. This phase lasted a minimum of four sessions per class. The primary dependent variable was the percentage of intervals where any student in the class engaged in the inappropriate behavior. Class-wide data were plotted on a repeated-measures graph, and visual analysis of trends, levels, and variability was used to decide when to apply the intervention to each class.

Teacher and Student Training

For each classroom, after the baseline phase ended, the researcher implemented teacher training (one 60-min session), student training (one 45-min session), and 1 practice day over 3 consecutive school days. Teacher training was staggered across classrooms, was conducted during planning time, and included the teacher and assigned teaching assistant(s). The primary researcher described and demonstrated how to apply CWS procedures. Next, the teachers and researcher reviewed daily schedules and discussed what rules (colors) were appropriate for what activities. Finally, the teacher and teaching assistants practiced using the CWS procedures in different scenarios while others (e.g., researchers, teaching assistants) played the role of students.

Several key concepts were emphasized during the training (Skinner & Fudge, 2006), which had already been introduced during the initial workshop. Teachers were reminded to consistently use red when transitioning and to always provide 2-min and 30-s warnings before changing the traffic light to red. The researcher also indicated that red rules were the most difficult to follow; consequently, time on red to should be kept to a minimum (e.g., after moving to red, quickly introduce the next activity and move the wheel to yellow or green). The researcher emphasized that yellow rules are often used for teaching and learning time: when students attend to teacher instruction, complete individual assignments, write or read, and raise their hands to ask or answer questions. They were reminded that green rules were designed to set behavior expectations for free time or group activities.

Teachers were reminded that the CWS is designed to enhance rule following, not improve their ability to detect and punish rule breaking (Fudge et al., 2008). Thus, teachers were encouraged to frequently praise rule following with verbal praise and tangible rewards; when students were not following rules, teachers were encouraged to gently prompt rule following (e.g., remind students of the rules for yellow, or point to the traffic light). Teachers were told that time on red should never be used for punishment, but time on green could be used as a group reward. Finally, the teachers were reminded that although the CWS is a group intervention, individual behavior plans should always be followed when necessary. Teacher mastery of the training was completed when the teachers and assistants could accurately identify all components of the CWS procedure, answer scenario questions posed by the researchers, and accurately role-play using the CWS in two classroom situations.

During the first 45 min of the following day, the researcher and teachers introduced the CWS to the students. A social story (see Appendix), supplemented with colored photographs of the traffic light, was used to assist in introducing the CWS to the students. Each page of the social story was accompanied by a photograph to help students visually understand the story. The researcher and teacher then helped the class practice examples and nonexamples of rules for each color. For instance, the class showed the teacher how to behave during red rules and then how not to behave during red rules. When prompted, all students demonstrated appropriate behavior for each set of rules. Next, the class practiced transitions from each color. Training was complete once the class could successfully transition from one activity to another using each transition combination available (i.e., yellow to red, green to red, red to yellow, and red to green) as the teacher implemented CWS procedures correctly.

For each class, the researcher returned the following day to observe the CWS being implemented, to answer any questions from teachers or assistants, and to provide feedback and prompts as the teachers applied CWS procedures. At the end of this practice day, all students were able to follow the procedures without much additional prompting and the student training was concluded.

Implementation and Maintenance

The following day, researchers continued to collect intervention-phase data using procedures identical to baseline data-collection procedures. Teachers and teaching assistants were responsible for implementing CWS procedures in their classrooms. At the beginning of each intervention day, the teacher read the social story to the class and then the teacher and assistants followed the CWS procedures during all daily routines while in the classroom. Throughout the intervention phase, the primary researcher provided ongoing feedback to teachers and gave suggestions of how to increase correct use of the CWS. The researcher talked to the teachers after approximately 60% of the sessions to provide reminders such as to always go to red before yellow or green, to remember to use the 2-min and 30-s warnings, and to keep time on red brief. Several times, the researcher and teacher discussed periods during the observed class session that could have been conducted differently to adhere to the CWS. Following the last intervention-phase session, the researcher asked the teachers to continue using the CWS in their classrooms, and all agreed to keep the system in place. Researchers returned twice per week for the next 4 weeks to collect maintenance data on disruptive behaviors. During maintenance, no additional prompting or feedback was provided to the teachers. Data-collection procedures were identical to those used during baseline.

Analysis

For each session, the percentage of intervals in which the primary observer scored any occurrence of the problem behavior across the entire class was divided by the number of intervals observed and was displayed on a repeated-measures graph. Visual analysis of this graph was used to determine when to change phases during the study and to interpret results after the study was completed. Analysis of treatment effects were based on within-classroom, across-phase comparisons and focused on changes in level, variability, and trends. Across-classroom and across-phase comparisons were used to evaluate the influence of threats to internal validity (Kazdin, 2011).

Visual analysis of the repeated-measures graph was supplemented with statistical analysis including effect size calculations. Specifically, for each classroom, percentage of data points exceeding the median (PEM) was calculated by determining the percentage of data points in the intervention and maintenance phases exceeding the median data point in the baseline phase and then dividing by 100 (Ma, 2006). Although Ma (2006) suggested PEM interpretation guidelines, more conservative guidelines for nonparametric effect size values were applied (Bruni et al., 2017; Parker & Vannest, 2009).

Interobserver Agreement and Acceptability

For each class, two observers scored inappropriate behaviors, and interobserver agreement was calculated for 20% or more sessions per phase. In total, 29% of all sessions were checked for interobserver agreement, not including training sessions for the observers. Interobserver agreement was calculated on an interval-by-interval basis by dividing the number of agreements per session by the number of disagreements plus agreements and then multiplying by 100. Average interobserver agreement scores for Classrooms A, B, and C were 99.7% (range 96.7%–100%), 99.2% (range 95%–100%), and 98.3% (range 90%–100%).

After all data were collected, each teacher completed an 11-item acceptability measure that was created in conjunction with the teachers and principal at the school. The principal participated because she wanted data that would assist her in determining if the CWS should be used in other classrooms. Additionally, a four-item student acceptability measure was read to all students. This form used a smiley face and a frowny face to indicate responses of “yes” and “no,” respectively, to the questions. Two students from Classroom B did not complete the student acceptability form.

Results

For each classroom, Table 2 provides descriptive statistics for inappropriate behaviors by phases. Figure 1 provides the percentage of intervals scored for disruptive behaviors for each session. Baseline data for Classroom A were variable and showed an increasing trend in out-of-seat behavior ranging from 13% to 33% (M = 23.8%). The percentage of intervals scored with out-of-seat behavior decreased immediately after the intervention phase began and showed a steadily decreasing trend, followed by eight sessions of increased variability. During the intervention phase, class-wide out-of-seat behavior ranged from 6% to 31% (M = 15.4%). Maintenance data were generally lower and more stable than baseline- or intervention-phase data and ranged from 1% to 15% (M = 11.4%).

Table 2 Percentage of intervals with inappropriate behaviors per phase and effect size measures between phases
Fig. 1
figure 1

Percentage of intervals where disruptive behaviors occurred during baseline, intervention, and maintenance phases

In Classroom B, percentage of intervals with inappropriate vocalizations during baseline ranged from 71% to 90% (M = 80.2%). Initial baseline data were variable, followed by an increasing trend in class-wide inappropriate vocalization. During the intervention phase, there was a gradual, but fairly consistent (the exceptions were Sessions 12 and 13) decline in inappropriate vocalizations. During the intervention phase, inappropriate vocalizations ranged from 43% to 91% (M = 70.2%). Although the first maintenance session was higher than the last three intervention-phase data points, other maintenance-phase data were consistent with intervention-phase data and showed a decreasing trend in inappropriate vocalizations. Maintenance-phase data ranged from 33% to 75% (M = 50.3%).

Classroom C baseline data were variable (M = 68.2%; range 55%–81%), with an increasing trend. Intervals with inappropriate vocalizations immediately decreased after the CWS intervention was implemented and remained lower than all baseline data. Intervention-phase data were variable (M = 45.3%; range 30%–53%), with no consistent trend. Maintenance-phase data showed the least amount of variability, ranging from 23% to 36% (M = 29.7%), and all sessions were lower than the lowest baseline session. Visual analysis of Fig. 1 shows that after the CWS was applied to one classroom, other classrooms showed no concomitant decreases in inappropriate behavior.

Statistical Analysis

Table 2 displays descriptive data for each phase across classrooms. Analysis of mean data across all three classrooms shows a consistent pattern. For each classroom, average inappropriate behaviors decreased during the intervention phase and were lowest during the maintenance phase, which indicates a general decline in inappropriate behaviors over time after the CWS was implemented. Generally, out-of-seat behavior in Classroom A was lower than inappropriate vocalizations in Classrooms B and C. When comparing percentage change from baseline to intervention phases and from baseline to maintenance phases, Classroom B showed the smallest changes.

For each classroom, effect size estimates (i.e., PEM) were calculated comparing baseline-phase data to both intervention- and maintenance-phase data (see Table 2). PEM values from .93 to 1 were considered large, from .66 to .92 were medium, and below .65 were small (Bruni et al., 2017; Parker & Vannest, 2009). When comparing baseline phases to intervention phases, effect sizes for Classrooms A and B were medium and the effect size for Classroom C was large. Baseline to maintenance comparisons for all three classrooms indicated large effect sizes (PEM = 1) with no maintenance points for any classroom exceeding the median baseline point.

Student and Teacher Acceptability

All teachers and students were given acceptability measures at the completion of maintenance data collection. Table 3 displays the ratio of students agreeing with each statement. The four questions were administered by paper with each question typed, and then smiley and frowny faces below each question, and “Yes” or “No” typed below the corresponding face. After the teacher explained the task to the students, the primary researcher read each question while the teacher walked around the room helping the students complete the measure. In all classrooms, the majority of students liked using the CWS, thought the CWS was fun, and felt the CWS helped with class rules. In Classrooms B and C, the majority of students wanted to keep using the CWS, but in Classroom A, only half of the students wanted to keep using the CWS.

Table 3 Ratio of student agreement per classroom for statements measuring student acceptability of intervention

The teacher acceptability data were more variable than the student acceptability data. Table 4 displays the number of teachers in each classroom agreeing with statements measuring teacher acceptability of the intervention. In Classrooms A and C, the teachers rated all items as either “Yes” or “Maybe.” All teachers in Classrooms A and C indicated that they noticed positive changes in their students’ behaviors and would recommend the CWS to other teachers. Teachers in Classroom B generally rated the CWS negatively. Both teachers rated many items as “No” and did not feel that the CWS was beneficial to their students.

Table 4 Tally of teacher agreement for statements measuring teacher acceptability of intervention

Discussion

The current study was designed to replicate CWS research and extend this research to students with ASD. Visual analysis of our repeated-measures graph shows immediate decreases and/or decreasing trends in inappropriate behavior after CWS procedures were applied to each class and no concomitant decreases in classrooms still in baseline. This pattern suggests that the CWS, as opposed to other confounding variables, caused decreases in disruptive behavior in all three self-contained classrooms (Kazdin, 2011). These results support previous findings, which suggested that the CWS reduced inappropriate behaviors in general education classrooms (e.g., Fudge et al., 2007; Watson et al., 2016).

PEM suggests that the decreases in inappropriate behavior from baseline phase to intervention phase were medium for Classrooms A and B and large for Classroom C. More support for meaningful effects was found when comparing baseline-phase data to maintenance-phase data, which revealed large effects (PEM = 1) for each classroom. Additionally, when comparing phase means, a consistent pattern emerged; for each classroom, the highest phase means occurred during baseline, intervention means were lower than baseline, and maintenance means were the lowest. These data, along with visual analysis of Fig. 1, which suggests that the CWS was more effective as the study progressed, provide directions for future researchers.

Researchers should determine if the CWS is more effective or more consistently effective when students become more experienced and familiar with CWS routine and rules (Fudge et al., 2008). An alternative explanation is that teachers became more skilled at applying CWS procedures (e.g., applying procedures more consistently and with more integrity). In the current study, after the CWS was implemented, researchers failed to collect treatment integrity data; rather, during the intervention phase, they provided regular feedback regarding treatment integrity and suggestions on how to enhance integrity. This need for additional prompts and feedback suggests that future researchers should conduct more intensive training sessions and include more in vivo training opportunities before beginning to collect intervention-phase data (Fudge et al., 2007). Additionally, future researchers should collect treatment integrity data so that the relationships between treatment integrity and other factors (e.g., teacher training, treatment effectiveness, treatment acceptability) can be explored. Although students were reminded of the rules and praised or rewarded for rule following, the rate of praise was not specified and the consistency of praise was not measured. The medium effects of the CWS in Classroom B may be due to using reminders that functioned as unintended reinforcement for breaking the rules, giving praise to the students that was not reinforcing the rule following, or praising inconsistently.

Other limitations and directions for future researchers should be addressed. In the current study, the dependent variable for Classrooms B and C was inappropriate vocalization, and for Classroom A, whose students were considered nonvocal, the dependent variable was out-of-seat behavior. Future researchers should consider measuring the same inappropriate behaviors across classrooms. Also, researchers should consider measuring appropriate behaviors associated with learning, including on-task behavior (Fudge et al., 2008) and attending to and following teacher directions and instruction (Saecker et al., 2008).

Researchers have demonstrated how the CWS could be modified by using different rules, more rules, different stimuli, and the application of group contingencies (Below et al., 2008; Blondin et al., 2012; Hautau et al., 2008; Skinner & Skinner, 2007). In the current study, the procedures were modified to include the use of a social story to introduce the intervention to students. Although using social stories to teach CWS procedures was unique, this component was not formally evaluated. Researchers should determine if social stories can help students with ASD and other disabilities learn the rules and routines associated with the CWS.

Researchers should consider evaluating other components of the CWS. Previous researchers found that students with ASD often respond well to visual prompts (Ganz & Flores, 2010; Pierce et al., 2013). Thus, researchers could determine if having the behavioral expectations displayed at all times (i.e., the color displayed on the traffic light indicating behavioral expectations) enhanced students’ knowledge of behavioral expectations and consequently their rule-following behaviors. Perhaps the visual presence of the color on the traffic light serving as a visual-discriminative stimulus was related to the specific behavioral contingency in place, thus improving discriminability and stimulus control. Also, researchers should determine if having the light displayed enhanced teachers’ ability to consistently support desired behaviors (Fudge et al., 2007). As students with ASD often respond well to routine transitions (Cihak, 2011), researchers may want to evaluate the consistent application of transition procedures. Students with ASD often have difficulty attending (Chien et al., 2015). As the rules for red are designed to enhance attention to directions and instructions for the next activity (e.g., keep your desk cleared, keep your eyes on the speaker), researchers could determine if the temporal warnings and red rules enhanced students’ direction following and reduced the need for teachers to repeat directions (Saecker et al., 2008).

The teachers in the current study indicated other directions for future research. Several teachers queried whether a visual timer, such as a SMART board countdown clock, could be used in addition to a verbal temporal warning. Teachers also indicated that during class time the students appeared to respond to the physical presence of the stop light. Researchers may want to determine if enhancing the conspicuousness of the rule indicator (e.g., a wheel or, in this case, a traffic light) or the temporal cues (e.g., a countdown clock) enhances the effectiveness of the CWS. Interestingly, Teacher A may have successfully enhanced the conspicuousness of the traffic light by carrying it with her.

Across participants, acceptability measures showed that students and teachers generally rated the CWS positively; however, Classroom B teacher ratings were negative. During intervention and maintenance phases, Classrooms A and C showed larger average percentage decreases in inappropriate behaviors relative to baseline than did Classroom B. Thus, the lower acceptability rating may be related to effectiveness. However, no causal relationships can be drawn because other unmeasured factors may explain this relationship. For example, weak treatment integrity may have caused low effectiveness and acceptability. Additional research designed to evaluate these relationships must include treatment fidelity measures. The CWS has been shown to reduce undesired behavior in general education kindergarten students and enhance on-task behavior in general education elementary students (e.g., Fudge et al., 2008; Watson et al., 2016). The participating students with ASD also appeared to learn and respond to the three sets of rules, and the authors found evidence of treatment effects. These findings suggest the need for additional studies designed to enhance the effectiveness of CWS procedures when working with students with ASD. More longitudinal studies should be conducted to determine if the CWS accelerates learning by preventing behavior problems that reduce teaching and learning time while enhancing behaviors (e.g., direction following, academic engagement) associated with learning (Watson et al., 2016). Finally, researchers should continue to adapt and evaluate CWS procedures and determine if these procedures could be effective in inclusive classrooms and with students with other disabilities.

Implications for Practice

  • Applies the CWS for use with students with ASD;

  • Investigates the use of the CWS for both nonvocal and vocal students;

  • Describes how teachers/researchers can implement the CWS in other classrooms;

  • Incorporates social stories into the student training procedures.