Keywords

1 Introduction

Interpersonal distance is an important factor in nonverbal communication. High demands exist for quantitative evaluation of nonverbal communication in social skills development training for children with autism spectrum disorders (ASD). In the field of psychology, researchers have evaluated personal space and interpersonal distance for children with ASD who have difficulties using nonverbal cues [13]. In these prior studies, interpersonal distance data were collected by human observers reviewing videos. Data captured using this human-based method was investment of time. Regarding intervention for children with ASD, an easy and quantitative understanding of children’s behavior and dynamic response–such as approaching or avoiding in response to each therapeutic activity–is essential. This study proposes a technology-based method for the quantitative estimation of interpersonal distances and face-to-face behavior. The technology, Social Imaging [4], was used to identify and represent social behaviors. We designed a soft marker cap that can be easily worn by children and used with the motion capture technology to record changes in interpersonal distance between the child with ASD and their therapist. This prototype measurement system was introduced in a practicing therapy room and captured ongoing therapy for children with ASD. In this paper, we report the analysis results regarding the quantitative estimation of interpersonal distance from captured data.

2 Method

2.1 Measurement

The most common quantitative evaluation method for interpersonal distance is the stop-distance method [1]. This method requires two persons to approach one another until they feel discomfort or unpleasantness; the distance between their toe positions at this final point is measured. Although commonly used, this method presents an unnatural situation for measuring interpersonal distance. Motion capture systems, similar to the one proposed in this study, show promise for accurately measuring interpersonal distance in a natural environment without observation. However, many ASD children have hypersensitivity [5, 6] that makes them resist multiple motion capture markers on their body. In Japan, children wear caps in school and during physical education. Most ASD children are also accustomed to wearing a cap without elastic straps. Based on these observations, Fig. 1 shows the marker cap developed in this study capable of measuring children’s position and orientation.

Fig. 1.
figure 1

Soft marker cap for motion capture (Color figure online)

Optitrack’s Flex3 motion capture system and Motive software [7] were used in this study. When estimating interpersonal distance, close (approaching) and away (avoiding) motions for both children and therapists are important. Equations (13) describe these motions, where C(t) is the position of the child and T(t) is the position of the therapist, both at time t.

$$\begin{aligned} D(t)= & {} \frac{\alpha (t) }{\beta (t) }\end{aligned}$$
(1)
$$\begin{aligned} \alpha (t)= & {} \varDelta (C(t)-T(t)) \end{aligned}$$
(2)
$$\begin{aligned} \beta (t)= & {} \varDelta (C(t)-C(t-\tau )) \end{aligned}$$
(3)

If both the child and therapist remain stationary (do not move), \(\alpha = 0 \cap \beta =~0\). If only the child moves, \(\alpha \ne \beta \ne 0 \cap \alpha - \beta = 0\). If only the therapist moves, \(\alpha > 0 \cap \beta = 0\). If both the child and therapist move, \(\alpha ( t) - \alpha (t - \tau ) = 0 \cap \beta \ne 0\). If \(D(t) > 0\), the subjects are approaching. If \(D(t) < 0\), the subjects are avoiding. Motion capture technologies can also calculate position and orientation. Researchers in one prior study determined the field of view (FOV) for face-to-face communication to be 129 [8]. This study used this criterion when estimating interpersonal distances with and without face-to-face communication.

2.2 Experimentation

Figure 2 shows the experimental setup used in this study to quantitatively estimate interpersonal distance. Children with ASD and their therapists wore the marker cap (shown previously in Fig. 1) for motion capture during the one-hour therapy session. Four children with ASD participated in this experiment. Table 1 summarizes participant characteristics. All participants were male with chronological ages (CA) ranging from approximately 4 to 6 years old. Developmental ages (DA)-derived using the Kyoto Scale of Psychological Development [9]-ranged from approximately 2 to 4 years old. The Kyoto Scale scores various subcategories of physical motion, verbal-social skills, cognitive-adaptation skills, and total developmental age. Participants were also assessed using the Pervasive Development Disorders Autism Society Japan Rating Scale (PARS) [10] and the Childhood Autism Rating Scale (CARS) [11]. Both assessments are used to confirm ASD diagnosis. This experiment was approved by the Institutional Review Board, and participants joined in the experiment after providing informed consent.

Both two video cameras for human coder (professional psychologist) and the motion capture system recorded motion behavior of the child participants and their therapists during the standing therapy sessions. Physical motion under 0.5 m/s is considered stagnant movement in preschool-aged children [12]. The human coder observed and recorded (in ms) predefined behaviors during therapy activities. Table 2 lists the predefined behaviors captured by the human observer. The frame rate of motion capture for the technology was set to 60 fps.

Fig. 2.
figure 2

Experimental setup in therapy room

Table 1. Participant characteristics
Table 2. Predefined behaviors observed by human coder

3 Result

Figures 3, 4, 5 and 6 show the participating children’s approach (value in Eqs. 13) and face-to-face behavior recorded by both the human coder and motion capture system. Table 3 shows the rate of agreement between the human coder and the motion capture system when recording approaching behavior. For each of the participants, behavior data captured by the human coder and technology were similar when recording approaching behavior with one exception–participant B (Fig. 4). Comparatively, the rate of agreement when recording face-to-face behaviors was not as high.

Fig. 3.
figure 3

Participant A results for approaching and face-to-face behavior

Fig. 4.
figure 4

Participant B results for approaching and face-to-face behavior

Fig. 5.
figure 5

Participant C results for approaching and face-to-face behavior

Fig. 6.
figure 6

Participant D results for approaching and face-to-face behavior

Table 3. Agreement rate between human- and technology-based methods when recording approaching behavior

4 Discussion

This study confirmed that approaching behavior can be effectively captured using the proposed motion capture system. Additionally, we confirmed that the system can dynamically estimate changes in interpersonal distance, but the agreement rate is not as high likely because of the limited capture frame. More differences were observed between the human coder and technology when recording face-to-face behavior. The human observer considered face-to-face behavior for a shorter period of time compared with the motion capture system in many cases. The obtained motion data may be used to develop an advanced behavior model that improves face-to-face behavior recognition. In addition, the marker cap was occasionally out of alignment due to the actions of the children. Design enhancements are needed to reduce this occurrence.

5 Conclusion

In this study, a method and prototype system for quantitatively estimating interpersonal distance by using motion capture technology and a marker cap for children with ASD was proposed. Future work includes design improvements to the marker cap so that it remains in place and further analysis of the experimental data to improve upon estimation methods.