Introduction

The frontal aslant tract (FAT) is a newly identified tract (Catani et al. 2012) that connects the inferior frontal gyrus (IFG) with supplementary and pre- supplementary motor areas (SMA and pre-SMA, respectively). Recently, FAT was introduced as part of a “motor stream” that plays an important role in speech production (Dick et al. 2013), complementing the accepted dorsal and ventral language streams (Hickok and Poeppel 2007). Several recent studies suggest that FAT plays a role in speech production. First, the volume of the FAT is left lateralized in right handed individuals (Catani et al. 2012), similar to other language-related pathways such as the long segment of the superior longitudinal fasciculus (Thiebaut de Schotten et al. 2011). Second, intraoperative electrical stimulation of the left FAT results in speech arrest (Vassal et al. 2014). In addition, in primary progressive aphasia, microstructural measures of the left FAT correlate with speech fluency measured using a story telling task (Catani et al. 2013). In this study, we specifically target the FAT and examine its contribution to persistent developmental stuttering, a speech disorder that affects a person’s ability to fluently produce speech.

Persistent developmental stuttering is a speech disorder primarily characterized by prolongations, blocks and repetitions of sounds or syllables that occur while a person attempts to produce speech. These breakdowns in the production of fluent speech (stuttering events) are sometimes associated with other symptoms such as eye blinking, jaw jerking and other involuntary movements termed secondary or associated behaviors (Bloodstein and Ratner 2008). While this description of stuttering events clearly depends on motor aspects of speech production, it has previously been postulated that multiple factors interact in producing these disfluencies (Smith 1999; Smith et al. 2012). Specifically, one theoretical approach suggests that the motor difficulties observed during stuttering events are in fact the result of a core deficit in linguistic processing (Perkins et al. 1991; Postma and Kolk 1993). Persistent developmental stuttering therefore lies at the border between several systems including the motor and language networks, thus providing a unique opportunity to study the interface between these domains.

Producing fluent speech requires precise temporal coordination between distant brain regions. Indeed, functional imaging studies of persistent developmental stuttering, in which the fluent production is disturbed, indicate atypical patterns of activation in a wide network of cortical and sub-cortical regions including the right frontal operculum, bilateral auditory cortices, cerebellum and basal ganglia (for reviews, see Alm 2004; Brown et al. 2005). The distributed nature of these functional differences suggests that persistent developmental stuttering may be associated with atypical properties of white matter pathways, not only with specific localized cortical damage. Indeed, in the last decade, several studies have examined structural white matter differences between people who stutter and neurotypical controls.

The most replicable finding in studies of white matter in persistent developmental stuttering concerns fractional anisotropy (FA) reductions in the left Rolandic operculum (Chang et al. 2008; Connally et al. 2013; Cykowski et al. 2010; Sommer et al. 2002; Watkins et al. 2008), located caudally to Brodmann area 44 (BA44) and close to the primary motor representation of tongue, larynx and pharynx. Typically, differences in the left Rolandic operculum were ascribed to the superior longitudinal fasciculus/arcuate fasciculus (Chang et al. 2008; Connally et al. 2013; Cykowski et al. 2010; Watkins et al. 2008), a well-known pathway that is considered part of the language network (Hickok and Poeppel 2007). A second replicable finding is FA reduction in the corticospinal tracts (CSTs) of the motor network. Bilateral FA reductions in the CST were previously reported in children who stutter (Chang et al. 2008) as well as in adults who stutter (Cai et al. 2014). One study in adolescents and young individuals who stutter, reported that these FA differences are restricted to the right CST (Watkins et al. 2008), while another study showed a significant difference between the FA values measured in the left CST and those measured in the right CST of people who stutter compared with the same measurements calculated in controls (Connally et al. 2013). Differences in the left Rolandic operculum as well as in the CST suggest that both language pathways and motor connections are suspected contributors to persistent developmental stuttering.

To the best of our knowledge, ours is the first attempt to examine the role of the FAT in persistent developmental stuttering. However, the potential involvement of this tract in persistent developmental stuttering is implied by previous reports of structural and functional stuttering-related anomalies in the cortical endpoints of this tract: the IFG and the pre-SMA/SMA. Studies in people who stutter show gray matter volume differences in IFG (Beal et al. 2007, 2013; Cai et al. 2014; Chang et al. 2008; Kell et al. 2009) as well as white matter volume differences (Jäncke et al. 2004) and anisotropy reductions (Connally et al. 2013; Watkins et al. 2008) underneath this region. Comparing spontaneous recovery from persistent developmental stuttering with therapy-induced recovery, one study suggested the IFG as the only neural marker of optimal repair (Kell et al. 2009). Structural and functional differences related to persistent developmental stuttering were also reported in the SMA (Brown et al. 2005; Chang et al. 2008, 2011; Lu et al. 2010b). One recent study used graph theory and reported a lower degree of centrality in left SMA of adults who stutter (AWS) compared with fluent controls, indicating that the left SMA may serve as a hub in the speech network of the typical population but not in persistent developmental stuttering (Cai et al. 2014). The functionality of the IFG and SMA in typical populations, along with previous reports of their involvement in persistent developmental stuttering, suggests that the tract that connects these regions, the FAT, may play a role in this disorder.

In this study, we use diffusion magnetic resonance imaging and tractography to study the FAT in persistent developmental stuttering. We identify the FAT bilaterally in a group of AWS as well as in neurotypical adults. In addition, we identify the CST, a well-known motor tract which has previously been related to persistent developmental stuttering. We compare the volume estimations and average diffusion properties of these tracts in AWS vs. controls and complement this analysis with a more sensitive comparison of diffusion properties along the entire extent of the tract. Last, we correlate the diffusivity measures extracted from the tract profiles with a behavioral measure of speech fluency. By studying the functionality of the FAT in persistent developmental stuttering we aim to extend the current knowledge about the involvement of this tract in speech production and contribute to the understanding of the newly described “motor stream” in language.

Methods

Participants

A total of 34 individuals participated in this study. Participants were physically healthy and reported no history of neurological disease or psychiatric disorder. They were all native Hebrew speakers who signed a written informed consent to participate in the study. The research protocol was approved by the Helsinki committee of the Tel-Aviv Sourasky Medical Center and by the ethics committee of the faculty of humanities in Bar-Ilan University.

Participants were assigned to the group of AWS based on the following criteria: (a) a reported history of stuttering since childhood, (b) exhibited a minimum of three stuttering-like disfluencies (SLD; Ambrose and Yairi 1999) per 100 syllables during an unstructured interview (described below) and (c) scored a total of at least 10 on the Stuttering Severity Instrument (SSI-III; Riley 1994). Assignment of participants to the control group was based on their self-report of having no history of stuttering.

To ensure that all participants assigned to the group of AWS were indeed individuals who stutter, two experienced speech pathologists (O.A. and R.E.-V.) were asked to blindly confirm the original classification based on their impression from the audio-visual recording of an unstructured interview (see “Speaking tasks” below). Classification was performed separately by each speech pathologist, and only those participants who were classified by both speech pathologists as individuals who stutter were assigned to the group of AWS. Based on these criteria, 15 participants were assigned to the group of AWS (3 females, mean age 32 years, age range 19–52 years), and 19 were assigned to the control group (3 females, mean age 33 years, age range 19–53 years). Table 1 presents the average demographic characteristics of the AWS and the control participants.

Table 1 Subject demographics and fluency measures

Speaking tasks

To assess the frequency of stuttering, participants were evaluated during two speaking tasks: an unstructured interview and a reading task (Riley 1994).

Unstructured interview

Each participant was seated in a quiet room together with the experimenter (V.K.-D.), and was asked to talk for 10 min about a neutral topic, such as a recent travel experience, a movie or a book. The experimenter was instructed to refrain from interrupting the speaker, and to ask questions only when the participant was having difficulties finding a topic to talk about. The session was recorded simultaneously with a digital video camera (Sony DCR-DVD 106E, Sony Corporation of America, New York, NY, USA) and with a noise canceling microphone (Sennheiser PC21, Sennheiser Electronic Corporation, Berlin, Germany). Audio signals from the microphone were digitally recorded using audio processing software (Goldwave, Inc., St. John’s, Canada), on a mono channel, with a sampling rate of 48 kHz (16 bit).

Reading task

Each participant was seated in a quiet room together with the experimenter and was asked to read aloud one of three paragraphs from the standardized and phonetically balanced Thousand Islands reading passage (Amir and Levine-Yundof 2013). The three paragraphs were of similar size (on average: 200 syllables) and the different paragraphs were assigned to the participants in a random order. This task was recorded using a video camera (see above).

Evaluation of speech fluency

We obtained three measures of speech fluency: (a) Average speech rate, (b) Stuttering-like disfluencies (SLD) and (c) Percent of stuttered syllables. These measures were calculated based on the digital audio recordings of the unstructured interview. The audio recordings were used because of their superior auditory signal-to-noise ratio (SNR) compared to the video recordings, and based on reports that visual information does not improve the reliability of measuring stuttering frequency (MacDonald and Mallard 1979; Williams et al. 1963). To calculate the fluency measures, each interview was transcribed, until a minimum of 600 consecutive syllables was obtained (exact number of syllables slightly differed between participants as only complete sentences were analyzed). Disfluencies were first annotated by two independent trained research assistants and then re-evaluated by a speech pathologist (O.A.). There was a high level of agreement between the raters on the classification of disfluencies, but any cases of disagreement were discussed until full agreement between all raters was reached. To reduce potential bias, both the research assistants and the speech pathologist were blind to the participant’s group assignment (stuttering/control).

The three measures of speech fluency (average speech rate, SLD, percent of stuttered syllables) were calculated as follows: (a) average speech rate was measured in units of syllables per second (SPS). It was calculated as the ratio between the total number of analyzed syllables and the time it took the participant to produce them. This measure was obtained from a visual and audio inspection of the spectrographic display of the speech signal (as discussed in Finkelstein and Amir 2013; Rochman and Amir 2013); (b) SLD was calculated as the number of part-word repetitions, monosyllabic-word repetitions and disrhythmic phonations, per 100 syllables. Other disfluencies like interjections, revisions or phrase repetitions were excluded from this measure (Ambrose and Yairi 1999); (c) the percent of stuttered syllables was calculated based on the syllables that included any type of SLD (Yairi and Ambrose 2005).

Stuttering severity evaluation

We evaluated stuttering severity of each individual of the group of AWS by administering the stuttering severity evaluation instrument (Riley 1994). Following the administration protocol for adults, we evaluated the percent of stuttered syllables based on the transcription of the two speaking tasks (the unstructured interview and the reading task). Stuttering duration scores and physical concomitants were evaluated by two speech pathologists (O.A and R.E.-V.), based on the video recording of the speaking tasks. Taken together, the percent of stuttered syllables, stuttering duration scores and physical concomitants were used to obtain a total score of stuttering severity rate (SSI score).

Image acquisition

Magnetic resonance imaging (MRI) was performed on a 3T General Electric MRI scanner at the Tel-Aviv Sourasky Medical Center. The MRI protocol included standard anatomical and diffusion imaging sequences, acquired with an eight channel head-coil. Participants were asked to lie still during the scan, and their head motion was minimized by placing cushions around their heads. Functional MRI experiments were also included in the scan protocol but those data are not reported here.

T1 image acquisition

High resolution T1 weighted anatomical images were acquired using a 3D fast spoiled gradient echo (FSPGR) sequence. We collected about 150 axial slices (±12 slices), covering the entire cerebrum, with a spatial resolution of 1 × 1 × 1 mm voxel size.

Diffusion weighted image acquisition

A standard DTI protocol was applied by means of a single-shot spin-echo diffusion-weighted echo-planar imaging (DW-EPI) sequence. We collected ~68 axial slices, adjusting the number of slices to cover the entire cerebrum in each participant (FOV = 240 mm; 128 × 128 matrix; 2 mm thick axial slices; voxel size: ~2 × 2 × 2 mm). 19 diffusion-weighted volumes (b = 1,000 s/mm2) and one reference volume (b = 0 s/mm2) were acquired using a standard direction matrix (e.g., Sasson et al. 2010, 2012, 2013). This protocol was repeated twice for an improved signal-to-noise ratio. Scan repetitions were not averaged so that tensors were fit to the entire dataset from both scans (see “Data preprocessing”). Scanning 19 directions twice was motivated by the fact that short scan time (5:50 min per scan) reduces the chances of within-scan motion while maintaining robust anisotropy measurements (Jones 2004).

Software

All data analysis and statistics were performed using Matlab 2012b (The Mathworks, Nattick, MA, USA). For data preprocessing, we used the ‘mrDiffusion’ package (http://white.stanford.edu/newlm/index.php/Software). Tract identification and quantification were executed using ‘AFQ’, an automated segmentation tool (Yeatman et al. 2012).Visual inspection of the tracts and manual cleaning was performed via ‘Quench’, an interactive 3D visualization tool (Akers 2006; http://white.stanford.edu/newlm/index.php/Software#QUENCH).

Data preprocessing

As a first step, T1 images were aligned to the AC–PC orientation. Diffusion weighted images were corrected for Eddy-current distortions and subject motion (Rhode et al. 2004). Each diffusion weighted image was registered to the mean of the two non-diffusion weighted (b0) images and the mean b0 image was registered automatically to the T1 image, using a rigid body mutual information maximization algorithm (implemented in SPM5; Friston and Ashburner 2004). Then, the combined transform resulting from motion correction, eddy current correction and anatomical alignment was applied to the raw diffusion data once, and the data was resampled at exactly 2 × 2 × 2 mm isotropic voxels. By applying the combined transform, we achieved AC–PC aligned T1 registered images while only resampling the raw data once. Next, the table of gradient directions was appropriately adjusted to fit the resampled diffusion data (Leemans and Jones 2009).

We fitted the raw diffusion data with the tensor model using a standard least-squares algorithm. Then, we extracted the eigenvectors and eigenvalues of the tensor and calculated FA as the normalized standard deviation of the eigenvalues (Basser and Pierpaoli 1996). Using the eigenvalues, we also calculated mean diffusivity (MD) as the average of all three eigenvalues. Axial diffusivity (AD) and radial diffusivity (RD) were calculated as complementary measures and were respectively defined as the diffusivity along the principal axis (AD) and as the average diffusivity along the two remaining minor axes (RD).

Tract identification protocol

We identified the FAT and the CST in each participant’s left and right hemispheres. To identify these tracts we used a procedure composed of three steps: (1) whole brain fiber tractography (2) region-of-interest (ROI) based fiber tract segmentation and (3) fiber tract cleaning.

Step 1: whole brain fiber tractography

Whole brain fibers were tracked in the native space of each participant using a deterministic streamlines tracking algorithm (Basser et al. 2000; Mori et al. 1999) with a fourth-order Runge–Kutta path integration method (1 mm fixed step size, 8 seed points per voxel). The tracking algorithm was seeded with a white matter mask of all voxels with FA greater than 0.2 and tracking was halted when FA dropped below 0.15 or if the angle between the last and the next step direction was greater than 30° (Dougherty et al. 2007). Minimum streamline length was set to 20 mm.

Step 2: fiber tract segmentation

We used a multiple ROI approach to delineate the tracts in each participant. To segment the fibers, the whole brain fiber group (obtained in step 1) was intersected with these ROIs using logical operations (AND, NOT). Below we describe the protocol used to identify the ROIs of the FAT and the CST.

FAT

We propose a protocol for identifying the FAT that is based on several previous studies (Catani et al. 2012; Ford et al. 2010; Lawes et al. 2008). Figure 1 illustrates this protocol in an attempt to promote common practices in future studies of this tract.

Fig. 1
figure 1

FAT identification protocol. a Whole brain fiber tractography is overlaid on a sagittal T1 image of a representative participant. b ROIs (in red) are defined on the MNI template (ICBM 2009a Nonlinear Asymmetric; Fonov et al. 2011). The left IFG ROI is overlaid on a sagittal image and the left SMA/pre-SMA ROI is overlaid on an axial image. c The left hemisphere ROIs are shown after they were back transformed from the MNI space to the native space of an individual participant. d The left FAT resulting from the intersection of the whole brain fiber group (illustrated in a) with the back transformed ROIs (shown in c). The tract is displayed in a 3D view, with coronal and axial images added for orientation. FAT frontal aslant tract, IFG inferior frontal gyrus, SMA supplementary motor area, MNI Montreal neurological institute

Two ROIs were defined on the Montreal Neurological Institute (MNI) template (ICBM 2009a Nonlinear Asymmetric template; Fonov et al. 2011). The first ROI was defined on a sagittal slice at the level of x = 45 (for the right tract) or x = −45 (for the left tract; see Fig. 1b, right image). This ROI (IFG) included all voxels above the Sylvian fissure and below the inferior frontal sulcus. Anteriorly, it was bounded by a coronal slice at the anterior end of the pars triangularis as it is seen on sagittal slices x = ±45. Posteriorly, it was bordered by a coronal slice at the most ventral end of the precentral sulcus as it is seen on the same sagittal slices (x = ±45). The second ROI was defined on axial slice z = 45 (see Fig. 1b, left image). This ROI (SMA/pre-SMA), encompassed a rectangle that was medially bordered by the mid-sagittal plane; laterally, it was bordered by a sagittal plane at the most medial point where the precentral sulcus is still seen on z = 45. Anteriorly, this ROI was bordered by a coronal slice at the level of the anterior portion of the genu of the corpus callosum and posteriorly, it was bordered by a coronal slice at the level of the precentral sulcus defined on the mid-sagittal plane.

These four ROIs (left and right IFG, left and right SMA/pre-SMA) were back transformed into an individual’s native space based on a non-linear transformation calculated between each individual’s volume anatomy and the MNI template (as implemented in Yeatman et al. 2012). The whole brain fiber group was intersected with the transformed left ROIs (left IFG, left SMA/pre-SMA) to obtain the left tract and with the right transformed ROIs (right IFG, right SMA/pre-SMA) to obtain the right tract.

CST

The identification of the CST was based on a standard protocol by Wakana and colleagues (2007). All ROIs necessary for the tract identification were anatomically defined in every participant (by V.K.-D.). The first ROI encompassed the cerebral peduncle, marked on an axial plane at the level of the decussation of the superior cerebellar peduncle (see Figure 4 of Wakana et al. 2007). To define the second ROI, we visually inspected the tract that resulted from the intersection of the whole brain fiber group (step 1) with the first ROI. The second ROI was then drawn around the fibers that project to the primary motor cortex, as identified on the most ventral axial slice where the branching of the central sulcus is seen (see Figure 4 of Wakana et al. 2007). The whole brain fiber group was intersected with the left ROIs to obtain the left tract and with the right ROIs to obtain the right tract. As a final step, a logical NOT operation was applied on the resulting tracts with an ROI that covers the whole mid-sagittal plane. This last step was aimed to exclude all tracts that cross the midline via the pontine crossing fibers (Wakana et al. 2007).

Step 3: fiber tract cleaning

To remove outlier tracts, we used an automated cleaning procedure that removed fibers extending over 4 standard deviations from the mean fiber length or spatially deviating more than 5 standard deviations from the core of the tract (see Yeatman et al. 2012 for further details). Next, we manually inspected all the tracts in each individual using a gesture-based interface (‘Quench’, see “Software”) and excluded single fibers that clearly did not fit the tract definition. For example, in the FAT we removed tracts that did not reach the IFG and in the CST we removed tracts that reached primary sensory (rather than motor) cortex.

To make sure that this manual cleaning phase was not biased, we calculated the ratio between the number of streamlines that were manually excluded from the fibers and the number of streamlines that were found in the original tract (i.e. before cleaning). These exclusion ratios were then compared between AWS and controls using two-tailed t tests with unequal variance (see “Results” and Figure S1).

Evaluation of FAT endpoints

To calculate the MNI coordinates of the FAT endpoints for each participant, we first calculated a nonlinear transformation between the T1 volume anatomy of the participant and the MNI T1-template, using the mutual information maximization algorithm implemented in SPM5 (Friston and Ashburner 2004). We then applied the inverse transformation to assign an MNI coordinate to each voxel in the participants’ native space. The centers of mass of the medial and lateral FAT endpoints were calculated for each participant, and their MNI coordinates were averaged across participants. We then used the automated anatomical labeling (AAL) atlas (Tzourio-Mazoyer et al. 2002) to assign anatomical labels to the average MNI coordinates of the medial and lateral FAT endpoints.

Fiber tract quantification and group comparisons

Tract diffusivity measures

For each participant and each tract, tract-FA and tract-MD measures were calculated as the average FA and MD across all voxels covered by the tract, respectively. These diffusion measures were compared between the groups using two-tailed t tests with unequal variance. Following up on these analyses, tract-AD and tract-RD were computed and compared between the groups in a similar fashion. Significance was corrected for 16 comparisons (4 tracts × 4 diffusion measures) using the false discovery rate (FDR) correction (Benjamini and Hochberg 1995), with alpha set at 0.05. To partial out the effect of age, all four diffusion measures were entered into ANCOVAs (analysis of covariance), with Age as a covariate.

Volume estimation

The volume of each tract was estimated as the number of voxels covered by one streamline or more. This volume estimation was divided by the volume estimation of the whole brain fiber group (that was obtained by applying the same procedure to the whole brain fiber group). This normalization process was aimed to assure that differences in tract volume do not reflect a general difference in white matter volume between the groups.

Normalized volume estimates were compared between the groups using two-tailed t tests with unequal variance. The FDR was controlled based on the number of tracts with alpha set at 0.05.

Lateralization indices

Three lateralization indices (LI) were computed for the FAT and CST, comparing volume estimates, FA and MD measures of the left and right tracts. For example, the volume lateralization index (VOL_LI) of the FAT was calculated based on the volume estimates of the left FAT (VOLL_FAT) and the volume estimate of the right FAT (VOLR_FAT) using the following equation:

$${\text{VOL}}\_{\text{LI}}_{\text{FAT}} = ({\text{VOL}}_{{{\text{L}}\_{\text{FAT}}}} - {\text{VOL}}_{{{\text{R}}\_{\text{FAT}}}} )/({\text{VOL}}_{{{\text{L}}\_{\text{FAT}}}} + {\text{VOL}}_{{{\text{R}}\_{\text{FAT}}}} )$$

The lateralization indices were compared between the groups using two-tailed t tests with unequal variance while controlling the FDR based on the number of tracts with alpha set at 0.05.

FA profiles along the tract

FA and MD profiles of each tract were obtained by sampling the full length of the tract at 100 equi-spaced nodes and calculating FA and MD in each node as a weighted average across the streamlines of that tract (Yeatman et al. 2012). Profiles were calculated along the entire length of the tracts, i.e. between the cortical endpoints of the tracts.

FA and MD profiles were compared between the groups using multiple two-tailed t tests. A permutation-based multiple comparisons correction (Nichols and Holmes 2002) was used to calculate the critical cluster of adjacent significant t tests. Significance was corrected for 400 comparisons (4 tracts × 100 nodes in each tract) setting the corrected alpha to 0.05. We report clusters of nodes in which (1) all neighboring nodes significantly differed between the groups at the level of 0.05 (uncorrected) and (2) the cluster of significant values was larger than the critical cluster size (Nichols and Holmes 2002; Yeatman et al. 2012).

Brain–behavior correlations

In clusters showing a significant group difference, we further aimed to examine how individual variability in tract properties predicts behavioral properties. To this end, we extracted diffusivity measures from the clusters of nodes that significantly differed between the groups, and then analyzed the correlation between diffusivity measure and speech fluency within the group of AWS and the group of controls separately.

To calculate a correlation measure, we had to reduce the dimensionality of the tract profile into a single diffusivity measure. This was done by extracting MD measures from a fixed sized window within the cluster of nodes where significant differences were found and averaging the MD values measured within this window. The size of the window was arbitrarily set to 11 nodes that include the middle-most node and 5 additional nodes on each side. We further validated that the results were not specific to a window size of 11 nodes by replicating the correlation analyses with different window sizes (see Figure S4).

Speech fluency was assessed using the speech rate (measured in SPS). This specific measure was chosen for its proximity to the fluency measure used by Catani et al. (2013) in their study of the FAT in primary progressive aphasia.

We calculated Spearman’s rank-order correlations to assess the link between an individual’s diffusion data and their speech rate. Spearman correlations were used due to evidence for non-normal distribution of both MD and speech rate as indicated by the Kolmogorov–Smirnoff test (Corder and Foreman 2009). Importantly, the correlations were calculated separately for AWS and controls, to avoid spurious correlations caused by the significant group differences found in both MD and in SPS. Fisher’s Z transformations were used to assess the significance of the difference between the correlation coefficients measured in AWS and control participants (Sheskin 2003). Finally, partial Spearman correlations were calculated between MD and SPS within each group, controlling for the effect of Age. This latter analysis served to confirm that the correlation effects are not driven by the large age range in our sample.

Results

Behavioral results

Participants in the group of AWS scored an average SSI score of 24.07 (standard deviation 7.38, range 10–41.5). The wide range of SSI scores demonstrated that this group consisted of different degrees of symptoms, ranging from very mild to very severe. Specifically, one participant was rated as exhibiting very mild stuttering, one mild, eight moderate, four severe and one very severe. On average, AWS and controls were of similar age and had similar handedness and education levels (see Table 1). As expected, the groups differed significantly in all three measures of speech fluency (speech rate: t(32) = 3.71, p < 10−3, SLD: t(32) = 2.66, p < 0.05, percent of stuttered syllables: t(32) = 6.14, p < 10−6, see Table 1).

Spearman’s rank-order correlations between speech rate (in SPS) and all the other measures used for the assessment of speech fluency and stuttering severity (SLD, % stuttered syllables, SSI score) showed a significant negative correlation in AWS (SPS-SLD: r s = −0.72, p < 0.005; SPS-percent stuttered syllables: r s = −0.68, p < 0.01; SPS-SSI score: r s = −0.6, p < 0.05).

Tract identification

The left FAT and the bilateral CST were identified successfully in all participants (N = 34). The right FAT was identified in 33 of the 34 participants, but could not be traced in one AWS. Figure 2 shows the tracts of interest identified in eight representative participants (four participants of each group). Figure 3 zooms in on the endpoints of the left FAT in a subset of these participants. As is evident in these figures, the FAT connects medial and lateral frontal cortices.

Fig. 2
figure 2

Bilateral FAT (cyan) and bilateral CST (green) shown in eight representative individuals. Tracts are displayed in a 3D view, with coronal and axial images added for orientation. Top row shows four AWS (a1a4) and bottom row shows four control participants (c1c4). AWS adults who stutter, FAT frontal aslant tract, CST corticospinal tract

Fig. 3
figure 3

The FAT endpoints. The projections of the left FAT (cyan) are shown in four representative participants (individuals a1, a2, c1 and c2 of Fig. 2). The lateral projections are overlaid on sagittal T1 images (a) and the medial projections are overlaid on axial T1 images (b). For illustration purposes, the tracts are surrounded by squares and the data outside the squares is slightly dampened. The mean lateral (c) and medial (d) MNI coordinates of the left FAT endpoints are shown on an MNI template (ICBM 2009a Nonlinear Asymmetric template; Fonov et al. 2011)

In a further analysis, we calculated the mean MNI coordinates of the center of mass of the FAT endpoints across all participants. We used the automated anatomical labeling (AAL) atlas (Tzourio-Mazoyer et al. 2002) to assign anatomical labels to these endpoints. This procedure revealed that in our sample, the FAT terminates medially in the SMA/pre-SMA (the AAL atlas includes both regions under the label SMA) or in the adjacent superior frontal gyrus (MNI coordinates [−13, 12, 68] [14, 12, 70]). Laterally, the FAT terminates in the pars triangularis of the inferior frontal gyrus (MNI coordinates [−53, 22, 13] and [56, 23, 15]).

The exclusion ratios that indicate the relative number of streamlines excluded during manual cleaning stage (see “Methods”) showed no significant difference between the groups (p > 0.1; Figure S1).

Elevated mean diffusivity in bilateral frontal aslant tracts of AWS compared with controls

For each tract, we compared FA and MD values of the AWS and control participants averaged across the entire tract (termed tract-FA and tract-MD, respectively). This analysis revealed that AWS have higher tract-MD values in bilateral FAT compared with fluent controls (Fig. 4b, left FAT: t(32) = 3.24, p < 0.005; right FAT: t(31) = 3.55, p < 0.005; both effects were significant when controlling the FDR with alpha set at 0.05). No significant group differences were found in average tract-MD of the bilateral CST (p > 0.1) and similarly, no significant differences were observed in average tract-FA values of either tract (p > 0.09, Fig. 4a).

Fig. 4
figure 4

Group comparison of tract-FA and tract-MD values. Average FA (a) and MD (b) values measured in bilateral FAT and bilateral CST are shown in AWS (red) and in controls (blue), with error bars denoting ±1 standard error of the mean. Asterisks denote significant group differences after controlling for the FDR with alpha set at 0.05 (see the main text for further details). Significant increase in MD is observed in bilateral FAT of AWS compared with controls. Note that FA and MD values are measured in different units, hence the difference in the y axis range of a and b. AWS adults who stutter, FAT frontal aslant tract, CST corticospinal tract, L left, R right, FA fractional anisotropy, MD mean diffusivity, FDR false discovery rate, a.u. arbitrary units, ms millisecond

To further identify the source of the group differences in tract-MD, we followed up on these results with a comparison of tract-AD and tract-RD values (Fig. 5). We found that both AD and RD extracted from bilateral FAT are increased in AWS compared with controls (AD in left FAT: t(32) = 2.68, p < 0.05; AD in right FAT: t(31) = 3.38, p < 0.005; RD in left FAT: t(32) = 3, p < 0.01, RD in right FAT: t(31) = 3.26, p < 0.005; both effects were significant when controlling the FDR with alpha set at 0.05).

Fig. 5
figure 5

Group comparison of tract-AD and tract-RD values. Average AD (a) and RD (b) values measured in bilateral FAT and bilateral CST are shown in AWS (red) and in controls (blue), with error bars denoting ±1 standard error of the mean. Asterisks denote significant group differences after controlling for the FDR at alpha level = 0.05 (see the main text for further details). Significant increase in both AD and RD are observed in bilateral FAT of AWS compared with controls. AWS adults who stutter, FAT frontal aslant tract, CST corticospinal tract, L left, R right, AD axial diffusivity, RD radial diffusivity, FDR false discovery rate, ms millisecond

Altogether, 16 group comparisons were conducted in this analysis (4 tracts × 4 diffusion measures). All the reported group differences in tract-MD, tract-AD and tract-RD of bilateral FAT were significant when controlling the FDR for 16 comparisons with alpha set at 0.05.

To partial out the effect of age on the results, we entered each diffusion measure into an ANCOVA, with Age as a covariate. The results show that in MD, AD and RD, the effects of Group remain significant after controlling for Age and that there is no significant Group by Age interaction. In FA, no significant Group effect and no significant Group by Age interaction is found (see Table 2).

Table 2 ANCOVA test results

Similar tract volumes and lateralization indices in AWS and in fluent controls

Comparing the normalized volumes of the tracts did not reveal any significant group differences (p > 0.1). Similarly, no significant group differences were found in the LIs calculated over volume estimations, FA and MD measures (p > 0.09). The mean LIs measured in the CST and in the FAT were close to zero in both groups, with absolute mean LI values below 0.1, indicating bilaterality.

Increased mean diffusivity observed along the left tract profiles

Differences in diffusion measures between the groups may be masked in the averaged tract-diffusivity estimates, due to large variability along the length of the tract (Yeatman et al. 2011). We therefore, generated profiles describing FA and MD along the tracts and then compared the profiles between the groups using multiple t tests and a permutation based multiple comparison correction (Nichols and Holmes 2002; Yeatman et al. 2012 and see “Methods” of this paper).

This analysis showed that MD values measured in AWS are significantly higher than those measured in fluent controls, in a large cluster of nodes measured within the left FAT (Fig. 6a, cluster ranges across nodes 55–90) as well as in a large cluster of nodes measured within the left CST (Fig. 6b, cluster ranges across nodes 74–97). In the right FAT and in the right CST we did not find any cluster of nodes that significantly differed between the groups and was large enough to survive the multiple comparison cluster-based threshold that was used (Figure S2). In addition, no significant difference was found between the FA profiles of AWS and controls along any of the tracts (Figure S3).

Fig. 6
figure 6

Group comparison of MD profiles and brain-behavior correlations. MD profiles are shown for the left FAT (a) and the left CST (b). The colored line indicates the average profile of the AWS (red) and controls (blue), with grey regions denoting ±1 standard error of the mean (AWS: dark grey; controls: light grey). Significant group-differences between the profiles are observed in nodes 55–90 of the left FAT and in nodes 74–97 of the left CST (see text). Rectangles overlaid on the profiles represent fixed sized windows centered around the midpoint of the significant cluster of nodes (11 nodes in each window). In c and d, MD values of individual participants measured within these windows are plotted against their SPS. In the left FAT (c), data show that MD is negatively correlated with speech rate (measured in SPS) in AWS (red; r s = −0.7, p < 0.005) but not in controls (blue; r s = 0.19, p > 0.4). No significant correlation is found in the left CST (d) (in AWS: r s = −0.56, p = ~0.03, in controls r s = 0.005, p > 0.9). Note that a trend is observed in the CST of AWS, yet it does not reach significance after correcting for multiple comparisons (see “Methods”). AWS adults who stutter, FAT frontal aslant tract, CST corticospinal tract, MD mean diffusivity, ms millisecond, IFG inferior frontal gyrus and SMA supplementary motor area

Mean diffusivity within the left FAT negatively correlates with speech fluency in AWS

To establish the functional contribution of the left FAT and the left CST in AWS, we measured brain-behavior correlations between the MD values measured within these tracts and the speech rates. For each participant, we extracted a single measure of MD from a fixed size window located within the cluster of nodes where the groups significantly differed (see “Methods”). We correlated the resulting MD value of each participant with their speech rate (estimated using SPS). To avoid spurious correlations caused by the significant group differences found in both MD and SPS separately, Spearman correlations were calculated separately for AWS and controls (altogether, four correlations were computed: 2 groups × 2 tracts).

Results show that in the left FAT, higher MD values predict lower SPS in AWS (r s = −0.7, p < 0.005; FDR corrected for four comparisons with alpha set to 0.05) but not in controls (r s = 0.19, p > 0.4). Importantly, the correlation coefficients calculated in the two groups differ significantly (Fisher’s Z = 2.77, p < 0.01), and a partial correlation analysis revealed that the negative correlation observed in the left FAT of AWS remains significant after controlling for Age (p < 0.001, FDR corrected for four comparisons with alpha set to 0.05). No significant correlation is found in the left CST of both groups (in AWS: r s = −0.56, p = ~0.03, in controls: r s = 0.005, p > 0.9). Notably, in the CST of AWS, a trend is observed, however it does not reach significance following the control for FDR.

To evaluate the stability of these correlations, we modified the size of the window used for MD extraction and we measured the Spearman correlation coefficients obtained using different window sizes (Figure S4). This analysis showed a stable pattern of results, verifying that the pattern of correlations between MD and speech rate does not depend on the particular window size used in the main analysis. Specifically, the results did not change when we used the entire cluster of nodes that significantly differed between the groups in the tract profile analysis (nodes 55–90 of the left FAT, nodes 74–97 of the left CST). Within these clusters, MD-SPS correlations were significant in AWS (r s = −0.65, p < 0.05; FDR corrected for four comparisons with alpha set to 0.05) but not in controls (r s = 0.01, p > 0.97). The correlation coefficients differed significantly between the groups (Fisher’s Z = 2.04, p < 0.05), and the negative correlation observed in the left FAT of AWS remained significant after controlling for Age using a partial correlation analysis (p < 0.05). In addition, no significant correlation was found in the left CST of both groups (in AWS: r s = −0.53, p = ~0.04, in controls: r s = −0.14, p > 0.5). Similar to the main analysis (of 11 nodes), a trend was observed in the left CST of AWS, yet it did not reach significance following the control for FDR.

Discussion

Our results show that the bilateral FAT is involved in speech production in a clinical population of individuals with persistent developmental stuttering. Specifically, we show that the microstructural properties of this tract, assessed using mean diffusivity, significantly differ between individuals who stutter and fluent controls. Beyond group comparisons, our data further show that the mean diffusivity calculated within the left FAT predicts individual speech rate in individuals who stutter. By demonstrating the involvement of the FAT in a developmental disorder that disrupts speech fluency, our results strengthen the view that the FAT plays a role in speech production, possibly via a “language motor stream” (Dick et al. 2013).

Our data show that AWS have higher MD values in the left FAT compared with controls. Further, the MD values calculated within this tract negatively correlate with the speech rate measured in individuals who stutter: the lower the MD, the higher the speech rate (and the closer it is to the average speech rate measured in controls). While these results demonstrate brain-behavior correlations with MD in the left FAT, the previous analysis of functional correlations within the left FAT in primary progressive aphasia (Catani et al. 2013) showed a negative correlation between speech fluency and RD values (along with a positive correlation between speech fluency and FA values). MD and RD values typically correlate in a positive fashion (see Figure 6 of De Santis et al. 2014). Thus, it appears that despite the different diffusion measures and the different assessment tools used for measuring fluency, both studies imply that less restriction on water diffusion within the left FAT is associated with a lower degree of fluency. By showing that this brain-behavior correlation is unique to AWS and does not hold in controls, our findings suggest that the left FAT may be an important predictor of fluency in clinical populations, but not necessarily in the neurotypical population.

In the right FAT, our data show a significant group difference in the average MD of the entire tract with no significant difference in the profile analysis. Specifically, the profile analysis of the right FAT shows several clusters of nodes that differ between the groups, yet none of these clusters is large enough to survive the cluster-based multiple comparison correction that we apply (see “Methods”). This suggests that differences in MD values of the right FAT are equally spread in multiple (not necessarily neighboring) points along the right FAT.

In light of the persistent developmental stuttering literature, our findings in bilateral FAT are not surprising. The involvement of both hemispheres in the etiology of stuttering has long been proposed (Kushner 2012; Travis 1931) and many functional imaging studies continue to indicate the contribution of both hemispheres to developmental stuttering (Biermann-Ruben et al. 2005; Brown et al. 2005; Chang et al. 2009; Kell et al. 2009; Lu et al. 2010a; Watkins et al. 2008; Xuan et al. 2012). Structural imaging studies of white matter in developmental stuttering have, for the most part, highlighted left frontal white matter abnormalities as the core deficit in the disorder (Chang et al. 2008; Cykowski et al. 2010; Sommer et al. 2002; Watkins et al. 2008). However, some of these studies also point to stuttering-related white matter differences in the right hemisphere (Chang et al. 2008, 2010; Connally et al. 2013; Watkins et al. 2008). Our findings are consistent with the view that both hemispheres contribute to fluent speech production in individuals who stutter, as well as with the general proposal that the underlying mechanisms required for speech production (like sensory-motor transformation) occur bilaterally (Cogan et al. 2014).

One view of persistent developmental stuttering maintains that the core deficit is left hemispheric (Chang et al. 2008; Kell et al. 2009), while the right hemisphere is recruited to compensate for the left hemisphere impairment (Preibisch et al. 2003). However, while some authors suggest that the right hemisphere involvement is beneficial, i.e., enhances fluency (Braun et al. 1997; Kell et al. 2009; Preibisch et al. 2003) others suggest that it is not (Brown et al. 2005; Chang et al. 2010; Foundas et al. 2004; Fox et al. 2000; Moore 1984). Our data show the same pattern of results (MD elevation in AWS compared with controls) in both the left and right FAT, which is difficult to reconcile with the idea that the right tract is compensating for the left impairment. However, because all participants of this study were adults, we cannot be conclusive about the functional interpretation of the differences that we find. All the observed differences can equally constitute the cause for stuttering or reflect compensation processes following years of stuttering.

We originally examined the FAT in the context of persistent developmental stuttering based on its recent depiction as a language production pathway (Dick et al. 2013). However, in fMRI, functional connectivity between medial-dorsal and ventral-lateral frontal regions has been related to domain general error monitoring (Dosenbach et al. 2006; Eckert et al. 2009; Vaden et al. 2013). This “cingulo-opercular network” has been proposed to relay information between the dorsal paracingulate/anterior cingulate cortex and the lateral anterior insula. While these regions do not overlap precisely with the endpoints of the FAT (Fig. 3, see also Catani et al. 2012), it is still possible that the FAT takes part in relaying information related to error monitoring in speech production. We offer this possibility here because, first, it is well-known that fiber tracking algorithms are prone to errors near the gray matter, where FA drops, and can very well skip a sulcus on their way to cortex (Ben-Shachar et al. 2007). Second, error monitoring deficits in speech production have often been proposed as part of the core deficit in stuttering, and this finding has been supported by recent electrophysiological evidence (Arnstein et al. 2011; but see Postma and Kolk 1992 for an alternative view). Future studies will be necessary to determine whether the FAT is related to error monitoring, and whether this relation generalizes to other domains beyond speech.

Within the CST, our data show an increase in MD of AWS compared with controls, in the inferior portion of the left tract. The pattern of this result is in agreement with a recent study that shows a significant reduction in FA values measured in the cerebellar peduncles of people who stutter compared with controls (Figure 4 of Connally et al. 2013). Both studies are compatible with less restricted diffusivity in the CST at the level of the cerebellar peduncles in people who stutter. Moreover, each of these studies extends this information in complementary ways: Our results indicate that the difference is limited to the left tract, while the study by Connally et al. (2013) shows that the differences extend over the inferior, middle and superior cerebellar peduncles (each averaged across the two hemispheres). In our sample of participants, FA values of the CST did not differ significantly between the groups. Previous studies of developmental stuttering that report FA differences in the CST used whole brain voxel-based methods (Cai et al. 2014; Chang et al. 2008; Watkins et al. 2008). Interestingly, the one study that used both tractography and Tract-Based-Spatial-Statistics (TBSS) reported FA differences in the CST only when using TBSS (Connally et al. 2013). Therefore, the discrepancy between our results and the previous reports of stuttering-related FA differences in the CST may be caused by the different methodologies that were employed.

In this study, we report differences in MD that are not accompanied by parallel differences in FA. Although it is common to find MD and FA effects that go in opposite directions, the two metrics are not necessarily correlated (Figure 6 of De Santis et al. 2014). MD and FA provide complementary information about the eigenvalues obtained during tensor fit: While MD measures the average of the three eigenvalues of the tensor, FA quantifies their normalized standard deviation. An MD difference without an FA difference suggests an overall difference in eigenvalues which is not specific to one direction of diffusivity, and thus does not translate into a difference in anisotropy. Indeed, we show that the increase in MD is derived from an increase in both AD and RD (Figs. 4, 5). Such a pattern of joint increase in AD and RD implies that within these tracts there are fewer constraints to diffusion (reduced tissue density) in a manner not specific to a certain direction.

MD values rely on water diffusion that is sensitive to the microstructure of the underlying brain tissue, yet the interpretation of these values in terms of tissue properties should be attempted cautiously (Jones et al. 2013). In white matter, MD values are affected by many factors including water content, membrane density, myelin and axonal count (Alexander et al. 2007; Burzynska et al. 2010; Schmierer et al. 2007). Any combination of these factors may have conspired to generate the effects reported here, but most would affect FA in an opposite manner. We hypothesize that increased MD values in AWS could stem from a combination of factors: For example, noisy communication (reduced synchrony) between the IFG and SMA could have led to excessive pruning of axons through the FAT, increasing MD and reducing FA, as well as to a more coherent fiber organization within the FAT, which would elevate FA back to its typical range. Elevated membrane permeability has similarly been proposed as a mechanism to explain developmental differences in MD that covary with slowed information transfer, without affecting FA (Scantlebury et al. 2014). Admittedly, such hypotheses are impossible to test directly with DTI alone, and would be more directly testable using imaging methods geared for quantifying more specific tissue properties (e.g., Assaf et al. 2008; Mezer et al. 2013; Stikov et al. 2011).

MD values may be affected by other factors that stem from the methodology, like partial volume averaging across different tissue types and crossing fibers (Vos et al. 2011, 2012). We consider partial volume effects an unlikely explanation for the effects reported here because the anisotropy values in regions that showed a significant group difference were greater than 0.35 (values typical to white matter), and because the FAT is not directly adjacent to the ventricles, as verified by individual inspection of the tracts. Thus, FAT voxels are unlikely to partially sample gray matter or CSF in addition to white matter.

Functionally, our results show that lower MD values are associated with a person’s ability to produce more syllables during a fixed time period. This implies that in this sample of AWS, lower MD values predict faster transmission between inferior frontal language regions and the pre-SMA/SMA involved in speech planning and production. This interpretation of the results is in agreement with previous reports in either older adults, younger adults or children, linking lower MD values with enhanced processing speed, and therefore with faster transmission between cortical regions (Bakhtiari et al. 2014; Sasson et al. 2010; Scantlebury et al. 2014). Future studies are expected to shed more light on this idea using methods that tap into more specific tissue properties, combined with direct estimates of transmission speed (Horowitz et al. 2014).

Limitations

Several limitations of the current study should be acknowledged. First, the sample size is limited (N = 15 and 19 for AWS and controls, respectively), which reduces the statistical power of our analyses. However, even with this limited power, we were able to detect significant differences in diffusion properties between AWS and controls, as well as within group correlations with behavior. This limitation is still relevant to null effects reported here, particularly the absence of significant group differences in FA. A recent analysis has shown that MD requires a relatively smaller sample size compared with FA, for the detection of equivalent effect sizes (De Santis et al. 2014). It is therefore possible that future studies with larger samples will detect FA differences that have not been detected here. Second, our sample spans a relatively large age range (19–52 years old participants in both groups). While the groups were well-matched in mean-age and age range (see Table 1), this age range still raises the concern that age could somehow drive the effects we find, especially given the known variation in white matter properties during adulthood (Lebel et al. 2012b). To address this concern, we conducted ANCOVA analyses with Age as a covariate to examine the contribution of age to the group differences that were found, and we calculated partial correlations to control for the effect of Age on the brain-behavior correlations that showed a significant effect. None of these analyses indicated a significant contribution of Age to the reported effects. Third, in this study we used a scan protocol with limited angular resolution (19 diffusion directions). We chose this number of directions to ensure a short duration scan (5:50 min) that will reduce the chance of within scan head motion, and we improved the signal-to-noise ratio by repeating the acquisition twice. While simulation studies recommend the use of at least 30 diffusion gradient-encoding directions for the purpose of tractography (Jones 2004), a study of human subjects demonstrated that similarly robust measurements are achieved using 6 and 30 directions in many white matter tracts delineated using deterministic tractography (Lebel et al. 2012a). The authors of the latter study conclude that using repeated scans (as used here) along with at least six directions (we use 19) should be considered appropriate for the purposes of deterministic tractography.

Conclusions

Our study shows differences in diffusion measures of the bilateral frontal aslant tract and the left CST in persistent developmental stuttering. Further, the data show an association between diffusion measures of the left frontal aslant tract and behavioral measures of speech fluency in adults who stutter. This association is anatomically selective in that it does not hold in the other tracts targeted here, or in controls. By demonstrating the involvement of the bilateral frontal aslant tracts in persistent developmental stuttering, our findings strengthen the view that this tract plays a role in the production of fluent speech and should be considered in future studies of language and its disorders.