Background

Attention-deficit/hyperactivity disorder (ADHD) is a common neurodevelopmental disorder that affects 3 to 10% of school-aged children (Schubiner and Katragadda 2008). The symptoms in affected children include inattention, hyperactivity, and impulsivity, which are often exhibited as inappropriate behavior, noncompliance, and negative social behavior (DuPaul et al. 2001). A multidisciplinary strategy is suggested for improving ADHD symptoms, but psychostimulant treatment, including methylphenidate (MPH), has remained a first-line therapeutic option for ADHD (Subcommittee on Attention-Deficit/Hyperactivity et al. 2011; Van der Oord et al. 2008).

The proposed mechanism of action of MPH is that it blocks both dopamine and norepinephrine transporters in the prefrontal cortex and the striatum preferentially (Berridge et al. 2006; Volkow et al. 2002). Previous functional MRI (fMRI) studies supported evidence that MPH could upregulate the activation of fronto-striatal regions in the ADHD group when tasks were being performed (Czerniak et al. 2013; Epstein et al. 2007; Konrad et al. 2007; Rubia et al. 2009). In addition, MPH-elicited changes were also consistently reported in the cerebellar regions, which showed significant increase in activity and metabolism (Czerniak et al. 2013; Epstein et al. 2007; Rubia et al. 2011; Volkow et al. 1997). Those results suggested that functional impairment in prefrontal-striatal-cerebellar circuits may play a crucial role in ADHD pathophysiology.

Recently, a growing body of evidence also pointed out that ADHD subjects also have impaired intrinsic functional architecture, including disrupted large-scale resting network organization as well as local spontaneous activity and connectivity (An et al. 2013a; Castellanos and Aoki 2016; Castellanos and Proal 2012; Fair et al. 2012; Zang et al. 2007). Although significant alterations were found across extensive brain regions, MPH effects on the resting-state functional network (RSFN) and correlated clinical improvement have been less investigated. In a previous study, MPH elicited local connectivity alteration in widespread frontal, parietal and cerebellar regions, but only a cluster in the superior parietal area showed a correlation with symptom improvement (An et al. 2013b). Another recent study revealed single-dose MPH induced decreased within-RSFN connectivity in the visual and cerebellar network as well as reduced visual network connectivity to executive and default mode networks (Silk et al. 2016). Those RSFN changes might have a mediating role in clinical improvement, but their contribution has not yet been fully discovered.

There are a number of methods for characterizing the architecture of the RSFN, including both local and large-scale approaches. For large-scale changes, a number of analytical approaches have been introduced, such as Independent Component Analysis (ICA) with dual regression (Beckmann and Smith 2004) and graph theoretical analysis (Bullmore and Sporns 2009). Furthermore, local spontaneous regional activity was estimated by data-driven approaches, including amplitude of low frequency fluctuation (ALFF) and fractional ALFF (fALFF) (Zang et al. 2007; Zou et al. 2008). Each approach has different foci of interest; thus, they can represent multi-faceted aspects of MPH treatment. We hypothesized that subsequent methods, ICA with dual regression, graph theoretical analysis, ALFF and fALFF, would capture various aspects of the intrinsic functional recovery of the ADHD subject. In addition, the findings from local and large-scale analyses could have a collaborative role in behavioral or clinical improvement with various levels of contribution.

In sum, our specific aim was to explore the significant changes in both local and large-scale RSFNs derived by short-term MPH treatment and association between RSFN change and improvement in clinical manifestation. To clarify the MPH-driven effect, we recruited ADHD subjects without prior exposure to any psychotropic medication and acquired resting-state functional magnetic resonance imaging (R-fMRI) before methylphenidate treatment and twelve weeks later. Paired analyses within the ADHD group were conducted to investigate the main effects of MPH using a variety of methods, including ICA with dual regression, graph theoretical analysis, and ALFF/fALFF. We also contrasted the RSFN changes with age-matched typically developing children (TDC). Finally, we adopted an ensemble learning algorithm called ‘random forest’ to determine the collaborative contribution of neuroimaging findings to symptom improvement.

Methods

Participants

Forty-three children with medication-naïve ADHD, aged 7 to 16 years old, and 28 age-matched TDC subjects were recruited by advertisements at the child and adolescent psychiatric clinic in Daejeon’s St. Mary’s Hospital. The Diagnostic and Statistical Manual of Mental Disorders - Fourth Edition (DSM-IV) criteria based on structured diagnostic interviews (Kiddie-Schedule for Affective Disorders and Schizophrenia-Present and Life time Version-Korean Version: K-SADS-PL) were used for diagnosis. The reliability and validity of the K-SADS-PL were previously determined (Kaufman et al. 1997; Kim et al. 2004). The full-range of The Korean Educational Development Institute-Wechsler Intelligence Scales for Children (KEDI-WISC) III (Park et al. 1996) was administered to each subject. Subjects were excluded if they had 1) any neurological disease or insult, developmental disorders including pervasive developmental disorders, or medical disorders that could affect brain development, 2) any history of substance abuse, psychotropic medication, significant fetal exposure to alcohol or drugs, or perinatal or neonatal complications, 3) full-scale Intelligence Quotients below 70, or 4) claustrophobia or irremovable metal materials. We recruited right-handed subjects only. Among 43 ADHD patients, 18 were excluded (13: subclinical ADHD, 2: brain abnormalities, 3: below 70 of IQ). At the baseline evaluation, a total of 25 ADHD subjects were included in the study. Among them, 5 subjects were additionally excluded in our analysis: 1 subject had poor adherence to medication (below 80% administration of prescribed medication according to self-reported medication administration chart), 1 subject had clinical suspicion of autism spectrum disorder rather than ADHD during clinical follow-up, and 3 subjects were loss to follow-up before twelfth week of treatment. Finally, 20 ADHD subjects completed the whole 12-week MPH treatment and MRI scanning at the end of the treatment. There was no significant difference in terms of age (Mann-Whitney U = 32.0, p = 0.221), IQ (Mann-Whitney U = 24.0, p = 0.077), K-ARS total score (Mann-Whitney U = 43.0, p = 0.749), Inattention score (Mann-Whitney U = 44.5, p = 0.830), Hyperactivity-impulsivity score (Mann-Whitney U = 41.0, p = 0.643) and K-CPRS score (Mann-Whitney U = 38.0, p = 0.498) between the included and excluded subjects. Among 28 TDC, one subject was excluded because of non-attendance at baseline evaluation. Finally, 20 psychotropic medication-naïve children with ADHD and 27 TDC were included in the further analysis. The imaging data used for this study was the expanded dataset from Choi et al. (2013).

Daejeon’s St. Mary’s Hospital Institutional Review Board approved all procedures. The purpose of the study was explained to the subjects and parents, who gave their written informed consent.

Clinical assessment and methylphenidate prescription

The severity of ADHD symptoms was evaluated with the Korean version of Dupaul’s ADHD rating scale (K-ARS) at the first interview. The validity and reliability of K-ARS were established by So and colleagues (2002). In addition, the behavioral domains of each subject were rated by their parents using the Korean version of Conner’s Parent Rating Scale (K-CPRS), for which the validity and reliability were determined in a previous study (Park et al. 2003).

After baseline R-fMRI acquisition, extended release form of MPH tablets (Concerta ®) was prescribed to 20 ADHD subjects. Two participants who addressed the problems of swallowing difficulty, were treated with Metadate CD ®. MPH doses were gradually titrated and adjusted to achieve desired symptom improvement for each subject while carefully monitored for adverse effects and drug compliance. The titration was limited if participants had intolerable side effects, or ADHD symptoms had been reduced to ‘much improved’ or ‘very much improved’ by the Clinical Global Impression – Global Improvement (CGI-I) scale. Symptom assessment and follow-up R-fMRI were acquired after twelve weeks later from the initial visit. The rating of the pre- and post-treatment states (PreTx and PostTx) of subjects and MPH prescription were conducted by an author. (J.C.).

Image acquisition

All images were acquired using a 1.5 T Philips scanner at Daejeon St. Mary’s Hospital. Before starting the MR scan, the children were given time to adapt to the scanner environment. The subject’s head was stabilized with cushions and taped to help minimize movement. The R-fMRI scan was acquired with an echo-planar imaging (EPI) sequence with the following imaging parameters: repetition time (TR) = 2000 ms; echo time (TE) = 35 ms; flip angle (FA) = 70°; voxel size =3.4375 × 3.4375 × 5 mm3; imaging matrix =64 × 64; field-of-view (FOV) = 240 × 240 mm2, number of volumes =210. The subjects were instructed to keep their eyes closed and not to think of anything particular during R-fMRI scans. A T1-weighted anatomical image was also acquired using a magnetization-prepared, gradient-echo sequence (TR = 25 ms; TE = 4.6 ms; FOV = 240 × 240 mm2; voxel size = 1 × 1 × 1 mm3). To relieve the subjects’ anxiety, animation films were screened during structural MR scanning if they maintained minimal body movement. R-fMRI and structural MRI were acquired after twelve weeks of MPH treatment using the same scanner with identical scanning procedures and parameters. Because our study aimed to investigate short-term treatment effect among ADHD subjects whose symptoms were well-controlled by regular administration schedule, we scheduled PostTx MRI scanning conducted 2 h apart from MPH administration in the morning of the day of the scan.

Data preprocessing

The MRI data were preprocessed with the FMRIB Software Library (FSL, www.fmrib.ox.ac.uk/fsl) version 5.0, Analysis of Functional NeuroImages (http://afni.nimh.nih.gov/afni/), and the FreeSurfer image analysis suite (http://surfer.nmr.mgh.harvard.edu/) version 5.3. Preprocessing of R-fMRI data was performed sequentially with the following procedure. First, we discarded the initial five volumes from each R-fMRI data item to reduce T1 saturation effects. Rigid body transformation and slice timing correction were performed on the remaining 205 images using FSL software. After skull stripping, motion correction, removing of linear and quadratic trends, and band-pass filtering (0.01–0.1 Hz) of the R-fMRI data were performed to reduce physiological noise and signal drift using CompCor methods (Behzadi et al. 2007). Then, we applied spatial smoothing on all the valid voxels using a Gaussian kernel of 6 mm full-width at half-maximum. Next, R-fMRI volumes were registered with corresponding high-resolution T1-weighted images that had been co-registered to the Montreal Neurological Institute (MNI) T1 template.

Spatially-independent, global-level RSFN components were extracted using the Multivariate Exploratory Linear Optimized Decomposition into Independent Components (MELODIC) module of the FSL software (Beckmann and Smith 2004). Multisession temporal concatenation in a MELODIC module was used to extract the probabilistic independent component (IC) for group analysis. A dual regression analysis was conducted to investigate the component-wise connectivity difference between groups. We used a publically available IC template suggested by Smith and colleagues, which had been proved to represent resting brain’s functional dynamics across a large dataset (Smith et al. 2009). Among 20 RSFNs in the template, we selected 10 well-matched pairs of RSFNs which we considered as non-artefactual IC components. The RSFNs were used in further analysis as follows: left frontoparietal network (LFPN), right frontoparietal network (RFPN), central executive network (CEN), Sensory/Motor network (SM), auditory network (AN), default mode network (DMN), cerebellum (CBL), lateral visual processing network (LVN), medial visual processing network (MVN) and visual network-occipital pole (OVN).

Considering heterogeneity of the participants’ ages and their unequal developmental stages, registration to a common standard 3D space would result in a biased transformation of a certain region. Rather than using the same approach used for a global-level approach, we adopted surface-based registration provided by the Freesurfer package. This approach reduced not only the inter-participant variability caused by the individual cortical folding pattern, but it also diminished the substantial spread of activation along the distant regions after spatial smoothing, which provided a critical advantage for local-level analysis. For this purpose, ALFF and fALFF were calculated from pre-processed R-fMRI data which were resampled into a 3 mm iso-voxel grid by averaging the square root of the power spectrum across 0.009–0.08 Hz and the power within the low-frequency range divided by the total power in the entire frequency range, respectively. Then, the high-resolution structural data of each participant were reconstructed to the cortical surface using methods based on the Freesurfer processing pipeline (Dale et al. 1999; Fischl and Dale 2000; Fischl et al. 1999a, b): registration to the Talairach atlas, bias field correction, skull stripping, intensity normalization, surface modeling, and spherical mapping and registration. ALFF and fALFF values in each cortical voxel were registered to the corresponding vertices in each subjects’ surface model and spatial smoothing was applied to the surface vertices with a 5 mm full-width at half-maximum Gaussian kernel. Finally, Z-transformation of ALFF and fALFF from each subject space were performed and warped to the common spherical model called ‘fsaverage’ to compute group statistics.

Imaging data analysis

Functional connectivity analyses using dual regression were performed using time courses of the 10 RSFNs from each subject. (Filippini et al. 2009; Littow et al. 2010; Veer et al. 2010). Time course matrices from spatial IC maps in group ICA were used to estimate subjects-specific spatial maps, and then the connectivity difference from each RSFN was tested for statistical significance in a pairwise manner between the PreTx and PostTx ADHD groups. We also performed the same procedure to contrast PreTx and PostTx ADHD participants relative to TDC. Statistical analyses were conducted with 10,000 replicated permutation tests with the Threshold-Free Cluster Enhancement methods implemented in the FSL archive to reduce the possibility of error from arbitrary inference. Finally, we defined clusters that showed a family-wise error (FWE) corrected p-value less than 0.05 had significant functional connectivity.

Next, graph theoretical analysis among 10 RSFNs was conducted with the GRETNA toolbox (Wang et al. 2015), an open-source package built on Matlab program (Mathworks Inc.). From the mean time series data of 10 RSFNs, a partial correlation coefficient (PCC) was estimated between every two networks after controlling for the effect of other connections on them. We took absolute values from a negative PCC to quantify the connectivity strength and then constructed a graph network of each group with a 10 by 10 weighted matrix. We also assumed that those large-scale networks might interact with each other during the resting state. Thus, we analyzed only fully-connected networks ranging from a similarity threshold of 0.01 (dense connection) to 0.06 (scarce connection). The network characteristics used in the current study were as follows: clustering coefficient (CC), global and local efficiency (Eglobal and Elocal), and modularity. The CC is the ratio of the number of existing connections to all possible connections of the subgraph connected to designated nodes (Onnela et al. 2005; Watts and Strogatz 1998), Eglobal and Elocal are the inverse of the harmonic mean of the minimum path length within the entire network and all possible subgraphs, respectively (Latora and Marchiori 2001, 2003). Modularity is the difference of the expected edges between the intra-module and the inter-module, and the community structures of the module illustrated the group organizations of RSFNs (Newman 2006). For the network-wise comparison, each global metric was normalized using the mean and standard deviation of those from 1000 randomly generated graphs. Then, all properties of the ADHD group were compared in a pairwise manner and also contrasted with TDC by a two-sample t-test with a significance level of p < 0.05.

Finally, surface-based ALFF and fALFF analyses were performed with the general linear model implemented in the Freesurfer package. For estimating the MPH effect, we conducted a paired t-test between PreTx and PostTx data and then also compared each group with the TDC group, respectively. To reduce false results, we employed the cluster-forming threshold at p < 0.0001, which was the conservative standard suggested in other studies (Silver et al. 2011), and all results were adjusted for FWE corrected p < 0.05 based on the 10,000 Monte-Carlo simulations.

All neuroimaging analyses were controlled for effects of demographic covariates including age, sex, and IQ.

Regression analysis with the random forest algorithm

The significant features extracted from the disparate R-fMRI analyses might be associated with clinical improvement of ADHD symptoms. However, conventional correlation analysis could only determine a one to one relationship between a single neuroimaging finding and symptom improvement, which is inappropriate for estimating integrative influence from multiple features. We adopted the random forest algorithm (Breiman 2001), an ensemble machine learning method, to explore the relationship between multiple neuroimaging features and clinical improvement. It operates by constructing a multitude of decision trees and expresses the role of a single feature, including weak and multivariate ones by the production of a feature-importance measure. In particular, the random forest algorithm has a considerable advantage in estimating high-order interactions and nonlinearity relationships between features (Strobl et al. 2009). In this framework, this approach provides an opportunity to explore minor features which might also be associated with clinical improvement. We tested the predictive power of all significant features from imaging analyses to explain clinical improvements measured from K-ARS and its inattention and hyperactivity-impulsivity subscale (K-ARS-I and K-ARS-H), and K-CPRS among 14 ADHD subjects who completed each symptom assessment (K-ARS and K-CPRS) both before and after twelve weeks of MPH treatment. To examine the paired difference after MPH administration, we normalized each participant’s PostTx data using the respective mean and standard deviation of PreTx, then calculated difference between Z-transformed values. Clinical variables were transformed to percent change improvement from the PreTx state.

Regression analysis was conducted with the ‘randomForestSRC’ package implemented in R version 3.2.2. (RCoreTeam 2016), and the final results were summarized from 2000 bootstrap replicates to obtain a robust model that produced fewer error rates. Imaging features that had a smaller mean squared error than a randomly shuffled dataset were chosen as a significant predictor of symptom severity. The variable importance was calculated for each neuroimaging feature as the difference between out-of-bag error estimates from original bootstrapped trees and that of randomly permuted trees. Finally, we examined the effects of the demographic variables including age, sex, and IQ with random forest regression by adding them as explanatory variables to determine their effect on regression.

Results

Demographic characteristics

In the between-group analysis, there were no meaningful differences in age, height, and weight between the ADHD and TDC groups (Table 1). We recruited more boys than girls in both groups up to four-fold, but the proportion between the two groups did not differ from each other. In contrast, verbal, performance and full-scale IQ were significantly lower in participants with ADHD. In the ADHD group, 14 subjects were diagnosed as combined subtype ADHD, and six subjects were inattentive subtypes.

Table 1 Demographic and clinical characteristics of participants

Average MPH dose at the twelfth week of treatment was 29.90 ± 13.56 mg (ranging from 18 mg to 72 mg). Treatment response to MPH was discernible when compared to the initial symptom assessment, and the total, inattention, and hyperactivity-impulsivity scores from K-ARS were significantly reduced to 17.87 ± 7.59 (paired t = 10.34, p < 0.001), 10.07 ± 4.15 (paired t = 12.33, p < 0.001), and 7.80 ± 4.26 (paired t = 7.06, p < 0.001), respectively. K-CPRS score also dropped to 10.14 ± 3.58 (paired t = 9.10, p < 0.001), which suggested marked symptom improvement after twelve weeks of MPH administration. According to the CGI-I rating, 18 ADHD subjects had met criteria for clinical response (10 subjects with ‘very much improved’, 8 subjects with ‘much improved’ and 2 subjects with ‘minimally improved’), and 10 subjects had K-ARS total score of ≤18 which could be accounted as symptomatic remission at the twelfth week of treatment.

ICA with dual regression

In the ADHD group, LFPN connectivity to the left insula was significantly increased after MPH treatment and meaningful changes were also seen in intra-component areas, including the middle and dorsolateral prefrontal cortex (DLPFC) and part of the caudate nucleus (Table 2, Fig. 1a, Family-wise error corrected p = 0.002). In addition, connectivity between the RFPN and the pontine tegmentum of the brainstem was increased in PostTx state (Family-wise error corrected p = 0.021). However, we could not find any treatment-related decreased connectivity from any of the 10 RSFNs and other brain regions. Compared to the TDC group, the PreTx ADHD group showed reduced intra-component connectivity between the RFPN and bilateral precuneus and the right dorsolateral prefrontal cortex (Fig. 1b, Family-wise error corrected p = 0.034), and the PostTx ADHD group was found to have increased LFPN connectivity to the left insula, left DLPFC and left superior occipital area (Fig. 1c, Family-wise error corrected p = 0.003).

Table 2 Significant results from all three resting-state fMRI analysis among pre- and post-treatment ADHD group and typically developing children
Fig. 1
figure 1

Large-scale connectivity changes identified from dual regression analysis. (a) Paired analysis before and after 1-month MPH treatment. Increased connectivity was found between left frontoparietal network and left insula and dorsolateral prefrontal cortex as well as right frontoparietal network and pontine reticular area after treatment. (b) Strong right frontoparietal network connectivity to precuneus and right dorsolateral prefrontal cortex was found in typically developing children relative to medication-naïve ADHD group. (c) After treatment, ADHD group showed increased left frontoparietal connectivity to left insula and dorsolateral prefrontal cortex than typically developing children. PreTx, pre-treatment ADHD; PostTx, post-treatment ADHD; TDC, typically developing children

Graph theoretical analysis

Paired comparison of global metrics revealed that network-wise CC and Elocal were significantly decreased after MPH treatment at a threshold of 0.05. The PostTx group also showed reduced CC of LFPN (threshold of 0.02), CBL (threshold of 0.05), and OVN (threshold of 0.06) among nodal metrics (Table 2, Fig. 2a, b). Global graph metrics of PreTx group did not differ from the TDC group, but the nodal metrics analysis showed elevated CC of LPFN (threshold of 0.04) and Elocal of DMN (threshold ranged from 0.02 to 0.04). Under MPH treatment, a lower global CC was identified at a threshold ranging from 0.03 to 0.05), as well as a reduced CC of OVN (threshold of 0.06) and CBL (threshold of 0.05 and 0.06), and Elocal of CBL (threshold ranged from 0.04 to 0.06). We could not find any meaningful differences in modularity and Eglobal in all the between-group comparisons. However, a qualitative investigation of community structure illustrated that functional integration of LFPN to DMN after MPH treatment at a threshold of 0.01 and 0.02, which was also found in the TDC group (Fig. 2c).

Fig. 2
figure 2

Large-scale intrinsic functional networks form graph theoretical analysis at similarity threshold 0.02. (a) and (b) Sagittal and axial view of clustering coefficient map among pre- and post-treatment ADHD and typically developing children. Significant difference was found in left frontoparietal networks. (c) Community structure of graph theoretical networks. Post-treatment ADHD group showed integration of left frontoparietal networks likewise typically developing children. PreTx, pre-treatment ADHD; PostTx, post-treatment ADHD; TDC, typically developing children; LFPN, Left frontoparietal network; RFPN, Right frontoparietal network; CEN, Central executive network; SM, Sensory/Motor network; AN, Auditory network; DMN, Default mode network; CBL, Cerebellum; LVN, Lateral visual processing network; MVN, Medial visual processing network; OVN, Visual network-occipital pole

ALFF and fALFF analysis

In a pairwise analysis of the ADHD group, surface clusters in the bilateral superior parietal cortex (SPC) showed reduced ALFF, while regions including the right inferior temporal gyrus (ITG), right temporal pole and inferior frontal gyrus (IFG) had an increased ALFF after MPH treatment (Table 2, Fig. 3a). However, paired analysis of fALFF did not reach the levels of statistical significance after correction for multiple comparisons. In addition, we could not find any significant changes from PreTx ADHD children relative to the controls. Meanwhile, PostTx ADHD group had an increased ALFF in the left DLPFC relative to TDC, (Fig. 3b), and a decreased fALFF in the right superior frontal gyrus as well as an increased fALFF in the left orbitofrontal and right ITG (Fig. 3c).

Fig. 3
figure 3

Local resting-state activity difference using ALFF/fALFF approach. (a) Pairwise comparison revealed ALFF reduction in bilateral superior parietal cortex while significant activity enhancement was found in right inferior frontal and temporal area. (b) ALFF of left DLPFC was also found to be increased in ADHD with MPH treatment group. (c) Post-treatment ADHD group showed decreased fALFF in right medial orbitofrontal cortex relative to typically developing children and increased fALFF in right inferior frontal and left inferior temporal cortex. PreTx, pre-treatment ADHD; PostTx, post-treatment ADHD; TDC, typically developing children; ALFF, Amplitude of low frequency fluctuation; fALFF, fractional amplitude of low frequency fluctuation

Imaging features explaining clinical variables

Among the 14 ADHD subjects, significant changes across five local and large-scale RSFNs could explain the 27.1% variance of symptom improvement measured by the K-ARS total score (Fig. 4). The highest variable importance was found in a reduced ALFF in the right SPC (15.35%), followed by an ALFF change in the left SPC (6.69%) decreased network-wise Elocal (6.39%), CC of CBL (4.95%) and network-wise CC (2.52%). A combination of those features also had predictive power for both K-ARS-I (13.85%) and K-ARS-H (32.95%), and the most important feature explaining K-ARS-I and K-ARS-H was the ALFF changes in the left (11.84%) and right SPC (31.31%), respectively. In addition, 13.72% of K-CPRS improvement could be explained by collaboration of neuroimaging features consisting of a reduced network-wise CC (8.64%), ALFF in the right IFG (7.65%), CC of CBL (5.83%), ALFF in the right ITG (3.12%) and the right SPC (0.59%). However, we could not identify the contribution of other significant neuroimaging features in symptom improvement. Features with higher importance tended to have a linear relationship with the clinical outcome in the partial correlation analysis, but a meaningful contribution of non-linear predictors was also found.

Fig. 4
figure 4

Estimated feature importance for clinical improvement and summarized explanatory power from random Forest regression in ADHD group. Feature importance and explanatory power has been demonstrated for explaining (a) total K-ARS score, (b) Inattentive domain score of K-ARS, (c) Hyperactivity/Impulsivity domain score of K-ARS and (d) K-CPRS. Features colored in black had significance level of p < 0.05, while those colored in gray did not show significance in Spearman’s rank order correlation analysis. K-ARS, Dupaul’s ADHD rating scale, Korean version; K-ARS-I, Inattentive domain score of K-ARS; K-ARS-H, Hyperactivity/Impulsivity domain score of K-ARS; K-CPRS, Conner’s parent rating scale, Korean version; CC, Clustering coefficient; Elocal, Local efficiency

In all regression analyses, inclusion of age, sex and IQ resulted in a loss of predictive power (K-ARS total score, 21.68%; K-ARS-I 1.22%; K-ARS-H, 28.21%, K-CPRS, 4.51% variance explained with demographic variables), which indicated that clinical improvements were not meaningfully affected by demographic characteristics of the participants.

Discussion

Our study investigated the local and large-scale changes in the RSFN after MPH administration and their integrative influence on symptom improvement. Using ICA with dual regression and graph theoretical analysis, we revealed treatment relevant alterations in large-scale connectivity which increased the LFPN connectivity to the left insular and frontal cortex, increased the RFPN connectivity to the brainstem, decreased CC in the LFPN, CBL, OVN and overall network. In addition, localized changes were presented both as decreased ALFF over the superior parietal cortex and elevated ALFF in the right IFG and ITG. With the random forest algorithm, the subset of neuroimaging features could explain the considerable variance of symptom improvement measured by K-ARS and K-CPRS after twelve weeks of MPH treatment.

Dual regression results revealed that the PostTx group was associated with increased connectivity between the LFPN and the left insula relative to both treatment-naïve ADHD and TDC. Current results were parallel with previous studies in which activity and connectivity within the ventrolateral prefrontal cortex (including insula) were upregulated by MPH treatment (Sripada et al. 2013; Cubillo et al. 2014; Mueller et al. 2014). The insular cortex is a hub region of the SN that plays a pivotal role in allocating resources between the frontoparietal network and the DMN to elicit adaptive responses (Dosenbach et al. 2007). In view of the decreased or immature connectivity between the SN and the frontoparietal network in the ADHD group (Choi et al. 2013; Sidlauskaite et al. 2016; Sripada et al. 2014), our finding might indicate upregulation of the hypoconnected intrinsic network by MPH-induced endogenous dopamine stimulation (Arnsten and Pliszka 2011). In addition, the left DLPFC and caudate nucleus also showed increased intra-component connectivity, which is strongly linked to executive function deficit in ADHD (Castellanos et al. 2006). It is consistent with the previous finding that regional blood flow of those regions increased after MPH treatment (Kim et al. 2001). Another noteworthy finding in the pairwise comparison was increased RFPN connectivity to the ascending reticular activation system (ARAS). The ARAS plays a key role in arousal and attention (Paus 2000), which has abundant noradrenergic projections from locus coeruleus (Sarter et al. 2006; Broyd et al. 2005). Thus, enhanced connectivity strength to the ARAS might be attributed to the noradrenergic facilitation effect by MPH. Additionally, the PreTx group showed decreased intra-component connectivity in a small part of the precuneus and right DLPFC relative to TDC, while this difference disappeared after treatment. Our findings might support the idea that MPH-induced recovery of diminished regional connectivity between the DMN and the frontoparietal network, which was suggested in a previous study (Sripada et al. 2013).

In graph theoretical analysis, a decrease in global network connectivity (network-wise CC and Elocal) was identified in the PostTx group, relative to the PreTx and TDC groups. In addition, the nodal CC also decreased in LFPN, CBL and OVN after treatment. Recent literature suggested that the ADHD group had a higher average connectivity in the subnetwork towards the regular graph during the resting state (Silk et al. 2016; Wang et al. 2009), but aberrant connections were downregulated by MPH treatment (Silk et al. 2016). In addition, ADHD subjects showed enhanced CBL connectivity to the frontoparietal and visual networks (Kucyi et al. 2015), while recent studies demonstrated that MPH treatment normalized regional connectivity in the fronto-parietal, CBL and visual networks (Yang et al. 2016; Mueller et al. 2014; Rosenberg et al. 2016; Sripada et al. 2013). Taking all the evidence into account, our findings support the previous hypothesis that functional improvement might be associated with MPH-induced functional segregation of the resting network (Sripada et al. 2013; Yang et al. 2016). Resting connectivity suppression by MPH was also found in a comparison between the PostTx and TDC groups. Meanwhile, our study also illustrated a functional integration of the LFPN after MPH treatment, although we could not find any quantitative change in modularity. Our finding could be interpreted as a delayed consolidation DMN over development in ADHD, which was suggested in previous studies (Fair et al. 2010; Castellanos et al. 2006) In addition, MPH treatment might have a differential effect on a resting network, including functional segregation across the global network as well as reorganization of a unified module linked to the DMN.

MPH treatment also modulated local spontaneous activity in the bilateral SPC, right IFG, and right ITG. In addition, the PostTx group also showed increased ALFF in the IFG, ITG, and DLPFC relative to the TDC. Our findings are generally corroborated by the results from An et al. that suggested that a single administration of MPH had resulted in an increased ReHo in the orbital and inferior frontal area and a decreased ReHo in the superior parietal cortex compared with a placebo (An et al. 2013b). As previous studies pointed out that the right IFG is the key region for response inhibition, sustained attention and attentional control for relevant stimulus (Hampshire et al. 2010). The inferior to middle temporal cortex have been less associated with ADHD pathophysiology, but greater activation of those regions have been shown to be related to low response time variability (Spinelli et al. 2011). In addition, MPH elicited downregulation of the bilateral SPC regions, which was significantly correlated with improvement of symptom severity and attentional performance (An et al. 2013b; Cho et al. 2007; Kim et al. 2012). Taking that evidence into account, we can consider that the normalization of baseline hyperactivity in SPC and upregulation of IFG, DLPFC and ITG might be associated with recovery of the attentional processing network. We could not find a significant ALFF/fALFF difference between the TDC and PreTx groups as previous studies reported (An et al. 2013a; Zang et al. 2007). Negative results could be driven by a small kernel size for spatial smoothing or a robust threshold for multiple comparison corrections relative to previous studies. Furthermore, comparison upon a two-dimension surface could provide better test-retest reliability using improved intra- and inter-subject registration quality (Zuo et al. 2013); however, the key structures for ADHD, including the striatum and cerebellum, cannot be covered in a surface mask. Therefore, local spontaneous activity differences between ADHD and TDC would need further exploration.

In the multivariate regression analysis, a considerable variance of K-ARS and K-CPRS was explained by the collaboration of neuroimaging features from both local and large-scale RSFNs. ALFF changes in the bilateral SPC had greater importance in predicting changes of the K-ARS and subdomain score. The contribution of the network-wise CC was prominent in explaining K-CPRS and also associated with K-ARS improvement. The included features and their importance were different between K-ARS and K-CPRS, which might be derived from rater’s characteristics (trained clinician vs. parent) and categories of assessment (symptoms in the DSM-IV diagnostic criteria vs. general childhood behavioral problems). High importance features tended to have a linear relationship in the partial correlation analysis, which might suggest a robust association between the recovery of intrinsic functional connectivity and clinical improvement. Furthermore, our approach could also reveal the substantial role of features with the minor importance that could not be detected in the conventional analysis that depended on a linearity assumption. Converging evidence suggested that dysfunctions associated with ADHD pathophysiology encompass both a number of large-scale and local changes in RSFNs which are largely interconnected to each other (Castellanos and Proal 2012; Di et al. 2013; Zang et al. 2007). In that sense, our approach might suggest the joint influence from diverse RSFN features on symptom improvement. However, some features, including large-scale changes in LFPN, RFPN and OVN connectivity, did not have any importance in predicting therapeutic effects. Fronto-parietal and visual networks have been implicated in ADHD pathophysiology, but impairments of such networks were identified mostly from task-based fMRI studies (Rubia et al. 2011; Silk et al. 2008). Thus, changes in a task-positive network might have limited influence on the intrinsic functional organization, as well as predicting symptom improvement. Finally, any explanatory benefit from including the age, sex and IQ might suggest that major symptom improvement was attributed to MPH-induced RSFN alteration, rather than subjects’ demographic characteristics.

To our knowledge, this is the first study to use multi-level R-fMRI analysis to explore MPH effect in ADHD subjects. In addition, machine learning application to R-fMRI data analysis showed promising results that could estimate the relationship between complex neuroimaging findings and subsequent clinical symptom improvement. However, there are some limitations to be declared of our study. First, the sample size of 20 ADHD and 27 TDC was relatively small, which limited generalization of our findings. In addition, a small number of subjects (n = 14) in random forest regression analysis could lower the performance of decision tree algorithm because of underestimation of the interrelationship among features. Second, a particular form of MPH was administered to ADHD participants as open-label, and the study protocol did not include a randomization process and/or a placebo-controlled group. This might confound symptom assessment to a favorable outcome after MPH treatment. Third, functional changes found in our analysis might be driven partly by pharmacodynamic properties of MPH, rather than recovery of symptoms. However, the study design constrained the inclusion of the TDC group with MPH treatment because of ethical considerations. Finally, most analyses in our study targeted the functional changes in the cortex; thus, subcortical contributions to clinical improvement could not be estimated. Further study with improved design will be needed to overcome these issues.

Conclusion

This study investigated both large-scale and localized RSFN changes in ADHD subjects after twelve weeks of MPH treatment. Furthermore, we estimated the integrated effects of multi-dimensional changes across RSFNs to clinical improvement with a novel ensemble learning algorithm. Among subsequent changes elicited by MPH, features strongly associated with K-ARS improvement increased local spontaneous activity in the bilateral SPC, reduced the CC of the global network and CBL. In addition, we predicted K-CPRS from a combination of reduced network-wise connectivity and normalized baseline activity across the right frontal, temporal, and parietal cortex. Our findings suggested that the neural mechanism of symptom improvement by MPH treatment could be associated with alteration of spontaneous activity in the SPC or widespread brain regions as well as functional segregation of the large-scale intrinsic functional network.