TractoFormer: A Novel Fiber-Level Whole Brain Tractography Analysis Framework Using Spectral Embedding and Vision Transformers

Zhang, Fan; Xue, Tengfei; Cai, Weidong; Rathi, Yogesh; Westin, Carl-Fredrik; O’Donnell, Lauren J.

doi:10.1007/978-3-031-16431-6_19

Fan Zhang¹²,
Tengfei Xue^12,13,
Weidong Cai¹³,
Yogesh Rathi¹²,
Carl-Fredrik Westin¹² &
…
Lauren J. O’Donnell¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13431))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

7502 Accesses
3 Citations

Abstract

Diffusion MRI tractography is an advanced imaging technique for quantitative mapping of the brain’s structural connectivity. Whole brain tractography (WBT) data contains over hundreds of thousands of individual fiber streamlines (estimated brain connections), and this data is usually parcellated to create compact representations for data analysis applications such as disease classification. In this paper, we propose a novel parcellation-free WBT analysis framework, TractoFormer, that leverages tractography information at the level of individual fiber streamlines and provides a natural mechanism for interpretation of results using the attention mechanism of transformers. TractoFormer includes two main contributions. First, we propose a novel and simple 2D image representation of WBT, TractoEmbedding, to encode 3D fiber spatial relationships and any feature of interest that can be computed from individual fibers (such as FA or MD). Second, we design a network based on vision transformers (ViTs) that includes: 1) data augmentation to overcome model overfitting on small datasets, 2) identification of discriminative fibers for interpretation of results, and 3) ensemble learning to leverage fiber information from different brain regions. In a synthetic data experiment, TractoFormer successfully identifies discriminative fibers with simulated group differences. In a disease classification experiment comparing several methods, TractoFormer achieves the highest accuracy in classifying schizophrenia vs control. Discriminative fibers are identified in left hemispheric frontal and parietal superficial white matter regions, which have previously been shown to be affected in schizophrenia patients.

This work is supported by the following NIH grants: R01MH119222, R01MH125860, P41EB015902, R01MH074794.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Deep Fiber Clustering: Anatomically Informed Unsupervised Deep Learning for Fast and Effective White Matter Parcellation

Deep White Matter Analysis: Fast, Consistent Tractography Segmentation Across Populations and dMRI Acquisitions

White Matter Fiber Segmentation Using Functional Varifolds

Keywords

1 Introduction

Diffusion MRI (dMRI) tractography is an advanced imaging technique that enables in vivo reconstruction of the brain’s white matter (WM) connections [1]. Tractography provides an important tool for quantitative mapping of the brain’s connectivity using measures of connectivity or tissue microstructure [2]. These measures have shown promise as potential biomarkers for disease classification using machine learning [3,4,5], which can improve our understanding of the brain in health and disease [6].

Defining a good data representation of tractography for machine learning is still an open challenge, especially at the fiber level. Performing whole brain tractography (WBT) on one individual subject can generate hundreds of thousands (or even millions) of fiber streamlines. WBT data is usually parcellated to create compact representations for data analysis applications. While most popular analyses of the brain’s structural connectivity rely on coarse-scale WM parcellations [2], recent studies have demonstrated the power of analyzing WBT at much finer scales of parcellation using high-resolution connectomes [7, 8]. While such approaches enable WBT analysis at a very high resolution (e.g., a 32k × 32k connectivity matrix), they are still quite high-dimensional and not able to represent information directly extracted from individual fibers.

Another challenge in machine learning for tractography analysis is the limited sample size (number of subjects) of many dMRI datasets. Developing data augmentation methods to increase sample size is a known challenge in structural connectivity research [9]. Small sample sizes limit the use of recently proposed advanced learning techniques such as Transformers [10] and Vision Transformers (ViTs) [11], which are highly accurate [12] but usually require a large number of samples to avoid overfitting [13].

Finally, an important challenge in deep learning for neuroimaging is to be able to pinpoint location(s) in the brain that are predictive of disease or affected by disease [14]. While interpretability is a well-known challenge in deep learning [15, 16], newer methods such as ViTs have shown advances in interpretability for vision tasks [17, 18].

In this paper, we propose a novel parcellation-free WBT analysis framework, TractoFormer, that leverages tractography information at the level of individual fiber streamlines and provides a natural mechanism for interpretation of results using the self-attention scheme of ViTs. TractoFormer includes two main contributions. First, we propose a novel 2D image representation of WBT, referred to as TractoEmbedding, based on a spectral embedding of fibers from tractography. Second, we propose a ViT-based network that performs effective and interpretable group classification. In the rest of this paper, we first describe the TractoFormer framework, then we illustrate its performance in two experiments: classification of synthetic data with true group differences, and disease classification between schizophrenia and control.

2 Methods

2.1 Diffusion MRI Datasets and Tractography

We use two dMRI datasets. The first dataset is used to create the embedding space and includes data from 100 subjects (29.1 ± 3.7 years; 54 F, 46 M) from the Human Connectome Project (www.humanconnectome.org) [19], with 18 b = 0 and 90 b = 3000 images, TE/TR = 89/5520 ms, resolution = 1.25 × 1.25 × 1.25 mm³. The second dataset is used for experimental evaluations and includes data from 103 healthy controls (HCs) (31.1 ± 8.7; 52 F, 51 M) and 47 schizophrenia (SCZ) patients (35.8 ± 8.8; 36 F and 11 M) from the Consortium for Neuropsychiatric Phenomics (CNP) (https://openfmri.org/dataset/ds000030) [20], with 1 b = 0 and 64 b = 1000 images, TE/TR = 93/9000 ms, resolution = 2 × 2 × 2 mm³. WBT is performed using the two-tensor unscented Kalman filter (UKF) method [21, 22] (via SlicerDMRI [23, 24]) to generate about one million fibers per subject. UKF has been successful in neuroscientific applications such as disease classification [25] and population statistical comparison [26], and it allows estimation of fiber-specific microstructural properties (including FA and MD). Fiber tracking parameters are as in [27]. Tractography from the 100 HCP are co-registered, followed by alignment of each CNP WBT using a tractography-based registration [28].

2.2 TractoEmbedding: A 2D Image Representation of WBT

The TractoEmbedding process includes three major steps (illustrated in Fig. 1). First, we perform spectral embedding to represent each fiber in WBT as a point in a latent space. Spectral embedding is a learning technique that performs dimensionality reduction based on the relative similarity of each pair of points in a dataset, and it has been successfully used for tractography computing tasks such as fiber segmentation [29], fiber clustering [30], and tract atlas creation [27]. To enable a robust and consistent embedding of WBT data from different subjects for population-wise analysis, we first create a groupwise embedding space using a random sample of fibers from co-registered tractography data from 100 subjects (see Sect. 2.1 for data details). This process uses spectral embedding [31] with a pairwise fiber affinity based on mean closest point distance [30, 32]. Next, to embed new WBT data, it is aligned to the 100-subject data [28], followed by computing pairwise fiber affinities to the population tractography sample. Then, each fiber of the new WBT data is spectrally embedded into the embedding space, resulting in an embedding coordinate vector for each fiber. We note that our process of spectral embedding is similar to that used for tractography clustering [30] and we refer the readers to [30] for details.

In the second step, the coordinates of each fiber of the new WBT data are discretized onto a 2D grid for creation of an image. Each dimension of the embedding coordinate vector corresponds to the eigenvectors of the affinity matrix sorted in descending order. A higher order indicates a higher importance of the coordinate to locate the point in the embedding space. A previous work has applied embedding coordinates for effective visualization of tractography data [33]. In our study, we choose the first two dimensions for each point and discretize them onto a 2D embedding grid^{Footnote 1}. A grid size parameter defines the image resolution.

In the third step, we map the measure of interest associated with each fiber to the corresponding pixel on the embedding grid as its intensity value. This generates a 2D image, i.e., the TractoEmbedding image. When multiple fibers that are spatially proximate are mapped to the same voxel, we can compute summary statistics from these fibers, such as max, min, and mean (mean is used in our experiments).

TractoEmbedding has several advantages (as illustrated in Fig. 2). First, TractoEmbedding is a 2D image that preserves the relative spatial relationship of every fiber pair in WBT in terms of the pixel neighborship in the 2D image (Fig. 2(1)). In this way, TractoEmbedding enables image-based computer vision techniques such as CNNs and ViT to leverage fiber spatial similarity information. (In the case where multiple fibers are mapped to the same voxel, to quantify the similarity of such fibers, we computed the mean pairwise fiber distance (MPFD) across the fibers. The average of MPFDs across all voxels with multiple fibers is 5.7 mm, which is a low value representing highly similar fibers through the same voxel.) Second, TractoEmbedding enables a multi-channel representation where each channel represents fibers from certain brain regions. This allows independent and complementary analysis of WBT anatomical regions, such as the left hemispheric, the right hemispheric and the commissural fibers in our current study. Thus, the TractoEmbedding is a 3-channel 2D image (Fig. 2(2)). Third, multiple TractoEmbedding images are generated by performing random downsampling of each subject’s input WBT (Fig. 2(3)). This naturally and effectively increases training sample size for data augmentation for learning-based methods, which is particularly important for methods that require a large number of samples. Fourth, TractoEmbedding can be generally used to encode any possible features of interest that can be computed at the level of individual tractography streamlines (Fig. 2(4)). This enables TractoEmbedding’s application in various tractography-based neuroscientific studies where particular WM properties are of interest. Fifth, TractoEmbedding allows a WBT representation at different scales in terms of the resolution of the embedding grid (Fig. 2(5)). With a low resolution, multiple fibers tend to be mapped into the same voxel, enabling WBT analysis at a coarse-scale fiber parcel level; with a high resolution, an individual fiber (or a few fibers) is mapped to any particular voxel, enabling WBT analysis at a fine-scale individual fiber level.

2.3 TractoFormer: A ViT-Based Framework for Group Classification

Figure 3 shows the proposed TractoFormer architecture, which leverages an ensemble of three ViTs to process the three-channel input TractoEmbedding images. Our design aims to address the aforementioned challenges of sample size/overfitting and interpretability. First, we leverage the multi-sample data augmentation (Fig. 2(3)) to reduce the known overfitting issue of ViTs on small sample size datasets [13]. Second, we leverage the self-attention scheme in ViT to identify discriminative fibers that are most useful to differentiate between groups. The interpretation of the ViT attention maps [11] is aided by our proposed multi-channel architecture, which can enable inspection of the independent contributions of different brain regions.

In detail, for each input channel, we use a light-weight ViT architecture (see Sect. 2.4 for details). An ensemble of the predictions is performed by averaging the logit outputs across the ViTs. For data augmentation, for each input subject, we create 100 TractoEmbedding images using randomly downsampled WBT data (80% of the fibers). For interpretation of results, in each ViT we compute the average of the attention weights for each token across all heads per layer, then recursively multiply the averaged weights for the first to the last layer, and finally map the joint token attention scores back to the input image space^{Footnote 2}. This generates an attention score map where the values indicate the importance of the corresponding pixels when classifying the TractoEmbedding image (as shown in Fig. 3). We identify the pixels with higher scores using a threshold T, and then identify the fibers that are mapped to these pixels when performing TractoEmbedding. These fibers are thus the ones that are highly important when classifying the TractoEmbedding image. We refer to the identified fibers as the discriminative fibers.

2.4 Implementation Details and Parameter Setting

Our method is implemented using PyTorch (v1.7.1) [34]. For each ViT, we use 3 layers with 8 heads, a hidden size of 128, and a dropout rate of 0.2 (grid search for {3, 4, 5}, {4, 6, 8}, {128, 256}, and {0.2, 0.3}, respectively). Adam [35] is used for optimization with a learning rate 1e−3 and a batch size 64 for a total of 200 epochs. Early stopping is adopted when there is no accuracy improvement in 20 continuous epochs. 5-fold cross-validation is performed for each experiment below and the mean accuracy and F1 scores are reported. T is set to be the mean + 2 stds of the scores in an attention map. The computation is performed using NVIDIA GeForce 1080 Ti. On average, each epoch (training and validation) takes ~30 s with 2 GB GPU memory usage when using data augmentation and 160 × 160 resolution. The code will be made available upon request.

2.5 Experimental Evaluation

Exp 1: Synthetic Data.

The goal is to provide a proof-of-concept evaluation to assess if the proposed TractoFormer can 1) successfully classify groups with true WM differences and 2) identify the fibers with group differences in the WBT data for interpretation. To do so, we create a realistic synthetic dataset with true group differences, as follows. From the 103 CNP HC data, we add white Gaussian noise (signal-to-noise ratio at 1 [36, 37]) to the actual measured mean FA value of each fiber in the WBT data. Repeating this process twice generates two synthetic groups of G1 and G2, each with 103 subjects. We then modify the mean FA of the fibers belonging to the corticospinal tract (CST) (a random tract selected for demonstration) in G2 to have a true group difference. To do so, we decrease the mean FA of each CST fiber in G2 by 20%, a synthetic change suggested to introduce a statistically significant difference in tractography-based group comparison analysis [36]. We apply the TractoFormer to this synthetic data to perform group classification and identify the discriminative fibers.

Exp 2: Disease Classification Between HC and SCZ.

The goal is to evaluate the proposed TractoFormer in a real neuroscientific application for brain disease classification. Previous studies have revealed widespread WM changes in SCZ patients using dMRI techniques [38]. In our paper, we apply TractoFormer to investigate the performance of using tractography data to classify between HC and SCZ in the CNP dataset. For interpretation purposes, we compute a group-wise attention map by averaging the attention maps from all subjects that are classified as SCZ, from which the discriminative TractoEmbedding pixels and discriminative fibers are identified. We compare our method with three baseline methods. The first one performs group classification using fiber parcel level features and a 1D CNN network [39], referred to as the FC-1DCNN method. Briefly, for each subject, WBT parcellation is performed using a fiber clustering atlas [27], resulting in a total of 1516 parcels per subject. The mean feature of interest (i.e., FA or MD) along each parcel is computed, leading to a 1D feature vector with 1516 values per subject. Then, a 1D CNN is applied to the feature vectors to perform group classification. For parameters, we follow the suggested settings in the author’s implementation^{Footnote 3}. The second method performs group classification using track-density images (TDI) [40] and 3D ResNet [41], referred to as the TDI-3DResNet method. Briefly, a 3D TDI, where each voxel represents streamline count, is generated per subject and fed into a 3D ResNet for group classification. The third baseline method performs group classification using TractoEmbedding images, but instead of using the proposed ViT, it applies ResNet [41], a classic CNN architecture that has been shown to be highly successful in many applications. We refer to this method as ResNet. For the ResNet and TractoFormer methods, we perform classification with and without data augmentation. We also provide interpretability results using Class Activation Maps (CAMs) [42] in ResNet.

3 Results and Discussion

Exp 1: Synthetic Data.

TractoFormer achieved, as expected, 100% group classification accuracy because of the added synthetic feature changes to G2. Figure 4 shows the identified discriminative fibers in one example G2 subject based on its subject-specific attention map and the G2-group-wise attention map. The discriminative fibers are generally similar to the CST fibers with synthetic changes.

Exp 2: Disease Classification Between HC and SCZ.

Table 1 shows the classification results of each compared method. In general, the FA measure gives the best result. The FC-1DCNN method generates lower accuracy and F1 scores than the methods that benefit from data augmentation. Regarding the 3 TractoEmbedding-based methods, we can observe that including data augmentation greatly improves the classification performance. The ensemble architecture gives the best overall result (at resolution 160 × 160 with FA feature), with a mean accuracy of 0.849 and a mean F1 of 0.770. Figure 5 gives a visualization of the discriminative fibers from group-wise and subject-specific attention maps. In general, our results suggest that the superficial fibers in the frontal and parietal lobes have high importance when classifying SCZ and HC under study. Multiple studies have suggested these white matter regions are affected in SCZ [43,44,45]. In ResNet (at resolution 160 × 160 with FA feature), CAM identifies the fibers related to the brainstem and cerebellum. The ViT- and ResNet-based methods focus on different brain regions, possibly explaining the accuracy difference of the two methods.

Table 1. Comparison across different methods: the mean accuracy and the mean F1 (the first and second values, respectively, per cell) across the cross-validation are reported.

Full size table

4 Conclusion

We present a novel parcellation-free WBT analysis framework, TractoFormer, which leverages tractography information at the level of individual fiber streamlines and provides a natural mechanism for interpretation of results using attention. We propose random sampling of tractography as an effective data augmentation strategy for small sample size WBT datasets. Future work could include an investigation of ensembles of different fiber features in the same network, multi-scale learning to use TractoEmbedding images with different resolutions together, and/or combination with advanced computer vison data augmentation methods. Overall, TractoFormer suggests the potential for deep learning analysis of WBT represented as images.

Notes

1.
While embeddings from the first 3 dimensions can be used to generate 3D TractoEmbedding images, our unpublished results show that this decreases group classification performance potentially due to the data sparsity where many voxels on the 3D grid do not have any mapped fibers.
2.
Following instructions from: https://github.com/jeonsworld/ViT-pytorch.
3.
https://github.com/H2ydrogen/Connectome_based_prediction.

References

Basser, P.J., Pajevic, S., Pierpaoli, C., Duda, J., Aldroubi, A.: In vivo fiber tractography using DT-MRI data. Magn. Reson. Med. 44, 625–632 (2000)
Article Google Scholar
Zhang, F., Daducci, A., He, Y., et al.: Quantitative mapping of the brain’s structural connectivity using diffusion MRI tractography: a review. Neuroimage 249, 118870 (2022)
Article Google Scholar
Zhan, L., et al.: Comparison of nine tractography algorithms for detecting abnormal structural brain networks in Alzheimer’s disease. Front. Aging Neurosci. (2015). https://doi.org/10.3389/fnagi.2015.00048
Article Google Scholar
Deng, Y., et al.: Tractography-based classification in distinguishing patients with first-episode schizophrenia from healthy individuals. Prog. Neuropsychopharmacol. Biol. Psychiatry 88, 66–73 (2019)
Article Google Scholar
Hu, M., X., et al.: Structural and diffusion MRI based schizophrenia classification using 2D pretrained and 3D naive Convolutional Neural Networks. Schizophr. Res. (2021). https://doi.org/10.1016/j.schres.2021.06.011
Article Google Scholar
Brown, C.J., Hamarneh, G.: Machine Learning on human connectome data from MRI. arXiv [cs.LG] (2016). http://arxiv.org/abs/1611.08699
Mansour, L.S., Tian, Y., Yeo, B.T.T., Cropley, V., Zalesky, A.: High-resolution connectomic fingerprints: mapping neural identity and behavior. Neuroimage 229, 117695 (2021)
Article Google Scholar
Cole, M., Murray, K., et al.: Surface-Based Connectivity Integration: an atlas-free approach to jointly study functional and structural connectivity. Hum. Brain Mapp. 42, 3481–3499 (2021)
Article Google Scholar
Barile, B., Marzullo, A., Stamile, C., Durand-Dubief, F., Sappey-Marinier, D.: Data augmentation using generative adversarial neural networks on brain structural connectivity in multiple sclerosis. Comput. Methods Programs Biomed. 206, 106113 (2021)
Article Google Scholar
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems 30 (2017)
Google Scholar
Dosovitskiy, A., Beyer, L., et al.: An image is worth 16 × 16 words: transformers for image recognition at scale. In: ICLR (2021)
Google Scholar
Han, K., et al.: A survey on vision transformer. IEEE Trans. Pattern. Anal. Mach. Intell. (2022). https://doi.org/10.1109/TPAMI.2022.3152247
Steiner, A., Kolesnikov, A., et al.: How to train your ViT? Data, augmentation, and regularization in vision transformers. TMLR (2022)
Google Scholar
Hofmann, S.M., Beyer, F., Lapuschkin, S., et al.: Towards the interpretability of deep learning models for human neuroimaging. bioRxiv, p. 2021.06.25.449906 (2021)
Google Scholar
Zhang, Q.-S., Zhu, S.-C.: Visual interpretability for deep learning: a survey. Front. Inf. Technol. Electron. Eng. 19(1), 27–39 (2018). https://doi.org/10.1631/FITEE.1700808
Article Google Scholar
Lombardi, A., Diacono, D., Amoroso, N., et al.: Explainable deep learning for personalized age prediction with brain morphology. Front. Neurosci. 15, 674055 (2021)
Article Google Scholar
Chefer, H., Gur, S., Wolf, L.: Generic attention-model explainability for interpreting bi-modal and encoder-decoder transformers. In: ICCV, pp. 397–406 (2021)
Google Scholar
Abnar, S., Zuidema, W.: Quantifying attention flow in transformers. In: ACL 2020. pp. 4190–4197 (2020)
Google Scholar
Van Essen, D.C., et al.: The WU-Minn Human Connectome Project: an overview. Neuroimage 80, 62–79 (2013)
Article Google Scholar
Poldrack, R.A., et al.: A phenome-wide examination of neural and cognitive function. Sci. Data 3, 160110 (2016)
Article Google Scholar
Malcolm, J.G., Shenton, M.E., Rathi, Y.: Filtered multitensor tractography. IEEE Trans. Med. Imaging 29, 1664–1675 (2010)
Article Google Scholar
Reddy, C.P., Rathi, Y.: Joint multi-fiber NODDI parameter estimation and tractography using the unscented information filter. Front. Neurosci. 10, 166 (2016)
Article Google Scholar
Norton, I., Essayed, W.I., Zhang, F., Pujol, S., et al.: SlicerDMRI: open source diffusion MRI software for brain cancer research. Cancer Res. 77, e101–e103 (2017)
Article Google Scholar
Zhang, F., Noh, T., et al.: SlicerDMRI: diffusion MRI and tractography research software for brain cancer surgery planning and visualization. JCO Clin. Can. Inform. 4, 299–309 (2020)
Google Scholar
Zhang, F., Savadjiev, P., Cai, W., et al.: Whole brain white matter connectivity analysis using machine learning: an application to autism. Neuroimage 172, 826–837 (2018)
Article Google Scholar
Hamoda, H.M., et al.: Abnormalities in thalamo-cortical connections in patients with first-episode schizophrenia: a two-tensor tractography study. Brain Imaging Behav. 13(2), 472–481 (2018). https://doi.org/10.1007/s11682-018-9862-8
Article Google Scholar
Zhang, F., Wu, Y., et al.: An anatomically curated fiber clustering white matter atlas for consistent white matter tract parcellation across the lifespan. Neuroimage 179, 429–447 (2018)
Article Google Scholar
O’Donnell, L.J., Wells, W.M., Golby, A.J., Westin, C.-F.: Unbiased groupwise registration of white matter tractography. In: Ayache, N., Delingette, H., Golland, P., Mori, K. (eds.) MICCAI 2012. LNCS, vol. 7512, pp. 123–130. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33454-2_16
Chapter Google Scholar
Vercruysse, D., Christiaens, D., Maes, F., Sunaert, S., Suetens, P.: Fiber bundle segmentation using spectral embedding and supervised learning. In: CDMRI, pp. 103–114 (2014)
Google Scholar
O’Donnell, L.J., Westin, C.-F.: Automatic tractography segmentation using a high-dimensional white matter atlas. IEEE Trans. Med. Imaging. 26, 1562–1575 (2007)
Article Google Scholar
Fowlkes, C., Belongie, S., Chung, F., Malik, J.: Spectral grouping using the Nyström method. IEEE Trans. Pattern Anal. Mach. Intell. 26, 214–225 (2004)
Article Google Scholar
Moberts, B., Vilanova, A., van Wijk, J.J.: Evaluation of fiber clustering methods for diffusion tensor imaging. In: IEEE Conference on Visualization, pp. 65–72 (2005)
Google Scholar
Jianu, R., Demiralp, C., Laidlaw, D.H.: Exploring 3D DTI fiber tracts with linked 2D representations. IEEE Trans. Vis. Comput. Graph. 15, 1449–1456 (2009)
Article Google Scholar
Paszke, A., et al.: PyTorch: an imperative style, high-performance deep learning library. In: Advances in Neural Information Processing Systems 32 (2019)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: ICLR (2015)
Google Scholar
Zhang, F., Wu, W., Ning, L., et al.: Suprathreshold fiber cluster statistics: Leveraging white matter geometry to enhance tractography statistical analysis. Neuroimage 171, 341–354 (2018)
Article Google Scholar
Smith, S.M., Nichols, T.E.: Threshold-free cluster enhancement: addressing problems of smoothing, threshold dependence and localisation in cluster inference. Neuroimage 44, 83–98 (2009)
Article Google Scholar
Kelly, S., et al.: Widespread white matter microstructural differences in schizophrenia across 4322 individuals: results from the ENIGMA Schizophrenia DTI Working Group. Mol. Psychiatry 23, 1261–1269 (2018)
Article Google Scholar
He, H., Zhang, F., et al.: Model and predict age and sex in healthy subjects using brain white matter features: a deep learning approach. In: ISBI, pp. 1–5 (2022)
Google Scholar
Calamante, F., Tournier, D., et al.: Track-density imaging (TDI): super-resolution white matter imaging using whole-brain track-density mapping. Neuroimage 53(4), 1233–1243 (2010)
Article Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR, pp. 770–778 (2016)
Google Scholar
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., Torralba, A.: Learning deep features for discriminative localization. In: CVPR, pp. 2921–2929 (2016)
Google Scholar
Nazeri, A., Chakravarty, M., et al.: Alterations of superficial white matter in schizophrenia and relationship to cognitive performance. Neuropsychopharmacology 38, 1954–1962 (2013)
Article Google Scholar
Makris, N., Seidman, L.J., Ahern, T., Kennedy, D.N., et al.: White matter volume abnormalities and associations with symptomatology in schizophrenia. Psychiatry Res. 183, 21–29 (2010)
Article Google Scholar
Ji, E., Guevara, P., et al.: Increased and decreased superficial white matter structural connectivity in schizophrenia and bipolar disorder. Schizophr. Bull. 45, 1367–1378 (2019)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Harvard Medical School, Boston, USA
Fan Zhang, Tengfei Xue, Yogesh Rathi, Carl-Fredrik Westin & Lauren J. O’Donnell
The University of Sydney, Sydney, NSW, Australia
Tengfei Xue & Weidong Cai

Authors

Fan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Tengfei Xue
View author publications
You can also search for this author in PubMed Google Scholar
Weidong Cai
View author publications
You can also search for this author in PubMed Google Scholar
Yogesh Rathi
View author publications
You can also search for this author in PubMed Google Scholar
Carl-Fredrik Westin
View author publications
You can also search for this author in PubMed Google Scholar
Lauren J. O’Donnell
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fan Zhang .

Editor information

Editors and Affiliations

Rochester Institute of Technology, Rochester, NY, USA
Linwei Wang
Chinese University of Hong Kong, Hong Kong, Hong Kong
Qi Dou
University of Virginia, Charlottesville, VA, USA
P. Thomas Fletcher
National Center for Tumor Diseases (NCT/UCC), Dresden, Germany
Stefanie Speidel
Case Western Reserve University, Cleveland, OH, USA
Shuo Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, F., Xue, T., Cai, W., Rathi, Y., Westin, CF., O’Donnell, L.J. (2022). TractoFormer: A Novel Fiber-Level Whole Brain Tractography Analysis Framework Using Spectral Embedding and Vision Transformers. In: Wang, L., Dou, Q., Fletcher, P.T., Speidel, S., Li, S. (eds) Medical Image Computing and Computer Assisted Intervention – MICCAI 2022. MICCAI 2022. Lecture Notes in Computer Science, vol 13431. Springer, Cham. https://doi.org/10.1007/978-3-031-16431-6_19

Download citation

DOI: https://doi.org/10.1007/978-3-031-16431-6_19
Published: 15 September 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16430-9
Online ISBN: 978-3-031-16431-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)