Learn to Track: Deep Learning for Tractography

Poulin, Philippe; Côté, Marc-Alexandre; Houde, Jean-Christophe; Petit, Laurent; Neher, Peter F.; Maier-Hein, Klaus H.; Larochelle, Hugo; Descoteaux, Maxime

doi:10.1007/978-3-319-66182-7_62

Philippe Poulin²¹,
Marc-Alexandre Côté²²,
Jean-Christophe Houde²²,
Laurent Petit²⁴,
Peter F. Neher²³,
Klaus H. Maier-Hein²³,
Hugo Larochelle²¹ &
…
Maxime Descoteaux²²

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10433))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

12k Accesses
16 Citations
2 Altmetric

Abstract

We show that deep learning techniques can be applied successfully to fiber tractography. Specifically, we use feed-forward and recurrent neural networks to learn the generation process of streamlines directly from diffusion-weighted imaging (DWI) data. Furthermore, we empirically study the behavior of the proposed models on a realistic white matter phantom with known ground truth. We show that their performance is competitive to that of commonly used techniques, even when the models are used on DWI data unseen at training time. We also show that our models are able to recover high spatial coverage of the ground truth white matter pathways while better controlling the number of false connections. In fact, our experiments suggest that exploiting past information within a streamline’s trajectory during tracking helps predict the following direction.

You have full access to this open access chapter, Download conference paper PDF

DeepTract: A Probabilistic Deep Learning Framework for White Matter Fiber Tractography

Deep Learning Methods for Identification of White Matter Fiber Tracts: Review of State-of-the-Art and Future Prospective

Article 17 June 2023

Learning a Single Step of Streamline Tractography Based on Neural Networks

1 Introduction

Tractography is currently at the heart of human brain connectomics studies [15]. However, recent biases and limitations of existing tractography pipelines have been highlighted [4], such as the reconstruction of many non-existent connections (false positive streamlines), poor spatial extent of existing connections and the difficulty of injecting anatomical priors beyond manual dissection and tissue classes from T1-weighted segmentations.

Currently, tracking algorithms depend on local models with assumptions on the nature of the underlying DWI signal. In 2015, [13] proposed a machine learning approach to fiber tractography based on a random-forest classifier. They successfully demonstrated how a purely data-driven approach can be used to reconstruct streamlines from the raw diffusion signal. Their method works well on 2D synthetic data and shows promising qualitative results on in vivo data. However, it has yet to be shown how well machine learning (and particularly deep learning) approaches can perform quantitatively on more realistic data and how well they can generalize to unseen data. In this paper, our main contributions are the first deep learning models for this problem and their evaluation, namely (1) a local reconstruction model based on a multilayer perceptron, (2) a sequential reconstruction model based on a recurrent neural network, (3) a careful quantitative evaluation of performances on the phantom of the ISMRM 2015 Tractography Challenge, and (4) a qualitative examination of the streamlines generated in unseen data during training. Our method outperforms or is competitive with the current state-of-the-art deterministic and probabilistic tractography algorithms robust to crossing fibers. In particular, out of 96 other tractography methods, this is the only approach able to recover more than 50% of spatial coverage of ground truth bundles while producing overreaching false connections below 50%. Our recurrent neural network is a promising deep learning solution for tractography based on raw DWI. It includes a notion of history of followed directions, which makes it robust to crossing fibers, robust to a wide range of geometries and allows the flexibility to include priors and learn how to reduce false-positive connections.

2 Using Deep Learning for Tractography

Given a diffusion dataset and sequences of spatial coordinates, the goal is to train a model to predict tracking directions to follow. In the context of tractography, a deterministic model can be used in an iterative process for streamline creation.

We chose to focus on deep learning models because of their well-known ability to discover and extract meaningful structures directly from raw data [9]. Our models are based on two types of deep learning models: a Feed-Forward Neural Network (FFNN), and a Recurrent Neural Network (RNN) [7]. While the FFNN is a local model and serves as a good baseline, it has the same weaknesses as existing methods, i.e. it is not able to learn streamline structures. To address this weakness, we used an RNN, because this family of models can process whole sequences as input. In our case, treating streamlines as sequences of coordinates in 3D space, our hypothesis is that a recurrent model should be able to learn the fiber or bundle structure through the diffusion signal in order to make better predictions and solve classic problems like fiber crossing.

Model inputs: As in [13], to be independent of the gradient scheme, the raw diffusion signal is first resampled to have D gradient encodings evenly distributed on the sphere (we used \(D=100\)). We also normalized each diffusion-weighted images by the b=0 image. A streamline is represented as a sequence \(\varvec{S}\) of M equally-spaced spatial coordinates \(P_i = (x_i, y_i, z_i)\). The diffusion signal is evaluated at each of these points, using trilinear interpolation in the voxel space. This results in a sequence of M vectors with D dimensions representing the diffusion information along the streamline. In all our models, we also tried giving the previous direction as a supplementary input, as in [13]. Note that the spatial coordinates are not given as input to the model. This choice allows the model to be invariant to brain size or translation, reducing the preprocessing needed before feeding data to the model and improving generalization.

2.1 Models

FFNN. The FFNN sees all streamline coordinates as individual, independent local data points. The output of the model is a 3-dimensional normalized vector. The model is represented in Fig. 1(a). To remove the directional ambiguity when no previous direction is given, we choose to consider the output vector as an undirected axis instead of a direction. To this end, the loss function is defined as the negative squared cosine similarity.

RNN. The general idea behind the RNN is to model an internal state that is updated with each new observation in the input sequence and can be used to make predictions. Through its updatable internal state, the model can “remember” relevant features about the past. In this case, we used a Gated Recurrent Unit (GRU) [3] type of RNN.

Figure 1(b) shows that for each point \(P_i\) in the streamline, the diffusion information (DWI(\(P_i\))) is used to update the internal state \(\varvec{h}_i\) of the model. From there, at each step along the streamline, the model makes a prediction of the direction to follow \(\varvec{\widehat{d}}_i\). The loss function is defined as the mean squared error (MSE) between the model’s prediction \(\varvec{\widehat{d}}_i\) and the target \(\varvec{d}_i\) (i.e. the next normalized segment of the streamline).

2.2 Tractography

Tractography is performed by using a fully trained model. Streamlines generation follows an iterative process as in classical streamline-based tractography techniques [11] as illustrated in Fig. 1(c). From a seed point \(P_0=(x_0,y_0,z_0)\), a new streamline is created with the initial seed \(\widehat{\varvec{S}}=\{P_0\}\). Next, the model is given the DWI data at the previous streamline coordinate \(P_i\) to obtain a predicted direction \(\varvec{\widehat{d}}_i\). A next point \(P_{i+1}=P_i+\alpha \varvec{\widehat{d}}_i\) is then computed, where \(\alpha \) is a chosen step size, as in standard streamline-based tractography algorithms. Points are generated to iteratively produce a streamline until a desired criterion is met (e.g. too high curvature, exiting WM mask). The whole process is repeated as many times as required to produce a full tractogram.

3 Related Work

The work of Neher et al. [13] hypothesizes that tractography can be improved by considering local neighborhood features, adding a directional prior to promote straight fibers, and using a fiber deflection protocol to help the model recover from mistakes. More precisely, their model makes predictions based on a voting mechanism, using local direction proposals from multiple sample positions in the vicinity of the current location. Each direction proposal is obtained by a classification over 100 possible directions (weighted using the previous direction), along with a streamline termination probability. If fiber termination is the more likely option, a deflection is attempted by rotating the sample point 180\(^{\circ }\) around the previous direction and classifying a second time.

In our current approach, the problem is framed as a regression task over normalized directions instead of a vote over discretized directions. This means that to produce a prediction, fewer computations are needed at the output, compared to computing and voting over many proposals. Regression also allows the model to output more precise directions and thus be more suitable at exploiting smaller variations in direction. In addition, if straight fibers are supported by the data, a directional prior should not be necessary and a deep learning model should be able to learn the right structure, which is why our model does not include such a prior.

While Neher et al. consider the neighborhood of the current position, they do not consider the full evolution of the streamline up to each point. Our hypothesis is that there are high-order dependencies between the next direction in the streamline and all previous directions. Consequently, our recurrent approaches have a natural mechanism for integrating past information along the streamline to predict a next direction. These two approaches are not exclusive however, and would probably benefit from each other.

Finally, in a deep learning context, learning a stopping criterion along with the direction to follow is more complex. It would require careful engineering and balancing of the loss function in order for one not to overcome the other during training, especially using a recurrent approach. This is beyond the scope of this paper and left for future work.

4 Experiments

We quantify the performance of our methods on the 2015 ISMRM challenge dataset [10] and evaluate using the Tractometer connectivity metrics [4]. In doing so, we can compare ourselves to the 96 original challenge submissions [1]. We then qualitatively evaluate our method when tracking on in vivo data.

Tracking parameters: Tracking was done using voxel step size (1.0 mm for ISMRM challenge, 0.625 mm for HCP). Seeding was done using 1 seed per voxel in the WM mask, and tracking was done using a dilated WM mask. All streamlines leaving the dilated mask were automatically terminated and streamlines shorter than 20 mm or longer than 200 mm were discarded. Streamlines with a half-cone curvature higher than 20\(^{\circ }\) were also discarded.

4.1 ISMRM2015 Challenge

For the first experiment, we choose to reproduce the training environment of Neher et al. [13], by using training data generated specifically for the subject of interest. They determined the optimal method for generating their training data to be a constrained spherical deconvolution (CSD) deterministic streamline tractography (DET) of MRtrix [14]. Staying coherent with their approach, a tractogram was generated using CSD-DET on a denoised and distortion-corrected version of the ISMRM2015 challenge dataset. The resulting 92 K streamlines were then split into training and validation sets (using splits of 90% and 10%).

We trained models with one to four layers, varying the layer size between 500 and 1000, and used the Adam optimizer with early stopping. Full code is available online^{Footnote 1}. For each type of model, the one with the best validation error was chosen for tracking & tractometer evaluation.

We report in Table 1 the valid connections ratio and the associated number of valid bundles (true positives), the invalid connections ratio and the associated number of invalid bundles (false negatives), the volumetric bundle overlap and overreach in percentages. Drawing conclusions from only one of these last two metrics can be misleading (e.g. a model can have both the best overlap and the worst overreach). These metrics are related to precision and recall measures, and are combined into the \(\text {F}_1\)-measure. Note that the “_PD” model suffix indicates when the previous direction was given as input to the model. We report as baselines the ISMRM mean results and submission 6_1 [1], which is a CSD-DET based method comparable to what was used to generate our training data.

We see that the local model (FFNN) is already competitive with the mean ISMRM challenge scores. Its ability to estimate the main diffusion axis and tuning its predictions according to the streamlines seen during training allows it to improve all mean scores except number of invalid connections (+3.9 VC, +7.6 IC, −10 NC, +1.6 VB, −214 IB, +12.2 OL, −1.6 OR, +11.6 F1). Surprisingly, adding the previous direction as input worsened the model’s performance. We think that the optimization process allowed the model to achieve a good performance by generally simply copying the previous direction given as input. Indeed, we looked at the generated streamlines, and while they do a good job of covering the brain (the FFNN_PD model recovered 22 bundles out of 25), they are mostly straight and miss important connections.

Going from the local model to the recurrent model (RNN) provided different insights. Without the previous direction as input, the model generated more relative valid connections, but overall very few streamlines (as seen in the overlap metric, 7.7%). With the previous direction however, the model achieved very good coverage of the challenge bundles (64.4% overlap), while dropping a bit below 50% VC. It achieved the best F1 score over all our models. In comparison, no submission in the ISMRM2015 challenge achieved an overlap higher than 50% while keeping overreach under 50% [1, 10]. Figure 2 shows how the left CST is reconstructed with high coverage and low overreach. We believe that the recurrent model, being able to accumulate “memories” about the past of the streamline, is able to extract information of the previous direction without committing the same mistakes as the local model. This ability to “memorize” is what makes this model stand apart from classic methods.

Table 1. Quantitative evaluation on the ISMRM 2015 Tractography Challenge.

Full size table

4.2 In Vivo tracking

Using the models trained in the first experiment (Sect. 4.1), we tracked on an unseen brain (HCP subject #100307). As a gold standard we used a virtual ROI-based dissection made by an expert neuroanatomist [2, 12]. Streamlines used for the dissection were generated using Particle Filtering Tractography [6] using default parameters and based on a spherical harmonics 8 multi-shell constrained spherical deconvolution reconstruction [5, 8]. The resulting bundles are shown in Fig. 3. Visual evaluation shows results that are in line with the first experiment. The local model does a good job of recovering the bundles, but has poor coverage. The recurrent model is much more similar to the expert segmentation in most of the recovered bundles. We suspect that the RNN would gain even more by training on much larger datasets with multiple subjects.

5 Conclusion

We propose the first deep learning alternatives to traditional local modeling approaches to tractography based on raw DWI. Our FFNN model provides the first performance baseline for local deep models. We also present a novel approach where the past of the streamline is considered by a recurrent model in order to make better predictions. Compared to the other ISMRM2015 submissions, this proved to be the only technique able to recover more than 50% of spatial coverage while producing overreaching false connections below 50%. We show that deep learning models can generalize to new DWI unseen at training time. These novel results show that deep learning is a promising approach to tractography.

While we believe that deep learning will be able to discover new pathways by learning the global streamline structure, we still do not have enough accurate data to explore this area of research. In future works, as data become available, we plan on training on incomplete datasets (i.e. removing one or more bundles) in order to see the reconstruction and discovery capabilities of our models. Furthermore, we will explore how modifying the output of the RNN (e.g. predicting the parameters of a distribution) can improve the power of the model.

Notes

1.
https://github.com/ppoulin91/learn2track/tree/miccai2017_submission.

References

ISMRM 2015 tractography challenge. http://tractometer.org
Catani, M., De Schotten, M.T.: A diffusion tensor imaging tractography atlas for virtual in vivo dissections. Cortex 44(8), 1105–1132 (2008)
Article Google Scholar
Cho, K., van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., Bengio, Y.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1724–1734 (2014)
Google Scholar
Côté, M.A., Girard, G., Boré, A., Garyfallidis, E., Houde, J.C., Descoteaux, M.: Tractometer: towards validation of tractography pipelines. Med. Image Anal. 17(7), 844–857 (2013)
Article Google Scholar
Garyfallidis, E., Brett, M., Amirbekian, B., Rokem, A., Van Der Walt, S., Descoteaux, M., Nimmo-Smith, I.: Dipy, a library for the analysis of diffusion MRI data. Frontiers Neuroinform. 8 (2014)
Google Scholar
Girard, G., Whittingstall, K., Deriche, R., Descoteaux, M.: Towards quantitative connectivity analysis: reducing tractography biases. NeuroImage (2014)
Google Scholar
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press (2016). http://www.deeplearningbook.org
Jeurissen, B., Tournier, J.D., Dhollander, T., Connelly, A., Sijbers, J.: Multi-tissue constrained spherical deconvolution for improved analysis of multi-shell diffusion MRI data. NeuroImage 103, 411–426 (2014)
Article Google Scholar
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)
Article Google Scholar
Maier-Hein, K., et al.: Tractography-based connectomes are dominated by false-positive connections. bioRxiv (2016). http://biorxiv.org/content/early/2016/11/07/084137
Mori, S., Crain, B.J., Chacko, V.P., Van Zijl, P.: Three-dimensional tracking of axonal projections in the brain by magnetic resonance imaging. Ann. Neurol. 45(2), 265–269 (1999)
Article Google Scholar
Mori, S., Wakana, S., Van Zijl, P.C., Nagae-Poetscher, L.: MRI Atlas of Human White Matter. Elsevier, Amsterdam (2005)
Google Scholar
Neher, P.F., Götz, M., Norajitra, T., Weber, C., Maier-Hein, K.H.: A machine learning based approach to fiber tractography using classifier voting. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9349, pp. 45–52. Springer, Cham (2015). doi:10.1007/978-3-319-24553-9_6
Chapter Google Scholar
Tournier, J., Calamante, F., Connelly, A.: Mrtrix: diffusion tractography in crossing fiber regions. Int. J. Imaging Syst. Technol. 22(1), 53–66 (2012)
Article Google Scholar
Van Essen, D., et al.: The human connectome project: a data acquisition perspective. NeuroImage 62(4), 2222–2231 (2012)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, Université de Sherbrooke, Sherbrooke, Canada
Philippe Poulin & Hugo Larochelle
Sherbrooke Connectivity Imaging Laboratory (SCIL), Computer Science Department, Université de Sherbrooke, Sherbrooke, Canada
Marc-Alexandre Côté, Jean-Christophe Houde & Maxime Descoteaux
Medical Image Computing (MIC), German Cancer Research Center (DKFZ), Heidelberg, Germany
Peter F. Neher & Klaus H. Maier-Hein
Groupe d’Imagerie Neurofonctionelle, IMN, CNRS, CEA, Université de Bordeaux, Bordeaux, France
Laurent Petit

Authors

Philippe Poulin
View author publications
You can also search for this author in PubMed Google Scholar
Marc-Alexandre Côté
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Christophe Houde
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Petit
View author publications
You can also search for this author in PubMed Google Scholar
Peter F. Neher
View author publications
You can also search for this author in PubMed Google Scholar
Klaus H. Maier-Hein
View author publications
You can also search for this author in PubMed Google Scholar
Hugo Larochelle
View author publications
You can also search for this author in PubMed Google Scholar
Maxime Descoteaux
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Philippe Poulin .

Editor information

Editors and Affiliations

Université de Sherbrooke, Sherbrooke, QC, Canada
Maxime Descoteaux
DKFZ, Heidelberg, Germany
Lena Maier-Hein
Ulm University of Applied Sciences, Ulm, Germany
Alfred Franz
Université de Rennes 1, Rennes, France
Pierre Jannin
McGill University, Montreal, QC, Canada
D. Louis Collins
Université Laval, Québec, QC, Canada
Simon Duchesne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Poulin, P. et al. (2017). Learn to Track: Deep Learning for Tractography. In: Descoteaux, M., Maier-Hein, L., Franz, A., Jannin, P., Collins, D., Duchesne, S. (eds) Medical Image Computing and Computer Assisted Intervention − MICCAI 2017. MICCAI 2017. Lecture Notes in Computer Science(), vol 10433. Springer, Cham. https://doi.org/10.1007/978-3-319-66182-7_62

Download citation

DOI: https://doi.org/10.1007/978-3-319-66182-7_62
Published: 04 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66181-0
Online ISBN: 978-3-319-66182-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)