MRI white matter lesion segmentation using an ensemble of neural networks and overcomplete patch-based voting

https://doi.org/10.1016/j.compmedimag.2018.05.001Get rights and content

Highlights

  • A novel method to segment white matter hyperintense lesions on FLAIR images is presented.

  • The proposed method uses an overcomplete patch-wise labelling within an neural network ensemble-based classifier.

  • Comparison with state of the art methods show the competitive results in terms of accuracy and efficiency.

Abstract

Accurate quantification of white matter hyperintensities (WMH) from Magnetic Resonance Imaging (MRI) is a valuable tool for the analysis of normal brain ageing or neurodegeneration. Reliable automatic extraction of WMH lesions is challenging due to their heterogeneous spatial occurrence, their small size and their diffuse nature. In this paper, we present an automatic method to segment these lesions based on an ensemble of overcomplete patch-based neural networks. The proposed method successfully provides accurate and regular segmentations due to its overcomplete nature while minimizing the segmentation error by using a boosted ensemble of neural networks. The proposed method compared favourably to state of the art techniques using two different neurodegenerative datasets.

Introduction

White matter hyperintensities (WMH) are regions of increased MR signal in T2-Weighted (T2 W) and FLuid Attenuated Inversion Recovery (FLAIR) images that are distinct from cavitations (Wardlaw et al., 2013). The number, size and location of WMH can provide important information into the aetiology and progression of various diseases. This has been extensively studied in normal ageing, cerebrovascular disease, dementia (Kuo and Lipsitz, 2004; Debette and Markus, 2010) and its influence on co-morbidities (Lee et al., 2015). The presence, topography and volume of WMH is used as biomarkers for stroke (Kuller et al., 2004; Wong et al., 2002), small vessel cerebrovascular disease (CVD) (Schmidt et al., 2004), dementia (Debette and Markus, 2010) and in multiple sclerosis (MS) (Filippi and Rocca, 2011).

In clinical practice, qualitative visual rating scales have been frequently used (Scheltens et al., 1998). However, in order to use WMH volume and spatial location as a biomarker, lesions need to be accurately and precisely segmented. Some promising early-automated methods have been used in longitudinal clinical studies (Mäntylä et al., 1997), with later studies focused on improving the sensitivity, specificity and robustness of automated WMH segmentation. Manual and semi-automated segmentation of WMH is a tedious process requiring trained observers and several hours per image for manual delineation by an expert making it unsuitable for routine clinical and research usage (Udupa et al., 1997). Moreover, manual segmentation is prone to inter and intrarater variability.

With many large clinical studies investigating ageing, cerebrovascular disease, and dementia, there is a need for robust, repeatable, accurate, and automated techniques for the segmentation of WMH. In recent years, several methods have been proposed to automatically segment WMH in CVD and in MS. While the underlying pathology is different, the radiological signatures of MS and CVD are sufficiently similar that methods developed for one have good performance for the other (Caligiuri et al., 2015). Demyelinating lesions of MS and cerebrovascular disease appear as hyperintense regions on T2 W and FLAIR images. Initial approaches to segment of WMH relied on the higher intensity in lesions compared to surrounding tissue to threshold the image after correction for inhomogeneities (Jack et al., 2001; Souplet et al., 2008). The hyperintensity assumption is challenged by the natural variation in intensity found in normal tissues across the brain such as the septum pellucidum and CSF flow artefacts around the ventricles (Neema et al., 2010). Other problem includes residual intensity inhomogeneity, even after correction.

To address these issues, more complex methods have been proposed. These methods can be classified into unsupervised and supervised. Unsupervised methods rely on the natural separation of image features using clustering type approaches. For example, the lesion growth algorithm (LGA) publicly available as part of the lesion segmentation toolbox (LST) has been widely used (Schmidt et al., 2012). In this method, both T1W and FLAIR images are required to first compute a map of possible candidate lesions whose centres are then used as seeds to segment the entire lesions using region growing. Also included in the LST toolbox, a more recent method, the lesion prediction algorithm (LPA), only requires FLAIR images as input. Within the same category, (Weiss et al., 2013) proposed a dictionary learning-based approach that segments lesions as outliers from a projection of the dataset onto a normative dictionary. Similarly (Raniga et al., 2011) used a generative model to segment lesions by detecting outlier tissue. More classical unsupervised approaches have also been proposed (Admiraal-Behloul et al., 2005).

Supervised methods require training datasets where WMH lesions are manually annotated by experts. This type of methods can work on single channel (FLAIR or T2W) or multi-channel data (FLAIR or T2W and T1W and PDW). Supervised methods for WMH segmentation typically involve machine learning methods at a voxel level with pre and/or post processing steps to improve the sensitivity and specificity of the results. Such methods have used support vector machines (Lao et al., 2008), k-nearest neighbours (Steenwijk et al., 2013), random forests (Ithapu et al., 2014; Geremia et al., 2010: Jesson and Arbel, 2015), artificial neural networks (Dyrby et al., 2008), deep learning (Brosch et al., 2015; Ghafoorian et al., 2016; Valverde et al., 2017) or multiatlas patch-based label fusion methods (Guizard et al., 2015). All these methods were trained on either single or multi-channel voxel intensities jointly with some other context-related features and typically within a standardized anatomical space. Independently of the features used, these methods perform the classification step at the voxel level, and do not take into account label spatial correlations, which might affect their performance.

To overcome the lack of local consistency (i.e. each voxel is labelled independently of neighbour voxels) of the methods performing voxel-wise classification, we propose an automatic pipeline for hyperintense lesion segmentation based on the use of patch-wise neural network classifier that segments the lesions taking in consideration patch labels local context in an overcomplete manner which further reduces classification errors. This pipeline benefits from some pre-processing steps aimed to improve the image quality and to locate it in a standardized geometrical and intensity space. The proposed method which extends a previous method recently published (Manjón et al., 2016) uses a boosting based ensemble learning strategy to minimize the classification error. In the following sections, the proposed method is described and compared to manual assessment and two state-of-the-art methods. This comparison is performed on data from two datasets.

Section snippets

AIBL dataset

In this work, we used a set of 128 subjects (including a wide range of white matter lesion severity, aged 38.6–92.1, male/female: 60/68) from the Australian Imaging Biomarkers and Lifestyle (AIBL) study (www.aibl.csiro.au) (Ellis et al., 2009). FLAIR scans were acquired for all the subjects on a 3T Siemens Magnetom TrioTim scanner using the following parameters: TR/TE: 6000/421 ms, flip angle: 120°, TI: 2100 ms, slice thickness: 0.90 mm, image matrix: 256 × 240, in-plane spacing: 0.98 mm. The

Experiments and results

All experiments were performed using MATLAB 2015a and its neural network toolbox on a standard PC (intel i7-6700 and 16 GB RAM) running Windows 10.

Discussion

In this paper, we have presented a new method to segment hyperintense lesions on FLAIR images based on an ensemble of overcomplete patch-wise neural network classifiers. We have shown that the proposed overcomplete patch-wise approach significantly improved the voxel-wise network by enforcing the regularity of the segmentations and by minimizing the variance of the classification error due to the aggregation of many patch contributions. We used a boosting strategy to combine an ensemble of

Conclusion

We have proposed a simple yet effective method to segment white matter hyperintense lesion on FLAIR images. The proposed method benefited from its overcomplete patch-based nature and boosting approach to provide regular and accurate segmentations. The proposed method compared favourably with many state-of-the-art methods in two different MRI datasets and can be a good choice to perform large-scale brain analysis studies.

Acknowledgements

This research has been done thanks to the Australian distinguished visiting professor grant from the CSIRO (Commonwealth Scientific and Industrial Research Organisation) and the Spanish “Programa de apoyo a la investigación y desarrollo (PAID-00-15)” of the Universidad Politécnica de Valencia. This research was partially supported by the Spanish grant TIN2013-43457-R from the Ministerio de Economia y competitividad. This study has been carried out also with support from the French State,

References (42)

  • H. Wang et al.

    A learning-based wrapper method to correct systematic errors in automatic image segmentation: consistently improved performance in hippocampus, cortex and brain segmetnation

    NeuroImage

    (2011)
  • B. Avants et al.

    Advanced Normalization Tools: V1.0

    (2009)
  • Leo Breiman

    Bagging Predictors. Technical Report 421

    (1994)
  • T. Brosch et al.

    Deep Convolutional Encoder Networks for Multiple Sclerosis Lesion Segmentation. MICCAI 2015, Volume 9351 of the Series Lecture Notes in Computer Science

    (2015)
  • M.E. Caligiuri et al.

    Automatic detection of white matter hyperintensities in healthy aging and pathology using magnetic resonance imaging: a review

    Neuroinformatics

    (2015)
  • S. Debette et al.

    The clinical importance of white matter hyperintensities on brain magnetic resonance imaging: systematic review and meta-analysis

    Br. Med. J.

    (2010)
  • J. Dolz et al.

    Deep CNN Ensembles and Suggestive Annotations for Infant Brain MRI Segmentation

    (2017)
  • K.A. Ellis et al.

    The Australian Imaging, Biomarkers and Lifestyle (AIBL) study of aging: methodology and baseline characteristics of 1112 individuals recruited for a longitudinal study of Alzheimer's disease

    Int Psychogeriatr. 2009

    (2009)
  • M. Filippi et al.

    MR imaging of multiple sclerosis

    Radiology

    (2011)
  • M. Ghafoorian et al.

    Transfer learning for domain adaptation in MRI: application in brain lesion segmentation

    Med. Phys.

    (2016)
  • E. Geremia et al.

    Spatial decision forests for MS lesion segmentation in multi-channel MR images

  • Cited by (32)

    • Cerebral microbleeds in dementia with Lewy bodies

      2022, Parkinsonism and Related Disorders
      Citation Excerpt :

      The number of CMB in each of these locations and the total number of CMB were recorded. FLAIR images were used to automatically segment white matter hyperintensities (WMH) using the HyperIntensity Segmentation Tool (HIST), which is based on an ensemble of neural network classifiers [9]. WMH segmentations were then visually examined for quality assurance, and manual edits were made accordingly.

    • Automatic segmentation of white matter hyperintensities from brain magnetic resonance images in the era of deep learning and big data – A systematic review

      2021, Computerized Medical Imaging and Graphics
      Citation Excerpt :

      The proposed method was cross validated with leave-one-subject-out evaluation in six studies (Stone et al., 2016; Moeskops et al., 2018; Jiang et al., 2018; Knight et al., 2018; Li et al., 2018; Wu et al., 2019a). Fourteen studies described a method for false positive removal (Sudre et al., 2017; Manjón et al., 2018; Stone et al., 2016; Moeskops et al., 2018; Rincón et al., 2017; Bowles et al., 2017; Li et al., 2018; Ghafoorian et al., 2017; Roy et al., 2015; Zhan et al., 2017; Hong et al., 2020; Fiford et al., 2020; Rachmadi et al., 2020; Ding et al., 2020). Fazekas visual rating scale was used in the validation of the results in nine studies (Rachmadi et al., 2018; Qin et al., 2018; Guerrero et al., 2017; Griffanti et al., 2016; Ling et al., 2018; Moeskops et al., 2018; Jiang et al., 2018; Bowles et al., 2017; Rachmadi et al., 2020).

    • Automatic labeling of cortical sulci using patch- or CNN-based segmentation techniques combined with bottom-up geometric constraints

      2020, Medical Image Analysis
      Citation Excerpt :

      The UNET method labels a brain in only twenty seconds, whereas the SPAM method takes about ten minutes. It is interesting to note that patch MAS approaches are also beginning to integrate deep learning techniques (Manjón et al., 2018), probably due to their ability to effectively summarize the data and for their rapidity of execution. Furthermore, the top-down refinement of bottom-up regularization significantly improves the results.

    View all citing articles on Scopus
    View full text