Reliability of brain volume measurements: A test-retest dataset

Maclaren, Julian; Han, Zhaoying; Vos, Sjoerd B; Fischbein, Nancy; Bammer, Roland

doi:10.1038/sdata.2014.37

Download PDF

Data Descriptor
Open access
Published: 14 October 2014

Reliability of brain volume measurements: A test-retest dataset

Julian Maclaren¹,
Zhaoying Han¹,
Sjoerd B Vos^1,2,
Nancy Fischbein¹ &
…
Roland Bammer¹

Scientific Data volume 1, Article number: 140037 (2014) Cite this article

15k Accesses
88 Citations
16 Altmetric
Metrics details

Subjects

Abstract

Evaluation of neurodegenerative disease progression may be assisted by quantification of the volume of structures in the human brain using magnetic resonance imaging (MRI). Automated segmentation software has improved the feasibility of this approach, but often the reliability of measurements is uncertain. We have established a unique dataset to assess the repeatability of brain segmentation and analysis methods. We acquired 120 T1-weighted volumes from 3 subjects (40 volumes/subject) in 20 sessions spanning 31 days, using the protocol recommended by the Alzheimer's Disease Neuroimaging Initiative (ADNI). Each subject was scanned twice within each session, with repositioning between the two scans, allowing determination of test-retest reliability both within a single session (intra-session) and from day to day (inter-session). To demonstrate the application of the dataset, all 3D volumes were processed using FreeSurfer v5.1. The coefficient of variation of volumetric measurements was between 1.6% (caudate) and 6.1% (thalamus). Inter-session variability exceeded intra-session variability for lateral ventricle volume (P<0.0001), indicating that ventricle volume in the subjects varied between days.

Design Type(s)	observation design • reference design • replicate design • time series design
Measurement Type(s)	structural magnetic resonance imaging
Technology Type(s)	Magnetic Resonance Imaging
Factor Type(s)	Session
Sample Characteristic(s)	Homo sapiens • brain

Machine-accessible metadata file describing the reported data (ISA-Tab format)

Examining the identification of age-related atrophy between T1 and T1 + T2-FLAIR cortical thickness measurements

Article Open access 02 August 2019

Heidi Lindroth, Veena A. Nair, … Robert D. Sanders

Structural and functional multi-platform MRI series of a single human volunteer over more than fifteen years

Article Open access 31 October 2019

Simon Duchesne, Louis Dieumegarde, … Olivier Potvin

A predictive model using the mesoscopic architecture of the living brain to detect Alzheimer’s disease

Article Open access 20 June 2022

Marianna Inglese, Neva Patel, … Eric O. Aboagye

Background & Summary

Quantitative measurement of the volume of human brain structures using magnetic resonance imaging (MRI) may have application in the monitoring of neurodegenerative disease progression or the assessment of general structural abnormalities^1–3. Traditionally, volumetric data for this purpose have been obtained using time-consuming manual segmentation methods. Developments in freely available automated segmentation software, however, have greatly improved the practicality of the process^4–8. A further recent development has been the introduction of standardized imaging protocols and online databases containing structural MRI brain data from both patients and normal controls. One notable example is the database and set of protocols made available by the MRI core of the Alzheimer’s Disease Neuroimaging Initiative (ADNI)⁹.

The combination of freely available data and automatic segmentation software has the potential to make volumetric brain MRI an indispensible tool for both research and clinical medicine. It is now possible to compare quantitative brain volume measurements from a patient with control data from databases of normal subjects within only a few minutes¹⁰. Since brain volume measurements are a known biomarker for neurodegenerative diseases, such as Alzheimer’s disease^2,11, this general approach may have great clinical value as a screening tool. Performing volumetric measurements at multiple time points in the same patient may also be a powerful means of measuring changes in the brain. Recent longitudinal results have shown that atrophy in medial temporal lobe (MTL) structures is predictive of subsequent memory decline in elderly subjects¹². Additionally, performing longitudinal MRI studies improves predictive power in assessing the prognosis of individuals with mild cognitive impairment (MCI)¹³, and predicting the development of Alzheimer’s disease in non-demented subjects¹⁴ as well as MCI patients¹⁵.

We have begun to investigate the feasibility of employing the above approach at our institution. However, clinical implementation requires thorough validation, and one existing cause for concern regarding this methodology is that the repeatability (i.e., test-retest reliability) of individual measurements is still largely unknown. This prevents robust comparison of a single patient measurement against those in a database of normal subjects. It also confounds the analysis of longitudinal measurements from an individual, since it is crucial to know whether any apparent trend is simply due to measurement variation. That is, without knowing the fluctuations of our measurements in a single subject, we cannot be certain what disease effects we can detect. A number of confounding factors can be controlled, such as the scanner, imaging protocol, and software version used for data processing. MRI data are inherently noisy, however, and effects such as patient positioning and field of view selection are not constant between scans. There is also evidence of possible day-to-day variations in the brain structures of interest, for example due to hydration levels^16,17 or the menstrual cycle¹⁸.

The goal of this study was to generate a publicly available test dataset to assist in the validation of the repeatability of current and future segmentation methods. We believe that this will be of great utility to other researchers who may not have the imaging facilities or resources to collect such a dataset. To demonstrate the utility of this dataset, we use it to obtain basic test-retest reliability information about the volume data corresponding to brain regions of clinical interest. This includes an analysis to test the hypothesis that test-retest reliability (repeatability) of measurements can be measured using multiple scans in a single session (intra-session) and that changes from day to day (inter-session) do not affect repeatability.

Methods

An experiment was designed to allow separate calculation of intra- and inter-session test-retest reliability (Figure 1). Human data collection was performed with the approval of the Stanford University Institutional Review Board (IRB) and all subjects gave their written informed consent. A total of 120 T1-weighted volumes were acquired from 3 subjects (40 scans/subject). Each subject was scanned twice on twenty different days within a 31-day period. Subjects were repositioned between the two scans in each session, and reregistered on the scanner console, so that all scans were treated as separate measurements (with a resulting break of ~5 min between scans). This ensured that confounding effects such as subject positioning were consistent between and within sessions.

We used a GE MR750 3T scanner (software version DV22.0_V02_1122.a, XRMB gradient set) and the ADNI-recommended T1-weighted imaging protocol for this system (accelerated sagittal 3D IR-SPGR, 27 cm FOV, 256 x 256 matrix slice, 1.2 mm slice thickness, standard 8-channel phased array head coil, TR: 7.3 ms, TE: 3 ms, TI: 400 ms, flip angle: 11 deg., acquisition time: 5 min 37 s). We chose to use the ADNI protocol⁹ because it exhibits excellent contrast between gray and white matter and has become standardized, so it is relevant to the research community.

Other data recorded throughout the study included subject weight, time of day, and phantom data for quality assurance and scanner stability. Both the ADNI phantom¹⁹ and the ACR (American College of Radiology) phantom were used for this purpose.

Data Records

Following the guidelines of our IRB, written informed consent was obtained from each subject allowing their data to be shared online. The test-retest dataset is available through FigShare (Data Citation 1). It consists of a single zip file containing the following:

A folder containing the brain imaging data, comprising 120 data files, separated by subject number into three folders. Each file contains an anonymized MRI brain volume in NifTI-1 format. Note that these files have not been processed by FreeSurfer in any way and are not corrected for B1 inhomogeneity or otherwise modified. Conversion from DICOM format was performed using a publicly available and often-used converter, known as ‘dcm2nii’, which is available online at http://www.mccauslandcenter.sc.edu/mricro/mricron/dcm2nii.html, together with extensive documentation. More information about the NIfTI file format is available from the Neuroimaging Informatics Technology Initiative webpage at http://nifti.nimh.nih.gov/.
A text file (‘anonymized_dicom_header.txt’) containing the anonymized DICOM header from Subject 2. Researchers can use this file to obtain any extra information about the scan protocol that is not adequately described in this manuscript.
Three files in Microsoft Excel format, indicating the day and time each image volume was acquired from each subject, the weight of each subject at the time of each scan, and the age and gender of each subject.
DICOM and NifTI files corresponding to 3D scans of the ADNI phantom and ACR phantom.

Technical Validation

All 3D volumes were visually inspected at the time of acquisition to check for severe motion corruption or any other potentially confounding artifacts. No significant or unexpected artifacts were observed, and so no scans were repeated. Regular QA tests on the scanner did not indicate any performance issues during the 31-day period of the study. To further validate the technical quality of the dataset, and to demonstrate its utility, we processed and analyzed the data using the methods described in this section.

Data processing

All 3D volumes were processed using FreeSurfer^6–8 (freely available online at http://surfer.nmr.mgh.harvard.edu/), which provides quantitative volume data for a range of brain structures. We ran FreeSurfer without user intervention (‘recon-all –i data1.dcm –all’), since this is the mode of operation that would be used in an automated pipeline processing patient data. The computing hardware, operating system (Ubuntu 12.04.1) and FreeSurfer version (v5.1) were kept constant for the duration of this study, in line with recommendations made by Gronenschild et al.²⁰, who showed that these factors can affect results. We chose to restrict our analysis to the lateral ventricle volumes as well as a select group of subcortical structures of particular interest to us, including the hippocampus, amygdala, putamen, pallidum, caudate and thalamus. In all cases, volumes from the left and right hemispheres were summed together to generate a single value for each structure.

Data analysis

We applied the following statistical analysis to the FreeSurfer output to assess variability for measurements obtained together on the same day (intra-session) and measurements from day to day (inter-session). Paired acquisitions allowed the intra-session variability to be computed using the expression for standard deviation from paired data²¹. The following analysis was applied to each brain structure of interest and for each subject. Denoting the standard deviation of intra-session measurements by σ_s, then

σ_{s} = \sqrt{\sum {(x_{i}^{'} - x_{i}^{''})}^{2} / 2 m},

where $x_{i}^{'}$ and $x_{i}^{''}$ are the ith paired measurements over the m pairs (or sessions), evaluated separately for each subject and assuming that the variance is the same for all samples. Intra-session variability reflects manual repositioning differences, noise and segmentation errors, but not biological variations occurring from day to day. The total standard deviation, σ_t, is computed across all 40 measurements from each subject.

In each case, the standard deviation is then converted to coefficient of variation (CV) expressed as a percentage, namely

\begin{array}{l} C V_{s} = \frac{σ_{s}}{\bar{x}} \times 100 \\ C V_{t} = \frac{σ_{t}}{\bar{x}} \times 100 \end{array}

where CV_s is the intra-session coefficient of variation, CV_t is the total coefficient of variation, and $\bar{x}$ is the arithmetic mean of all 40 volume values for the given structure and subject. We use the coefficient of variation since the normalization by the mean volume is somewhat more intuitive than absolute standard deviations. This is particularly the case when comparing between the segmentation repeatability of different structures, which may have very different volumes. Finally, CV_s and CV_t data from each subject were pooled by taking the root-mean-square average, as proposed by Gluer et al.²², since taking the arithmetic mean directly is not valid.

A Monte Carlo permutation test²³ was used to evaluate observed differences between CV_t and CV_s for each structure under the null hypothesis of ‘no difference’, i.e., that CV_s=CV_t. In brief, the scan order of all 40 volumes for each subject is randomly permuted N times. The above analysis is repeated for each permutation and the value obtained for |CV_s−CV_t| is recorded. The two-sided P-value is then calculated as the proportion of the N permutations where the absolute difference |CV_s−CV_t| was equal to or greater than the value obtained with the measured data. In this work, we used N=100,000.

Data analysis results

Figure 2 shows typical segmentation results obtained using FreeSurfer. The volumetric data segmented in this example were acquired from Subject 2 on Day 1 of the study. Despite generally good segmentation accuracy, minor differences are visible between the two scans from one session. This is to be expected, since the input data differ slightly due to noise, subject positioning, and other confounding factors.

**Figure 2: Example FreeSurfer segmentation.**

Quantitative differences between segmentation volumes are shown in Figure 3 for several brain structures from Subject 2. For the thalamus, caudate, pallidum and hippocampus, the differences between each scan visually appear to be random. However, in the case of the lateral ventricles it is evident that inter-session variance exceeds intra-session variance, since values obtained on the same day (neighboring bars in the plot) are typically closer in magnitude to each other than values obtained on different days.

**Figure 3: Scan-to-scan volume differences.**

Results from the numerical analysis show total coefficient of variation values between 1.6% (caudate) and 6.1% (thalamus) for the structures analyzed (Table 1). The mean structure volume appears to have little influence on the coefficient of variation value. The lateral ventricles are the only structures showing a statistically significant difference between intra- and inter-session variability. This difference is also substantial: accounting for variance between sessions more than doubles the total coefficient of variation for lateral ventricle volume from 1.6% to 3.4%.

Table 1 Measurement variation by structure.

Full size table

Based on the results shown in Table 1, one might expect that true ventricle size changes are occurring between days. To investigate this further, registered volumes were compared directly. Figure 4 shows three orthogonal slices from Subject 2, acquired at the time of the maximum (Day 7, Measurement 2, shown in top row) and minimum (Day 18, Measurement 1, shown in bottom row) lateral ventricle volume. The reduced ventricle size is visible, particularly when the images are viewed dynamically (see video available through FigShare (Data Citation 1)).

**Figure 4: Apparent ventricle volume changes.**

In order to explore the apparent change in ventricle size, we investigated the relationship between the lateral ventricle volumes and (a) subject weight at the time of each scan, (b) image signal-to-noise ratio (SNR), and (c) time of day of scan (Figure 5). No correlation with weight or image SNR (Supplementary Figure 1) was observed, but in the case of time of day, a statistically significant positive correlation (r=0.66, P=0.0016) was observed for Subject 3, but not for Subjects 1 or 2.

**Figure 5: Lateral ventricle volume vs time of day.**

Usage Notes

The analysis and results presented in this work demonstrate how the test-retest dataset can be used to determine the reliability of brain volume measurements. The coefficient of variation (CV) values shown in Table 1 indicate the repeatability of the volumetric measurements obtained using our FreeSurfer-based pipeline. The CV values shown (from 0.9% to 6.1%, depending on the structure) could be used for effect size calculations for volumes of individual structures in a study using the same processing pipeline. However, we did not attempt to perform a comprehensive valuation of FreeSurfer itself, or to compare its output to that from other software packages or manual segmentation, as the reliability of FreeSurfer has already been evaluated on numerous occasions^24–26. We also limited our analysis to volumes of the lateral ventricles, cortical white matter, and the sub-cortical structures of particular interest to us. We hope that researchers developing alternative software packages will use the test-retest dataset to evaluate and compare their processing methods in brain regions of interest to them.

The main unique aspect of our test-retest dataset is the large number of scans performed on each subject over a short period (one month) and that two back-to-back scans were performed within each session, with the patient repositioned between each scan. This can be used to separate intra- from inter-session variation, using the methods demonstrated in this work. In particular, the data indicate that variance between days exceeds variance within a session for lateral ventricle volume and that this is due to actual volume changes of the ventricles between days, rather than measurement errors.

The observed ventricle volume changes may be due to subject hydration effects, since dehydration can cause shrinkage of brain tissue and enlargement of ventricle volume^16,17. But unlike these previous studies, our subjects were not required to deliberately become dehydrated or to consume excess fluid, and we did not observe a reduction in body mass correlating with ventricular volume. We refrained from using a hydration protocol, as it is not standard practice to subject patients to such a measure for routine imaging. Despite this, the maximum observed ventricle volume changes in our study were of similar magnitude to that found in other studies where subjects underwent a ‘thermal-exercise dehydration protocol’^17,27. It is therefore interesting to note that these effects also occur in normal subjects, and it is possible that other confounding factors are affecting water balance in the brain. The time of day did not appear to be a strong predictor of this effect (Figure 5). Regardless of the cause, this effect may need to be considered when looking for signs of ventricular enlargement using MR-based methods such as VIENA²⁸. It is unclear whether the effect would be more pronounced in older subjects where ventricles are larger. While larger ventricles are potentially easier to segment, leading to less error in the volume estimates, absolute volume changes from day to day would lead to a smaller percentage change.

Although the lateral ventricles were the only structures where we were able to reject the null hypothesis of identical intra- and inter-session variance, a ‘conservation of volume’ assumption within the skull would suggest that volume changes should occur in other brain structures to compensate for the ventricle volume gain or loss. Perhaps future advances in segmentation methods will allow these changes to be detected in this dataset. Nonetheless, the data indicate that changes from day to day are likely to be very small for structures other than the ventricles.

Regular measurements of the QA phantoms and other standard scanner QA measurements gave no evidence of scanner-related variation over the 31-day period. We have also not observed any changes in the brain volume data that would indicate this was a limiting factor in the study. A more important limitation is that the subjects (healthy, age 26–31) are not representative of the general population or any specific patient group. With only three subjects, it is also possible that the inter-scan variability does not hold for a larger population. Nonetheless, the data are sufficient to perform basic test-retest reliability analysis, which was the goal of the work.

Our experimental protocol differed significantly from that used in previous studies examining test-retest reliability^29,30, since our subjects were imaged many times, rather than on only two occasions. While we specifically aimed at comparing intra- and inter-session variation, previous work has largely focused on the influence of confounding factors such as pulse sequence, imaging geometry, parallel imaging, scanner drifts and choice of segmentation software. We have collected a unique dataset, complimentary to existing studies, which we hope will become a valuable tool to assess repeatability of segmentation methods.

Additional information

How to cite this article: Maclaren, J. et al. Reliability of brain volume measurements: A test-retest dataset. Sci. Data 1:140037 doi: 10.1038/sdata.2014.37 (2014).

References

Jack, C. R. Jr et al. Medial temporal atrophy on MRI in normal aging and very mild Alzheimer's disease. Neurology 49, 786–794 (1997).
Article Google Scholar
Dubois, B. et al. Research criteria for the diagnosis of Alzheimer's disease: revising the NINCDS-ADRDA criteria. Lancet Neurology 6, 734–746 (2007).
Article Google Scholar
Jack, C. R. Jr et al. Rates of hippocampal atrophy correlate with change in clinical status in aging and AD. Neurology 55, 484–489 (2000).
Article Google Scholar
Kovacevic, S., Rafii, M. S. & Brewer, J. B. High-throughput, fully automated volumetry for prediction of MMSE and CDR decline in mild cognitive impairment. Alzheimer Dis. Assoc. Disord. 23, 139–145 (2009).
Article Google Scholar
Brewer, J. B., Magda, S., Airriess, C. & Smith, M. E. Fully-automated quantification of regional brain volumes for improved detection of focal atrophy in Alzheimer disease. AJNR Am. J. Neuroradiol. 30, 578–580 (2009).
Article CAS Google Scholar
Dale, A. M., Fischl, B. & Sereno, M. I. Cortical surface-based analysis. I. Segmentation and surface reconstruction. Neuroimage 9, 179–194 (1999).
Article CAS Google Scholar
Fischl, B. FreeSurfer. NeuroImage 62, 774–781 (2012).
Article Google Scholar
Fischl, B. et al. Whole brain segmentation: automated labeling of neuroanatomical structures in the human brain. Neuron 33, 341–355 (2002).
Article CAS Google Scholar
Jack, C. R. Jr et al. The Alzheimer's Disease Neuroimaging Initiative (ADNI): MRI methods. J. Magn. Reson. Imaging 27, 685–691 (2008).
Article Google Scholar
Brewer, J. B. Fully-automated volumetric MRI with normative ranges: translation to clinical practice. Behavioural Neurology 21, 21–28 (2009).
Article CAS Google Scholar
Schuff, N. et al. MRI of hippocampal volume loss in early Alzheimer's disease in relation to ApoE genotype and biomarkers. Brain 132, 1067–1077 (2009).
Article CAS Google Scholar
Murphy, E. A. et al. Six-month atrophy in MTL structures is associated with subsequent memory decline in elderly controls. Neuroimage 53, 1310–1317 (2010).
Article CAS Google Scholar
McEvoy, L. K. et al. Mild cognitive impairment: baseline and longitudinal structural MR imaging measures improve predictive prognosis. Radiology 259, 834–843 (2011).
Article Google Scholar
Sluimer, J. D. et al. Accelerating regional atrophy rates in the progression from normal aging to Alzheimer's disease. European Radiology 19, 2826–2833 (2009).
Article Google Scholar
Karas, G. et al. Amnestic mild cognitive impairment: structural MR imaging findings predictive of conversion to Alzheimer disease. AJNR Am. J. Neuroradiol. 29, 944–949 (2008).
Article CAS Google Scholar
Duning, T. et al. Dehydration confounds the assessment of brain atrophy. Neurology 64, 548–550 (2005).
Article CAS Google Scholar
Kempton, M. J. et al. Dehydration affects brain structure and function in healthy adolescents. Hum. Brain Mapp. 32, 71–79 (2011).
Article Google Scholar
Hagemann, G. et al. Changes in brain size during the menstrual cycle. PLoS ONE 6, e14655 (2011).
Article ADS CAS Google Scholar
Gunter, J. L. et al. Measurement of MRI scanner performance with the ADNI phantom. Medical Physics 36, 2193–2205 (2009).
Article ADS Google Scholar
Gronenschild, E. H. et al. The effects of FreeSurfer version, workstation type, and Macintosh operating system version on anatomical volume and cortical thickness measurements. PLoS ONE 7, e38234 (2012).
Article ADS CAS Google Scholar
Massart, D. L. et al. Handbook of Chemometrics and Qualimetrics: Part A (Elsevier Science Pub Co., 1997).
Google Scholar
Gluer, C. C. et al. Accurate assessment of precision errors: how to measure the reproducibility of bone densitometry techniques. Osteoporosis Int. 5, 262–270 (1995).
Article CAS Google Scholar
Good, P . Permutation, Parametric and Bootstrap Tests of Hypotheses (Springer, 2005).
MATH Google Scholar
Tae, W. S., Kim, S. S., Lee, K. U., Nam, E. C. & Kim, K.W. Validation of hippocampal volumes measured using a manual method and two automated methods (FreeSurfer and IBASPM) in chronic major depressive disorder. Neuroradiology 50, 569–581 (2008).
Article Google Scholar
Jovicich, J. et al. MRI-derived measurements of human subcortical, ventricular and intracranial brain volumes: Reliability effects of scan sessions, acquisition sequences, data analyses, scanner upgrade, scanner vendors and field strengths. Neuroimage 46, 177–192 (2009).
Article Google Scholar
Lehmann, M. et al. Atrophy patterns in Alzheimer's disease and semantic dementia: a comparison of FreeSurfer and manual volumetric measurements. NeuroImage 49, 2264–2274 (2010).
Article Google Scholar
Kempton, M. J. et al. Effects of acute dehydration on brain morphology in healthy humans. Hum. Brain Mapp. 30, 291–298 (2009).
Article Google Scholar
Vrenken, H. et al. Validation of the automated method VIENA: An accurate, precise, and robust measure of ventricular enlargement. Hum. Brain Mapp. 35, 1101–1110 (2013).
Article Google Scholar
Wonderlick, J. S. et al. Reliability of MRI-derived cortical and subcortical morphometric measures: effects of pulse sequence, voxel geometry, and parallel imaging. Neuroimage 44, 1324–1333 (2009).
Article CAS Google Scholar
Morey, R. A. et al. Scan-rescan reliability of subcortical brain volumes derived from automated segmentation. Hum. Brain Mapp. 31, 1751–1762 (2010).
PubMed PubMed Central Google Scholar

Data Citations

Maclaren, J., Han, Z., Vos, S. B., Fischbein, N., & Bammer, R. FigShare https://doi.org/10.6084/m9.figshare.929651 (2014)

Download references

Acknowledgements

We thank Christoph Seeger, Alexander Brost and Norbert Schuff for valuable discussions and Andrew Gentles for his advice concerning the statistical methods used in this work. We would also like to acknowledge the generosity of the authors of FreeSurfer (Anders Dale, Bruce Fischl and the entire development team) for making their software freely available to the public.

Author information

Authors and Affiliations

Department of Radiology, Center for Quantitative Neuroimaging, Stanford University, Stanford, 94305, California, USA
Julian Maclaren, Zhaoying Han, Sjoerd B Vos, Nancy Fischbein & Roland Bammer
Image Sciences Institute, University Medical Center Utrecht, Utrecht, 3584 CX, The Netherlands
Sjoerd B Vos

Authors

Julian Maclaren
View author publications
You can also search for this author in PubMed Google Scholar
Zhaoying Han
View author publications
You can also search for this author in PubMed Google Scholar
Sjoerd B Vos
View author publications
You can also search for this author in PubMed Google Scholar
Nancy Fischbein
View author publications
You can also search for this author in PubMed Google Scholar
Roland Bammer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.M. co-designed the study, performed the imaging experiments, analyzed the data, and wrote the manuscript with contributions from all authors. Z.H. provided advice on the study design, performed the imaging experiments, processed the data in FreeSurfer, and contributed to the manuscript. S.B.V. co-designed the study, performed the imaging experiments, formatted the submitted data, and contributed to the manuscript. N.F. provided advice on the study design and contributed to the manuscript. R.B. provided advice on the study design, managed the project, and contributed to the manuscript.

Corresponding author

Correspondence to Julian Maclaren.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

ISA-Tab metadata

Supplementary information

Supplementary Figure 1 (PDF 51 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc/4.0/ Metadata associated with this Data Descriptor is available at http://www.nature.com/sdata/ and is released under the CC0 waiver to maximize reuse.

Reprints and permissions

About this article

Cite this article

Maclaren, J., Han, Z., Vos, S. et al. Reliability of brain volume measurements: A test-retest dataset. Sci Data 1, 140037 (2014). https://doi.org/10.1038/sdata.2014.37

Download citation

Received: 21 March 2014
Accepted: 02 September 2014
Published: 14 October 2014
DOI: https://doi.org/10.1038/sdata.2014.37

This article is cited by

Bildgebende Diagnostik und der Einsatz von künstlicher Intelligenz beim Management von Organmetastasen
- Nithya Bhasker
- Felix Schön
- Stefanie Speidel
Die Onkologie (2023)
A longitudinal multi-scanner multimodal human neuroimaging dataset
- Colin Hawco
- Erin W. Dickie
- Aristotle N. Voineskos
Scientific Data (2022)
Effect of head motion-induced artefacts on the reliability of deep learning-based whole-brain segmentation
- Péter Kemenczky
- Pál Vakli
- Zoltán Vidnyánszky
Scientific Reports (2022)
Anthropomorphic brain phantoms for use in MRI systems: a systematic review
- Noelle Crasto
- Abirami Kirubarajan
- Dafna Sussman
Magnetic Resonance Materials in Physics, Biology and Medicine (2022)
Brain volume refinement (BVeR): automatic correction tool as an alternative to manual intervention on brain segmentation
- Antonio Carlos da Silva Senra Filho
- Fabrício Henrique Simozo
- Luiz Otávio Murta Junior
Research on Biomedical Engineering (2021)

Subjects

Abstract

Similar content being viewed by others

Background & Summary

Methods

Data Records

Technical Validation

Data processing

Data analysis

Data analysis results

Usage Notes

Additional information

References

References

Data Citations

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

ISA-Tab metadata

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links