Performance changes due to differences in training data for cerebral aneurysm detection in head MR angiography images

Nomura, Yukihiro; Hanaoka, Shouhei; Nakao, Takahiro; Hayashi, Naoto; Yoshikawa, Takeharu; Miki, Soichiro; Watadani, Takeyuki; Abe, Osamu

doi:10.1007/s11604-021-01153-1

Performance changes due to differences in training data for cerebral aneurysm detection in head MR angiography images

Original Article
Published: 14 June 2021

Volume 39, pages 1039–1048, (2021)
Cite this article

Japanese Journal of Radiology Aims and scope Submit manuscript

Yukihiro Nomura ORCID: orcid.org/0000-0001-6471-9936¹,
Shouhei Hanaoka²,
Takahiro Nakao¹,
Naoto Hayashi¹,
Takeharu Yoshikawa¹,
Soichiro Miki¹,
Takeyuki Watadani² &
…
Osamu Abe²

6 Citations
Explore all metrics

Abstract

Purpose

The performance of computer-aided detection (CAD) software depends on the quality and quantity of the dataset used for machine learning. If the data characteristics in development and practical use are different, the performance of CAD software degrades. In this study, we investigated changes in detection performance due to differences in training data for cerebral aneurysm detection software in head magnetic resonance angiography images.

Materials and methods

We utilized three types of CAD software for cerebral aneurysm detection in MRA images, which were based on 3D local intensity structure analysis, graph-based features, and convolutional neural network. For each type of CAD software, we compared three types of training pattern, which were two types of training using single-site data and one type of training using multisite data. We also carried out internal and external evaluations.

Results

In training using single-site data, the performance of CAD software largely and unpredictably fluctuated when the training dataset was changed. Training using multisite data did not show the lowest performance among the three training patterns for any CAD software and dataset.

Conclusion

The training of cerebral aneurysm detection software using data collected from multiple sites is desirable to ensure the stable performance of the software.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A deep learning algorithm may automate intracranial aneurysm detection on MR angiography with high diagnostic performance

Article 30 May 2020

Automated detection of intracranial aneurysms using skeleton-based 3D patches, semantic segmentation, and auxiliary classification for overcoming data imbalance in brain TOF-MRA

Article Open access 25 July 2023

Deep Learning for Detection of Intracranial Aneurysms from Computed Tomography Angiography Images

Article 09 September 2022

References

Giger ML, Chan HP, Boone J. Anniversary paper: history and status of CAD and quantitative image analysis: the role of medical physics and AAPM. Med Phys. 2008;35(12):5799–820.
Article Google Scholar
van Ginneken B, Schaefer-Prokop CM, Prokop M. Computer-aided diagnosis: how to move from the laboratory to the clinic. Radiology. 2011;261(3):719–32.
Article Google Scholar
Litjens G, Kooi T, Bejnordi BE, et al. A survey on deep learning in medical image analysis. Med Image Anal. 2017;42:60–88.
Article Google Scholar
Yasaka K, Akai H, Kunimatsu A, Kiryu S, Abe O. Deep learning with convolutional neural network in radiology. Jpn J Radiol. 2018;36(4):257–72.
Article Google Scholar
Hu Z, Tang J, Wang Z, Zhang K, Zhang L, Sun Q. Deep learning for image-based cancer detection and diagnosis—a survey. Pattern Recogn. 2018;83:134–49.
Article Google Scholar
Weikert T, Cyriac J, Yang S, Nesic I, Parmar V, Stieltjes B. A practical guide to artificial intelligence-based image analysis in radiology. Invest Radiol. 2020;55(1):1–7.
Article Google Scholar
Bluemke DA, Moy L, Bredella MA, et al. Assessing radiology research on artificial intelligence: a brief guide for authors, reviewers, and readers–from the Radiology editorial board. Radiology. 2020;294(3):487–9.
Article Google Scholar
Park SH, Han K. Methodologic guide for evaluating clinical performance and effect of artificial intelligence technology for medical diagnosis and prediction. Radiology. 2018;286(3):800–9.
Article Google Scholar
Gibson E, Hu Y, Ghavami N, et al. Inter-site variability in prostate segmentation accuracy using deep learning. MICCAI 2018, LNCS. 2018;11073:506–14.
Onofrey JA, Casetti-Dinescu DI, Lauritzen AD, et al. Generalizable multi-site training and testing of deep neural networks using image normalization. In: 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), 2019, pp. 348–351.
Remedios SW, Roy S, Bermudez C, et al. Distributed deep learning across multisite datasets for generalized CT hemorrhage segmentation. Med Phys. 2020;47(1):89–98.
Article Google Scholar
Therrien R, Doyle S. Role of training data variability on classifier performance and generalizability. Proc SPIE. 2018;10581:1058109.
Google Scholar
Zech JR, Badgeley MA, Liu M, Costa AB, Titano JJ, Oermann EK. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: a cross-sectional study. PLoS Med. 2018;15(11):e1002683.
Article Google Scholar
Arimura H, Li Q, Korogi Y, et al. Automated computerized scheme for detection of unruptured intracranial aneurysms in three-dimensional magnetic resonance angiography. Acad Radiol. 2004;11(10):1093–104.
Article Google Scholar
Kobashi SK, Konda K, Hata Y. Computer-aided diagnosis of intracranial aneurysms in MRA images with case-based reasoning. IEICE Trans Inf Syst. 2006;E89-D(1):340–50.
Article Google Scholar
Yang X, Blezek DJ, Cheng LT, Ryan WJ, Kallmes DF, Erickson BJ. Computer-aided detection of intracranial aneurysms in MR angiography. J Digit Imaging. 2011;24(1):86–95.
Article Google Scholar
Nomura Y, Masutani Y, Miki S, et al. Performance improvement in computerized detection of cerebral aneurysms by retraining classifier using feedback data collected in routine reading environment. J Biomed Graph Comput. 2014;4(4):12–21.
Google Scholar
Hanaoka S, Nomura Y, Nemoto M, et al. HoTPiG: a novel geometrical feature for vessel morphometry and its application to cerebral aneurysm detection. MICCAI 2015, LNCS. 2015;9350:103–10.
Jin Z, Arimura H, Kakeda S, Yamashita F, Sasaki M, Korogi Y. An ellipsoid convex enhancement filter for detection of asymptomatic intracranial aneurysm candidates in CAD frameworks. Med Phys. 2016;43(2):951–60.
Article Google Scholar
Nakao T, Hanaoka S, Nomura Y, et al. Deep neural network-based computer-assisted detection of cerebral aneurysms in MR angiography. J Magn Reson Imaging. 2018;47(4):948–53.
Article Google Scholar
Sichtermann T, Faron A, Sijben R, Teichert N, Freiherr J, Wiesmann M. Deep learning-based detection of intracranial aneurysms in 3D TOF-MRA. Am J Neuroradiol. 2019;40(1):25–32.
Article CAS Google Scholar
Ueda D, Yamamoto A, Nishimori M, et al. Deep learning for MR angiography: automated detection of cerebral aneurysms. Radiology. 2019;290(1):187–94.
Article Google Scholar
Hanaoka S, Nomura Y, Takenaga T, et al. HoTPiG: a novel graph-based 3-D image feature set and its applications to computer-assisted detection of cerebral aneurysms and lung nodules. Int J Comput Assist Radiol Surg. 2019;14(12):2095–107.
Article Google Scholar
Joo B, Ahn SS, Yoon PH, et al. A deep learning algorithm may automate intracranial aneurysm detection on MR angiography with high diagnostic performance. Eur Radiol. 2020;30(11):5785–93.
Article Google Scholar
Chen G, Wei X, Lei H, et al. Automated computer-assisted detection system for cerebral aneurysms in time-of-flight magnetic resonance angiography using fully convolutional network. Biomed Eng Online. 2020;19(1):38.
Article Google Scholar
Hou W, Mei S, Gui Q, et al. 1D CNN-based intracranial aneurysms detection in 3D TOF-MRA. Complexity. 2020;2020:7023754.
Google Scholar
Nyul LG, Udupa JK. On standardizing the MR image intensity scale. Magn Reson Med. 1999;42(6):1072–81.
Article CAS Google Scholar
Jiang X, Zhou Z, Ding X, Deng X, Zou L, Li B. Level set based hippocampus segmentation in MR images with improved initialization using region growing. Comput Math Methods Med. 2017;2017:5256346.
Article Google Scholar
Koenderink JJ. Solid shape. Cambridge, MA: MIT Press; 1990.
Google Scholar
Dorai C, Jain AK. COSMOS-a representation scheme for 3D free-form objects. IEEE Trans Pattern Anal Mach Intell. 1997;19(10):1115–30.
Article Google Scholar
Nemoto M, Shimizu A, Hagihara Y, Kobatake H, Nawano S. Improvement of tumor detection performance in mammograms by feature selection from a large number of features and proposal of fast feature selection method. Syst Comput Jpn. 2006;37(12):56–68.
Article Google Scholar
Li Q, Sone S, Doi K. Selective enhancement filters for nodules, vessels, and airway walls in two- and three-dimensional CT scans. Med Phys. 2003;30(8):2040–51.
Article Google Scholar
Schapire RE, Freund Y, Bartlett P, Lee WS. Boosting the margin: a new explanation for the effectiveness of voting methods. Ann Statist. 1998;26(5):1651–86.
Google Scholar
Kim K, Choi HI, Oh K. Object detection using ensemble of linear classifiers with fuzzy adaptive boosting. J Image Video Proc. 2017. https://doi.org/10.1186/s13640-017-0189-y.
Article Google Scholar
Chakraborty DP, Berbaum KS. Observer studies involving detection and localization: modeling, analysis, and validation. Med Phys. 2004;31(8):2313–30.
Article Google Scholar
Metz CE. Receiver operating characteristic analysis: a tool for the quantitative evaluation of observer performance and imaging systems. J Am Coll Radiol. 2006;3(6):413–22.
Article Google Scholar
Setio AAA, Traverso A, de Bel T, et al. Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: the LUNA16 challenge. Med Image Anal. 2017;42:1–13.
Article Google Scholar
Chang K, Balachandar N, Lam C, et al. Distributed deep learning networks among institutions for medical imaging. J Am Med Inform Assoc. 2018;25(8):945–54.
Article Google Scholar
Nomura Y, Miki S, Hayashi N, et al. Novel platform for development, training, and validation of computer-assisted detection/diagnosis software. Int J Comput Assist Radiol Surg. 2020;15(4):661–72.
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank Dr. Shogo Nishiyama (Fuchinobe General Hospital, Kanagawa, Japan) for providing Dataset C. The Department of Computational Radiology and Preventive Medicine, The University of Tokyo Hospital, is sponsored by HIMEDIC Inc. and Siemens Healthcare K.K. This study was supported by the Japan Agency for Medical Research and Development (AMED) under Grant Number 19lk1010038h0001.

Author information

Authors and Affiliations

Department of Computational Diagnostic Radiology and Preventive Medicine, The University of Tokyo Hospital, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8655, Japan
Yukihiro Nomura, Takahiro Nakao, Naoto Hayashi, Takeharu Yoshikawa & Soichiro Miki
Department of Radiology, The University of Tokyo Hospital, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8655, Japan
Shouhei Hanaoka, Takeyuki Watadani & Osamu Abe

Authors

Yukihiro Nomura
View author publications
You can also search for this author in PubMed Google Scholar
Shouhei Hanaoka
View author publications
You can also search for this author in PubMed Google Scholar
Takahiro Nakao
View author publications
You can also search for this author in PubMed Google Scholar
Naoto Hayashi
View author publications
You can also search for this author in PubMed Google Scholar
Takeharu Yoshikawa
View author publications
You can also search for this author in PubMed Google Scholar
Soichiro Miki
View author publications
You can also search for this author in PubMed Google Scholar
Takeyuki Watadani
View author publications
You can also search for this author in PubMed Google Scholar
Osamu Abe
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yukihiro Nomura.

Ethics declarations

Conflict of interest

The authors declare no conflicts of interest with regard to the present study.

Ethical approval

This study was approved by the ethical review boards of our institutions.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix: Methods used in CAD software

Shape index

The shape index [29, 30] represents a local feature of the surface. The value ranges from 0 to 1. The values of the voxels of a typical cerebral aneurysm range from 0.8 to 1.0, showing a peak or dome.

Selective enhancement filters derived from Hessian matrix

The Hessian matrix is a 3 × 3 square matrix composed of local second-order derivatives of the image. The selective enhancement filters for dot, line, and plane structures were derived from the eigenvalues of the Hessian matrix [32].

HoTPiG feature set

HoTPiG [18, 23] is a voxel-based feature set derived from a graph structure extracted from a binary image of the target structure (e.g., vessel system). The HoTPiG feature set is defined at each node (i.e., each voxel) in a given graph based on a 3D histogram of shortest path distances between the node of interest and each of its neighboring node pairs.

Mahalanobis distance ratio

The Mahalanobis distance is the distance between a point and a distribution. In the classification, first, the Mahalanobis distance d_j (j = aneurysm, normal) between a feature vector at the target voxel and the multivariate distributions of two classes, which are estimated during training, are calculated. After that, the Mahalanobis distance ratio γ is calculated as follows:

$$\gamma = \frac{{d_{{normal}} }}{{d_{{aneurysm}} }}$$

(1)

If γ is above a certain threshold, the target voxel is classified as an aneurysm.

AdaBoost algorithm

AdaBoost is an adaptive algorithm to boost a sequence of classifiers [33, 34]. The AdaBoost algorithm chooses a good set of weak classifiers in rounds. On each round, it chooses the optimal classifier, which consists of feature values and the threshold, so that some misclassified data in the previous round would be correctly classified.

About this article

Cite this article

Nomura, Y., Hanaoka, S., Nakao, T. et al. Performance changes due to differences in training data for cerebral aneurysm detection in head MR angiography images. Jpn J Radiol 39, 1039–1048 (2021). https://doi.org/10.1007/s11604-021-01153-1

Download citation

Received: 25 February 2021
Accepted: 08 June 2021
Published: 14 June 2021
Issue Date: November 2021
DOI: https://doi.org/10.1007/s11604-021-01153-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Performance changes due to differences in training data for cerebral aneurysm detection in head MR angiography images