Radiomics feature stability of open-source software evaluated on apparent diffusion coefficient maps in head and neck cancer

Korte, James C.; Cardenas, Carlos; Hardcastle, Nicholas; Kron, Tomas; Wang, Jihong; Bahig, Houda; Elgohari, Baher; Ger, Rachel; Court, Laurence; Fuller, Clifton D.; Ng, Sweet Ping

doi:10.1038/s41598-021-96600-4

Download PDF

Article
Open access
Published: 03 September 2021

Radiomics feature stability of open-source software evaluated on apparent diffusion coefficient maps in head and neck cancer

James C. Korte^1,2,
Carlos Cardenas³,
Nicholas Hardcastle^1,4,
Tomas Kron^1,5,
Jihong Wang³,
Houda Bahig⁶,
Baher Elgohari^7,8,
Rachel Ger⁹,
Laurence Court³,
Clifton D. Fuller⁷ &
…
Sweet Ping Ng^7,10,11

Scientific Reports volume 11, Article number: 17633 (2021) Cite this article

4991 Accesses
26 Citations
3 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 17 September 2021

This article has been updated

Abstract

Radiomics is a promising technique for discovering image based biomarkers of therapy response in cancer. Reproducibility of radiomics features is a known issue that is addressed by the image biomarker standardisation initiative (IBSI), but it remains challenging to interpret previously published radiomics signatures. This study investigates the reproducibility of radiomics features calculated with two widely used radiomics software packages (IBEX, MaZda) in comparison to an IBSI compliant software package (PyRadiomics). Intensity histogram, shape and textural features were extracted from 334 diffusion weighted magnetic resonance images of 59 head and neck cancer (HNC) patients from the PREDICT-HN observational radiotherapy study. Based on name and linear correlation, PyRadiomics shares 83 features with IBEX and 49 features with MaZda, a sub-set of well correlated features are considered reproducible (IBEX: 15 features, MaZda: 18 features). We explore the impact of including non-reproducible radiomics features in a HNC radiotherapy response model. It is possible to classify equivalent patient groups using radiomic features from either software, but only when restricting the model to reliable features using a correlation threshold method. This is relevant for clinical biomarker validation trials as it provides a framework to assess the reproducibility of reported radiomic signatures from existing trials.

Robustness of radiomic features in CT images with different slice thickness, comparing liver tumour and muscle

Article Open access 15 April 2021

Robust imaging habitat computation using voxel-wise radiomics features

Article Open access 11 October 2021

The ImSURE phantoms: a digital dataset for radiomic software benchmarking and investigation

Article Open access 12 November 2022

Introduction

Extracting textural features from medical images provides additional information¹ to capture changes in tumour heterogeneity that may complement existing shape based metrics². Radiomics^3,4 is the high-throughput extraction of image features from standard-of-care medical images, with the hypothesis that macroscopic image features offer insight into disease process at a molecular level⁵. Radiomics analysis has been widely adopted in oncology research, showing potential to identify magnetic resonance image (MRI) based biomarkers for clinical outcomes in head and neck cancers (HNC)⁶. The evolution of radiomics features during treatment, commonly referred to as delta-radiomics, may offer more information than a single time point to identify biomarkers during radiotherapy or chemotherapy and has been explored in HNC with CT imaging^7,8,9,10, positron emission tomography (PET) imaging¹¹ and more recently in MRI studies^{12,13,14,15,16}.

A review of head and neck cancer studies⁶ details the investigation of MRI radiomics features for applications such as image segmentation, histopathological classification and prognostic or predictive biomarkers. Previous HNC studies explore a range of MRI sequences, from anatomical imaging such as T1 weighted, T2 weighted and short tau inversion recovery (STIR) to functional imaging such as diffusion weighted (DW-MRI) and dynamic contrast enhanced imaging (DCE-MRI). This study focuses on radiomics features calculated on apparent diffusion coefficient (ADC) maps derived from DW-MRI images. The apparent diffusion coefficient has been correlated to cellularity in many tumour types¹⁷ and has been linked to cell proliferation in head and neck squamous cell carcinoma¹⁸. During radiotherapy, early changes in ADC have been linked to treatment response outcomes for multiple tumour types, making it a potential candidate for biological image-guided adaptive radiotherapy on MRI guided radiotherapy systems¹⁹.

Developing a radiomics model is often considered as a series of discrete tasks each with its own challenges⁵, with the variability of each task known to impact model performance^20,21,22. In MRI studies, these effects have been investigated with regard to image acquisition^{23,24,25,26,27}, region of interest segmentation^23,28,29,30, image pre-processing^28,29,31,32, feature extraction^33,34,35 and feature reduction combined with classifier training^13,36,37. To address the known variability issues of features extracted with different software^33,34,35 the image biomarker standardisation initiative (IBSI)³⁸ has proposed a set of feature extraction guidelines. In head and neck cancer, MRI studies have reported feature extraction with software such as MazDa^24,39,40,41, IBEX⁴² and in-house solutions based on MATLAB^{37,42,43,44,45,46,47,48,49}, none of which adhear to the IBSI guidelines.

Radiomic analysis generates hundreds of features, making feature reduction a crucial step to prevent overfitting when developing a radiomics model. Validation studies^50,51 select a small set of features based on previously reported radiomic signatures. The IBSI guidelines should mitigate known feature reproducibility issues^33,34,35 in future studies, but feature uncertainty remains a problem when interpreting previously reported radiomic signatures. This study investigates the correlation between features generated with open-source radiomic software packages (IBEX⁵² and MaZda⁵³) used in many published studies against an open-source tool (PyRadiomics⁵⁴) which follows the IBSI guidelines. We then explore the impact of non-reproducible radiomics features on a HNC radiotherapy response model using through therapy ADC radiomics features. Our comparison focuses on DW-MRI of head and neck cancer but provides general confidence on which previously reported radiomics features can be reproduced with software that adheres to the IBSI guidelines.

Results

Variation in radiomic features

Radiomics features were extracted from 334 apparent diffusion coefficient maps from the prospective PREDICT-HN study⁵⁹(Fig. 1) with PyRadiomics, IBEX and MaZda. A total of 314 features were extracted per ADC map (PyRadiomics: 125, IBEX: 110, MaZda: 79) including intensity histogram, shape and texture features. Based on name similarity, equation similarity and linear correlation, we identified that PyRadiomics and IBEX have 83 shared features and PyRadiomics and MaZda have 49 shared features. The linear correlation of PyRadiomics and IBEX features (Fig. 2) and PyRadiomics and MaZda features (Supplementary Figure 2) shows high correlation between intensity histogram features with a range of correlation between features in the shape and texture classes. A summary of correlation between features shared with PyRadiomics (Fig. 3) shows that on average IBEX extracts more highly correlated shape features, with MaZda extracting more highly correlated first order, GLCM and GLRM features. Features with a high Pearson’s coefficient ($r>0.901$) were considered reproducible between software packages. IBEX had 15 reproducible features (intensity histogram: 5, shape: 4, GLCM(1): 4, GLCM(4): 1, NGTDM: 1) and MaZda had 18 reproducible features (intensity histogram: 5, shape: 2, GLCM(1): 3, GLCM(4): 3, GLCM(7): 2, GLRLM: 3). For full detail of the feature names and correlations see Supplementary Tables 2, 3 and 4.

The PyRadiomics feature set was composed of 18 intensity histogram, 14 shape, 24 GLCM (at three neighbourhoods), 16 GLRLM and 5 NGTDM features. The IBEX feature set had (shared/total) 7/13 intensity histogram, 6/18 shape, 18/21 GLCM (at three neighbourhoods), 11/11 GLRLM and 5/5 NGTDM features. The MaZda feature set had (shared/total) 6/14 intensity histogram, 6/22 shape, 10/12 GLCM (at three neighbourhoods) and 7/7 GLRLM features. Shared features that had dissimilar names included shape features (Flatness, Roundness), (Volume, Area), (VoxelVolume, VoxelSize), (SurfaceArea, Perimeter), (SurfaceVolumeRatio, SurfaceAreaDensity), (FerretDiameter, Maximum3DDiameter), GLCM features (SumSquares, Variance), (InverseDifference, Homogeneity1), (InverseDifferenceMoment, Homogeneity2), (JointEnergy, AngularSecondMoment), (ClusterTendancy, JointEnergy) and one GLRLM feature (RunPercentage, Fraction).

Variation in patient modelling

To investigate the impact of feature variability on patient modelling, 36 patients with imaging at all eight time points were clustered into two patient groups using an unsupervised learning method. Radiomic features were calculated throughout radiotherapy on apparent diffusion maps coefficient (ADC) maps, derived from diffusion weighted MRI. Hierarchal clustering based on all shared radiomics features (Fig. 4a, Supplementary Figure 1a) resulted in different patient groups, with six patients classified differently between PyRadiomics and IBEX and fifteen patients classified differently between PyRadiomics and MaZda groups. When clustering with a reduced set of reproducible features we observed nearly identical patient groups, with one patient classified differently between PyRadiomics and IBEX radiomics features (Fig. 4b) and identical PyRadiomics and MaZda groups (Supplementary Figure 2b). The change in clustering similarity over a range of reproducibility thresholds (Fig. 5) highlights that as the reliability threshold increases the number of included radiomics features decreases, with a general trend of increasing clustering similarity for a threshold above 0.90. To cluster identical patient groups in both software a more stringent reproducibility threshold ($r>0.965$) was required and reduced the IBEX feature set to 7 features (intensity histogram: 3, GLCM(1): 4) and the MaZda feature set to 5 intensity histogram features.

Discussion

Existing studies of variability in radiomics feature extraction^33,34,35 explore a range of image modalities and feature extraction software. A study of mammograms and HNC computed tomography images³³ also extracted IBEX and MaZda features, but compared them against one another and with two in-house software packages. One feature extraction study of HNC patients was less comparable as it analysed PET images³⁵ and compared two in-house radiomics software packages. A study of HNC patients who had both CT and MRI imaging³⁴ also extracted PyRadiomics features but compared them to features extracted with Moddicom⁵⁵ and the radiomics extension to CERR⁵⁶. Whilst that study³⁴ also extracted features from MRI images of HNC patients, the features were from T2 weighted images, whereas our study is the first, to our knowledge, to investigate feature extraction stability on ADC maps from diffusion weighted MRI.

Our study observed a similar trend of feature extraction reproducibility to existing studies; intensity histogram features have high reproducibility across feature extraction software, with shape and textural features being more software package dependent. The study of mammograms and HNC CT images³³ demonstrated high intra-class correlation (ICC) of intensity histogram features and low ICC of GLCM features, though this was partially attributed to the use of default GLCM extraction settings for each software. The study of HNC patients with both CT and MRI images³⁴ showed that CT features had high Spearman correlation of intensity histogram, shape and GLCM features and lower ICC for GLRLM and grey-level size zone matrix (GLSZM) features. The T2 weighted MRI features compared in that study had high Spearman correlation for intensity histogram and shape features between PyRadiomics and CERR and only shape features highly correlated between PyRadiomics and Moddicom; the lack of correlation with intensity histogram features was attributed to Moddicom performing an image intensity correction on the T2 weighted MRI. In our study NGTDM features were extracted with IBEX and had a mixture of well correlated (1/5) and uncorrelated (4/5) features, the previously mentioned PET study³⁵ also extracted NGTDM features but only a percentage of grouped textural features above an ICC threshold was reported. The similarity of our findings with those previously reported may indicate that variability in radiomics features due to the extraction software can be considered independent of the imaging modality.

Two radiomics feature reproducibility studies^34,35 have investigated the impact of feature extraction variability on HNC model performance. The joint MRI and CT imaging study³⁴ performed hierarchical clustering of patients for each feature class separately (intensity histogram, GLCM, etc.) and observed consistent clustering of clinical variables (TN category, GTV volume) for radiomics feature classes with a high reproducibility. The PET study³⁵ demonstrated that models of local tumour control built on features from two different radiomics packages can stratify patients into very similar low or high risk recurrence groups, with the model features being highly correlated between the two software packages (ICC > 0.9). These studies and our results demonstrate that it is possible to generate equivalent radiomics models with different feature extraction software, but that care must be taken to ensure that features are highly correlated between the software packages. Additionally, our study extends this observation to patient classification based on radiomics features prior to treatment, at multiple time points during treatment and post-treatment.

A limitation of this and previous studies^33,34,35 is the ambiguity in matching features between different software, firstly identifying shared features and secondly matching the feature extraction settings which often differ between software package. Name similarity does not guarantee equation similarity, for example the shape feature “SurfaceVolumeRatio” is calculated with a voxel based volume in IBEX and a mesh based volume in PyRadiomics. Equation similarity can be difficult to establish due to different notations, such as the formulation of the shape feature “SurfaceArea” between PyRadiomics and IBEX (Supplementary Table 6) which appear quite different but have a high correlation (0.998). We observed that equation similarity does not guarantee strong feature correlation, even with well-matched extraction settings, for a number of IBEX GLCM features such as “AutoCorrelation”, “MaximumProbability” and “JointEnergy” (Supplementary Table 7). Alternatively, minor differences in equations with non-ideally matched extraction settings showed moderate correlation, such as MaZda GLCM features “DifferenceVariance” and “InverseDifferenceMoment” (Supplementary Table 7). The naming issue can be avoided in future for a sub-set of features that have a unique identifier as defined in the IBSI guidelines³⁸. To assist with consistent naming and feature extraction settings an ontology based radiomics workflow has been proposed⁵⁷.

It is challenging to match the feature extraction settings between PyRadiomics, IBEX and MaZda. We were unable to define a reduced intensity range in MaZda for GLCM and GLRLM features, which may have negatively affected the feature correlation results, though we observed higher correlations in these feature classes from MaZda than with IBEX that had a matched intensity range and bin width. Similarly, we were unable to define identical directions for GLCM and GLRLM feature extraction across the software packages, averaging over different directions may have negatively affected our feature correlation results. A newer version of MaZda (qmazda) was selected for this study as it supports batch feature extraction, from the software documentation it was inferred that there was minimal change to the feature extraction code from earlier MaZda versions. The region of interest was calculated with a Plastimatch derived label map for PyRadiomics and MaZda and was calculated directly from the DICOM structure file (RTSTRUCT) by IBEX, which may have introduced variability in the image region used for feature extraction. We explored radiomics features calculated on the original ADC map to avoid additional variability from image filtration (Wavelet, Laplacian or Gaussian, etc.) between software implementations. Image filtration reproducibility between software using MRI images may be worth investigating as differences in wavelet filtered features from PET images have been previously reported³⁵. Whilst our investigation of feature variability on patient modelling demonstrated general classification differences between software, it is challenging to quantify a clinical impact as the patient groups are not correlated with a clinical outcome. There is some evidence that unsupervised feature clustering is correlated with clinical outcomes in HNC³⁴.

Magnetic resonance imaging offers insight into disease related anatomical and functional changes and is ideal for radiomics analysis as multiple MRI sequences can provide complimentary information⁵⁸. This study reports the variability of radiomics features extracted from ADC maps, that are a relatively quantitative image, generating features not significantly influenced by scanner manufacturer or magnetic field strength²⁶; radiomics features extracted from T1 and T2 weighted images are less reproducible across scanner and not recommended at this point for multicentre trials²⁴. The increasing uptake of MRI for simulation and therapy in radiotherapy departments offers an unprecedented opportunity to characterise tumour response and personalise patient treatment. This study provides information on how feature extraction software can impact the reproducibility of a radiomics workflow, which we should endeavour to optimise in order to accelerate discovery through reproducible research and data sharing.

Conclusion

This work highlights feature and model reproducibility issues due to different radiomic analysis software. We propose a correlation threshold method to select reproducible features and demonstrate that the identified features from both software generate an equivalent model. This is relevant for the selection of radiomic features in clinical biomarker validation trials as it provides a framework to assess the reproducibility of radiomic signatures from existing studies.

Material and methods

Study cohort

The imaging data used in this study was collected as part of the prospective PREDICT-HN study⁵⁹ (Fig. 1). The trial imaged 59 patients with head and neck squamous cell carcinoma who were treated with curative intent radiotherapy, patient details are summarised in Table 1. Trial participants were imaged prior to radiotherapy, weekly during radiotherapy (following fraction 5, 10, 15, 20, 25 and 30) and two to three months post-radiotherapy. Imaging data was acquired for all patients prior to radiotherapy (n = 59), with a lower number of images acquired during radiotherapy over the first three weeks (n = 40), at weeks four and five (n = 39), at week six (n = 38) and post-radiotherapy (n = 39).

Table 1 Patient characteristics.

Full size table

Ethics approval and consent to participate

The PREDICT-HN study⁵⁹ was conducted in accordance with the Declaration of Helsinki and was approved by the Institutional Review Board at the University of Texas MD Anderson Cancer Center. The study is registered on clinicaltrials.gov with registration number NCT03491176, date of registration 09/04/2018 (retrospectively registered), date of enrolment of the first participant 30/05/2017. All study participants were 18 years and older, informed consent was obtained from all participants.

Imaging protocol

MRI imaging was performed on a Siemens 1.5 T Aera scanner to acquire both anatomical and functional images. The gross tumour volume (GTV) was contoured on pre-treatment T2 weighted turbo-spin-echo (T2w-TSE) images (voxel size = 0.5 mm, FOV = 256 × 256 mm, axial slices = 12, slice thickness = 2 mm, TE = 80 ms, TR = 4800 ms, FA = 90°, ETL = 15, pixel bandwidth = 300 Hz) by a radiation oncologist. Throughout treatment, the pre-treatment contours were rigidly registered to the through treatment images and manually adjusted to anatomical boundaries, then propagated onto the apparent diffusion coefficient (ADC) maps and visually verified. Diffusion weighted images (DWI) were acquired with the BLADE^60,61 sequence (voxel size = 2 mm, FOV = 256 × 256 mm, axial slices = 25, slice thickness = 4 mm, TE = 50 ms, TR = 5400 ms, FA = 90°, b = 0,800 s/mm2, ETL = 15, pixel bandwidth = 1220 Hz). Apparent diffusion coefficient maps were calculated from the DWI images using the default mono-exponential model.

Feature extraction

Features were extracted from all the ADC maps (n = 334) using PyRadiomics (version 2.1.0), IBEX (version 1.0 Beta) and MaZda (qmazda 19.02). All available intensity histogram (IHIST), shape, grey-level co-occurrence matrix (GLCM⁶²), grey-level run length matrix (GLRLM⁶³) and neighbourhood grey-tone difference matrix (NGTDM⁶⁴) features were calculated on the original ADC map only; the reproducibility of image filtration prior to feature extraction was considered outside the scope of this study. The region of interest for feature extraction was calculated directly from the DICOM radiotherapy contours (RTSTRUCT) with IBEX and an intermediate step to convert the DICOM contours to a binary label map with Plastimatch (version 1.7.3)⁶⁵ was required for PyRadiomics and MaZda. The feature extraction settings (Supplementary Table 1) were set as the default IBEX settings and matched as closely as possible between the three software packages, based on available documentation. ADC maps were discretised (256 bins, bin width = 16) prior to calculation of IHIST and NGTDM features and with a reduced data range (100 bins, bin width = 21) for GLCM and GLRLM features. GLCM features were calculated at a series of neighbourhoods (1, 4, 7), asymmetric features were calculated with PyRadiomics and IBEX, symmetric features were calculated with MaZda to compensate for a reduced range of direction angles. Symmetric NGTDM features were calculated with a neighbourhood of three. Texture features (GLCM, GLRLM, NGTDM) were calculated in the axial plane over the three dimensional region of interest,an example of this is the NGTDM matrices being constructed with a 2D neighbourhood rather than a 3D neighbourhood. We calculated the average of all GLCM and GLRLM feature directions as PyRadiomics does not report or allow the specification of features for individual directions.

Variation in radiomic features

To determine the relationship between features generated with the established radiomics software (IBEX/MaZda) and the IBSI compliant software (PyRadiomics) we performed a linear regression analysis on radiomics features extracted from ADC maps of all patients at all time points. A linear fit was calculated between PyRadiomics and IBEX or MaZda for every feature in a given feature class (IHIST, SHAPE, GLCM, GLRLM, NGTDM). Features with an invalid value (i.e. infinity due to division by zero) were excluded during the linear regression. A list of shared radiomics features was collated by identifying features extracted by both PyRadiomics and the alternative software package, and was based primarily on name and equation similarity. A small number of features that showed a high linear correlation but had dissimilar names, for example ‘SurfaceArea’ and ‘Perimeter’, were also included. The list of shared features, feature correlations and the number of images used per feature correlation can be found in Supplementary Tables 2, 3 and 4. Full details of the equation comparison are in Supplementary Tables 5–9.

Variation in patient modelling

To demonstrate the potential impact of incorporating non-reproducible features in a radiomics model we used unsupervised learning to identify two groups of patients, based on radiomic features at pre-treatment, throughout radiotherapy and post-radiotherapy. Separate radiomics models were generated based on PyRadiomics and IBEX/MaZda features, first using all features and then with the sub-set of reproducible features. Reproducible features were selected as those with a high Pearson’s correlation coefficient ($r> 0.901$) as calculated for the analysis of variation in radiomics features, the correlation threshold was determined as per the sensitivity analysis described below. Features that contained an invalid number (i.e. infinity) for any patient in the modelling cohort were excluded. Patients (n = 36) with image data for all time points were grouped with SciPy^66,67 using Ward's minimum variance clustering method⁶⁸ on scaled radiomics features (z-score standardisation) with an automatic minimum clustering threshold to generate no more than two clusters.

To test the sensitivity of clustering to the selected correlation threshold, we performed the clustering over a range of thresholds ($r>0.0$ to $r>0.999$, with an increment of 0.001) and measured the clustering similarity as the percentage of patients clustered by IBEX or MaZda into the same groups as PyRadiomics. We define clustering similarity as,

$$ Similarity_{i} \;\left( \% \right) = \frac{{ \left| {A_{PyRad} \cap A_{i} } \right| + |B_{PyRad} \cap B_{i} |}}{{\left| {A_{PyRad} \cup B_{PyRad} } \right|}}100, $$

where ${A}_{PyRad}$ and ${B}_{PyRad}$ are patient groups from clustering with PyRadiomics features, ${A}_{i}$ and ${B}_{i}$ are patient groups from clustering with either IBEX or MaZda features, set intersection is denoted with $\cap $, set union is denoted with $\cup $ and the number of patients in a set is denoted with $|A|$. Due to the possibility that unsupervised clustering can return similar groups but in a different order, the similarity metric was calculated as,

$$Similarity_{i} \;\left( \% \right) = max \left( {\frac{{ \left| {A_{PyRad} \cap A_{i} } \right| + |B_{PyRad} \cap B_{i} |}}{{\left| {A_{PyRad} \cup B_{PyRad} } \right|}} , \frac{{ \left| {A_{PyRad} \cap B_{i} } \right| + |B_{PyRad} \cap A_{i} |}}{{\left| {A_{PyRad} \cup B_{PyRad} } \right|}}} \right) 100.$$

Change history

17 September 2021
A Correction to this paper has been published: https://doi.org/10.1038/s41598-021-98445-3

References

Castellano, G. et al. Texture analysis of medical images. Clin. Radiol. 59(12), 1061–1069 (2004).
Article CAS PubMed Google Scholar
Jaffe, C. C. Measures of response: RECIST, WHO, and new alternatives. J. Clin. Oncol. 24(20), 3245–3251 (2006).
Article PubMed Google Scholar
Kumar, V. et al. Radiomics: The process and the challenges. Magn. Reson. Imaging 30(9), 1234–1248 (2012).
Article PubMed PubMed Central Google Scholar
Lambin, P. et al. Radiomics: Extracting more information from medical images using advanced feature analysis. Eur. J. Cancer 48(4), 441–446 (2012).
Article PubMed PubMed Central Google Scholar
Gillies, R. J., Kinahan, P. E. & Hricak, H. Radiomics: Images are more than pictures, they are data. Radiology 278(2), 563–577 (2016).
Article PubMed Google Scholar
Jethanandani, A. et al. Exploring applications of radiomics in magnetic resonance imaging of head and neck cancer: A systematic review. Front. Oncol. 8, 21 (2018).
Article Google Scholar
Cunliffe, A. et al. Lung texture in serial thoracic computed tomography scans: Correlation of radiomics-based features with radiation therapy dose and radiation pneumonitis development. Int. J. Radiat. Oncol. Biol. Phys. 91(5), 1048–1056 (2015).
Article PubMed PubMed Central Google Scholar
Fave, X. et al. Delta-radiomics features for the prediction of patient outcomes in non-small cell lung cancer. Sci. Rep. 7, 11 (2017).
Article ADS CAS Google Scholar
Goh, V. et al. Assessment of response to tyrosine kinase inhibitors in metastatic renal cell cancer: CT texture as a predictive biomarker. Radiology 261(1), 165–171 (2011).
Article PubMed Google Scholar
Rao, S. X. et al. CT texture analysis in colorectal liver metastases: A better way than size and volume measurements to assess response to chemotherapy?. United Eur. Gastroenterol. J. 4(2), 257–263 (2016).
Article CAS Google Scholar
Carvalho, S. et al. Early variation of FDG-PET radiomics features in NSCLC is related to overall survival—The “delta radiomics” concept. Radiother. Oncol. 118, S20–S21 (2016).
Article Google Scholar
Boldrini, L. et al. Delta radiomics for rectal cancer response prediction with hybrid 035T magnetic resonance-guided radiotherapy (MRgRT): A hypothesis-generating study for an innovative personalized medicine approach. Radiol. Med. 124(2), 145–153 (2019).
Article MathSciNet PubMed Google Scholar
Chang, Y. et al. An investigation of machine learning methods in delta-radiomics feature analysis. PLoS ONE 14(12), e0226348 (2019).
Article CAS PubMed PubMed Central Google Scholar
Crombe, A. et al. T-2-based MRI Delta-radiomics improve response prediction in soft-tissue sarcomas treated by neoadjuvant chemotherapy. J. Magn. Reson. Imaging 50(2), 497–510 (2019).
Article PubMed Google Scholar
Jeon, S. H. et al. Delta-radiomics signature predicts treatment outcomes after preoperative chemoradiotherapy and surgery in rectal cancer. Radiat. Oncol. 14, 10 (2019).
Article Google Scholar
Zhang, Z. J. et al. A predictive model for distinguishing radiation necrosis from tumour progression after gamma knife radiosurgery based on radiomic features from MR images. Eur. Radiol. 28(6), 2255–2263 (2018).
Article PubMed Google Scholar
Surov, A., Meyer, H. J. & Wienke, A. Correlation between apparent diffusion coefficient (ADC) and cellularity is different in several tumors: A meta-analysis. Oncotarget 8(35), 59492–59499 (2017).
Article PubMed PubMed Central Google Scholar
Surov, A. et al. Histogram analysis parameters of apparent diffusion coefficient reflect tumor cellularity and proliferation activity in head and neck squamous cell carcinoma. Oncotarget 9(34), 23599–23607 (2018).
Article PubMed PubMed Central Google Scholar
van Houdt, P. J., Yang, Y. & van der Heide, U. A. Quantitative magnetic resonance imaging for biological image-guided adaptive radiotherapy. Front. Oncol. 10(3190), 615643 (2021).
Article PubMed PubMed Central Google Scholar
Leger, S. et al. A comparative study of machine learning methods for time-to-event survival data for radiomics risk modelling. Sci. Rep. 7, 11 (2017).
Article CAS Google Scholar
Parmar, C. et al. Machine learning methods for quantitative radiomic biomarkers. Sci. Rep. 5, 11 (2015).
Article Google Scholar
Zhao, B. S. et al. Reproducibility of radiomics for deciphering tumor phenotype with imaging. Sci. Rep. 6, 7 (2016).
CAS Google Scholar
Fiset, S. et al. Repeatability and reproducibility of MRI-based radiomic features in cervical cancer. Radiother. Oncol. 135, 107–114 (2019).
Article PubMed Google Scholar
Fruehwald-Pallamar, J. et al. Texture-based analysis of 100 MR examinations of head and neck tumors—Is it possible to discriminate between benign and malignant masses in a multicenter trial?. Rofo-Fortschritte Auf Dem Gebiet Der Rontgenstrahlen Und Der Bildgebenden Verfahren 188(2), 195–202 (2016).
CAS Google Scholar
Mayerhoefer, M. E. et al. Effects of magnetic resonance image interpolation on the results of texture-based pattern classification a phantom study. Investig. Radiol. 44(7), 405–411 (2009).
Article Google Scholar
Peerlings, J. et al. Stability of radiomics features in apparent diffusion coefficient maps from a multi-centre test-retest trial. Sci. Rep. 9, 10 (2019).
Article CAS Google Scholar
Yang, F. et al. Evaluation of radiomic texture feature error due to MRI acquisition and reconstruction: A simulation study utilizing ground truth. Phys. Med. Eur. J. Med. Phys. 50, 26–36 (2018).
Google Scholar
Bologna, M. et al. Assessment of stability and discrimination capacity of radiomic features on apparent diffusion coefficient images. J. Digit. Imaging 31(6), 879–894 (2018).
Article PubMed PubMed Central Google Scholar
Duron, L. et al. Gray-level discretization impacts reproducible MRI radiomics texture features. PLoS ONE 14(3), 14 (2019).
Article CAS Google Scholar
Saha, A., Harowicz, M. R. & Mazurowski, M. A. Breast cancer MRI radiomics: An overview of algorithmic features and impact of inter-reader variability in annotating tumors. Med. Phys. 45(7), 3076–3085 (2018).
Article PubMed Google Scholar
Collewet, G., Strzelecki, M. & Mariette, F. Influence of MRI acquisition protocols and image intensity normalization methods on texture classification. Magn. Reson. Imaging 22(1), 81–91 (2004).
Article CAS PubMed Google Scholar
Moradmand, H., Aghamiri, S. M. R. & Ghaderi, R. Impact of image preprocessing methods on reproducibility of radiomic features in multimodal magnetic resonance imaging in glioblastoma. J. Appl. Clin. Med. Phys. 21(1), 179–190 (2020).
Article PubMed Google Scholar
Foy, J. J. et al. Variation in algorithm implementation across radiomics software. J. Med. Imaging 5(4), 10 (2018).
Article Google Scholar
Liang, Z. G. et al. Comparison of radiomics tools for image analyses and clinical prediction in nasopharyngeal carcinoma. Br. J. Radiol. 92(1102), 9 (2019).
Article Google Scholar
Bogowicz, M. et al. Post-radiochemotherapy PET radiomics in head and neck cancer—The influence of radiomics implementation on the reproducibility of local control tumor models. Radiother. Oncol. 125(3), 385–391 (2017).
Article PubMed Google Scholar
Qian, Z. H. et al. Differentiation of glioblastoma from solitary brain metastases using radiomic machine-learning classifiers. Cancer Lett. 451, 128–135 (2019).
Article CAS PubMed Google Scholar
Zhang, B. et al. Radiomic machine-learning classifiers for prognostic biomarkers of advanced nasopharyngeal carcinoma. Cancer Lett. 403, 21–27 (2017).
Article CAS PubMed Google Scholar
Zwanenburg, A. et al. Image biomarker standardisation initiative. arXiv preprint arXiv:1612.07003 (2016).
Brown, A. M. et al. Multi-institutional validation of a novel textural analysis tool for preoperative stratification of suspected thyroid tumors on diffusion-weighted MRI. Magn. Reson. Med. 75(4), 1708–1716 (2016).
Article PubMed Google Scholar
Fruehwald-Pallamar, J. et al. Texture-based and diffusion-weighted discrimination of parotid gland lesions on MR images at 3.0 T. Nmr Biomed. 26(11), 1372–1379 (2013).
Article PubMed Google Scholar
Meyer, H. J. et al. MRI texture analysis reflects histopathology parameters in thyroid cancer—A first preliminary study. Transl. Oncol. 10(6), 911–916 (2017).
Article PubMed PubMed Central Google Scholar
Hou, Z. et al. Radiomic analysis in T2W and SPAIR T2W MRI: Predict treatment response to chemoradiotherapy in esophageal squamous cell carcinoma. J. Thorac. Disease 10(4), 2256 (2018).
Article Google Scholar
Jansen, J. F. A. et al. Texture analysis on parametric maps derived from dynamic contrast-enhanced magnetic resonance imaging in head and neck cancer. World J. Radiol. 8(1), 90–97 (2016).
Article PubMed PubMed Central Google Scholar
Liu, J. et al. Use of texture analysis based on contrast-enhanced MRI to predict treatment response to chemoradiotherapy in nasopharyngeal carcinoma. J. Magn. Reson. Imaging 44(2), 445–455 (2016).
Article PubMed Google Scholar
Ouyang, F. S. et al. Exploration and validation of radiomics signature as an independent prognostic biomarker in stage III–IVb nasopharyngeal carcinoma. Oncotarget 8(43), 74869–74879 (2017).
Article PubMed PubMed Central Google Scholar
Ren, J. L. et al. Magnetic resonance imaging based radiomics signature for the preoperative discrimination of stage I–II and III–IV head and neck squamous cell carcinoma. Eur. J. Radiol. 106, 1–6 (2018).
Article ADS PubMed Google Scholar
Wang, G. Y. et al. Pretreatment MR imaging radiomics signatures for response prediction to induction chemotherapy in patients with nasopharyngeal carcinoma. Eur. J. Radiol. 98, 100–106 (2018).
Article PubMed Google Scholar
Zhang, B. et al. Advanced nasopharyngeal carcinoma: Pre-treatment prediction of progression based on multi-parametric MRI radiomics. Oncotarget 8(42), 72457–72465 (2017).
Article PubMed PubMed Central Google Scholar
Zhang, B. et al. Radiomics features of multiparametric MRI as novel prognostic factors in advanced nasopharyngeal carcinoma. Clin. Cancer Res. 23(15), 4259–4269 (2017).
Article ADS PubMed Google Scholar
Leijenaar, R. T. H. et al. External validation of a prognostic CT-based radiomic signature in oropharyngeal squamous cell carcinoma. Acta Oncol. 54(9), 1423–1429 (2015).
Article CAS PubMed Google Scholar
Lucia, F. et al. External validation of a combined PET and MRI radiomics model for prediction of recurrence in cervical cancer patients treated with chemoradiotherapy. Eur. J. Nucl. Med. Mol. Imaging 46(4), 864–877 (2019).
Article PubMed Google Scholar
Zhang, L. et al. Ibex: An open infrastructure software platform to facilitate collaborative work in radiomics. Med. Phys. 42(3), 1341–1353 (2015).
Article PubMed PubMed Central Google Scholar
Szczypinski, P. M. et al. MaZda—A software package for image texture analysis. Comput. Methods Prog. Biomed. 94(1), 66–76 (2009).
Article Google Scholar
van Griethuysen, J. J. M. et al. Computational radiomics system to decode the radiographic phenotype. Can. Res. 77(21), E104–E107 (2017).
Article CAS Google Scholar
Dinapoli, N. et al. Moddicom: A complete and easily accessible library for prognostic evaluations relying on image features, in 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 771–774 (2015).
Apte, A. P. et al. Technical note: Extension of CERR for computational radiomics: A comprehensive MATLAB platform for reproducible radiomics research. Med. Phys. 45(8), 3713–3720 (2018).
Article Google Scholar
Shi, Z. et al. Technical note: Ontology-guided radiomics analysis workflow (O-RAW). Med. Phys. 46(12), 5677–5684 (2019).
Article PubMed Google Scholar
Lecler, A. et al. Combining multiple magnetic resonance imaging sequences provides independent reproducible radiomics features. Sci. Rep. 9, 8 (2019).
Article CAS Google Scholar
Ng, S. P. et al. Predicting treatment response based on dual assessment of magnetic resonance imaging kinetics and circulating tumor cells in patients with head and neck cancer (PREDICT-HN): matching ‘liquid biopsy’ and quantitative tumor modeling. BMC Cancer 18, 8 (2018).
Article CAS Google Scholar
Pipe, J. G. Motion correction with PROPELLER MRI: Application to head motion and free-breathing cardiac imaging. Magn. Reson. Med. 42(5), 963–969 (1999).
Article CAS PubMed Google Scholar
Wintersperger, B. J. et al. Brain magnetic resonance imaging at 3 T using BLADE compared with standard rectilinear data sampling. Investig. Radiol. 41(7), 586–592 (2006).
Article Google Scholar
Haralick, R. M., Shanmugam, K. & Dinstein, I. Textural features for image classification. IEEE Trans. Syst. Man Cybern. SMC3(6), 610–621 (1973).
Article Google Scholar
Galloway, M. M. Texture analysis using gray level run lengths. Comput. Graphics Image Process. 4(2), 172–179 (1975).
Article Google Scholar
Amadasun, M. & King, R. Textural features corresponding to textural properties. IEEE Trans. Syst. Man Cybern. 19(5), 1264–1274 (1989).
Article Google Scholar
Sharp, G. C. et al. Plastimatch-an open source software suite for radiotherapy image processing, in Proceedings of the XVI’th International Conference on the use of Computers in Radiotherapy (ICCR), Amsterdam, Netherlands (2010).
Virtanen, P. et al. SciPy 1.0: Fundamental algorithms for scientific computing in Python. Nat. Methods 17(3), 261–272 (2020).
Article CAS PubMed PubMed Central Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Ward, J. H. Hierarchical grouping to optimize an objective function. J. Am. Stat. Assoc. 58(301), 236–244 (1963).
Article MathSciNet Google Scholar

Download references

Acknowledgements

This project is supported by funding from the Peter MacCallum Cancer Foundation, RSNA Fellow Grant, Mike Hogg Fund, and RANZCR research grants.

Author information

Authors and Affiliations

Department of Physical Science, Peter MacCallum Cancer Centre, 305 Grattan St, Melbourne, VIC, 3000, Australia
James C. Korte, Nicholas Hardcastle & Tomas Kron
Department of Biomedical Engineering, University of Melbourne, Melbourne, Australia
James C. Korte
Department of Radiation Physics, University of Texas MD Anderson Cancer Center, Houston, USA
Carlos Cardenas, Jihong Wang & Laurence Court
Centre for Medical Radiation Physics, University of Wollongong, Wollongong, Australia
Nicholas Hardcastle
Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, Australia
Tomas Kron
Radiation Oncology Department, Centre Hospitalier de l’Université de Montréal, Montreal, Canada
Houda Bahig
Department of Radiation Oncology, University of Texas MD Anderson Cancer Center, Houston, USA
Baher Elgohari, Clifton D. Fuller & Sweet Ping Ng
Clinical Oncology & Nuclear Medicine Department, Mansoura University, Mansoura, Egypt
Baher Elgohari
Department of Radiation Oncology, Mayo Clinic, Phoenix, AZ, USA
Rachel Ger
Department of Radiation Oncology, Peter MacCallum Cancer Centre, Melbourne, Australia
Sweet Ping Ng
Department of Radiation Oncology, Olivia Newton-John Cancer Wellness and Research Centre, Melbourne, Australia
Sweet Ping Ng

Authors

James C. Korte
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Cardenas
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas Hardcastle
View author publications
You can also search for this author in PubMed Google Scholar
Tomas Kron
View author publications
You can also search for this author in PubMed Google Scholar
Jihong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Houda Bahig
View author publications
You can also search for this author in PubMed Google Scholar
Baher Elgohari
View author publications
You can also search for this author in PubMed Google Scholar
Rachel Ger
View author publications
You can also search for this author in PubMed Google Scholar
Laurence Court
View author publications
You can also search for this author in PubMed Google Scholar
Clifton D. Fuller
View author publications
You can also search for this author in PubMed Google Scholar
Sweet Ping Ng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.K. wrote the main manuscript text and prepared the figures and tables. J.K. and S.P. performed the radiomics feature extraction. J.K., S.P. and N.H. performed the radiomics analysis and data interpretation. S.P., C.C., J.W. and C.F. recruited patients and supervised data collection on the PREDICT-HN study. All authors reviewed the manuscript.

Corresponding author

Correspondence to James C. Korte.

Ethics declarations

Competing interests

TK has a Research Collaboration Agreement with Varian Medical Systems. NH has a Clinical Research Collaborations Program Grant from Varian Medical Systems. SPN was funded by the Australian Postgraduate Award, the Royal Australian and New Zealand College of Radiologists (RANZCR) Research Grant and the Radiological Society of North America (RSNA) Fellow Grant. CDF received funding support from the National Institutes of Health (NIH)/National Institute for Dental and Craniofacial Research (NIDCR) (1R01DE025248-01/R56DE025248-01). CDF was previously funded via the National Science Foundation (NSF), Division of Mathematical Sciences, Joint NIH/NSF Initiative on Quantitative Approaches to Biomedical Big Data (QuBBD) Grant (NSF DMS-1557679) and is currently supported by the NIH National Cancer Institute (NCI)/Big Data to Knowledge (BD2K) Program (1R01CA214825-01), the NIH/NCI Head and Neck Specialized Programs of Research Excellence (SPORE) Developmental Research Program Career Development Award (P50CA097007-10); the NCI Paul Calabresi Clinical Oncology Program Award (K12 CA088084-06); a General Electric Healthcare/MD Anderson Center for Advanced Biomedical Imaging In-Kind Award; an Elekta AB/MD Anderson Department of Radiation Oncology Seed Grant; the Center for Radiation Oncology Research (CROR) at MD Anderson Cancer Center Seed Grant; and the MD Anderson Institutional Research Grant (IRG) Program. CDF has received speaker travel funding from Elekta AB. Supported in part by the NIH/NCI Cancer Center Support (Core) Grant CA016672 to The University of Texas MD Anderson Cancer Center (P30 CA016672). JCK, CC, JW, HB, BE, RG and LC have no potential competing interests to declare.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this Article was revised: The original version of this Article contained an error in the order of the Figures. Figures 1, 2, 3, 4 and 5 were published as Figures 5, 1, 2, 3, and 4 respectively.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Korte, J.C., Cardenas, C., Hardcastle, N. et al. Radiomics feature stability of open-source software evaluated on apparent diffusion coefficient maps in head and neck cancer. Sci Rep 11, 17633 (2021). https://doi.org/10.1038/s41598-021-96600-4

Download citation

Received: 22 March 2021
Accepted: 10 August 2021
Published: 03 September 2021
DOI: https://doi.org/10.1038/s41598-021-96600-4

This article is cited by

Utilization of texture features of volumetric ADC maps in differentiating between serous cystadenoma and intraductal papillary neoplasms
- Golnoosh Ansari
- Mohammad Mirza-Aghazadeh-Attari
- Ihab R. Kamel
Abdominal Radiology (2024)
Phantom-based radiomics feature test–retest stability analysis on photon-counting detector CT
- Alexander Hertel
- Hishan Tharmaseelan
- Isabelle Ayx
European Radiology (2023)
Radiomics under 2D regions, 3D regions, and peritumoral regions reveal tumor heterogeneity in non-small cell lung cancer: a multicenter study
- Xingping Zhang
- Guijuan Zhang
- Yanchun Zhang
La radiologia medica (2023)
The impact of inter-observer variation in delineation on robustness of radiomics features in non-small cell lung cancer
- Gargi Kothari
- Beverley Woon
- Shankar Siva
Scientific Reports (2022)
Evaluation of radiomics feature stability in abdominal monoenergetic photon counting CT reconstructions
- Hishan Tharmaseelan
- Lukas T. Rotkopf
- Matthias F. Froelich
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Variation in radiomic features

Variation in patient modelling

Discussion

Conclusion

Material and methods

Study cohort

Ethics approval and consent to participate

Imaging protocol

Feature extraction

Variation in radiomic features

Variation in patient modelling

Change history

17 September 2021

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links