Original contributionRadiomics: the process and the challenges
Introduction
“Radiomics” involves the high-throughput extraction of quantitative imaging features with the intent of creating mineable databases from radiological images [1]. It is proposed that such profound analyses and mining of image feature data will reveal quantitative predictive or prognostic associations between images and medical outcomes. In cancer, current radiological practice is generally qualitative, e.g., “a peripherally enhancing spiculated mass in the lower left lobe.” When quantitative, measurements are commonly limited to dimensional measurements of tumor size via one-dimensional (Response Evaluation Criteria In Solid Tumors [RECIST]) or two-dimensional (2D) (World Health Organization) long-axis measures [2]. These measures do not reflect the complexity of tumor morphology or behavior, nor, in many cases, are changes in these measures predictive of therapeutic benefit [3]. When additional quantitative measures are obtained, they generally average values over an entire region of interest (ROI).
There are efforts to develop a standardized lexicon for the description of such lesions [4], [5] and to include these descriptors via annotated image markup into quantitative, mineable data [6], [7]. However, such approaches do not completely cover the range of quantitative features that can be extracted from images, such as texture, shape or margin gradients. In focused studies, texture features have been shown to provide significantly higher prognostic power than ROI-based methods [8], [9], [10], [11]. The modern rebirth of radiomics (or radiogenomics) was articulated in two papers by Kuo and colleagues. Following a complete manual extraction of numerous (> 100) image features, a subset of 14 features was able to predict 80% of the gene expression pattern in hepatocellular carcinoma using computed tomographic (CT) images [12]. A similar extraction of features from contrast-enhanced magnetic resonance images (MRI) of glioblastoma was able to predict immunohistochemically identified protein expression patterns [13]. Although paradigm shifting, these analyses were performed manually, and the studies were consequently underpowered. In the current iteration of radiomics, image features have to be extracted automatically and with high throughput, putting a high premium on novel machine learning algorithm development.
The goal of radiomics is to convert images into mineable data, with high fidelity and high throughput. The radiomics enterprise can be divided into five processes with definable inputs and outputs, each with its own challenges that need to be overcome: (a) image acquisition and reconstruction, (b) image segmentation and rendering, (c) feature extraction and feature qualification, (d) databases and data sharing and (e) ad hoc informatics analyses. Each of these steps must be developed de novo and, as such, poses discrete challenges that have to be met (Fig. 1). For example, optimum protocols for image acquisition and reconstruction have to be identified and harmonized. Segmentations have to be robust and involve minimal operator input. Features have to be generated that robustly reflect the complexity of the individual volumes, but cannot be overly complex or redundant. Informatics databases that allow for incorporation of image features and image annotations, along with medical and genetic data, have to be generated. Finally, the statistical approaches to analyze these data have to be optimized, as radiomics is not a mature field of study. Variation in results may come from variations in any of these individual processes. Thus, after optimization, another level of challenge is to harmonize and standardize the entire process, while still allowing for improvement and process evolution.
Section snippets
Image acquisition and reconstruction challenges
In routine clinical image acquisition, there is wide variation in imaging parameters such as image resolution (pixel size or matrix size and slice thickness), washout period in the case of positron emission tomography (PET) imaging, patient position, and the variations introduced by different reconstruction algorithms and slice thicknesses, which are different for each scanner vendor. Even this simple set of imaging issues can create difficulty in comparing results obtained across institutions
Segmentation challenges
Segmentation of images into VOIs such as tumor, normal tissue and other anatomical structures is a crucial step for subsequent informatics analyses. Manual segmentation by expert readers is often treated as ground truth. However, it suffers from high interreader variability and is labor intensive; thus, it is not feasible for radiomics analysis requiring very large data sets. Many automatic and semiautomatic segmentation methods have been developed across various image modalities like CT, PET
Feature extraction and qualification
Once tumor regions are defined, imaging features can be extracted. These features describe characteristics of the tumor intensity histogram (e.g., high or low contrast), tumor shape (e.g., round or spiculated), texture patterns (e.g., homogeneous or heterogeneous), as well as descriptors of tumor location and relations with the surrounding tissues (e.g., near the heart).
Deidentification
To follow the principle of providing the minimum amount of confidential information (i.e., patient identifiers) necessary to accommodate downstream analysis of imaging data, raw DICOM image data can be stripped of identified headers and assigned a deidentified number. Maintaining deidentified images and clinical data is an important patient privacy safeguard [77]. In the context of DICOM images, Supplement 142 from the DICOM Standards Committee provides guidance in the process of deidentifying
Statistical and radioinformatics analysis
Analysis within radiomics must evolve appropriate approaches for identifying reliable, reproducible findings that could potentially be employed within a clinical context. Applying the existing bioinformatics “toolbox” to radiomics data is an efficient first step since it eliminates the necessity to develop new analytical methods and leverages accepted and validated methodologies. Radiomics-specific analysis issues will exist, as in any field; therefore, an important step in achieving consensus
Acknowledgment
Radiomics of NSCLC U01 CA143062.
References (97)
- et al.
Radiomics: extracting more information from medical images using advance feature analysis
Eur J Cancer
(2012) RECIST: right time to renovate?
Lancet Oncol
(2007)- et al.
The integration of PET–CT scans from different hospitals into radiotherapy treatment planning
Radiother Oncol
(2008) - et al.
Blood glucose level normalization and accurate timing improves the accuracy of PET-based treatment response predictions in rectal cancer
Radiother Oncol
(2010) - et al.
Diffusion-weighted magnetic resonance imaging as a cancer biomarker: consensus and recommendations
Neoplasia
(2009) - et al.
Feasibility of pathology-correlated lung imaging for accurate target definition of lung tumors
Int J Radiat Oncol Biol Phys
(2007) - et al.
Individual tooth segmentation from CT images using level set method with shape and intensity prior
Pattern Recognition
(2010) A level set method based on the Bayesian risk for medical image segmentation
Pattern Recognition
(2010)- et al.
Fronts propagating with curvature-dependent speed: algorithms based on Hamilton–Jacobi formulations
J Comput Phys
(1988) - et al.
Non-rigid image registration of brain magnetic resonance images using graph-cuts
Pattern Recognition
(2011)
Segmentation of elastographic images using a coarse-to-fine active contour model
Ultrasound Med Biol
Active contours driven by local and global intensity fitting energy with application to brain MR image segmentation
Comput Med Imaging Graph
Interactive segmentation with intelligent scissors
Graph Models Image Process
Segmentation of the central-chest lymph nodes in 3D MDCT images
Comput Biol Med
Variability of target volume delineation in cervical esophageal cancer
Int J Radiat Oncol Biol Phys
An evaluation of the variability of tumor-shape definition derived by experienced observers from CT images of supraglottic carcinomas (ACRIN protocol 6658)
Int J Radiat Oncol Biol Phys
Quantitative histogram analysis of images
Comput Phys Commun
Exploring feature-based approaches in PET images for predicting cancer treatment outcomes
Pattern Recognition
Texture analysis using gray level run lengths
Comput Graph Image Process
Texture analysis of medical images
Clin Radiol
Optimum compactness structures derived from the regular octahedron
Engineering Structures
Image segmentation feature selection and pattern classification for mammographic microcalcifications
Comput Med Imaging Graph
Measures of response: RECIST, WHO, and new alternatives
J Clin Oncol
Creating and curating a terminology for radiology: ontology modeling and analysis
J Digit Imaging
Mapping LIDC, RadLex™, and lung nodule image features
J Digit Imaging
The annotation and image mark-up project 1
Radiology
Imaging tumor vascular heterogeneity and angiogenesis using dynamic contrast-enhanced magnetic resonance imaging
Clin Cancer Res
Quantifying spatial heterogeneity in dynamic contrast‐enhanced MRI parameter maps
Magn Reson Med
Textural analysis of contrast‐enhanced MR images of the breast
Magn Reson Med
Characterization of image heterogeneity using 2D Minkowski functionals increases the sensitivity of detection of a targeted MRI contrast agent
Magn Reson Med
Decoding global gene expression programs in liver cancer by noninvasive imaging
Nat Biotechnol
Identification of noninvasive imaging surrogates for brain tumor gene-expression modules
Proc Natl Acad Sci
FDG PET and PET/CT: EANM procedure guidelines for tumour PET imaging: version 1.0
Eur J Nucl Med Mol Imaging
Standards for PET image acquisition and quantitative data analysis
J Nucl Med
Developing a quality control protocol for diffusion imaging on a clinical MRI system
Phys Med Biol
Quantifying tumor vascular heterogeneity with dynamic contrast-enhanced magnetic resonance imaging: a review
J Biomed Biotechnol
Reproducibility of dynamic contrast-enhanced MRI in human muscle and tumours: comparison of quantitative and semi-quantitative analysis
NMR Biomed
Quantification of perfusion and permeability in breast tumors with a deconvolution-based analysis of second-bolus T1-DCE data
J Magn Reson Imaging
Comparison of quantitative parameters in cervix cancer measured by dynamic contrast-enhanced MRI and CT
Magn Reson Med
Dynamic contrast-enhanced MRI in ovarian cancer: initial experience at 3 tesla in primary and metastatic disease
Magn Reson Med
Comparison of model-based arterial input functions for dynamic contrast-enhanced MRI in tumor bearing rats
Magn Reson Med
The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans
Med Phys
The Reference Image Database to Evaluate Response to therapy in lung cancer (RIDER) project: a resource for the development of change-analysis software
Clin Pharmacol Ther
Region growing: a new approach
IEEE Trans Image Process
Segmentation of pulmonary nodules in thoracic CT scans: a region growing approach
IEEE Trans Med Imaging
Cited by (1686)
Artificial intelligence in skeletal metastasis imaging
2024, Computational and Structural Biotechnology JournalClinical applications of artificial intelligence in identification and management of bacterial infection: Systematic review and meta-analysis
2024, Saudi Journal of Biological SciencesTexture and Radiomics inspired Data-Driven Cancerous Lung Nodules Severity Classification
2024, Biomedical Signal Processing and Control