Radiomics: the process and the challenges

doi:10.1016/j.mri.2012.06.010

Magnetic Resonance Imaging

Volume 30, Issue 9, November 2012, Pages 1234-1248

https://doi.org/10.1016/j.mri.2012.06.010 Get rights and content

Abstract

“Radiomics” refers to the extraction and analysis of large amounts of advanced quantitative imaging features with high throughput from medical images obtained with computed tomography, positron emission tomography or magnetic resonance imaging. Importantly, these data are designed to be extracted from standard-of-care images, leading to a very large potential subject pool. Radiomics data are in a mineable form that can be used to build descriptive and predictive models relating image features to phenotypes or gene–protein signatures. The core hypothesis of radiomics is that these models, which can include biological or medical data, can provide valuable diagnostic, prognostic or predictive information. The radiomics enterprise can be divided into distinct processes, each with its own challenges that need to be overcome: (a) image acquisition and reconstruction, (b) image segmentation and rendering, (c) feature extraction and feature qualification and (d) databases and data sharing for eventual (e) ad hoc informatics analyses. Each of these individual processes poses unique challenges. For example, optimum protocols for image acquisition and reconstruction have to be identified and harmonized. Also, segmentations have to be robust and involve minimal operator input. Features have to be generated that robustly reflect the complexity of the individual volumes, but cannot be overly complex or redundant. Furthermore, informatics databases that allow incorporation of image features and image annotations, along with medical and genetic data, have to be generated. Finally, the statistical approaches to analyze these data have to be optimized, as radiomics is not a mature field of study. Each of these processes will be discussed in turn, as well as some of their unique challenges and proposed approaches to solve them. The focus of this article will be on images of non-small-cell lung cancer.

Introduction

“Radiomics” involves the high-throughput extraction of quantitative imaging features with the intent of creating mineable databases from radiological images [1]. It is proposed that such profound analyses and mining of image feature data will reveal quantitative predictive or prognostic associations between images and medical outcomes. In cancer, current radiological practice is generally qualitative, e.g., “a peripherally enhancing spiculated mass in the lower left lobe.” When quantitative, measurements are commonly limited to dimensional measurements of tumor size via one-dimensional (Response Evaluation Criteria In Solid Tumors [RECIST]) or two-dimensional (2D) (World Health Organization) long-axis measures [2]. These measures do not reflect the complexity of tumor morphology or behavior, nor, in many cases, are changes in these measures predictive of therapeutic benefit [3]. When additional quantitative measures are obtained, they generally average values over an entire region of interest (ROI).

There are efforts to develop a standardized lexicon for the description of such lesions [4], [5] and to include these descriptors via annotated image markup into quantitative, mineable data [6], [7]. However, such approaches do not completely cover the range of quantitative features that can be extracted from images, such as texture, shape or margin gradients. In focused studies, texture features have been shown to provide significantly higher prognostic power than ROI-based methods [8], [9], [10], [11]. The modern rebirth of radiomics (or radiogenomics) was articulated in two papers by Kuo and colleagues. Following a complete manual extraction of numerous (> 100) image features, a subset of 14 features was able to predict 80% of the gene expression pattern in hepatocellular carcinoma using computed tomographic (CT) images [12]. A similar extraction of features from contrast-enhanced magnetic resonance images (MRI) of glioblastoma was able to predict immunohistochemically identified protein expression patterns [13]. Although paradigm shifting, these analyses were performed manually, and the studies were consequently underpowered. In the current iteration of radiomics, image features have to be extracted automatically and with high throughput, putting a high premium on novel machine learning algorithm development.

The goal of radiomics is to convert images into mineable data, with high fidelity and high throughput. The radiomics enterprise can be divided into five processes with definable inputs and outputs, each with its own challenges that need to be overcome: (a) image acquisition and reconstruction, (b) image segmentation and rendering, (c) feature extraction and feature qualification, (d) databases and data sharing and (e) ad hoc informatics analyses. Each of these steps must be developed de novo and, as such, poses discrete challenges that have to be met (Fig. 1). For example, optimum protocols for image acquisition and reconstruction have to be identified and harmonized. Segmentations have to be robust and involve minimal operator input. Features have to be generated that robustly reflect the complexity of the individual volumes, but cannot be overly complex or redundant. Informatics databases that allow for incorporation of image features and image annotations, along with medical and genetic data, have to be generated. Finally, the statistical approaches to analyze these data have to be optimized, as radiomics is not a mature field of study. Variation in results may come from variations in any of these individual processes. Thus, after optimization, another level of challenge is to harmonize and standardize the entire process, while still allowing for improvement and process evolution.

Section snippets

Image acquisition and reconstruction challenges

In routine clinical image acquisition, there is wide variation in imaging parameters such as image resolution (pixel size or matrix size and slice thickness), washout period in the case of positron emission tomography (PET) imaging, patient position, and the variations introduced by different reconstruction algorithms and slice thicknesses, which are different for each scanner vendor. Even this simple set of imaging issues can create difficulty in comparing results obtained across institutions

Segmentation challenges

Segmentation of images into VOIs such as tumor, normal tissue and other anatomical structures is a crucial step for subsequent informatics analyses. Manual segmentation by expert readers is often treated as ground truth. However, it suffers from high interreader variability and is labor intensive; thus, it is not feasible for radiomics analysis requiring very large data sets. Many automatic and semiautomatic segmentation methods have been developed across various image modalities like CT, PET

Feature extraction and qualification

Once tumor regions are defined, imaging features can be extracted. These features describe characteristics of the tumor intensity histogram (e.g., high or low contrast), tumor shape (e.g., round or spiculated), texture patterns (e.g., homogeneous or heterogeneous), as well as descriptors of tumor location and relations with the surrounding tissues (e.g., near the heart).

Deidentification

To follow the principle of providing the minimum amount of confidential information (i.e., patient identifiers) necessary to accommodate downstream analysis of imaging data, raw DICOM image data can be stripped of identified headers and assigned a deidentified number. Maintaining deidentified images and clinical data is an important patient privacy safeguard [77]. In the context of DICOM images, Supplement 142 from the DICOM Standards Committee provides guidance in the process of deidentifying

Statistical and radioinformatics analysis

Analysis within radiomics must evolve appropriate approaches for identifying reliable, reproducible findings that could potentially be employed within a clinical context. Applying the existing bioinformatics “toolbox” to radiomics data is an efficient first step since it eliminates the necessity to develop new analytical methods and leverages accepted and validated methodologies. Radiomics-specific analysis issues will exist, as in any field; therefore, an important step in achieving consensus

Acknowledgment

Radiomics of NSCLC U01 CA143062.

References (97)

P. Lambin et al.
Radiomics: extracting more information from medical images using advance feature analysis
Eur J Cancer
(2012)
A. Burton
RECIST: right time to renovate?
Lancet Oncol
(2007)
M. Ollers et al.
The integration of PET–CT scans from different hospitals into radiotherapy treatment planning
Radiother Oncol
(2008)
M.H. Janssen et al.
Blood glucose level normalization and accurate timing improves the accuracy of PET-based treatment response predictions in rectal cancer
Radiother Oncol
(2010)
A.R. Padhani et al.
Diffusion-weighted magnetic resonance imaging as a cancer biomarker: consensus and recommendations
Neoplasia
(2009)
J. Stroom et al.
Feasibility of pathology-correlated lung imaging for accurate target definition of lung tumors
Int J Radiat Oncol Biol Phys
(2007)
H. Gao et al.
Individual tooth segmentation from CT images using level set method with shape and intensity prior
Pattern Recognition
(2010)
Y.T. Chen
A level set method based on the Bayesian risk for medical image segmentation
Pattern Recognition
(2010)
S. Osher et al.
Fronts propagating with curvature-dependent speed: algorithms based on Hamilton–Jacobi formulations
J Comput Phys
(1988)
R.W.K. So et al.
Non-rigid image registration of brain magnetic resonance images using graph-cuts
Pattern Recognition
(2011)

W. Liu et al.

Segmentation of elastographic images using a coarse-to-fine active contour model

Ultrasound Med Biol

(2006)

L. Wang et al.

Active contours driven by local and global intensity fitting energy with application to brain MR image segmentation

Comput Med Imaging Graph

(2009)

E.N. Mortensen et al.

Interactive segmentation with intelligent scissors

Graph Models Image Process

(1998)

K. Lu et al.

Segmentation of the central-chest lymph nodes in 3D MDCT images

Comput Biol Med

(2011)

P. Tai et al.

Variability of target volume delineation in cervical esophageal cancer

Int J Radiat Oncol Biol Phys

(1998)

J.S. Cooper et al.

An evaluation of the variability of tumor-shape definition derived by experienced observers from CT images of supraglottic carcinomas (ACRIN protocol 6658)

Int J Radiat Oncol Biol Phys

(2007)

O. Holub et al.

Quantitative histogram analysis of images

Comput Phys Commun

(2006)

I. El Naqa et al.

Exploring feature-based approaches in PET images for predicting cancer treatment outcomes

Pattern Recognition

(2009)

M.M. Galloway

Texture analysis using gray level run lengths

Comput Graph Image Process

(1975)

G. Castellano et al.

Texture analysis of medical images

Clin Radiol

(2004)

J. Suárez et al.

Optimum compactness structures derived from the regular octahedron

Engineering Structures

(2008)

J. Fu et al.

Image segmentation feature selection and pattern classification for mammographic microcalcifications

Comput Med Imaging Graph

(2005)

C.C. Jaffe

Measures of response: RECIST, WHO, and new alternatives

J Clin Oncol

(2006)

D.L. Rubin

Creating and curating a terminology for radiology: ontology modeling and analysis

J Digit Imaging

(2008)

P. Opulencia et al.

Mapping LIDC, RadLex™, and lung nodule image features

J Digit Imaging

(2011)

D.S. Channin et al.

The annotation and image mark-up project 1

Comparison of model-based arterial input functions for dynamic contrast-enhanced MRI in tumor bearing rats

Magn Reson Med

(2009)

Jackson E, Ashton E, Evelhoch J, Buonocore M, Karczmar G, Rosen M, et al. Multivendor, multisite DCE-MRI phantom...

S.G. Armato et al.

The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans

Med Phys

(2011)

S.G. Armato et al.

The Reference Image Database to Evaluate Response to therapy in lung cancer (RIDER) project: a resource for the development of change-analysis software

Clin Pharmacol Ther

(2008)

Basu S, Hall L, Goldgof D, Gu Y, Kumar V, Choi J, et-al. Developing a classifier model for lung tumors in CT-scan...

S. Hojjatoleslami et al.

Region growing: a new approach

IEEE Trans Image Process

(1998)

J. Dehmeshki et al.

Segmentation of pulmonary nodules in thoracic CT scans: a region growing approach

IEEE Trans Med Imaging

(2008)

Cited by (1686)

Artificial intelligence in skeletal metastasis imaging
2024, Computational and Structural Biotechnology Journal
In the field of metastatic skeletal oncology imaging, the role of artificial intelligence (AI) is becoming more prominent. Bone metastasis typically indicates the terminal stage of various malignant neoplasms. Once identified, it necessitates a comprehensive revision of the initial treatment regime, and palliative care is often the only resort. Given the gravity of the condition, the diagnosis of bone metastasis should be approached with utmost caution. AI techniques are being evaluated for their efficacy in a range of tasks within medical imaging, including object detection, disease classification, region segmentation, and prognosis prediction in medical imaging. These methods offer a standardized solution to the frequently subjective challenge of image interpretation.This subjectivity is most desirable in bone metastasis imaging. This review describes the basic imaging modalities of bone metastasis imaging, along with the recent developments and current applications of AI in the respective imaging studies. These concrete examples emphasize the importance of using computer-aided systems in the clinical setting. The review culminates with an examination of the current limitations and prospects of AI in the realm of bone metastasis imaging. To establish the credibility of AI in this domain, further research efforts are required to enhance the reproducibility and attain robust level of empirical support.
Clinical applications of artificial intelligence in identification and management of bacterial infection: Systematic review and meta-analysis
2024, Saudi Journal of Biological Sciences
Pneumonia is declared a global emergency public health crisis in children less than five age and the geriatric population. Recent advancements in deep learning models could be utilized effectively for the timely and early diagnosis of pneumonia in immune-compromised patients to avoid complications. This systematic review and meta-analysis utilized PRISMA guidelines for the selection of ten articles included in this study. The literature search was done through electronic databases including PubMed, Scopus, and Google Scholar from 1st January 2016 till 1 July 2023. Overall studies included a total of 126,610 images and 1706 patients in this meta-analysis. At a 95% confidence interval, for pooled sensitivity was 0.90 (0.85–0.94) and I2 statistics 90.20 (88.56 – 91.92). The pooled specificity for deep learning models' diagnostic accuracy was 0.89 (0.86–––0.92) and I2 statistics 92.72 (91.50 – 94.83). I2 statistics showed low heterogeneity across studies highlighting consistent and reliable estimates, and instilling confidence in these findings for researchers and healthcare practitioners. The study highlighted the recent deep learning models single or in combination with high accuracy, sensitivity, and specificity to ensure reliable use for bacterial pneumonia identification and differentiate from other viral, fungal pneumonia in children and adults through chest x-rays and radiographs.
Radiomics-based Machine Learning to Predict the Recurrence of Hepatocellular Carcinoma: A Systematic Review and Meta-analysis
2024, Academic Radiology
Recurrence of hepatocellular carcinoma (HCC) is a major concern in its management. Accurately predicting the risk of recurrence is crucial for determining appropriate treatment strategies and improving patient outcomes. A certain amount of radiomics models for HCC recurrence prediction have been proposed. This study aimed to assess the role of radiomics models in the prediction of HCC recurrence and to evaluate their methodological quality.
Databases Cochrane Library, Web of Science, PubMed, and Embase were searched until July 11, 2023 for studies eligible for the meta-analysis. Their methodological quality was evaluated using the Radiomics Quality Score (RQS). The predictive ability of the radiomics model, clinical model, and the combined model integrating the clinical characteristics with radiomics signatures was measured using the concordance index (C-index), sensitivity, and specificity. Radiomics models in included studies were compared based on different imaging modalities, including computed tomography (CT), magnetic resonance imaging (MRI), ultrasound/sonography (US), contrast-enhanced ultrasound (CEUS).
A total of 49 studies were included. On the validation cohort, radiomics model performed better (CT: C-index = 0.747, 95% CI: 0.70–0.79; MRI: C-index = 0.788, 95% CI: 0.75–0.83; CEUS: C-index = 0.763, 95% CI: 0.60–0.93) compared to the clinical model (C-index = 0.671, 95% CI: 0.65–0.70), except for ultrasound-based models (C-index = 0.560, 95% CI: 0.53–0.59). The combined model outperformed other models (CT: C-index = 0.790, 95% CI: 0.76–0.82; MRI: C-index = 0.826, 95% CI: 0.79–0.86; US: C-index = 0.760, 95% CI: 0.65–0.87), except for CEUS-based combined models (C-index = 0.707, 95% CI: 0.44–0.97).
Radiomics holds the potential to predict HCC recurrence and demonstrates enhanced predictive value across various imaging modalities when integrated with clinical features. Nevertheless, further studies are needed to optimize the radiomics approach and validate the results in larger, multi-center cohorts.
Texture and Radiomics inspired Data-Driven Cancerous Lung Nodules Severity Classification
2024, Biomedical Signal Processing and Control
In the process of developing artificial consciousness to mimic human intelligence, situational decision making, and self realization, steadfast progress has been noted in the field of machine learning. The machine learning with its advantages of accommodating small dataset in the training of models accounts for its necessity in practical life. The inherent information deeply ingrained in the sea of data is extracted via radiomics and 2D texture analysis that constitutes the feature set to model the framework of the algorithm for classification, and is named as texture and radiomics features based severity classification (TRFSC). In the process, thoracic CT scans of 1018 patients in LIDC-IDRI dataset are considered for the classification of benign and malignant lung nodules. The image and annotated tumor mask information are used to extract volumetric and 2D weighted reconstructed image features. The estimated 71 features of each subject are utilized to construct the best classification model using SVM, LDA, linear regression, KNN, Bayes, and boosted trees classifiers. The performance of the classifiers are evaluated using accuracy, specificity, sensitivity, and AUC metrics. This work achieves accuracy of 0.913, specificity of 0.92, sensitivity of 0.90, and AUC of 0.96 for the SVM classifier when compared with other different classifiers. The application of various parametric values in the process of feature extraction and discrimination of classes, provides the flexibility to choose the best possible model for classification purposes. The proposed method is compared quantitatively with other classification algorithms on the ground of performance to showcase its applicability and relevance as a classification algorithm to discriminate the two benign and malignant categories.
A distributed feature selection pipeline for survival analysis using radiomics in non-small cell lung cancer patients
2024, Scientific Reports
Radiomics analysis for distinctive identification of COVID-19 pulmonary nodules from other benign and malignant counterparts
2024, Scientific Reports

View all citing articles on Scopus

View full text

Original contributionRadiomics: the process and the challenges

Abstract

Introduction

Section snippets

Image acquisition and reconstruction challenges

Segmentation challenges

Feature extraction and qualification

Deidentification

Statistical and radioinformatics analysis

Acknowledgment

Eur J Cancer

Lancet Oncol

Radiother Oncol

Radiother Oncol

Neoplasia

Int J Radiat Oncol Biol Phys

Pattern Recognition

Pattern Recognition

J Comput Phys

Pattern Recognition

Ultrasound Med Biol

Comput Med Imaging Graph

Graph Models Image Process

Comput Biol Med

Int J Radiat Oncol Biol Phys

Int J Radiat Oncol Biol Phys

Comput Phys Commun

Pattern Recognition

Comput Graph Image Process

Clin Radiol

Engineering Structures

Comput Med Imaging Graph

Measures of response: RECIST, WHO, and new alternatives

J Clin Oncol

Creating and curating a terminology for radiology: ontology modeling and analysis

J Digit Imaging

Mapping LIDC, RadLex™, and lung nodule image features

J Digit Imaging

The annotation and image mark-up project 1

Radiology

Imaging tumor vascular heterogeneity and angiogenesis using dynamic contrast-enhanced magnetic resonance imaging

Clin Cancer Res

Quantifying spatial heterogeneity in dynamic contrast‐enhanced MRI parameter maps

Magn Reson Med

Textural analysis of contrast‐enhanced MR images of the breast

Magn Reson Med

Characterization of image heterogeneity using 2D Minkowski functionals increases the sensitivity of detection of a targeted MRI contrast agent

Magn Reson Med

Decoding global gene expression programs in liver cancer by noninvasive imaging

Nat Biotechnol

Identification of noninvasive imaging surrogates for brain tumor gene-expression modules

Proc Natl Acad Sci

FDG PET and PET/CT: EANM procedure guidelines for tumour PET imaging: version 1.0

Eur J Nucl Med Mol Imaging

Standards for PET image acquisition and quantitative data analysis

J Nucl Med

Developing a quality control protocol for diffusion imaging on a clinical MRI system

Phys Med Biol

Quantifying tumor vascular heterogeneity with dynamic contrast-enhanced magnetic resonance imaging: a review

J Biomed Biotechnol

Reproducibility of dynamic contrast-enhanced MRI in human muscle and tumours: comparison of quantitative and semi-quantitative analysis

NMR Biomed

Quantification of perfusion and permeability in breast tumors with a deconvolution-based analysis of second-bolus T1-DCE data

J Magn Reson Imaging

Comparison of quantitative parameters in cervix cancer measured by dynamic contrast-enhanced MRI and CT

Magn Reson Med

Dynamic contrast-enhanced MRI in ovarian cancer: initial experience at 3 tesla in primary and metastatic disease

Magn Reson Med

Comparison of model-based arterial input functions for dynamic contrast-enhanced MRI in tumor bearing rats

Magn Reson Med

The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans

Med Phys

The Reference Image Database to Evaluate Response to therapy in lung cancer (RIDER) project: a resource for the development of change-analysis software

Clin Pharmacol Ther

Region growing: a new approach

IEEE Trans Image Process

Segmentation of pulmonary nodules in thoracic CT scans: a region growing approach

IEEE Trans Med Imaging

Original contribution
Radiomics: the process and the challenges