Elsevier

Magnetic Resonance Imaging

Volume 30, Issue 9, November 2012, Pages 1234-1248
Magnetic Resonance Imaging

Original contribution
Radiomics: the process and the challenges

https://doi.org/10.1016/j.mri.2012.06.010Get rights and content

Abstract

“Radiomics” refers to the extraction and analysis of large amounts of advanced quantitative imaging features with high throughput from medical images obtained with computed tomography, positron emission tomography or magnetic resonance imaging. Importantly, these data are designed to be extracted from standard-of-care images, leading to a very large potential subject pool. Radiomics data are in a mineable form that can be used to build descriptive and predictive models relating image features to phenotypes or gene–protein signatures. The core hypothesis of radiomics is that these models, which can include biological or medical data, can provide valuable diagnostic, prognostic or predictive information. The radiomics enterprise can be divided into distinct processes, each with its own challenges that need to be overcome: (a) image acquisition and reconstruction, (b) image segmentation and rendering, (c) feature extraction and feature qualification and (d) databases and data sharing for eventual (e) ad hoc informatics analyses. Each of these individual processes poses unique challenges. For example, optimum protocols for image acquisition and reconstruction have to be identified and harmonized. Also, segmentations have to be robust and involve minimal operator input. Features have to be generated that robustly reflect the complexity of the individual volumes, but cannot be overly complex or redundant. Furthermore, informatics databases that allow incorporation of image features and image annotations, along with medical and genetic data, have to be generated. Finally, the statistical approaches to analyze these data have to be optimized, as radiomics is not a mature field of study. Each of these processes will be discussed in turn, as well as some of their unique challenges and proposed approaches to solve them. The focus of this article will be on images of non-small-cell lung cancer.

Introduction

“Radiomics” involves the high-throughput extraction of quantitative imaging features with the intent of creating mineable databases from radiological images [1]. It is proposed that such profound analyses and mining of image feature data will reveal quantitative predictive or prognostic associations between images and medical outcomes. In cancer, current radiological practice is generally qualitative, e.g., “a peripherally enhancing spiculated mass in the lower left lobe.” When quantitative, measurements are commonly limited to dimensional measurements of tumor size via one-dimensional (Response Evaluation Criteria In Solid Tumors [RECIST]) or two-dimensional (2D) (World Health Organization) long-axis measures [2]. These measures do not reflect the complexity of tumor morphology or behavior, nor, in many cases, are changes in these measures predictive of therapeutic benefit [3]. When additional quantitative measures are obtained, they generally average values over an entire region of interest (ROI).

There are efforts to develop a standardized lexicon for the description of such lesions [4], [5] and to include these descriptors via annotated image markup into quantitative, mineable data [6], [7]. However, such approaches do not completely cover the range of quantitative features that can be extracted from images, such as texture, shape or margin gradients. In focused studies, texture features have been shown to provide significantly higher prognostic power than ROI-based methods [8], [9], [10], [11]. The modern rebirth of radiomics (or radiogenomics) was articulated in two papers by Kuo and colleagues. Following a complete manual extraction of numerous (> 100) image features, a subset of 14 features was able to predict 80% of the gene expression pattern in hepatocellular carcinoma using computed tomographic (CT) images [12]. A similar extraction of features from contrast-enhanced magnetic resonance images (MRI) of glioblastoma was able to predict immunohistochemically identified protein expression patterns [13]. Although paradigm shifting, these analyses were performed manually, and the studies were consequently underpowered. In the current iteration of radiomics, image features have to be extracted automatically and with high throughput, putting a high premium on novel machine learning algorithm development.

The goal of radiomics is to convert images into mineable data, with high fidelity and high throughput. The radiomics enterprise can be divided into five processes with definable inputs and outputs, each with its own challenges that need to be overcome: (a) image acquisition and reconstruction, (b) image segmentation and rendering, (c) feature extraction and feature qualification, (d) databases and data sharing and (e) ad hoc informatics analyses. Each of these steps must be developed de novo and, as such, poses discrete challenges that have to be met (Fig. 1). For example, optimum protocols for image acquisition and reconstruction have to be identified and harmonized. Segmentations have to be robust and involve minimal operator input. Features have to be generated that robustly reflect the complexity of the individual volumes, but cannot be overly complex or redundant. Informatics databases that allow for incorporation of image features and image annotations, along with medical and genetic data, have to be generated. Finally, the statistical approaches to analyze these data have to be optimized, as radiomics is not a mature field of study. Variation in results may come from variations in any of these individual processes. Thus, after optimization, another level of challenge is to harmonize and standardize the entire process, while still allowing for improvement and process evolution.

Section snippets

Image acquisition and reconstruction challenges

In routine clinical image acquisition, there is wide variation in imaging parameters such as image resolution (pixel size or matrix size and slice thickness), washout period in the case of positron emission tomography (PET) imaging, patient position, and the variations introduced by different reconstruction algorithms and slice thicknesses, which are different for each scanner vendor. Even this simple set of imaging issues can create difficulty in comparing results obtained across institutions

Segmentation challenges

Segmentation of images into VOIs such as tumor, normal tissue and other anatomical structures is a crucial step for subsequent informatics analyses. Manual segmentation by expert readers is often treated as ground truth. However, it suffers from high interreader variability and is labor intensive; thus, it is not feasible for radiomics analysis requiring very large data sets. Many automatic and semiautomatic segmentation methods have been developed across various image modalities like CT, PET

Feature extraction and qualification

Once tumor regions are defined, imaging features can be extracted. These features describe characteristics of the tumor intensity histogram (e.g., high or low contrast), tumor shape (e.g., round or spiculated), texture patterns (e.g., homogeneous or heterogeneous), as well as descriptors of tumor location and relations with the surrounding tissues (e.g., near the heart).

Deidentification

To follow the principle of providing the minimum amount of confidential information (i.e., patient identifiers) necessary to accommodate downstream analysis of imaging data, raw DICOM image data can be stripped of identified headers and assigned a deidentified number. Maintaining deidentified images and clinical data is an important patient privacy safeguard [77]. In the context of DICOM images, Supplement 142 from the DICOM Standards Committee provides guidance in the process of deidentifying

Statistical and radioinformatics analysis

Analysis within radiomics must evolve appropriate approaches for identifying reliable, reproducible findings that could potentially be employed within a clinical context. Applying the existing bioinformatics “toolbox” to radiomics data is an efficient first step since it eliminates the necessity to develop new analytical methods and leverages accepted and validated methodologies. Radiomics-specific analysis issues will exist, as in any field; therefore, an important step in achieving consensus

Acknowledgment

Radiomics of NSCLC U01 CA143062.

References (97)

  • W. Liu et al.

    Segmentation of elastographic images using a coarse-to-fine active contour model

    Ultrasound Med Biol

    (2006)
  • L. Wang et al.

    Active contours driven by local and global intensity fitting energy with application to brain MR image segmentation

    Comput Med Imaging Graph

    (2009)
  • E.N. Mortensen et al.

    Interactive segmentation with intelligent scissors

    Graph Models Image Process

    (1998)
  • K. Lu et al.

    Segmentation of the central-chest lymph nodes in 3D MDCT images

    Comput Biol Med

    (2011)
  • P. Tai et al.

    Variability of target volume delineation in cervical esophageal cancer

    Int J Radiat Oncol Biol Phys

    (1998)
  • J.S. Cooper et al.

    An evaluation of the variability of tumor-shape definition derived by experienced observers from CT images of supraglottic carcinomas (ACRIN protocol 6658)

    Int J Radiat Oncol Biol Phys

    (2007)
  • O. Holub et al.

    Quantitative histogram analysis of images

    Comput Phys Commun

    (2006)
  • I. El Naqa et al.

    Exploring feature-based approaches in PET images for predicting cancer treatment outcomes

    Pattern Recognition

    (2009)
  • M.M. Galloway

    Texture analysis using gray level run lengths

    Comput Graph Image Process

    (1975)
  • G. Castellano et al.

    Texture analysis of medical images

    Clin Radiol

    (2004)
  • J. Suárez et al.

    Optimum compactness structures derived from the regular octahedron

    Engineering Structures

    (2008)
  • J. Fu et al.

    Image segmentation feature selection and pattern classification for mammographic microcalcifications

    Comput Med Imaging Graph

    (2005)
  • C.C. Jaffe

    Measures of response: RECIST, WHO, and new alternatives

    J Clin Oncol

    (2006)
  • D.L. Rubin

    Creating and curating a terminology for radiology: ontology modeling and analysis

    J Digit Imaging

    (2008)
  • P. Opulencia et al.

    Mapping LIDC, RadLex™, and lung nodule image features

    J Digit Imaging

    (2011)
  • D.S. Channin et al.

    The annotation and image mark-up project 1

    Radiology

    (2009)
  • Rubin DL, Mongkolwat P, Kleper V, Supekar K, Channin DS. Medical imaging on the semantic web: annotation and image...
  • A. Jackson et al.

    Imaging tumor vascular heterogeneity and angiogenesis using dynamic contrast-enhanced magnetic resonance imaging

    Clin Cancer Res

    (2007)
  • C.J. Rose et al.

    Quantifying spatial heterogeneity in dynamic contrast‐enhanced MRI parameter maps

    Magn Reson Med

    (2009)
  • P. Gibbs et al.

    Textural analysis of contrast‐enhanced MR images of the breast

    Magn Reson Med

    (2003)
  • H.C. Canuto et al.

    Characterization of image heterogeneity using 2D Minkowski functionals increases the sensitivity of detection of a targeted MRI contrast agent

    Magn Reson Med

    (2009)
  • E. Segal et al.

    Decoding global gene expression programs in liver cancer by noninvasive imaging

    Nat Biotechnol

    (2007)
  • M. Diehn et al.

    Identification of noninvasive imaging surrogates for brain tumor gene-expression modules

    Proc Natl Acad Sci

    (2008)
  • R. Boellaard et al.

    FDG PET and PET/CT: EANM procedure guidelines for tumour PET imaging: version 1.0

    Eur J Nucl Med Mol Imaging

    (2010)
  • R. Boellaard

    Standards for PET image acquisition and quantitative data analysis

    J Nucl Med

    (2009)
  • I. Delakis et al.

    Developing a quality control protocol for diffusion imaging on a clinical MRI system

    Phys Med Biol

    (2004)
  • X. Yang et al.

    Quantifying tumor vascular heterogeneity with dynamic contrast-enhanced magnetic resonance imaging: a review

    J Biomed Biotechnol

    (2011)
  • S.M. Galbraith et al.

    Reproducibility of dynamic contrast-enhanced MRI in human muscle and tumours: comparison of quantitative and semi-quantitative analysis

    NMR Biomed

    (2002)
  • S. Makkat et al.

    Quantification of perfusion and permeability in breast tumors with a deconvolution-based analysis of second-bolus T1-DCE data

    J Magn Reson Imaging

    (2007)
  • C. Yang et al.

    Comparison of quantitative parameters in cervix cancer measured by dynamic contrast-enhanced MRI and CT

    Magn Reson Med

    (2010)
  • A.N. Priest et al.

    Dynamic contrast-enhanced MRI in ovarian cancer: initial experience at 3 tesla in primary and metastatic disease

    Magn Reson Med

    (2010)
  • D.M. McGrath et al.

    Comparison of model-based arterial input functions for dynamic contrast-enhanced MRI in tumor bearing rats

    Magn Reson Med

    (2009)
  • Jackson E, Ashton E, Evelhoch J, Buonocore M, Karczmar G, Rosen M, et al. Multivendor, multisite DCE-MRI phantom...
  • S.G. Armato et al.

    The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans

    Med Phys

    (2011)
  • S.G. Armato et al.

    The Reference Image Database to Evaluate Response to therapy in lung cancer (RIDER) project: a resource for the development of change-analysis software

    Clin Pharmacol Ther

    (2008)
  • Basu S, Hall L, Goldgof D, Gu Y, Kumar V, Choi J, et-al. Developing a classifier model for lung tumors in CT-scan...
  • S. Hojjatoleslami et al.

    Region growing: a new approach

    IEEE Trans Image Process

    (1998)
  • J. Dehmeshki et al.

    Segmentation of pulmonary nodules in thoracic CT scans: a region growing approach

    IEEE Trans Med Imaging

    (2008)
  • Cited by (1686)

    • Artificial intelligence in skeletal metastasis imaging

      2024, Computational and Structural Biotechnology Journal
    View all citing articles on Scopus
    View full text