Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

A transcriptomics-based meta-analysis combined with machine learning approach identifies a secretory biomarker panel for diagnosis of pancreatic adenocarcinoma

View ORCID ProfileIndu Khatri, Manoj K. Bhasin
doi: https://doi.org/10.1101/2020.04.16.20061515
Indu Khatri
1Division of IMBIO, Department of Medicine, Beth Israel Lahey Health, Harvard Medical School, Boston MA 02215
3Division of Immunohematology and Blood transfusion, Leiden University Medical Center, Leiden, The Netherlands, 2333ZA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Indu Khatri
Manoj K. Bhasin
1Division of IMBIO, Department of Medicine, Beth Israel Lahey Health, Harvard Medical School, Boston MA 02215
2Department of Pediatrics and Biomedical Informatics, Children Healthcare of Atlanta, Emory School of Medicine, Atlanta, GA 30322
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: manoj.bhasin@emory.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Pancreatic ductal adenocarcinoma (PDAC) is largely incurable due to late diagnosis and absence of markers that are concordant with expression in several sample sources (i.e. tissue, blood, plasma) and platform (i.e. Microarray, sequencing). We optimized meta-analysis of 19 PDAC (tissue and blood) transcriptome studies from multiple platforms. The key biomarkers for PDAC diagnosis with secretory potential were identified and validated in different cohorts. Machine learning approach i.e. support vector machine supported by leave-one-out cross-validation was used to build and test the classifier. We identified a 9-gene panel (IFI27, ITGB5, CTSD, EFNA4, GGH, PLBD1, HTATIP2, IL1R2, CTSA) that achieved ∼0.92 average sensitivity and ∼0.90 specificity in discriminating PDAC from non-tumor samples in five training-sets on cross-validation. This classifier accurately discriminated PDAC from chronic-pancreatitis (AUC=0.95), early stages of progression (Stage I and II (AUC=0.82), IPMA and IPMN (AUC=1), IPMC (AUC=0.81)). The 9-gene marker outperformed the previously known markers in blood studies particularly (AUC=0.84). The discrimination of PDAC from early precursor lesions in non-malignant tissue (AUC>0.81) and peripheral blood (AUC>0.80) may facilitate early blood-diagnosis and risk stratification upon validation in prospective clinical-trials. Furthermore, the validation of these markers in proteomics and single-cell transcriptomics studies suggest their prognostic role in the diagnosis of PDAC.

Competing Interest Statement

BIDMC will be filling patent on behalf of MB and IK on the use of biomarker panel for early PDAC diagnosis. MB is an equity holder at BiomaRx and Canomiks.

Funding Statement

This study was supported through BIDMC CAO Innovation grant.

Author Declarations

All relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.

Yes

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

The datasets used and/or analysed during the current study are available in public repositories GEO and ArrayExpress. The codes and DE genes per dataset will be available via GitHub (https://github.com/IKhatri-Git/Secretory-gene-classifier).

  • Abbreviations

    AUC
    area under the curve
    CA 19-9
    Carbohydrate antigen 19-9
    CP
    chronic pancreatitis
    GEO
    gene expression omnibus
    GGH
    γ-glutamyl hydrolase
    HPA
    Human Protein Atlas
    IPMA
    intraductal papillary-mucinous adenoma
    IPMC
    intraductal papillary-mucinous carcinoma
    IPMN
    intraductal papillary mucinous neoplasm
    LOOCV
    leave-one-out cross-validation
    noTM
    no transmembrane segments
    PanIN
    pancreatic intraepithelial neoplasia
    PC
    pancreatic cancer
    PDAC
    Pancreatic ductal adenocarcinoma
    ROC
    receiver operating characteristic
    SVM
    support vector machines
    TCGA
    tissue cancer genome atlas
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
    Back to top
    PreviousNext
    Posted April 22, 2020.
    Download PDF
    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    A transcriptomics-based meta-analysis combined with machine learning approach identifies a secretory biomarker panel for diagnosis of pancreatic adenocarcinoma
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    A transcriptomics-based meta-analysis combined with machine learning approach identifies a secretory biomarker panel for diagnosis of pancreatic adenocarcinoma
    Indu Khatri, Manoj K. Bhasin
    medRxiv 2020.04.16.20061515; doi: https://doi.org/10.1101/2020.04.16.20061515
    Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
    Citation Tools
    A transcriptomics-based meta-analysis combined with machine learning approach identifies a secretory biomarker panel for diagnosis of pancreatic adenocarcinoma
    Indu Khatri, Manoj K. Bhasin
    medRxiv 2020.04.16.20061515; doi: https://doi.org/10.1101/2020.04.16.20061515

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Oncology
    Subject Areas
    All Articles
    • Addiction Medicine (161)
    • Allergy and Immunology (416)
    • Anesthesia (91)
    • Cardiovascular Medicine (860)
    • Dentistry and Oral Medicine (159)
    • Dermatology (97)
    • Emergency Medicine (250)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (394)
    • Epidemiology (8562)
    • Forensic Medicine (4)
    • Gastroenterology (384)
    • Genetic and Genomic Medicine (1753)
    • Geriatric Medicine (167)
    • Health Economics (373)
    • Health Informatics (1244)
    • Health Policy (621)
    • Health Systems and Quality Improvement (468)
    • Hematology (196)
    • HIV/AIDS (374)
    • Infectious Diseases (except HIV/AIDS) (10303)
    • Intensive Care and Critical Care Medicine (553)
    • Medical Education (192)
    • Medical Ethics (51)
    • Nephrology (212)
    • Neurology (1678)
    • Nursing (97)
    • Nutrition (251)
    • Obstetrics and Gynecology (326)
    • Occupational and Environmental Health (451)
    • Oncology (929)
    • Ophthalmology (263)
    • Orthopedics (102)
    • Otolaryngology (172)
    • Pain Medicine (114)
    • Palliative Medicine (40)
    • Pathology (253)
    • Pediatrics (536)
    • Pharmacology and Therapeutics (253)
    • Primary Care Research (208)
    • Psychiatry and Clinical Psychology (1770)
    • Public and Global Health (3841)
    • Radiology and Imaging (624)
    • Rehabilitation Medicine and Physical Therapy (320)
    • Respiratory Medicine (520)
    • Rheumatology (208)
    • Sexual and Reproductive Health (168)
    • Sports Medicine (158)
    • Surgery (190)
    • Toxicology (36)
    • Transplantation (101)
    • Urology (76)