Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Integrating Protein-protein Interaction Networks and Machine Learning to Identify Biomarkers of Cancer Onset

View ORCID ProfileHongyue Chen, Min Ma, Haoyu Liu, Qian Yang, View ORCID ProfileJiqiu Wang, View ORCID ProfileJie Zheng
doi: https://doi.org/10.1101/2025.11.21.25340742
Hongyue Chen
1Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
2Shanghai National Clinical Research Center for metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumour, Shanghai Digital Medicine Innovation Center, Lifecycle Health Management Center, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
3Department of Pharmacology and Chemical Biology, Emory University School of Medicine, Emory University, Atlanta, GA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Hongyue Chen
Min Ma
1Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
2Shanghai National Clinical Research Center for metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumour, Shanghai Digital Medicine Innovation Center, Lifecycle Health Management Center, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Haoyu Liu
1Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
2Shanghai National Clinical Research Center for metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumour, Shanghai Digital Medicine Innovation Center, Lifecycle Health Management Center, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Qian Yang
1Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
2Shanghai National Clinical Research Center for metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumour, Shanghai Digital Medicine Innovation Center, Lifecycle Health Management Center, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
4Medical Research Council (MRC) Integrative Epidemiology Unit, University of Bristol, Bristol, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jiqiu Wang
1Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
2Shanghai National Clinical Research Center for metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumour, Shanghai Digital Medicine Innovation Center, Lifecycle Health Management Center, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jiqiu Wang
Jie Zheng
1Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
2Shanghai National Clinical Research Center for metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumour, Shanghai Digital Medicine Innovation Center, Lifecycle Health Management Center, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
4Medical Research Council (MRC) Integrative Epidemiology Unit, University of Bristol, Bristol, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jie Zheng
  • For correspondence: epxjz{at}bristol.ac.uk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Recent large-scale plasma proteomic studies have identified a set of biomarkers for the diagnosis of early cancer onset, but the predictive performance is still a challenging problem. Most existing studies have treated proteins as independent markers, ignoring their functional interdependencies within the biological network. We consider that protein–protein interaction (PPI) networks can capture coordinated biological signals to enhance the predictive performance. We identified 1,605 high-confidence PPI pairs of proteins (corresponding to 1,155 unique proteins) from the STRING database (confidence scores>0.9). The plasma proteomic data of these pairs were extracted from a subset of 38,585 UK Biobank participants with Olink measurements (noted as UKB-PPP). The univariate Cox regression (p<0.05) integrated with elastic-net machine learning models was used to build PPI predictive model on 23 cancer types, which seven cancer types with robust PPI associated with were included in the final predictive model. In general, models included proteomics features outperformed those based on age, sex, and lifestyles. Incorporating PPI-derived interaction features further improved prediction performance in three of the seven cancer models, with melanoma showing a significant improvement compared to the base model in C-index (ΔC-index = 0.13). In summary, integrating PPI networks with proteomic models could provide predictive gains in specific cancer types and underscore the value of molecular interaction patterns as complementary biomarkers for cancer onset.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This research was funded by the Noncommunicable Chronic Diseases-National Science and Technology Major Project (2024ZD0531500, 2024ZD0531502), the National Key Research and Development Program of China (2022YFC2505200, 2022YFC2505201, 2022YFC2505203), and the National Natural Science Foundation of China (32500519, 32570728). These funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

We thank the participants, contributors, and researchers of the UKB for making data available for this study. We thank the research and development teams at the 13 participating UKB-PPP companies (Alnylam Pharmaceuticals, Amgen, AstraZeneca, Biogen, Calico, Bristol-Myers Squibb, Genetech, GlaxoSmithKline (GSK), Janssen Pharmaceuticals, Novo Nordisk, Pfizer, Regeneron, and Takeda) for funding the study. All 13 companies listed as part of the UKB-PPP were involved in the generation of the proteomic data used in the present study.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data availability

Full information on how to access UKB data can be found at its website (https://www.ukbiobank.ac.uk/use-our-data/).

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted November 22, 2025.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Integrating Protein-protein Interaction Networks and Machine Learning to Identify Biomarkers of Cancer Onset
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Integrating Protein-protein Interaction Networks and Machine Learning to Identify Biomarkers of Cancer Onset
Hongyue Chen, Min Ma, Haoyu Liu, Qian Yang, Jiqiu Wang, Jie Zheng
medRxiv 2025.11.21.25340742; doi: https://doi.org/10.1101/2025.11.21.25340742
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Integrating Protein-protein Interaction Networks and Machine Learning to Identify Biomarkers of Cancer Onset
Hongyue Chen, Min Ma, Haoyu Liu, Qian Yang, Jiqiu Wang, Jie Zheng
medRxiv 2025.11.21.25340742; doi: https://doi.org/10.1101/2025.11.21.25340742

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Oncology
Subject Areas
All Articles
  • Addiction Medicine (576)
  • Allergy and Immunology (868)
  • Anesthesia (306)
  • Cardiovascular Medicine (4483)
  • Dentistry and Oral Medicine (449)
  • Dermatology (385)
  • Emergency Medicine (615)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1528)
  • Epidemiology (15283)
  • Forensic Medicine (31)
  • Gastroenterology (1134)
  • Genetic and Genomic Medicine (6651)
  • Geriatric Medicine (671)
  • Health Economics (1006)
  • Health Informatics (4606)
  • Health Policy (1378)
  • Health Systems and Quality Improvement (1624)
  • Hematology (545)
  • HIV/AIDS (1276)
  • Infectious Diseases (except HIV/AIDS) (15965)
  • Intensive Care and Critical Care Medicine (1111)
  • Medical Education (626)
  • Medical Ethics (147)
  • Nephrology (675)
  • Neurology (6699)
  • Nursing (346)
  • Nutrition (1006)
  • Obstetrics and Gynecology (1153)
  • Occupational and Environmental Health (961)
  • Oncology (3370)
  • Ophthalmology (989)
  • Orthopedics (370)
  • Otolaryngology (421)
  • Pain Medicine (437)
  • Palliative Medicine (131)
  • Pathology (670)
  • Pediatrics (1704)
  • Pharmacology and Therapeutics (700)
  • Primary Care Research (717)
  • Psychiatry and Clinical Psychology (5497)
  • Public and Global Health (9288)
  • Radiology and Imaging (2225)
  • Rehabilitation Medicine and Physical Therapy (1375)
  • Respiratory Medicine (1202)
  • Rheumatology (598)
  • Sexual and Reproductive Health (721)
  • Sports Medicine (536)
  • Surgery (722)
  • Toxicology (100)
  • Transplantation (290)
  • Urology (267)