Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

KMSubtraction: Reconstruction of unreported subgroup survival data utilizing published Kaplan-Meier survival curves

Joseph J. Zhao, Nicholas L. Syn, Benjamin Kye Jyn Tan, Dominic Wei Ting Yap, Chong Boon Teo, Yiong Huak Chan, Raghav Sundar
doi: https://doi.org/10.1101/2021.09.04.21263111
Joseph J. Zhao
1Yong Loo Lin School of Medicine, National University of Singapore, Singapore
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nicholas L. Syn
1Yong Loo Lin School of Medicine, National University of Singapore, Singapore
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Benjamin Kye Jyn Tan
1Yong Loo Lin School of Medicine, National University of Singapore, Singapore
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dominic Wei Ting Yap
1Yong Loo Lin School of Medicine, National University of Singapore, Singapore
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chong Boon Teo
1Yong Loo Lin School of Medicine, National University of Singapore, Singapore
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yiong Huak Chan
2Biostatistics Unit, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: raghav_sundar@nuhs.edu.sg medcyh@nus.edu.sg
Raghav Sundar
1Yong Loo Lin School of Medicine, National University of Singapore, Singapore
3Department of Haematology-Oncology, National University Health System, Singapore
4Cancer and Stem Cell Biology Program, Duke-NUS Medical School, Singapore
5The N.1 Institute for Health, National University of Singapore, Singapore
6Singapore Gastric Cancer Consortium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: raghav_sundar@nuhs.edu.sg medcyh@nus.edu.sg
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

BACKGROUND Data from certain subgroups of clinical interest may not be presented in primary manuscripts or conference abstract presentations. In an effort to enable secondary data analyses, we propose a workflow to retrieve unreported subgroup survival data from published Kaplan-Meier (KM) curves.

METHODS We developed KMSubtraction, an R-package that retrieves patients from unreported subgroups by matching participants on KM curves of the overall cohort to participants on KM curves of a known subgroup with follow-up time. By excluding matched patients, the opposing unreported subgroup may be retrieved. Reproducibility and limits of error of the KMSubtraction workflow were assessed by comparing unmatched patients against the original survival data of subgroups from published datasets and simulations. Monte Carlo simulations were utilized to evaluate the effect of the reported subgroup proportion, missing data, censorship proportion in the overall and subgroup cohort, sample size and number-at-risk table intervals on the limits of error of KMSubtraction. 3 matching algorithms were explored – minimal cost bipartite matching, Mahalanobis distance matching, and nearest neighbor matching by logistic regression.

RESULTS The validation exercise found no material systematic error and demonstrates the robustness of KMSubtraction in deriving unreported subgroup survival data. Limits of error were small and negligible on marginal Cox proportional hazard models comparing reconstructed and original survival data of unreported subgroups. Extensive Monte Carlo simulations demonstrate that datasets with high reported subgroup proportion (r=0.467, p<0.001), small dataset size (r=-0.374, p<0.001) and high proportion of missing data in the unreported subgroup (r=0.553, p<0.001) were associated with uncertainty are likely to yield high limits of error with KMSubtraction.

CONCLUSION While KMSubtraction demonstrates robustness in deriving survival data from unreported subgroups, the implementation of KMSubtraction should take into consideration the aforementioned limitations. The limits of error of KMSubtraction, as reflected by the mean |ln(HR)| from converged Monte Carlo simulations may guide the interpretation of reconstructed survival data of unreported subgroups.

Competing Interest Statement

RS has received honoraria from Bristol-Myers Squibb, Lilly, Roche, Taiho, Astra Zeneca, DKSH and MSD; has advisory activity with Bristol-Myers Squibb, Merck, Eisai, Bayer, Taiho, Novartis, MSD and AstraZeneca; received research funding from MSD and Paxman Coolers; and has received travel grants from AstraZeneca, Eisai, Roche and Taiho Pharmaceutical.

Funding Statement

Not applicable.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

RS is supported by the National Medical Research Council (NMRC) (NMRC/Fellowship/0059/2018 and NMRC/TA/0014/2020). All other authors have no funding to declare.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • Conflict of interest: RS has received honoraria from Bristol-Myers Squibb, Lilly, Roche, Taiho, Astra Zeneca, DKSH and MSD; has advisory activity with Bristol-Myers Squibb, Merck, Eisai, Bayer, Taiho, Novartis, MSD and AstraZeneca; received research funding from MSD and Paxman Coolers; and has received travel grants from AstraZeneca, Eisai, Roche and Taiho Pharmaceutical.

  • Financial disclosures: RS is supported by the National Medical Research Council (NMRC) (NMRC/Fellowship/0059/2018 and NMRC/TA/0014/2020). All other authors have no funding to declare.

Data Availability

Not applicable, the data was derived from simulations.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted September 10, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
KMSubtraction: Reconstruction of unreported subgroup survival data utilizing published Kaplan-Meier survival curves
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
KMSubtraction: Reconstruction of unreported subgroup survival data utilizing published Kaplan-Meier survival curves
Joseph J. Zhao, Nicholas L. Syn, Benjamin Kye Jyn Tan, Dominic Wei Ting Yap, Chong Boon Teo, Yiong Huak Chan, Raghav Sundar
medRxiv 2021.09.04.21263111; doi: https://doi.org/10.1101/2021.09.04.21263111
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
KMSubtraction: Reconstruction of unreported subgroup survival data utilizing published Kaplan-Meier survival curves
Joseph J. Zhao, Nicholas L. Syn, Benjamin Kye Jyn Tan, Dominic Wei Ting Yap, Chong Boon Teo, Yiong Huak Chan, Raghav Sundar
medRxiv 2021.09.04.21263111; doi: https://doi.org/10.1101/2021.09.04.21263111

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (174)
  • Allergy and Immunology (421)
  • Anesthesia (97)
  • Cardiovascular Medicine (901)
  • Dentistry and Oral Medicine (170)
  • Dermatology (102)
  • Emergency Medicine (257)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (407)
  • Epidemiology (8790)
  • Forensic Medicine (4)
  • Gastroenterology (405)
  • Genetic and Genomic Medicine (1864)
  • Geriatric Medicine (179)
  • Health Economics (388)
  • Health Informatics (1292)
  • Health Policy (644)
  • Health Systems and Quality Improvement (492)
  • Hematology (207)
  • HIV/AIDS (395)
  • Infectious Diseases (except HIV/AIDS) (10567)
  • Intensive Care and Critical Care Medicine (564)
  • Medical Education (193)
  • Medical Ethics (52)
  • Nephrology (218)
  • Neurology (1756)
  • Nursing (103)
  • Nutrition (266)
  • Obstetrics and Gynecology (343)
  • Occupational and Environmental Health (461)
  • Oncology (965)
  • Ophthalmology (283)
  • Orthopedics (107)
  • Otolaryngology (177)
  • Pain Medicine (118)
  • Palliative Medicine (43)
  • Pathology (264)
  • Pediatrics (557)
  • Pharmacology and Therapeutics (266)
  • Primary Care Research (219)
  • Psychiatry and Clinical Psychology (1845)
  • Public and Global Health (3986)
  • Radiology and Imaging (655)
  • Rehabilitation Medicine and Physical Therapy (344)
  • Respiratory Medicine (535)
  • Rheumatology (215)
  • Sexual and Reproductive Health (178)
  • Sports Medicine (166)
  • Surgery (197)
  • Toxicology (37)
  • Transplantation (106)
  • Urology (80)