Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Multiclass risk models for ovarian malignancy: an illustration of prediction uncertainty due to the choice of algorithm

View ORCID ProfileLedger Ashleigh, View ORCID ProfileCeusters Jolien, Valentin Lil, Testa Antonia, Caroline VAN Holsbeke, Franchi Dorella, Bourne Tom, Froyman Wouter, Timmerman Dirk, View ORCID ProfileBen VAN Calster
doi: https://doi.org/10.1101/2023.07.25.23293141
Ledger Ashleigh
1Department of Development and Regeneration, KU Leuven, Leuven, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ledger Ashleigh
Ceusters Jolien
1Department of Development and Regeneration, KU Leuven, Leuven, Belgium
2Department of Oncology, Leuven Cancer Institute, Laboratory of Tumor Immunology and Immunotherapy, KU Leuven, Leuven, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ceusters Jolien
Valentin Lil
3Department of Obstetrics and Gynecology, Skåne University Hospital, Malmö, Sweden
4Department of Clinical Sciences Malmö, Lund University, Sweden
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Testa Antonia
5Department of Woman, Child and Public Health, Fondazione Policlinico Universitario A. Gemelli IRCCS, Rome, Italy
6Dipartimento Universitario Scienze della Vita e Sanità Pubblica, Università Cattolica del Sacro Cuore, Rome, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Caroline VAN Holsbeke
7Department of Obstetrics and Gynecology, Ziekenhuis Oost-Limburg, Genk, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Franchi Dorella
8Preventive Gynecology Unit, Division of Gynecology, European Institute of Oncology IRCCS, Milan, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bourne Tom
1Department of Development and Regeneration, KU Leuven, Leuven, Belgium
9Department of Obstetrics and Gynecology, University Hospitals Leuven, Leuven, Belgium
10Queen Charlotte’s and Chelsea Hospital, Imperial College, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Froyman Wouter
1Department of Development and Regeneration, KU Leuven, Leuven, Belgium
9Department of Obstetrics and Gynecology, University Hospitals Leuven, Leuven, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Timmerman Dirk
1Department of Development and Regeneration, KU Leuven, Leuven, Belgium
9Department of Obstetrics and Gynecology, University Hospitals Leuven, Leuven, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ben VAN Calster
1Department of Development and Regeneration, KU Leuven, Leuven, Belgium
11Department of Biomedical Data Sciences, Leiden University Medical Centre (LUMC), Leiden, Netherlands
12EPI-centre, KU Leuven, Leuven, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ben VAN Calster
  • For correspondence: Ben.vancalster@kuleuven.be
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

ABSTRACT

OBJECTIVE To compare performance and probability estimates of six algorithms to estimate the probabilities that an ovarian tumor is benign, borderline malignant, stage I primary invasive, stage II-IV primary invasive, or secondary metastatic.

MATERIALS AND METHODS Models were developed on 5909 patients (recruited 1999-2012) and validated on 3199 patients (2012-2015). Nine clinical and ultrasound predictors were used. Outcome was based on histology following surgery within 120 days after the ultrasound examination. We developed models using multinomial logistic regression (MLR), Ridge MLR, random forest (RF), XGBoost, neural networks (NN), and support vector machines (SVM).

RESULTS Benign tumors were most common (62%), secondary metastatic tumors least common (5%). XGBoost, RF, NN and MLR had similar performance: c-statistics for benign versus any type of malignant tumors were 0.92, multiclass c-statistics 0.54-0.55, average Estimated Calibration Indexes 0.03-0.07, and Net Benefits at the 10% malignancy risk threshold 0.33-0.34. Despite poorer discrimination and calibration performance for Ridge MLR and in particular SVM, Net Benefits were similar for all models. The estimated probabilities often differed strongly between models. For example, the probability of a benign tumor differed by more than 20 percentage points in 29% of the patients, and by more than 30 percentage points in 16% of the patients.

DISCUSSION Several regression and machine learning models had very good and similar performance in terms of discrimination, calibration and clinical utility. Nevertheless, individual probabilities often varied substantially.

CONCLUSION Machine learning did not outperform MLR. The choice of algorithm can strongly affect probabilities given to a patient.

Competing Interest Statement

LV reported receiving grants from the Swedish Research Council, Malmö University Hospital and Skåne University Hospital, Allmänna Sjukhusets i Malmö Stiftelse för bekämpande av cancer (the Malmö General Hospital Foundation for Fighting Against Cancer), Avtal om läkarutbildning och forskning (ALF)–medel, and Landstingsfinansierad Regional Forskning during the conduct of the study; and teaching fees from Samsung outside the submitted work. DT and BVC reported receiving grants from the Research Foundation–Flanders (FWO) and Internal Funds KU Leuven during the conduct of the study. TB reported receiving grants from NIHR Biomedical Research Centre, speaking honoraria and departmental funding from Samsung Healthcare and grants from Roche Diagnostics, Illumina, and Abbott. No other disclosures were reported. All other authors declare no competing interests.

Funding Statement

The study is supported by the Research Foundation Flanders (FWO) (projects G049312N, G0B4716N, 12F3114N, G097322N), and Internal Funds KU Leuven (projects C24/15/037, C24M/20/064). DT is a senior clinical investigator of FWO, and WF was a clinical fellow of FWO. TB is supported by the National Institute for Health Research (NIHR) Biomedical Research Centre based at Imperial College Healthcare National Health Service (NHS) Trust and Imperial College London. The views expressed are those of the authors and not necessarily those of the NHS, NIHR, or Department of Health. LV is supported by the Swedish Research Council (grant K2014-99X-22475-01-3, Dnr 2013-02282), funds administered by Malmo University Hospital and Skane University Hospital, Allmanna Sjukhusets i Malmo Stiftelse for bekampande av cancer (the Malmo General Hospital Foundation for fighting against cancer), and two Swedish governmental grants (Avtal om lakarutbildning och forskning (ALF)-medel and Landstingsfinansierad Regional Forskning). The funders of the study had no role in the study design, data collection, data analysis, data interpretation, writing of the report, or in the decision to submit the paper for publication.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Ethics Committee of the University Hospitals Leuven gave ethical approval for this work (S64709)

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted July 27, 2023.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Multiclass risk models for ovarian malignancy: an illustration of prediction uncertainty due to the choice of algorithm
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Multiclass risk models for ovarian malignancy: an illustration of prediction uncertainty due to the choice of algorithm
Ledger Ashleigh, Ceusters Jolien, Valentin Lil, Testa Antonia, Caroline VAN Holsbeke, Franchi Dorella, Bourne Tom, Froyman Wouter, Timmerman Dirk, Ben VAN Calster
medRxiv 2023.07.25.23293141; doi: https://doi.org/10.1101/2023.07.25.23293141
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Multiclass risk models for ovarian malignancy: an illustration of prediction uncertainty due to the choice of algorithm
Ledger Ashleigh, Ceusters Jolien, Valentin Lil, Testa Antonia, Caroline VAN Holsbeke, Franchi Dorella, Bourne Tom, Froyman Wouter, Timmerman Dirk, Ben VAN Calster
medRxiv 2023.07.25.23293141; doi: https://doi.org/10.1101/2023.07.25.23293141

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (280)
  • Allergy and Immunology (579)
  • Anesthesia (140)
  • Cardiovascular Medicine (1947)
  • Dentistry and Oral Medicine (252)
  • Dermatology (184)
  • Emergency Medicine (333)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (699)
  • Epidemiology (11102)
  • Forensic Medicine (8)
  • Gastroenterology (624)
  • Genetic and Genomic Medicine (3168)
  • Geriatric Medicine (309)
  • Health Economics (561)
  • Health Informatics (2042)
  • Health Policy (863)
  • Health Systems and Quality Improvement (782)
  • Hematology (310)
  • HIV/AIDS (682)
  • Infectious Diseases (except HIV/AIDS) (12720)
  • Intensive Care and Critical Care Medicine (707)
  • Medical Education (317)
  • Medical Ethics (92)
  • Nephrology (334)
  • Neurology (2986)
  • Nursing (164)
  • Nutrition (463)
  • Obstetrics and Gynecology (589)
  • Occupational and Environmental Health (614)
  • Oncology (1552)
  • Ophthalmology (477)
  • Orthopedics (185)
  • Otolaryngology (266)
  • Pain Medicine (202)
  • Palliative Medicine (57)
  • Pathology (403)
  • Pediatrics (912)
  • Pharmacology and Therapeutics (381)
  • Primary Care Research (355)
  • Psychiatry and Clinical Psychology (2785)
  • Public and Global Health (5591)
  • Radiology and Imaging (1094)
  • Rehabilitation Medicine and Physical Therapy (635)
  • Respiratory Medicine (760)
  • Rheumatology (339)
  • Sexual and Reproductive Health (311)
  • Sports Medicine (289)
  • Surgery (343)
  • Toxicology (48)
  • Transplantation (159)
  • Urology (132)