Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Algorithmic Fairness and Bias Mitigation for Clinical Machine Learning: Insights from Rapid COVID-19 Diagnosis by Adversarial Learning

View ORCID ProfileJenny Yang, View ORCID ProfileAndrew A. S. Soltan, Yang Yang, David A. Clifton
doi: https://doi.org/10.1101/2022.01.13.22268948
Jenny Yang
1Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jenny Yang
  • For correspondence: jenny.yang@eng.ox.ac.uk
Andrew A. S. Soltan
2John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, UK
3RDM Division of Cardiovascular Medicine, University of Oxford, Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Andrew A. S. Soltan
Yang Yang
1Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David A. Clifton
1Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Machine learning is becoming increasingly prominent in healthcare. Although its benefits are clear, growing attention is being given to how machine learning may exacerbate existing biases and disparities. In this study, we introduce an adversarial training framework that is capable of mitigating biases that may have been acquired through data collection or magnified during model development. For example, if one class is over-presented or errors/inconsistencies in practice are reflected in the training data, then a model can be biased by these. To evaluate our adversarial training framework, we used the statistical definition of equalized odds. We evaluated our model for the task of rapidly predicting COVID-19 for patients presenting to hospital emergency departments, and aimed to mitigate regional (hospital) and ethnic biases present. We trained our framework on a large, real-world COVID-19 dataset and demonstrated that adversarial training demonstrably improves outcome fairness (with respect to equalized odds), while still achieving clinically-effective screening performances (NPV>0.98). We compared our method to the benchmark set by related previous work, and performed prospective and external validation on four independent hospital cohorts. Our method can be generalized to any outcomes, models, and definitions of fairness.

Competing Interest Statement

DAC reports personal fees from Oxford University Innovation, personal fees from BioBeats, personal fees from Sensyne Health, outside the submitted work. No other authors report any conflicts of interest.

Funding Statement

This work was supported by the Wellcome Trust/University of Oxford Medical & Life Sciences Translational Fund (Award: 0009350) and the Oxford National Institute of Research (NIHR) Biomedical Research Campus (BRC). The funders of the study had no role in study design, data collection, data analysis, data interpretation, or writing of the manuscript. JY is a Marie Sklodowska-Curie Fellow, under the European Union Horizon 2020 research and innovation programme (Grant agreement: 955681, MOIRA). AS is an NIHR Academic Clinical Fellow (Award: ACF-2020-13-015). The views expressed are those of the authors and not necessarily those of the NHS, NIHR, EU H2020 programme, or the Wellcome Trust.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

United Kingdom National Health Service (NHS) approval via the national oversight/regulatory body, the Health Research Authority (HRA), has been granted for this work (IRAS ID: 281832).

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

Data from OUH studied here are available from the Infections in Oxfordshire Research Database (https://oxfordbrc.nihr.ac.uk/research-themesoverview/antimicrobial- resistance-and-modernising-microbiology/infections-inoxfordshire- research-database-iord/), subject to an application meeting the ethical and governance requirements of the Database. Data from UHB, PUH and BH are available on reasonable request to the respective trusts, subject to HRA requirements. Code and supplementary information for this paper are available online alongside publication.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted January 14, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Algorithmic Fairness and Bias Mitigation for Clinical Machine Learning: Insights from Rapid COVID-19 Diagnosis by Adversarial Learning
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Algorithmic Fairness and Bias Mitigation for Clinical Machine Learning: Insights from Rapid COVID-19 Diagnosis by Adversarial Learning
Jenny Yang, Andrew A. S. Soltan, Yang Yang, David A. Clifton
medRxiv 2022.01.13.22268948; doi: https://doi.org/10.1101/2022.01.13.22268948
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Algorithmic Fairness and Bias Mitigation for Clinical Machine Learning: Insights from Rapid COVID-19 Diagnosis by Adversarial Learning
Jenny Yang, Andrew A. S. Soltan, Yang Yang, David A. Clifton
medRxiv 2022.01.13.22268948; doi: https://doi.org/10.1101/2022.01.13.22268948

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (164)
  • Allergy and Immunology (416)
  • Anesthesia (92)
  • Cardiovascular Medicine (867)
  • Dentistry and Oral Medicine (159)
  • Dermatology (98)
  • Emergency Medicine (251)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (397)
  • Epidemiology (8589)
  • Forensic Medicine (4)
  • Gastroenterology (390)
  • Genetic and Genomic Medicine (1772)
  • Geriatric Medicine (169)
  • Health Economics (375)
  • Health Informatics (1252)
  • Health Policy (625)
  • Health Systems and Quality Improvement (472)
  • Hematology (197)
  • HIV/AIDS (380)
  • Infectious Diseases (except HIV/AIDS) (10344)
  • Intensive Care and Critical Care Medicine (553)
  • Medical Education (193)
  • Medical Ethics (51)
  • Nephrology (214)
  • Neurology (1692)
  • Nursing (97)
  • Nutrition (252)
  • Obstetrics and Gynecology (330)
  • Occupational and Environmental Health (451)
  • Oncology (933)
  • Ophthalmology (265)
  • Orthopedics (104)
  • Otolaryngology (172)
  • Pain Medicine (115)
  • Palliative Medicine (40)
  • Pathology (255)
  • Pediatrics (539)
  • Pharmacology and Therapeutics (257)
  • Primary Care Research (210)
  • Psychiatry and Clinical Psychology (1785)
  • Public and Global Health (3871)
  • Radiology and Imaging (627)
  • Rehabilitation Medicine and Physical Therapy (322)
  • Respiratory Medicine (525)
  • Rheumatology (208)
  • Sexual and Reproductive Health (170)
  • Sports Medicine (158)
  • Surgery (191)
  • Toxicology (36)
  • Transplantation (101)
  • Urology (76)