Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Machine Learning Based Reanalysis of Clinical Scores for Distinguishing Between Ischemic and Hemorrhagic Stroke in Low Resource Setting

Aman Bhardwaj, MV Padma Srivastava, Pulikottil Wilson Vinny, Amit Mehndiratta, Venugopalan Y Vishnu, Rahul Garg
doi: https://doi.org/10.1101/2022.03.03.22271885
Aman Bhardwaj
1School of Information Technology, Indian Institute of Technology Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: aman.bhardwaj.iitd@gmail.com
MV Padma Srivastava
2Department of Neurology, All India Institute of Medical Sciences, New Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pulikottil Wilson Vinny
3Department of Internal Medicine, Armed Forces Medical College Pune, Maharashtra, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Amit Mehndiratta
4Centre for Biomedical Engineering, Indian Institute of Technology Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Venugopalan Y Vishnu
2Department of Neurology, All India Institute of Medical Sciences, New Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rahul Garg
5Department of Computer Science and Engineering, Indian Institute of Technology Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

BACKGROUND Identification of stroke and classifying them as ischemic and hemorrhagic type using clinical scores alone faces two unaddressed issues. One pertains to over-estimation of performance of scores and the other involves class imbalance nature of stroke data leading to biased accuracy. We conducted a quantitative comparison of existing scores, after correcting them for the above-stated issues. We explored the utility of Machine Learning theory to address overestimation of performance and class imbalance inherent in these clinical scores.

METHODS We included validation studies of Siriraj (SS), Guys Hospital/Allen (GHS/AS), Greek (GS), and Besson (BS) Scores for stroke classification, from 2001-2021, identified from systematic search on PubMed, ERIC, ScienceDirect, and IEEE-Xplore. From included studies we extracted the reported cross tabulation to identify the listed issues. Further, we mitigated them while recalculating all the performance metrics for a comparative analysis of the performance of SS, GHS/AS, GS, and BS.

RESULTS A total of 21 studies were included. Our calculated sensitivity range (IS-diagnosis) for SS is 40-90% (median 70%[IQR:57-73%], aggregate 71%[SD:15%]) as against reported 43-97% (78%[IQR:65-88%]), for GHS/AS 35-93% (64%[IQR:53-71%], 64%[SD:17%]) against 35-94% (73%[IQR:62-88%]), and for GS 60-74% (64%[IQR:62-69%], 69%[SD:7%]) against 74-94% (89%[IQR:81-92%]). Calculated sensitivity (HS-diagnosis), for SS, GHS/AS, and GS respectively, are 34-86% (59%[IQR:50-79%], 61%[SD:17%]), 20-73% (46%[IQR:34-64%], 44%[SD:17%]), and 11-80% (43%[IQR:27-62%], 51%[SD:35%]) against reported 50-95% (71%[IQR:64-82%]), 33-93% (63%[IQR:39-73%]), and 41-80% (78%[IQR:59-79%]). Calculated accuracy ranges, are 37-86% (67%[IQR:56-75%], 68%[SD:13%]), 40-87% (58%[IQR:47-61%], 59%[SD:14%]), and 38-76% (51%[IQR:45-63%], 61%[SD:19%]) while the weighted accuracy ranges are 37-85% (64%[IQR:54-73%], 66%[SD:12%]), 43-80% (53%[IQR:47-62%], 54%[SD:13%]), and 38-77% (51%[IQR:44-64%], 60%[SD:20%]). Only one study evaluated BS.

CONCLUSION Quantitative comparison of existing scores indicated significantly lower ranges of performance metrics as compared to the ones reported by the studies. We conclude that published clinical scores for stroke classification over-estimate performance. We recommend inclusion of equivocal predictions while calculating performance metrics for such analysis. Further, the high variability in performance of clinical scores in stroke identification and classification could be improved upon by creating a global data-pool with statistically important attributes. Scores based on Machine Learning from such globally pooled data may perform better and generalise at scale.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This study did not receive any funding

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

All data produced in the present work are contained in the manuscript

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted March 07, 2022.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Machine Learning Based Reanalysis of Clinical Scores for Distinguishing Between Ischemic and Hemorrhagic Stroke in Low Resource Setting
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Machine Learning Based Reanalysis of Clinical Scores for Distinguishing Between Ischemic and Hemorrhagic Stroke in Low Resource Setting
Aman Bhardwaj, MV Padma Srivastava, Pulikottil Wilson Vinny, Amit Mehndiratta, Venugopalan Y Vishnu, Rahul Garg
medRxiv 2022.03.03.22271885; doi: https://doi.org/10.1101/2022.03.03.22271885
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Machine Learning Based Reanalysis of Clinical Scores for Distinguishing Between Ischemic and Hemorrhagic Stroke in Low Resource Setting
Aman Bhardwaj, MV Padma Srivastava, Pulikottil Wilson Vinny, Amit Mehndiratta, Venugopalan Y Vishnu, Rahul Garg
medRxiv 2022.03.03.22271885; doi: https://doi.org/10.1101/2022.03.03.22271885

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Neurology
Subject Areas
All Articles
  • Addiction Medicine (228)
  • Allergy and Immunology (506)
  • Anesthesia (110)
  • Cardiovascular Medicine (1244)
  • Dentistry and Oral Medicine (206)
  • Dermatology (147)
  • Emergency Medicine (282)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (532)
  • Epidemiology (10027)
  • Forensic Medicine (5)
  • Gastroenterology (500)
  • Genetic and Genomic Medicine (2462)
  • Geriatric Medicine (238)
  • Health Economics (479)
  • Health Informatics (1645)
  • Health Policy (753)
  • Health Systems and Quality Improvement (636)
  • Hematology (250)
  • HIV/AIDS (535)
  • Infectious Diseases (except HIV/AIDS) (11871)
  • Intensive Care and Critical Care Medicine (626)
  • Medical Education (253)
  • Medical Ethics (75)
  • Nephrology (268)
  • Neurology (2289)
  • Nursing (139)
  • Nutrition (352)
  • Obstetrics and Gynecology (454)
  • Occupational and Environmental Health (537)
  • Oncology (1248)
  • Ophthalmology (377)
  • Orthopedics (134)
  • Otolaryngology (226)
  • Pain Medicine (158)
  • Palliative Medicine (50)
  • Pathology (325)
  • Pediatrics (733)
  • Pharmacology and Therapeutics (314)
  • Primary Care Research (282)
  • Psychiatry and Clinical Psychology (2281)
  • Public and Global Health (4840)
  • Radiology and Imaging (838)
  • Rehabilitation Medicine and Physical Therapy (492)
  • Respiratory Medicine (651)
  • Rheumatology (286)
  • Sexual and Reproductive Health (240)
  • Sports Medicine (227)
  • Surgery (268)
  • Toxicology (44)
  • Transplantation (125)
  • Urology (99)