Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

A Novel Abnormality Annotation Database for COVID-19 Affected Frontal Lung X-rays

Surbhi Mittal, Vasantha Kumar Venugopal, Vikash Kumar Agarwal, Manu Malhotra, Jagneet Singh Chatha, Savinay Kapur, Ankur Gupta, Vikas Batra, Puspita Majumdar, Aakarsh Malhotra, Kartik Thakral, Saheb Chhabra, Mayank Vatsa, Richa Singh, Santanu Chaudhury
doi: https://doi.org/10.1101/2021.01.07.21249323
Surbhi Mittal
1Indian Institute of Technology Jodhpur, Rajasthan, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vasantha Kumar Venugopal
2Centre for Advanced Research in Imaging, Neuroscience & Genomics (CARING), New Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: vasanthdrv@gmail.com
Vikash Kumar Agarwal
3Mahajan Imaging, New Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Manu Malhotra
3Mahajan Imaging, New Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jagneet Singh Chatha
3Mahajan Imaging, New Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Savinay Kapur
3Mahajan Imaging, New Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ankur Gupta
3Mahajan Imaging, New Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vikas Batra
3Mahajan Imaging, New Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Puspita Majumdar
4Indraprastha Institute of Information Technology, New Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Aakarsh Malhotra
4Indraprastha Institute of Information Technology, New Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kartik Thakral
1Indian Institute of Technology Jodhpur, Rajasthan, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Saheb Chhabra
4Indraprastha Institute of Information Technology, New Delhi, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mayank Vatsa
1Indian Institute of Technology Jodhpur, Rajasthan, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Richa Singh
1Indian Institute of Technology Jodhpur, Rajasthan, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Santanu Chaudhury
1Indian Institute of Technology Jodhpur, Rajasthan, India
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Purpose To advance the usage of CXRs as a viable solution for efficient COVID-19 diagnostics by providing large-scale annotations of the abnormalities in frontal CXRs in BIMCV-COVID19+ database, and to provide a robust evaluation mechanism to facilitate its usage.

Materials and Methods We provide the abnormality annotations in frontal CXRs by creating bounding boxes. The frontal CXRs are a part of the existing BIMCV-COVID19+ database. We also define four different protocols for robust evaluation of semantic segmentation and classification algorithms. Finally, we benchmark the defined protocols and report the results using popular deep learning models as a part of this study.

Results For semantic segmentation, Mask-RCNN performs the best among all the models with a DICE score of 0.43 ± 0.01. For classification, we observe that MobileNetv2 yields the best results for 2-class and 3-class classification. We also observe that deep models report a lower performance for classifying other classes apart from the COVID class.

Conclusion By making the annotated data and protocols available to the scientific community, we aim to advance the usage of CXRs as a viable solution for efficient COVID-19 diagnostics. This large-scale data will be useful for ML algorithms and can be used for learning radiological patterns observed in COVID-19 patients. Further, the protocols will facilitate ML practitioners for unified large-scale evaluation of their algorithms.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

The study is funded by Rakshak Project by IIT-JODHPUR

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

IRB exempted

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

There are two annotation files in this release. Filename - Study-Level_Annotations.csv, contains two types of study level annotations of these X-rays (Normal/Abnormal status & Image Quality). Filename - BoundingBox.csv, conatins pixel level image annotations for 10 Pathologies - Atelectasis, Cardiomegaly, Consolidation/Ground Glass opacity, Edema, Nodule, Pleural Effusion, Pleural Other, Pneumothorax. The corresponding X-rays were released by the Medical Imaging Data Bank of the Valencia region (BIMCV). They can be downloaded at the following link - https://bimcv.cipf.es/bimcv-projects/bimcv-covid19.

http://covbase4all.igib.res.in/

https://osf.io/b35xu/

  • Abbreviations

    (COVID-19)
    Coronavirus Disease 2019
    (RT-PCR)
    real time polymerase chain reaction
    (AI)
    artificial intelligence
    (ROC)
    receiver operating characteristic
    (AUC)
    area under the ROC curve
    (CNN)
    convolutional neural network
    (CXR)
    chest x-ray
    (ML)
    machine learning
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
    Back to top
    PreviousNext
    Posted January 08, 2021.
    Download PDF
    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    A Novel Abnormality Annotation Database for COVID-19 Affected Frontal Lung X-rays
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    A Novel Abnormality Annotation Database for COVID-19 Affected Frontal Lung X-rays
    Surbhi Mittal, Vasantha Kumar Venugopal, Vikash Kumar Agarwal, Manu Malhotra, Jagneet Singh Chatha, Savinay Kapur, Ankur Gupta, Vikas Batra, Puspita Majumdar, Aakarsh Malhotra, Kartik Thakral, Saheb Chhabra, Mayank Vatsa, Richa Singh, Santanu Chaudhury
    medRxiv 2021.01.07.21249323; doi: https://doi.org/10.1101/2021.01.07.21249323
    Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
    Citation Tools
    A Novel Abnormality Annotation Database for COVID-19 Affected Frontal Lung X-rays
    Surbhi Mittal, Vasantha Kumar Venugopal, Vikash Kumar Agarwal, Manu Malhotra, Jagneet Singh Chatha, Savinay Kapur, Ankur Gupta, Vikas Batra, Puspita Majumdar, Aakarsh Malhotra, Kartik Thakral, Saheb Chhabra, Mayank Vatsa, Richa Singh, Santanu Chaudhury
    medRxiv 2021.01.07.21249323; doi: https://doi.org/10.1101/2021.01.07.21249323

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Radiology and Imaging
    Subject Areas
    All Articles
    • Addiction Medicine (62)
    • Allergy and Immunology (142)
    • Anesthesia (46)
    • Cardiovascular Medicine (415)
    • Dentistry and Oral Medicine (70)
    • Dermatology (47)
    • Emergency Medicine (144)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (171)
    • Epidemiology (4855)
    • Forensic Medicine (3)
    • Gastroenterology (183)
    • Genetic and Genomic Medicine (676)
    • Geriatric Medicine (70)
    • Health Economics (192)
    • Health Informatics (629)
    • Health Policy (320)
    • Health Systems and Quality Improvement (203)
    • Hematology (85)
    • HIV/AIDS (156)
    • Infectious Diseases (except HIV/AIDS) (5339)
    • Intensive Care and Critical Care Medicine (330)
    • Medical Education (93)
    • Medical Ethics (24)
    • Nephrology (75)
    • Neurology (686)
    • Nursing (42)
    • Nutrition (115)
    • Obstetrics and Gynecology (126)
    • Occupational and Environmental Health (208)
    • Oncology (439)
    • Ophthalmology (140)
    • Orthopedics (36)
    • Otolaryngology (89)
    • Pain Medicine (35)
    • Palliative Medicine (16)
    • Pathology (129)
    • Pediatrics (194)
    • Pharmacology and Therapeutics (131)
    • Primary Care Research (84)
    • Psychiatry and Clinical Psychology (780)
    • Public and Global Health (1816)
    • Radiology and Imaging (324)
    • Rehabilitation Medicine and Physical Therapy (138)
    • Respiratory Medicine (255)
    • Rheumatology (86)
    • Sexual and Reproductive Health (69)
    • Sports Medicine (62)
    • Surgery (100)
    • Toxicology (23)
    • Transplantation (29)
    • Urology (37)