RT Journal Article SR Electronic T1 A Novel Abnormality Annotation Database for COVID-19 Affected Frontal Lung X-rays JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.01.07.21249323 DO 10.1101/2021.01.07.21249323 A1 Surbhi Mittal A1 Vasantha Kumar Venugopal A1 Vikash Kumar Agarwal A1 Manu Malhotra A1 Jagneet Singh Chatha A1 Savinay Kapur A1 Ankur Gupta A1 Vikas Batra A1 Puspita Majumdar A1 Aakarsh Malhotra A1 Kartik Thakral A1 Saheb Chhabra A1 Mayank Vatsa A1 Richa Singh A1 Santanu Chaudhury YR 2021 UL http://medrxiv.org/content/early/2021/01/08/2021.01.07.21249323.abstract AB Purpose To advance the usage of CXRs as a viable solution for efficient COVID-19 diagnostics by providing large-scale annotations of the abnormalities in frontal CXRs in BIMCV-COVID19+ database, and to provide a robust evaluation mechanism to facilitate its usage.Materials and Methods We provide the abnormality annotations in frontal CXRs by creating bounding boxes. The frontal CXRs are a part of the existing BIMCV-COVID19+ database. We also define four different protocols for robust evaluation of semantic segmentation and classification algorithms. Finally, we benchmark the defined protocols and report the results using popular deep learning models as a part of this study.Results For semantic segmentation, Mask-RCNN performs the best among all the models with a DICE score of 0.43 ± 0.01. For classification, we observe that MobileNetv2 yields the best results for 2-class and 3-class classification. We also observe that deep models report a lower performance for classifying other classes apart from the COVID class.Conclusion By making the annotated data and protocols available to the scientific community, we aim to advance the usage of CXRs as a viable solution for efficient COVID-19 diagnostics. This large-scale data will be useful for ML algorithms and can be used for learning radiological patterns observed in COVID-19 patients. Further, the protocols will facilitate ML practitioners for unified large-scale evaluation of their algorithms.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThe study is funded by Rakshak Project by IIT-JODHPURAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:IRB exemptedAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThere are two annotation files in this release. Filename - Study-Level_Annotations.csv, contains two types of study level annotations of these X-rays (Normal/Abnormal status & Image Quality). Filename - BoundingBox.csv, conatins pixel level image annotations for 10 Pathologies - Atelectasis, Cardiomegaly, Consolidation/Ground Glass opacity, Edema, Nodule, Pleural Effusion, Pleural Other, Pneumothorax. The corresponding X-rays were released by the Medical Imaging Data Bank of the Valencia region (BIMCV). They can be downloaded at the following link - https://bimcv.cipf.es/bimcv-projects/bimcv-covid19. http://covbase4all.igib.res.in/ https://osf.io/b35xu/ (COVID-19)Coronavirus Disease 2019(RT-PCR)real time polymerase chain reaction(AI)artificial intelligence(ROC)receiver operating characteristic(AUC)area under the ROC curve(CNN)convolutional neural network(CXR)chest x-ray(ML)machine learning