TY - JOUR T1 - A Comprehensive Typing System for Information Extraction from Clinical Narratives JF - medRxiv DO - 10.1101/19009118 SP - 19009118 AU - J. Harry Caufield AU - Yichao Zhou AU - Yunsheng Bai AU - David A. Liem AU - Anders O. Garlid AU - Kai-Wei Chang AU - Yizhou Sun AU - Peipei Ping AU - Wei Wang Y1 - 2019/01/01 UR - http://medrxiv.org/content/early/2019/10/22/19009118.abstract N2 - We have developed ACROBAT (Annotation for Case Reports using Open Biomedical Annotation Terms), a typing system for detailed information extraction from clinical text. This resource supports detailed identification and categorization of entities, events, and relations within clinical text documents, including clincal case reports (CCRs) and the free-text components of electronic health records. Using ACROBAT and the text of 200 CCRs, we annotated a wide variety of real-world clinical disease presentations. The resulting dataset, MACCROBAT2018, is a rich collection of annotated clinical language appropriate for training biomedical natural language processing systems.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was supported by National Institutes of Health grants U54GM114833 and R35HL135772.Author DeclarationsAll relevant ethical guidelines have been followed and any necessary IRB and/or ethics committee approvals have been obtained.YesAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.Not ApplicableAny clinical trials involved have been registered with an ICMJE-approved registry such as ClinicalTrials.gov and the trial ID is included in the manuscript.Not ApplicableI have followed all appropriate research reporting guidelines and uploaded the relevant Equator, ICMJE or other checklist(s) as supplementary files, if applicable.Not ApplicableAll data are available on Figshare as indicated in the manuscript. https://doi.org/10.6084/m9.figshare.c.4652765 ER -