ABSTRACT
Objective Patient data repositories often assemble medication data from multiple sources, necessitating standardization prior to analysis. We implemented and evaluated a medication standardization procedure for use with a wide range of pharmacy data inputs across all drug categories, which supports research queries at multiple levels of granularity.
Methods The GEMINI-RxNorm system automates the use of multiple RxNorm tools in tandem with other datasets to identify drug concepts from pharmacy orders. GEMINI-RxNorm was used to process 2,090,155 pharmacy orders from 245,258 hospitalizations between 2010 and 2017 at 7 hospitals in Ontario, Canada. The GEMINI-RxNorm system matches drug-identifying information from pharmacy data (including free-text fields) to RxNorm concept identifiers. A user interface allows researchers to search for drug terms and returns the relevant original pharmacy data through the matched RxNorm concepts. Users can then manually validate the predicted matches and discard false positives. We designed the system to maximize recall (sensitivity) and enable excellent precision (positive predictive value) with minimal manual validation. We compared the performance of this system to manual coding (by a physician and pharmacist) of 13 medication classes.
Results Manual coding was performed for 1,948,817 pharmacy orders and GEMINI-RxNorm successfully returned 1,941,389 (99.6%) orders. Recall was greater than 98.5% in all 13 drug classes, and the F-Measure and precision remained above 90.0% in all drug classes, facilitating efficient manual review to achieve 100.0% precision. GEMINI-RxNorm saved time substantially compared to manual standardization, reducing the time taken to review a pharmacy order row from an estimated 30 seconds to 5 seconds and reducing the number of rows needed to be reviewed by up to 99.99%.
Discussion and Conclusion GEMINI-RxNorm presents a novel combination of RxNorm tools and other datasets to enable accurate, efficient, flexible, and scalable standardization of pharmacy data. By facilitating efficient minimal manual validation, the GEMINI-RxNorm system can allow researchers to achieve near-perfect accuracy in medication data standardization.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
NDRIO-Portage COVID-19 Data Curation Funding
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study was approved by Clinical Trials Ontario (CTO project ID: 1394) Research Ethics Board of Mount Sinai Hospital gave ethical approval for this work (15-0075-C) Research Ethics Board of Trillium Health Partners gave ethical approval for this work (ID#742)
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
↵* Co-Senior Authors
Funding Statement: This study was funded by the NDRIO-Portage COVID-19 Data Curation Funding. The development of the GEMINI data platform has been supported with funding from the Canadian Cancer Society, the Canadian Frailty Network, the Canadian Institutes of Health Research, the Canadian Medical Protective Association, Green Shield Canada Foundation, the Natural Sciences and Engineering Research Council of Canada, Ontario Health, the St. Michael’s Hospital Association Innovation Fund, and the University of Toronto Department of Medicine, with in-kind support from partner hospitals and Vector Institute. Funders had no role in the design, conduct or interpretation of this study.
Ethics Approval: Research ethics board approval was obtained from the University Health Network (Toronto), Sunnybrook Health Sciences Centre (Toronto) and St. Michael’s Hospital (Toronto) through the integrated Clinical Trials Ontario platform, with St. Michael’s Hospital as the Board of Record (CTO project ID: 1394). Research ethics board approval was also obtained from Trillium Health Partners (Mississauga; REB# 742) and Mount Sinai Hospital (Toronto; REB# 15-0075-C).
Data Availability
Data from this manuscript can be accessed upon request to the corresponding author, to the extent that is possible in compliance with local research ethics board requirements and data sharing agreements