Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Usability of a Machine-Learning Clinical Order Recommender System Interface for Clinical Decision Support and Physician Workflow

Andre Kumar, Jonathan Chiang, Jason Hom, Lisa Shieh, Rachael Aikens, Michael Baiocchi, David Morales, Divya Saini, Mark Musen, Russ Altman, Mary K Goldstein, Steven Asch, Jonathan H Chen
doi: https://doi.org/10.1101/2020.02.24.20025890
Andre Kumar
2Division of Hospital Medicine, Department of Medicine, Stanford University, Stanford, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jonathan Chiang
1Center for Biomedical Informatics Research, Department of Medicine, Stanford University, Stanford, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jason Hom
2Division of Hospital Medicine, Department of Medicine, Stanford University, Stanford, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lisa Shieh
2Division of Hospital Medicine, Department of Medicine, Stanford University, Stanford, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rachael Aikens
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael Baiocchi
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David Morales
3Department of Computer Science, Stanford University, Stanford, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Divya Saini
3Department of Computer Science, Stanford University, Stanford, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mark Musen
1Center for Biomedical Informatics Research, Department of Medicine, Stanford University, Stanford, CA
2Division of Hospital Medicine, Department of Medicine, Stanford University, Stanford, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Russ Altman
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mary K Goldstein
4Geriatrics Research Education and Clinical Center, Veteran Affairs Palo Alto Health Care System, Palo Alto, CA
5Primary Care and Outcomes Research (PCOR), Stanford University, Stanford, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Steven Asch
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jonathan H Chen
1Center for Biomedical Informatics Research, Department of Medicine, Stanford University, Stanford, CA
2Division of Hospital Medicine, Department of Medicine, Stanford University, Stanford, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jonc101@stanford.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Objective To determine whether clinicians will use machine learned clinical order recommender systems for electronic order entry for simulated inpatient cases, and whether such recommendations impact the clinical appropriateness of the orders being placed.

Materials and Methods 43 physicians used a clinical order entry interface for five simulated medical cases, with each physician-case randomized whether to have access to a previously-developed clinical order recommendation system. A panel of clinicians determined whether orders placed were clinically appropriate. The primary outcome was the difference in clinical appropriateness scores of orders for cases randomized to the recommender system. Secondary outcomes included usage metrics and physician opinions.

Results Clinical appropriateness scores for orders were comparable for cases randomized to the recommender system (mean difference -0.1 order per score, 95% CI:[-0.4, 0.2]). Physicians using the recommender placed more orders (mean 17.3 vs. 15.7 orders; incidence ratio 1.09, 95% CI:[1.01-1.17]). Case times were comparable with the recommender system. Order suggestions generated from the recommender system were more likely to match physician needs than standard manual search options. Approximately 95% of participants agreed the system would be useful for their workflows.

Discussion Machine-learned clinical order options can meet physician needs better than standard manual search systems. This may increase the number of clinical orders placed per case, while still resulting in similar overall clinically appropriate choices.

Conclusions Clinicians can use and accept machine learned clinical order recommendations integrated into an electronic order entry interface. The clinical appropriateness of orders entered was comparable even when supported by automated recommendations.

Introduction

Physician compliance with evidence-based care often falls short, with overall compliance with clinical guideline recommendations ranging from 20 to 80%.1 Such variability may compromise care quality, cost effectiveness, or expedient healthcare delivery, especially when knowledge is inconsistently applied.2 The advent of the meaningful use era of electronic health records (EHRs)3 creates the opportunity for data-driven clinical decision support (CDS) that utilizes the collective expertise of many practitioners in a learning health system.4–8 It may additionally facilitate the acquisition of medical knowledge by enabling clinicians to adopt evolving evidence-based practice patterns.(Holroyd et al. 2007) Tools such as order sets already reinforce consistency and compliance with best practices,9,10 but maintainability is limited in scale by a top-down, knowledge-based approach requiring the manual effort of human experts.11 Moreover, the intended vs. actual usage of EHR order sets may not align with physician workflows,12 and it may impede physicians from learning appropriate alternatives toward patient care.(Kumar and Allaudeen 2016) A key challenge to fulfill a future vision for clinical decision support13,14 is the automatic production of content from the bottom-up by data-mining clinical data sources.15

Most prior studies in automated development of clinical decision support content have been strictly offline analytical evaluations5,16,21–25, with few studies assessing the response of human clinicians to such recommender tools and their ordering patterns. More broadly, the majority of physicians have significant distrust or negative attitudes toward the EHR,26–28 which may affect how well these tools could be adopted. As with many machine learning models designed to support clinical decision making, it is unknown if physicians will actually accept such suggestions into their clinical decision workflow.

Previously, we developed a clinical order recommender system by automatically data-mining hospital EHR data.16 The results of this approach align with established standards of care15,17,18 and is predictive of real physician behavior and patient outcomes.16 Our underlying vision is to seamlessly integrate a system into clinical order entry workflows that automatically infers the relevant clinical context based on data already in the EHR and provides actionable decision support in the form of clinical order suggestions, analogous to Netflix or Amazon.com’s “customers who bought A also bought B” system.19,20 It is unknown if these suggested orders would improve the quality of care and be readily accepted into clinical decision making.

This study seeks to address these issues by examining physicians’ behaviors while interacting with a clinical provider order entry (CPOE) interface that simulates an electronic health record for hospital clinical scenarios. We specifically examine whether the clinical recommender system impacted the number of clinically inappropriate/appropriate orders placed during the simulated encounters. We also add expanded results related to physician ordering patterns, user experience metrics, and survey responses when a clinical order recommender system is added to standard functionality.

Objective

To determine whether clinicians will use machine learned clinical order recommender systems for electronic order entry for simulated inpatient cases, and whether such recommendations impact the clinical appropriateness of the orders being placed and the system’s impact on physician workflow.

Methods

Participants and Setting

This study was conducted at a single academic institution from 10/2018-12/2019. We recruited physicians (n=43) with experience caring for medical inpatients within the past year using local mailing listservs. Participants included both medical residents (trainees who have a medical license but still require oversight) and supervising physicians. The study was approved by the Stanford University Institutional Review Board.

Study Design & Outcomes

Participants were offered a $195 incentive payment for a 1 hour usability testing session in a closed office setting where they were exposed to a series of five clinical cases that simulate common inpatient medical problems (see Cases & Grading below) on a digital interface that simulated their institution’s electronic health record. Upon recruitment to the study, a researcher guided participants through two demonstration cases (diabetic ketoacidosis and chest pain) to illustrate basic functions of the digital interface (data review, order entry, order sets). The subsequent five cases were presented to participants in a sequence randomly assigned for each user (Table 2).

All participants were randomized to undergo each of the five cases with either an available clinical recommender system that offered order suggestions vs. no recommender system. Conventional clinical order entry options including order set checklists and manual search of individual orders by name were available in all cases, making usage of the recommender system completely optional compared to their usual order-entry workflows. Participant activity was recorded through screen capture, audio, and user interface tracking software. Following the case series, all participants filled out a survey on their experiences with the system and their receptiveness to a clinical recommender system.

Outcome measures included the time to complete the case, the number of clinical orders selected from manual search vs. the automated recommender system, usage metrics (e.g. number of clicks), clinical quality of orders placed (see Cases & Grading), and survey results.

Clinical Recommender Development

As described previously,16 we extracted deidentified structured data for all inpatient hospitalizations from the 2009-2014 STRIDE clinical data warehouse.29 The data cover >74K patients with >11M instances of >27K items (medication, laboratory, imaging, and nursing orders, lab results and diagnosis codes). We built a clinical collaborative filtering (recommender) system based on this data, modeled on Amazon’s product algorithm19,20 using item co-occurrence statistics.

We built a simulated computerized physician order entry (CPOE) interface with open technologies including PostgreSQL, Python, Apache HTTP, and HTML/JavaScript. Our unique addition is an automated recommender (Figure 1), analogous to a “Customers Who Bought This Item Also Bought This…” service that anticipates other clinical orders that are likely to be relevant based on similar prior cases in prior electronic health records.

Figure 1
  • Download figure
  • Open in new tab
Figure 1

Simulated clinical order entry interface, notes and clinical order recommender. Standard functions include navigation links to review notes and results (top-left). Order entry includes a conventional search box for individual orders and pre-authored order sets (top-right). A recommender algorithm suggests clinical orders (right), in this example triggered by a presenting symptom code (Shortness of Breath, ICD9 786.05). Clinical orders predicted most likely to occur next are highlighted under Common Orders, while those under Related Orders are less likely but disproportionately associated with similar cases and thus may be more specifically relevant. As users enter additional orders, the recommender algorithm continually updates the suggested lists based on the accumulating information.

Figure 2
  • Download figure
  • Open in new tab
Figure 2

Simulated clinical order entry interface, results review and clinical order manual search. Users can order diagnostics such as cerebrospinal fluid (CSF) studies to review results (left). This may require simulated passage of time for results to be ready (e.g., CT Head requiring another 44 minutes for results to be ready). Conventional manual search for clinical orders via a text search box (top-right) yields clinical order options (right) identified by prefix. In this example, identifying all clinical orders with a word starting with “cef.”

Cases & Grading

A panel of board-certified internal medicine physicians (AK, JH, LS, and JHC) developed 5 clinical cases of common inpatient medical problems: unstable atrial fibrillation, neutropenic fever, variceal gastrointestinal hemorrhage, bacterial meningitis, and acute pulmonary embolism (Table 1). Each participant was exposed to the clinical interface (Figure 1), which included the patient’s history and physical examination. Depending on the interventions ordered, the case would progress across several decisional nodes (Table 1). For example, if a participant ordered a lumbar puncture and antibiotics for bacterial meningitis, the case would progress toward a different node (patient improvement). In contrast, if antibiotics were not ordered, the patient would deteriorate (updated vitals and clinical notes would appear in this node). Diagnostic test results are only visible and change state if respective orders are entered (e.g., low hemoglobin revealed only if a blood count is ordered and changes if a blood transfusion is ordered). With each order entered, the clinical order recommender lists updates based on the accumulating patient information.

View this table:
  • View inline
  • View popup
Table 1

summarizes key elements of the five case scenarios that participants were tested with.

Delphi method was used by the case designers to determine clinical appropriateness for all orders. (Hsu and Sandford 2007) Following the creation of this system, each physician (AK, JH, LS, and JHC) independently reviewed all orders placed for the case. Cases were classified according to their state (initial, subsequent, or resolution) and orders were considered in the context of each state. Based on prior consensus on the scoring procedure, each grader assigned an individual score to an order on a −10 (very harmful) to +10 (very beneficial) scale. Please see the Appendix for a full description of how grading was considered. The reviewers achieved an initial intraclass correlation coefficient (ICC) of 0.51 (95% CI: [0.47-0.53]) for all scored orders on a −10 to +10 scale (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4913118/). Following independent scoring of each order, the reviewers met as a group to review their scores. Appropriate research studies and clinical guidelines were considered when assigning a consensus score (Table 1). For items that did not have perfect interrater agreement, the group convened and deliberated to assign a consensus score.Hsu and Sandford 2007 In instances where the panel could not reach a final consensus, no score was assigned and the order was not included in the final analysis.

Case-Based Scenarios

Results

Participants

A total of 43 physicians participated in this study, with a total of 215 unique observations. The physicians had a median of 3.0 [IQR: 3.0-5.0] years since obtaining their medical degree. Approximately 30 (70.0%) identified their primary specialty as Internal Medicine, 8 (18.6%) identified Emergency Medicine, 25 (58.1%) were resident trainees, and 19 (44.1%) were board certified in their respective specialty.

Primary Outcome

The mean assigned score per order for all participants was 6.2 (95% CI: [4.8, 7.5]; Table 1). There was no significant difference detected in the mean score per order for physicians randomized to the clinical recommender (mean 0.1 decrease in score, 95% CI: [-0.4, 0.2]). Random effects modeling for physicians revealed a SD of 0.4 (95% CI: [0.1-0.6]), while random effects modeling for the clinical cases revealed an SD of 1.4 (95% CI: [0.7-2.7]), suggesting most variations in scores occurred due to the clinical cases rather than the participants.

Secondary Outcomes

A. Physician Experience

Overall, physicians spent an average time of 6.9 (standard error, 0.6) minutes per clinical module, with a mean 55.2 (standard error, 3.9) navigation clicks between sections (e.g., notes vs. results review) and 16.7 (standard error, 0.9) clinical orders per case. (Table 3). Physicians randomized to the recommender had approximately 10% less navigational clicks (mean clicks 53.1) compared to physicians without the recommender (mean clicks 58.4; incidence ratio 0.90, 95% CI: [0.83-0.99]). Physicians ordered a greater amount of orders when the recommender system was available (mean 17.3 vs. 15.7 orders per case; incidence ratio 1.09, 95% CI: [1.01-1.17]). Across different simulated case types, there was not a consistent trend in physicians taking more or less time to complete cases with the recommender system available (Table 3). In subgroup analysis of resident physicians in training vs. non-residents (Appendix Material), there appeared to be varying effects with residents spending less case time and ordering more with the recommender available, while non-residents spent more case time.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 3a

Usage metrics when clinical order recommender system was available vs. not. Reported as totals, proportions, or means +/− standard error. Options reflects clinical order options that were presented to the user for consideration via either manual search results or automated recommender. Recommender precision (positive predictive value) reflects the proportion of clinical order options from the recommender that were actually used. Clinical benefit, harm or neutrality was based on the integer assigned by the expert panel consensus (e.g. positive, negative, or zero) for each order in the context of each clinical case.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 3b

Usage metrics when clinical order recommender system was available vs. not, separated by Resident physicians (trainees) vs. non-Residents. Reported as totals, proportions, or means +/− standard error. Clinical benefit, harm or neutrality was based on the integer assigned by the expert panel consensus (e.g. positive, negative, or zero) for each order in the context of each clinical case.

All physicians used clinical order options from the recommender system at least once, including 127 (98%) of the 129 physician-cases where the recommender was available. This corresponded to much less need for manual searching for clinical orders with the number of available order options considered from manual searches being 54% less when the recommender system was available vs. unavailable (mean 44.5 vs. 81.5). The recall of the recommender options was consistently greater than manual search options (58% vs 42%), indicating users were more likely to find the clinical orders they wanted from the automated recommender lists than from options returned by manual search. The precision of the recommender options was similarly greater than manual search options (25% vs 16%), indicating users had to sift through fewer irrelevant options to find the clinical orders they wanted than the number of irrelevant options produced by manual searches.12

B. Scoring Outcomes

Clinician-cases randomized to the clinical recommender had a 6% increase in the total scores per case (incidence ratio 1.06 (95% CI: [1.01-1.12]). The number of clinically beneficial orders, defined as a positive integer on the grading scale for a given case, demonstrated a trend toward statistical significance when the recommender was available vs. unavailable (mean number of positively-graded orders 12.2 vs. 11.1; incidence ratio 1.080, 95% CI: [0.99-1.17]). There was no difference in the number of clinically neutral or harmful orders for when the recommender was available vs. unavailable (mean number of negative/neutral orders 3.0 vs. 2.7; incidence ratio 0.99, 95% CI: [0.81-1.22])..

View this table:
  • View inline
  • View popup
  • Download powerpoint
Appendix Table A

Usage metrics stratified per simulated case scenario for when the clinical order recommender system was available vs. not. Reported as totals, proportions, or means +/− standard error. Clinical harm, benefit, or irrelevance was based on the integer assigned by the expert panel consensus for each order in the context of each clinical case.

Survey Responses

Overall, the clinical decision tool was positively received by the study participants, where 96% agreed or strongly agreed that the tool would be useful for their position. Moreover, 90% agreed or strongly agreed that the system would make their job easier and 86% felt that it would increase their productivity. Thematic analysis revealed a dichotomy in how physicians viewed the system could be used. Approximately 58% of the physicians self-identified the system would be useful for patients who have a clear diagnosis or whose clinical problems could be guided by a step-wise, algorithmic approach. However, a sizable minority (23%) stated the system would be more useful for patients presenting to the emergency department without a clear diagnosis, as this would facilitate expedient ordering of several diagnostic tests to help differentiate the patient. Others mentioned the tool’s utility for diseases that may require several simultaneous orders (for example, diabetic ketoacidosis). Additional comments indicate physicians felt that the tool would be less useful for sub-specialized care or for patients that require few simultaneous orders.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 4

Physician Survey Responses. Responses were assessed based on a 5-point Likert scale (1 = Strongly Disagree, 2 = Disagree, 3 = Neutral, 4 = Agree, 5 = Strongly Agree)

Discussion

In this study, we found that the use of a clinical order recommender system for common clinical scenarios seen in hospital medicine and emergency medicine did not adversely affect patient care by suggesting more clinically adverse or irrelevant orders. The recommender system did not affect the amount of time physicians spent on an EHR interface, but did it reduce the number of clicks per case. Although physicians placed more orders with the recommender system, this effect was minimal (mean 1.6 more orders per encounter). Importantly, physicians placed less orders from manual searches as a result of the tool. The recommender demonstrated superior recall of orders, suggesting that users were more likely to find the orders they wanted from the recommender rather than from manual searches. The tool was positively received by the study participants, who identified clear benefits toward their workflow and productivity. This represents a key study to examine the use of clinical recommender decision support tools on physician ordering habits and patient care as applied toward inpatient emergency clinical scenarios.

There is wide variability in clinical practice, even in instances where are clear guideline-directed diagnostic and treatment algorithms for well-defined clinical problems.1 Such variability may compromise care quality, cost effectiveness, or expedient healthcare delivery.2 Healthcare systems have sought to improve both the quality of patient care and the EHR experience by providing standardized order sets.30–33 However, order sets may not align with individual cases with many extraneous, irrelevant, or contextual order options.(Lietal.2019;KumarandAllaudeen2016) They are also static: for instance, the manually-authored static order sets available in our own institution for deep venous thrombosis treatment still recommend warfarin therapy, despite direct oral anticoagulants largely becoming the current standard of practice.42 Guidance for up-to-date medical care clearly must come from multiple sources, and this points to one potential advantage of the collaborative filtering approach in that it can rapidly and automatically adapt to newly emerging practices.

Our recommender tool essentially functions as a dynamical clinical order set that continuously updates in response to new patient information, demonstrating increased accuracy and reduced need for conventional manual searches. While there are challenges to designing and maintaining such a system, there may be several benefits, including increased physician acceptance and usage (100% of physicians in this study who had the recommender available to them used despite being completely optional). The recommender system interface received largely positive views by our participants, suggesting that physicians will accept machine generated clinical order tools if they are embedded into clinical workflows. More importantly, our expert panel found no significant deterioration in the quality of the clinical orders, and physicians using the recommender system were not more likely to place clinically harmful orders. Notably, there remained substantial variability in the amount of clinically appropriate orders placed by different providers with or without the recommender system. These findings highlight the ongoing variability of clinical practice among physicians (even when additional point-of-care tools are given to them). While order sets have been shown to promote cost-effectiveness,38,39 further evaluation is needed to determine how much clinical recommender systems are promoting improved care with more useful orders vs. reducing cost-effectiveness with more unnecessary orders.

Time motion studies indicate that clinicians spend most of their time in the EHR,34,35 with many spending significant time searching for and entering orders.36 While this study showed a reduction on reliance of manual searches and navigational clicks, interestingly, it did not show a reduction in the amount of time that physicians spent per simulated case. The simulated test setting may have led participants to artificially fill the time within cases, or perhaps the reduction in manual search efforts freed their cognitive attention to attend more to the medical decision making tasks of each case. Additionally, other authors have shown that most of a clinicians time in the EHR is spent in data review (reviewing clinical notes, laboratory results, or diagnostic reports)(Chi et al. 2019; Wang et al. 2019; Ouyang et al. 2016; Ouyang et al. 2016), which was simplified in these clinical scenarios. For example, these patients had only a few notes, compared to many patients who may have hundreds. Future studies should consider the implementation of clinical recommender systems in real practice environments to assess whether they result in time savings for physicians navigating the EHR without compromising quality of care.

There are several limitations to this study. Our tool was based on a clinical data warehouse of electronic health records data that may not be available at all institutions. Similarly, the lack of a broadly accepted open architecture that allows for custom workflow integrations into common commercial EHRs, limits the ease of implementation of the system components studied. Our users were given an orientation of the recommender system and its purpose before engaging with the practice scenarios, which likely contributes a Hawthorne effect on how users interacted and viewed the system. Each testing session was pre-scheduled for a fixed time (1 hour for 5 test cases), which may have artificially constrained the variability in task completion time outcome. Finally, although our expert panel used previously validated methodology to devise a scoring system(Hsu and Sandford 2007), there was still moderate interrater agreement for some clinical orders, which limits the generalizability of these findings to other clinical scenarios or healthcare settings.

At a time when the EHR is met with distrust and negativity by clinicians from the burdens of documentation and data entry, clinical recommender systems represent a key opportunity to improve the quality, consistency, and experience of healthcare. This study represents an important step towards a future where EHRs anticipate clinical needs without even having to ask, so that clinicians can start to feel like the computers are working for them, instead of the other way around.

Conclusions

Clinical order suggestions from a data-driven recommender system were readily used and accepted by physicians across a variety of simulated inpatient clinical scenarios. This system mildly increased the number of clinical orders placed per case without compromising the number of clinically harmful or clinically irrelevant orders. Physicians were more likely to find the clinical orders they wanted using such tools as compared to manual search methods (i.e. superior precision and recall), but it did not change the overall amount of time they spent in a simulated EHR setting. Nonetheless, clinicians overall view such clinical recommender systems positively, perceiving a clear potential benefit toward their workflow.

Data Availability

Study data will be shared online.

Conflict of Interest Statements

JHC is co-founder of Reaction Explorer LLC that develops and licenses organic chemistry education software and has been paid consulting or speaker fees from the National Institute of Drug Abuse Clinical Trials Network, Tuolc Inc., and Roche Inc.

Appendix: Determination of Clinical Benefit/Harm of Orders

Instructions

The following instructions were given to each of the members of the expert panel to score orders for each clinical case. These instructions were given during the initial scoring phase of the project. Following independent grading by each member of the expert panel, the group convened to assign a consensus score for items that did not have complete interrater reliability. In instances where no consensus could be assigned, a score was not assigned.

Overview of Columns

Each column is separated by the following:

View this table:
  • View inline
  • View popup
  • Download powerpoint

Clinical_Item: Which corresponds to a particular order

Group: Corresponds to a category of orders (antibiotics, imaging, fluids, medications, diagnostics). Use your best judgement as to how you would group them.

Importance (Benefit vs Harm): This corresponds to whether an order is good, bad, or neutral. Graded on a −10 (extremely harmful) to +10 (extremely beneficial) scale to give a gradation.

Confidence: How confident an individual is with their score for an item (1-5 Scale). 1= not confident at all; 2-unconfident; 3=neutral; 4=confident; 5= very confident.

Time_Flag: Relates to the important decisional nodes for each case.

Notes: Make any notes about how you scored this case and why

Example

Imagine you had a male patient coming in with nonpurulent cellulitis of the lower extremity with severe sepsis/hypotension. Below are some sample orders:

View this table:
  • View inline
  • View popup
  • Download powerpoint

Some orders (Istat blood gases + lactate and IV Cefazolin) might be more relevant/beneficial.

  • You would score IV cefazolin as +10 (extremely beneficial), decide how important a lactate would be, and score a −10 for starting a new high-dose blood pressure medication in a patient with an active infection and severe sepsis.

  • A pregnancy test is neutral (score of 0) because it is clinically irrelevant.

Consider harm of procedures

If a patient with well-controlled atrial fibrillation presents to clinic and is currently in sinus rhythm, ordering a DCCV would be harmful without causing immediate benefit (hence the score should be negative).

Exclude cost considerations

when making a decision (including unnecessary labs)

Exclude reputational or cultural aspects

It might be ill-advised in real life to consult a pulmonologist for a patient with acute DKA without pulmonary findings, but if it was not immediately relevant to the case, assign 0 points (rather than negative for wasting a hospital resource).

Example 2

70 year old patient presenting with cough, fevers, and shortness of breath (e.g. presenting with suspected community acquired pneumonia). The patient gets worse if the provider doesn’t initiate initial antibiotics and fluids.

SAMPLE Orders for the Case

View this table:
  • View inline
  • View popup

In order to prevent participants who delayed appropriate care from getting scored the same as other individuals, half points should be awarded for delays in care (e.g. the case progresses to a new decisional node).

Certain orders may have consistent grading across all clinical nodes (they may be clinically irrelevant or always harmful).

lAcknowledgements

This research was supported in part by the NIH Big Data 2 Knowledge initiative via the National Institute of Environmental Health Sciences under Award Number K01ES026837, the Gordon and Betty Moore Foundation through Grant GBMF8040, and a Stanford Human-Centered Artificial Intelligence Seed Grant. This research used data or services provided by STARR, STAnford medicine Research data Repository,” a clinical data warehouse containing live Epic data from Stanford Health Care (SHC), the University Healthcare Alliance (UHA) and Packard Children’s Health Alliance (PCHA) clinics and other auxiliary data from Hospital applications such as radiology PACS. The STARR platform is developed and operated by Stanford Medicine Research IT team and is made possible by Stanford School of Medicine Research Office. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH, VA, or Stanford Healthcare.

Footnotes

  • Submission Guidelines Here: https://academic.oup.com/jamia/pages/General_Instructions, Objective, Materials and Methods, Results, Discussion, and Conclusion, The main text should, in addition to the sections corresponding to these headings, include a section describing Background and Significance.

References

  1. 1.↵
    Richardson, W. C. & Others. Crossing the quality chasm: a new health system for the 21st century. 2001, Institute of Medicine. National Academy Press
  2. 2.↵
    Tricoci, P., Allen, J. M., Kramer, J. M., Califf, R. M. & Smith, S. C., Jr.. Scientific evidence underlying the ACC/AHA clinical practice guidelines. JAMA 301, 831–841 (2009).
    OpenUrlCrossRefPubMedWeb of Science
  3. 3.↵
    Health and Human Services Department. Health Information Technology: Standards, Implementation Specifications, and Certification Criteria for Electronic Health Record Technology, 2014 Edition; Revisions to the Permanent Certification Program for Health Information Technology. Federal Register 77, 54163–54292 (2012).
    OpenUrl
  4. 4.↵
    de Lissovoy, G. Big data meets the electronic medical record: a commentary on ‘identifying patients at increased risk for unplanned readmission’. Med. Care 51, 759–760 (2013).
    OpenUrlCrossRefPubMed
  5. 5.↵
    Frankovich, J., Longhurst, C. A. & Sutherland, S. M. Evidence-based medicine in the EMR era. N. Engl. J. Med. 365, 1758–1759 (2011).
    OpenUrlCrossRefPubMedWeb of Science
  6. 6.
    Longhurst, C. A., Harrington, R. A. & Shah, N. H. A ‘green button’ for using aggregate patient data at the point of care. Health Aff. 33, 1229–1235 (2014).
    OpenUrlAbstract/FREE Full Text
  7. 7.
    Smith, M. et al. A Continuously Learning Health Care System. (National Academies Press (US), 2013).
  8. 8.↵
    Krumholz, H. M. Big data and new knowledge in medicine: the thinking, training, and tools needed for a learning health system. Health Aff. 33, 1163–1170 (2014).
    OpenUrlAbstract/FREE Full Text
  9. 9.↵
    Kaushal, R., Shojania, K. G. & Bates, D. W. Effects of computerized physician order entry and clinical decision support systems on medication safety: a systematic review. Arch. Intern. Med. 163, 1409–1416 (2003).
    OpenUrlCrossRefPubMedWeb of Science
  10. 10.↵
    Overhage, J. M., Tierney, W. M., Zhou, X. H. & McDonald, C. J. A randomized trial of ‘corollary orders’ to prevent errors of omission. J. Am. Med. Inform. Assoc. 4, 364–375 (1997).
    OpenUrlCrossRefPubMed
  11. 11.↵
    Bates, D. W. et al. Ten commandments for effective clinical decision support: making the practice of evidence-based medicine a reality. J. Am. Med. Inform. Assoc. 10, 523–530 (2003).
    OpenUrlCrossRefPubMed
  12. 12.↵
    Li, R. C., Wang, J. K., Sharp, C. & Chen, J. H. When order sets do not align with clinician workflow: assessing practice patterns in the electronic health record. BMJ Qual. Saf. (2019). doi:10.1136/bmjqs-2018-008968
    OpenUrlAbstract/FREE Full Text
  13. 13.↵
    Sittig, D. F. et al. Grand challenges in clinical decision support. J. Biomed. Inform. 41, 387–392 (2008).
    OpenUrlCrossRefPubMedWeb of Science
  14. 14.↵
    Middleton, B., Sittig, D. F. & Wright, A. Clinical Decision Support: a 25 Year Retrospective and a 25 Year Vision. Yearb. Med. Inform. Suppl 1, S103–16 (2016).
  15. 15.↵
    Chen, J. H., Goldstein, M. K., Asch, S. M., Mackey, L. & Altman, R. B. Predicting inpatient clinical order patterns with probabilistic topic models vs conventional order sets. J. Am. Med. Inform. Assoc. 24, 472–480 (2017).
    OpenUrl
  16. 16.↵
    Chen, J. H., Podchiyska, T. & Altman, R. B. OrderRex: clinical order decision support and outcome predictions by data-mining electronic medical records. J. Am. Med. Inform. Assoc. 23, 339–348 (2016).
    OpenUrlCrossRefPubMed
  17. 17.↵
    Chen, J. H. & Altman, R. B. Data-Mining Electronic Medical Records for Clinical Order Recommendations: Wisdom of the Crowd or Tyranny of the Mob? AMIA Jt Summits Transl Sci Proc 2015, 435–439 (2015).
    OpenUrl
  18. 18.↵
    Wang, J. K. et al. An evaluation of clinical order patterns machine-learned from clinician cohorts stratified by patient mortality outcomes. J. Biomed. Inform. 86, 109–119 (2018).
    OpenUrl
  19. 19.↵
    Linden, G., Smith, B. & York, J. Amazon.com recommendations: item-to-item collaborative filtering. IEEE Internet Comput. 7, 76–80 (2003).
    OpenUrlCrossRef
  20. 20.↵
    Smith, B. & Linden, G. Two Decades of Recommender Systems at Amazon.com. IEEE Internet Comput. 21, 12–18 (2017).
    OpenUrl
  21. 21.↵
    Zhang, Y., Levin, J. E. & Padman, R. Data-driven order set generation and evaluation in the pediatric environment. AMIA Annu. Symp. Proc. 2012, 1469–1478 (2012).
    OpenUrlPubMed
  22. 22.
    Klann, J., Schadow, G. & McCoy, J. M. A recommendation algorithm for automating corollary order generation. AMIA Annu. Symp. Proc. 2009, 333–337 (2009).
    OpenUrlPubMed
  23. 23.
    Wright, A. P., Wright, A. T., McCoy, A. B. & Sittig, D. F. The use of sequential pattern mining to predict next prescribed medications. J. Biomed. Inform. 53, 73–80 (2015).
    OpenUrlCrossRef
  24. 24.
    Chen, J. H., Goldstein, M. K., Asch, S. M. & Altman, R. B. Usability of an Automated Recommender System for Clinical Order Entry. in AMIA (2016).
  25. 25.↵
    King, A. J. et al. Using Machine Learning to Predict the Information Seeking Behavior of Clinicians Using an Electronic Medical Record System. AMIA Annu. Symp. Proc. 2018, 673–682 (2018).
    OpenUrl
  26. 26.↵
    Emani, S. et al. Physician Beliefs about the Meaningful Use of the Electronic Health Record: A Follow-Up Study. Appl. Clin. Inform. 8, 1044–1053 (2017).
    OpenUrl
  27. 27.
    Verghese, A., Shah, N. H. & Harrington, R. A. What This Computer Needs Is a Physician: Humanism and Artificial Intelligence. JAMA 319, 19–20 (2018).
    OpenUrlCrossRefPubMed
  28. 28.↵
    Gawande, A. Why Doctors Hate Their Computers. The New Yorker (2018).
  29. 29.↵
    Lowe, H. J., Ferris, T. A., Hernandez, P. M. & Weber, S. C. STRIDE--An integrated standards-based translational research informatics platform. AMIA Annu. Symp. Proc. 2009, 391–395 (2009).
    OpenUrlPubMed
  30. 30.↵
    Brown, K. E., Johnson, K. J., DeRonne, B. M., Parenti, C. M. & Rice, K. L. Order Set to Improve the Care of Patients Hospitalized for an Exacerbation of Chronic Obstructive Pulmonary Disease. Ann. Am. Thorac. Soc. 13, 811–815 (2016).
    OpenUrlPubMed
  31. 31.
    Radosevich, M. A. et al. Implementation of a Goal-Directed Mechanical Ventilation Order Set Driven by Respiratory Therapists Improves Compliance With Best Practices for Mechanical Ventilation. J. Intensive Care Med. 34, 550–556 (2019).
    OpenUrl
  32. 32.
    Nichols, K. R., Petschke, A. L., Webber, E. C. & Knoderer, C. A. Comparison of Antibiotic Dosing Before and After Implementation of an Electronic Order Set. Appl. Clin. Inform. 10, 229–236 (2019).
    OpenUrl
  33. 33.↵
    Zeidan, A. M. et al. Impact of a venous thromboembolism prophylaxis ‘smart order set’: Improved compliance, fewer events. Am. J. Hematol. 88, 545–549 (2013).
    OpenUrlCrossRefPubMed
  34. 34.↵
    Desai, S. V. et al. Education Outcomes in a Duty-Hour Flexibility Trial in Internal Medicine. N. Engl. J. Med. 378, 1494–1508 (2018).
    OpenUrlPubMed
  35. 35.↵
    Kumar, A. & Chi, J. Duty-Hour Flexibility Trial in Internal Medicine. The New England journal of medicine 379, 300 (2018).
    OpenUrl
  36. 36.↵
    Ouyang, D., Chen, J. H., Hom, J. & Chi, J. Internal Medicine Resident Computer Usage: An Electronic Audit of an Inpatient Service. JAMA Intern. Med. 176, 252–254 (2016).
    OpenUrl
  37. 37.
    Tunkel, A. R. et al. Practice guidelines for the management of bacterial meningitis. Clin. Infect. Dis. 39, 1267–1284 (2004).
    OpenUrlCrossRefPubMedWeb of Science
  38. 38.↵
    Fleming, N. S., Ogola, G. & Ballard, D. J. Implementing a standardized order set for community-acquired pneumonia: impact on mortality and cost. Jt. Comm. J. Qual. Patient Saf. 35, 414–421 (2009).
    OpenUrl
  39. 39.↵
    1. Henriksen, K.,
    2. Battles, J. B.,
    3. Keyes, M. A. &
    4. Grady, M. L.
    Ballard, D. J. et al. The Impact of Standardized Order Sets on Quality and Financial Outcomes. in Advances in Patient Safety: New Directions and Alternative Approaches (Vol. 2: Culture and Redesign) (eds. Henriksen, K., Battles, J. B., Keyes, M. A. & Grady, M. L.) (Agency for Healthcare Research and Quality (US), 2011).
Back to top
PreviousNext
Posted February 26, 2020.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Usability of a Machine-Learning Clinical Order Recommender System Interface for Clinical Decision Support and Physician Workflow
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Usability of a Machine-Learning Clinical Order Recommender System Interface for Clinical Decision Support and Physician Workflow
Andre Kumar, Jonathan Chiang, Jason Hom, Lisa Shieh, Rachael Aikens, Michael Baiocchi, David Morales, Divya Saini, Mark Musen, Russ Altman, Mary K Goldstein, Steven Asch, Jonathan H Chen
medRxiv 2020.02.24.20025890; doi: https://doi.org/10.1101/2020.02.24.20025890
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Usability of a Machine-Learning Clinical Order Recommender System Interface for Clinical Decision Support and Physician Workflow
Andre Kumar, Jonathan Chiang, Jason Hom, Lisa Shieh, Rachael Aikens, Michael Baiocchi, David Morales, Divya Saini, Mark Musen, Russ Altman, Mary K Goldstein, Steven Asch, Jonathan H Chen
medRxiv 2020.02.24.20025890; doi: https://doi.org/10.1101/2020.02.24.20025890

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (174)
  • Allergy and Immunology (419)
  • Anesthesia (97)
  • Cardiovascular Medicine (895)
  • Dentistry and Oral Medicine (167)
  • Dermatology (101)
  • Emergency Medicine (257)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (404)
  • Epidemiology (8757)
  • Forensic Medicine (4)
  • Gastroenterology (403)
  • Genetic and Genomic Medicine (1856)
  • Geriatric Medicine (176)
  • Health Economics (386)
  • Health Informatics (1283)
  • Health Policy (642)
  • Health Systems and Quality Improvement (490)
  • Hematology (206)
  • HIV/AIDS (389)
  • Infectious Diseases (except HIV/AIDS) (10535)
  • Intensive Care and Critical Care Medicine (564)
  • Medical Education (193)
  • Medical Ethics (52)
  • Nephrology (217)
  • Neurology (1747)
  • Nursing (100)
  • Nutrition (265)
  • Obstetrics and Gynecology (342)
  • Occupational and Environmental Health (458)
  • Oncology (961)
  • Ophthalmology (279)
  • Orthopedics (107)
  • Otolaryngology (174)
  • Pain Medicine (117)
  • Palliative Medicine (41)
  • Pathology (263)
  • Pediatrics (556)
  • Pharmacology and Therapeutics (264)
  • Primary Care Research (218)
  • Psychiatry and Clinical Psychology (1840)
  • Public and Global Health (3971)
  • Radiology and Imaging (650)
  • Rehabilitation Medicine and Physical Therapy (341)
  • Respiratory Medicine (532)
  • Rheumatology (215)
  • Sexual and Reproductive Health (178)
  • Sports Medicine (165)
  • Surgery (196)
  • Toxicology (37)
  • Transplantation (106)
  • Urology (79)