Systematic approach to outcome assessment from coded electronic healthcare records in the DaRe2THINK NHS-embedded randomised trial ================================================================================================================================== * Xiaoxia Wang * Alastair R Mobley * Otilia Tica * Kelvin Okoth * Rebecca E Ghosh * Puja Myles * Tim Williams * Sandra Haynes * Krishnarajah Nirantharakumar * David Shukla * Dipak Kotecha * the DaRe2THINK Trial Committees ## ABSTRACT **Background** Improving the efficiency of clinical trials is key to their continued importance in directing evidence-based patient care. Digital innovations, in particular the use of electronic healthcare records (EHR), allow for large-scale screening and follow-up of participants. However, it is critical these developments are accompanied by robust and transparent methods that can support high quality and high clinical value research. **Methods** The DaRe2THINK trial includes a series of novel processes, including nationwide pseudonymised pre-screening of the primary care EHR across England, digital enrolment, remote e-consent, and ‘no-visit’ follow-up by linking all primary and secondary care health data with patient-reported outcomes. **Findings** DaRe2THINK is a pragmatic, healthcare-embedded randomised trial testing whether earlier use of direct oral anticoagulants in patients with prior or current atrial fibrillation can prevent thromboembolic events and cognitive decline ([www.birmingham.ac.uk/dare2think](http://www.birmingham.ac.uk/dare2think)). This paper outlines the systematic approach and methodology employed to define patient information and outcome events. This includes transparency on all medical code lists and phenotypes used in the trial across a variety of national data sources, including Clinical Practice Research Datalink Aurum (primary care), Hospital Episode Statistics (secondary care) and the Office for National Statistics (mortality). **Interpretation** Co-designed by a patient and public involvement team, DaRe2THINK presents an opportunity to transform the approach to randomised trials in the setting of routine healthcare, providing high-quality evidence generation in populations representative of the community at-risk. Key Words * Randomised controlled trial * Electronic healthcare record * Coding * Atrial fibrillation * Anticoagulation * Primary care * Secondary care ## INTRODUCTION Rapid uptake of electronic healthcare record (EHR) systems across the world have led to the opportunity for large-scale, real-world, longitudinal clinical research.[1] In countries with national health systems such as the United Kingdom, there is also the potential to link all EHRs for an individual patient, across both primary and secondary care, and from birth to death. Standardised coding systems provide the basis for harnessing medical information from different healthcare providers, with linkage allowing for a complete picture of each patient’s history, healthcare utilisation and adverse events.[2] This is an attractive (and cost-efficient) prospect for clinical research, particular as the quality of EHR data is typically high due to the need for accurate billing and reimbursement purposes. [3-5] Observational clinical research has benefited for many decades from EHR data. Evolving technology and coding systems, coupled with more complete EHR coverage, has provided the opportunity for randomised controlled trials (RCT) to be embedded alongside EHR systems, taking advantage of existing clinical data and follow-up. The coronavirus pandemic has epitomised how innovations in the design and running of RCTs are critical to address unmet clinical need. However, RCTs remain too costly, tend to recruit selective patients within high-performing centres, and frequently do not match the real population at-risk in our communities. The DaRe2 approach (healthcare Data for pragmatic clinical Research in the NHS – primary 2 secondary) was designed to incorporate recent innovations in EHR and clinical research, combined with advances in mobile technology, to provide an end-to-end framework for RCTs embedded in Primary Care in England. The DaRe2THINK trial provides an exemplar of a “remote-RCT” to address a key public health concern, requiring no physical patient visits despite being a clinical trial of an investigational medicinal product (CTIMP).[6] In this paper, we outline the design features of DaRe2THINK, and the systematic approach used to define patient information and outcome events that will be accrued during the trial. All medical code lists and phenotypes are presented for full transparency, and for future use by other researchers using EHR data. ## THE DaRe2THINK TRIAL Using the digital potential of the EHR, DaRe2THINK is transformational project that will underpin a new trajectory for NHS-based research for patient benefit. DaRe2THINK is led by the University of Birmingham Institute of Cardiovascular Sciences, in collaboration with the Clinical Practice Research Datalink (CPRD; part of the Medicines and Healthcare products Regulatory Agency). The trial is supported by the University Hospitals Birmingham NHS Foundation Trust, University of Birmingham Institute of Applied Health Research and Centre for Patient Reported Outcomes Research, Aston University, London School of Economics, University of Oxford Nuffield Department of Population Health, NIHR Clinical Research Network (West Midlands Primary Care), and Health Data Research UK Midlands. ### DaRe2 platform features 1. Automated secure screening of inclusion and exclusion criteria across more than 13 million NHS patients, including around 1 in 4 primary care sites in England that are part of CPRD. This provides rapid and cost-efficient screening of a diverse and representative proportion of the UK population[7], yet maintains patient privacy using pseudonymised records that can only be re-identified by local NHS staff. 2. Targeted enrolment of primary care sites identified as having potentially-eligible patients, streamlining recruitment by focusing resources on high-value sites. 3. Dynamic adaptive screening of the population at each site, with updates on a weekly basis of newly-eligible patients, reducing burden on local NHS staff and simplifying recruitment. 4. Easy enrolment of patients, with remote e-consent using their own smartphone/tablet/computer, one-click randomisation by primary care staff, and drug prescription via usual clinical systems. 5. ‘No-visit’ follow-up, utilising NHS records linked across national primary and secondary care for capture of endpoints without the patient attending the healthcare site, and no need for NHS investigators to complete case report forms. 6. Regular scheduled patient-reported outcomes, with requests sent by automated text message to each participant’s telephone and email, and secure data acquisition through online completion. 7. Automated collation of safety outcomes from the primary care NHS record, avoiding the possibility of missed events unknown to the local investigator. 8. Co-design and management of all processes by a Patient and Public Involvement (PPI) team [8], supporting a patient-centric approach with sustainable and valued output. ### DaRe2THINK design DaRe2THINK will be the first exemplar of this system, and is focused on the intersection of key national priorities for health and social care services – the common heart rhythm disorder called atrial fibrillation (AF) and the impact this condition has on thromboembolic events, including long-term cognitive decline and vascular dementia. Patients with AF suffer from a high rate of morbidity, with one in four stroke patients having AF as a potential cause, half of AF patients developing heart failure that responds poorly to treatment, and the rate of death doubled at all ages.[9-11] Most patients with AF develop progressive subclinical cerebral damage over time [12, 13], with high rates of cognitive impairment and a 40% increased risk of dementia compared to normal sinus rhythm.[14, 15] As the prevalence of AF is expected to double in the coming decades [16], this will place an unsustainable burden on healthcare and social services. DaRe2THINK is a pragmatic, NHS-embedded, open-label, event-driven, parallel-group RCT. It will test whether earlier use of anticoagulants is effective and cost-effective at preventing thromboembolic events (primary outcome) and cognitive decline (key secondary outcome) in patients with AF at low or intermediate risk of stroke. Around 3,000 patients will be randomised to either starting a direct oral anticoagulant (DOAC; used due to their excellent safety and efficacy profile [17, 18]), or continuing standard-of-care where anticoagulation is instituted when the patient is older with clear risk factors for stroke.[19] A summary of the DaRe2THINK trial is presented in **Figure 1**. Due to the ability to access pseudonymised NHS records, DaRe2THINK will be able to adapt inclusion and exclusion criteria to reflect current changes in practice and extraneous factors (e.g. the coronavirus pandemic). Recruitment has commenced across England, and the current DaRe2THINK protocol is available at [https://www.birmingham.ac.uk/dare2think](https://www.birmingham.ac.uk/dare2think). The trial has received ethical/Health Research Authority approval (REC number: 21/NE/0021; IRAS project ID: 290420), regulatory approval (MHRA CTA 21761/0364) and is registered at [Clinicaltrials.gov](http://Clinicaltrials.gov) ([NCT04700826](http://medrxiv.org/lookup/external-ref?link_type=CLINTRIALGOV&access_num=NCT04700826&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom)), ISRCTN (21157803) and EudraCT (2020-005774-10). A plain English summary written by the PPI team is provided in **Table 1**. View this table: [Table 1:](http://medrxiv.org/content/early/2022/05/25/2022.05.24.22275434/T1) Table 1: Plain English summary of DaRe2THINK ![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/05/25/2022.05.24.22275434/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2022/05/25/2022.05.24.22275434/F1) Figure 1: DaRe2THINK key innovations Flowchart of the DaRe2 approach (healthcare Data for pragmatic clinical Research in the NHS – primary 2 secondary) and summary of the DaRe2THINK clinical trial. * Please see current protocol at [www.birmingham.ac.uk/dare2think](http://www.birmingham.ac.uk/dare2think) for full details of inclusion and exclusion criteria. CPRD = Clinical Practice Research Datalink; DOAC = direct oral anticoagulant; NHS = National Health Service. ## SYSTEMATIC CODING METHODOLOGY The DaRe2 approach links and combines NHS data from primary care, secondary care and other national databases in order to provide a full and complete picture of each participant’s health and healthcare utilisation. The UK public health system is free at the point of delivery to all citizens, with indirect reimbursement via local health authorities based on coded data for diseases and procedures performed. In this section, we detail the processes employed in the DaRe2THINK trial to systematically catalogue and incorporate all relevant codes, including for selection criteria, baseline variables, outcome endpoints. In addition, DaRe2THINK includes monthly automated searches of the primary care record to assess for safety events. For transparency, all codes used are presented in **Appendix 1** for other researchers to see, comment, update and re-use. The DaRe2THINK trial adheres to the CODE-EHR best practice framework for the use of structured electronic healthcare records in clinical research.[2] This study meets all five of the CODE-EHR minimum standards, and will in addition meet all 5 standards for preferred criteria once completed; further details are presented in **Appendix 2**. The DaRe2THINK investigators are committed to the FAIR principles (Findable, Accessible, Interoperable and Reusable).[20] ### Data sources Primary care data are obtained through CPRD Aurum, a prospectively-collected, population-based, pseudonymised medical record database that collects daily information directly from NHS primary care sites that are part of the CPRD network and use the Egton Medical Information Systems (EMIS) software system. As of March 2021 when DaRe2THINK was initiated, CPRD Aurum included 39,555,354 research acceptable patients of which 13,299,826 were actively registered (19.9% of the UK population) across 1,375 active primary care sites (15.3% of UK general practices). All baseline data for DaRe2THINK trial participants is extracted from CPRD Aurum, obviating the need for investigators to fill in case report forms. CPRD Aurum includes codes for diagnosis and non-prescription data (medcodeid), which is cross-mapped to SNOMED CT (UK edition) [21] and Read Version 2 [22], with prescriptions by product code (prodcodeid) mapped to the Dictionary of Medicines and Devices.[23] Secondary care Hospital Episode Statistics (HES) Admitted Patient Care data, which contains information on all admissions to NHS hospitals in England, as well as NHS-funded care at independent providers, are linked at the patient level to CPRD Aurum by NHS Digital. With around 99% of hospital activity in England funded by the NHS, this provides almost total coverage of healthcare utilisation for the DaRe2THINK trial, including admission and discharge date, admission type (pre-planned or emergency), primary diagnosis (reason for hospitalisation), secondary diagnoses, and any procedures performed during the hospital stay. HES uses the International Classification of Diseases (ICD) version 10 for diagnosis codes [24], and the Office of Population Censuses and Surveys Classification of Surgical Operation and Procedures (OPCS) version 4 for procedures.[25] Death and cause of death are obtained via linkage with the Office for National Statistics (ONS) mortality database [26], which includes all deaths in England occurring both within and outside any healthcare setting. Data in ONS is coded using ICD-10. ### Development of the medical code lists To achieve a comprehensive set of contemporary codes which are suitable for defining outcome and safety events in a clinical trial, we designed and employed a four-phase systematic framework to develop code lists for data extraction from the listed data sources. The approach is summarised in **Figure 2**. Throughout the four phases we used DExtER, an automated platform for clinical epidemiology that was developed to support clinical code list development and validation of phenotypes.[27] ![Figure 2:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2022/05/25/2022.05.24.22275434/F2.medium.gif) [Figure 2:](http://medrxiv.org/content/early/2022/05/25/2022.05.24.22275434/F2) Figure 2: DaRe2THINK coding framework Systematic, iterative approach to defining code lists from national healthcare data sources. * Study-Within-A-Trial to be performed in a subset of patients with clinical events identified using primary and secondary healthcare records.[28] #### Phase 1: Phenotype specification Prior to searching for relevant medical codes, the DaRe2THINK trial selection criteria and outcome measures were first transformed into relevant phenotypes for each data source. This was an iterative process that accounted for the limitations of the data source and, where needed, combined phenotypes to derive a particular outcome. For example, ‘major bleeding’ is not recorded as such in any primary care dataset, hence we combined a phenotype list for bleeding events with a list generated to identify concurrent hospitalisation. This provided a code list for the safety outcome of ‘bleeding resulting in hospitalisation’ based solely on CPRD Aurum data. This process was facilitated by the Data Coordination Team, a specific DaRe2THINK committee which includes clinical expertise in primary and secondary care, data scientists with knowledge and experience in using medical codes, and representatives of CPRD. #### Phase 2: Systematic code searching A systematic search was carried out against the agreed phenotypes using the medical code dictionaries (SNOMED CT and Read codes for CPRD Aurum, and ICD-10 codes for HES and ONS). When searching medical code dictionaries, search terms and acronyms were agreed with clinical input. The hierarchy structure of each dictionary was used to find additional relevant children or sibling codes, and these were collated into a dynamic code list. The list was supplemented by any codes identified by other researchers after screening of previously-published and relevant research. To facilitate future use, or if there are updates to coding systems during the trial follow-up period, all search terms and searching scripts are saved as meta data. Medication codes required in the phenotyping algorithms were extracted from the CPRD Aurum product lists based on both British National Formulary (BNF) codes and drug substance names. CPRD provide code lists for demographics and measurements of ethnicity, height, weight, blood pressure, etc. #### Phase 3: Iterative and clinical review The codes found in the systematic search went through a multi-stage review process. This included review by data and clinical scientists. Code lists were reviewed independently by two individuals, marking codes for inclusion, exclusion or further review. Discrepancies between the reviewers and any queries were resolved by consensus agreement after adjudication by a senior investigator. All resultant code lists were then further reviewed by senior clinical experts to ensure that code lists were correct and consistent. #### Phase 4: Analysis of prevalence and validation For validation purposes, the prevalence of conditions based on the final code lists was then cross-checked with prevalence estimates from relevant publications and the Quality & Outcomes Framework (a national system designed to remunerate general practices for providing good quality care to their patients). Results were discussed with the DaRe2THINK Data Coordination Team, and then suggested changes iteratively fed back to phases 1 through 3 as appropriate. This process will be updated regularly through implementation so that phenotype definitions can be adjusted accordingly, for example if there is a change to coding systems. A Study-Within-A-Trial (SWAT) is also planned in a subset of participants to compare outcome events obtained from coded primary and secondary EHR data with in-depth and granular information extracted from one of the UK’s largest hospital trusts (University Hospitals Birmingham NHS Foundation Trust). Further details are available in the prospectively-registered Medical Research Council SWAT repository [28]; in brief, this will involve extraction of text using machine learning methods from letters, clinical notes, imaging and time-series data to determine accuracy and missingness compared to primary and secondary EHR data. ## RESULTS & FINAL CODE LISTS Using the systematic approach discussed above, we searched a total of 1,176,611 codes, including 1,159,849 from primary care, and 16,762 from secondary care and national databases. This led to the creation of a total of 77 code lists across the different sections of the trial. ### Participant selection criteria The DaRe2THINK trial uses automated pre-screening of the primary care EHR to determine inclusion and exclusion according to the defined selection criteria. **Table 2** lists the main components of the selection criteria and associated CPRD Aurum code lists. A total of 3,494 codes were included in these code lists using the systematic approach, including contribution from 14 published code lists.[29, 30] Individual codes for each list can be found in **Appendix 1 Tables S1 to S23**. View this table: [Table 2:](http://medrxiv.org/content/early/2022/05/25/2022.05.24.22275434/T2) Table 2: Summary of code lists for selection criteria ### Outcome events and safety search Outcome events in the DaRe2THINK trial are collated from primary care, secondary care and national mortality data. Automated safety searches are also performed on a monthly basis in the primary care EHR. **Table 3** provides a summary of the code lists used for outcome events, including 4,434 codes from CPRD Aurum and 610 codes from HES and ONS. For safety events, 3,081 CPRD Aurum codes were used to determine ischaemic stroke, intracranial bleeding, gastrointestinal bleeding and other bleeding, with 483 codes to define associated hospitalisation. Code lists were supplemented after review of 27 code lists from existing published literature [31-44], with the final list of individual codes detailed in **Appendix 1 Tables S24 to S44**. View this table: [Table 3:](http://medrxiv.org/content/early/2022/05/25/2022.05.24.22275434/T3) Table 3: Summary of code lists for outcome events ### Baseline characteristics To remove the need for case report forms, baseline characteristics in the DaRe2THINK trial are extracted from the primary care EHR. **Table 4** provides a summary of these code lists, with a total of 6,802 codes from CPRD Aurum. Measurements and blood results are extracted as the latest value in the past 12 months, medications during the last 12 months, and health conditions throughout the participant’s primary care EHR. For heart failure, hypertension and diabetes, diagnosis is classified in two ways; (1) use of one or more of the relevant codes identified in the EHR, and (2) a ‘secure’ diagnosis based on the code identified *in addition* to an associated medication in the last 12 months (loop diuretic therapy, antihypertensive medications or oral antidiabetic/insulin, respectively). Codes are provided in **Appendix 1 Tables S1 to S77**. View this table: [Table 4:](http://medrxiv.org/content/early/2022/05/25/2022.05.24.22275434/T4) Table 4: Summary of code lists for baseline characteristics ## DISCUSSION The DaRe2 framework was designed to link together a series of innovations for a digital clinical trial, creating a patient-centred approach with high quality output, yet low burden on healthcare staff. The ability to pre-screen from over 13 million NHS patient records allows for targeted and efficient recruitment, and simplifies enrolment of patients that are representative of the real-world population. Combining all health data from primary and secondary care sources nationally enables a ‘remote’ RCT, with no requirement for baseline or follow-up visits. These processes are supported by advances in digital technology, including remote e-consent and patient reported outcomes. The DaRe2THINK trial is using this sequence of innovations to test a hypothesis of critical importance to public health, with the aim of preventing the long-term consequences of atrial fibrillation, including cognitive decline and vascular dementia. This paper outlines our approach to defining phenotypes that support all aspects of the trial, from understanding the characteristics of recruited participants, through to determination of clinical endpoints. Unlike conventional trials with case report forms, DaRe2THINK extracts all relevant information from structured and coded healthcare data sources. Hence, it is critical for dissemination and transparency that all coding schema for this trial are pre-published and available for evaluation (plus re-use) by other researchers and clinicians. In a published review of 450 EHR-based studies, only 19 (5.1%) were accompanied by a full set of clinical codes [45], severely limiting the value of those studies and their implementation to routine care. The emergence of EHR systems across the world provides an important opportunity for healthcare-embedded clinical research [1], but only when this is accompanied by a clear pathway from data collection to interpretation.[2] The broad scope of coded healthcare data has been a limiting factor in prior attempts at EHR-embedded clinical trials. As evidenced by the large number of codes screened for this trial (over a million), a robust and systematic approach is required to ensure that patient events are not missed. The UK NHS is ideally suited to operationalise this sort of innovative trial, as structured healthcare data is used for quality and reimbursement purposes [4] and there is an ability to link across different healthcare sources.[46] Linkage is critical to understanding the full healthcare utilisation of each participant.[3] In this case allowing for the combination of historical and future primary care data, with that pertaining to hospitalisation and procedures across the NHS nationally, as well as information on cause of death from the national register. EHR-based trials may have an advantage in this regard compared to traditional trials, where investigators are relied on to document safety and outcome events. In the modern era of multiple heath providers for a multitude of different health conditions, sourcing safety and outcome events from national data has the potential to avoid missing endpoints, and potentially eliminate ascertainment bias in unblinded research. Although the value of trials based on routine EHR data is likely to vary depending on the disease being researched [47], it is clear that the direction of travel is to enhance existing clinical trial infrastructure with digital innovations.[48] In this regard, DaRe2THINK is the vanguard for cost-efficient RCTs embedded in clinical care. Previous trials have shown that although endpoints based on structured healthcare data may not replicate the traditional adjudication committee output, the estimated intervention effect is almost identical for adjudicated compared to EHR follow-up.[49, 50]. There remain substantial limitations to effective use of EHR data for clinical trials, including ongoing changes to coding systems. For example, ICD-10 includes 68,000 codes (a number of countries have their own editions, with ICD-11 launched this year), and SNOMED CT has over 300,000 clinical concepts. New codes can also be created within primary healthcare databases, highlighting the importance of iterative review and updates as exemplified by our coding framework. Universal limitations, such as missing data, variation in data quality, reliability of coding and imprecision in the choice of code used, are common across real-world data sources. These and specific areas of bias such as lack of representativeness and loss to follow-up are substantially minimised in DaRe2THINK due to the NHS being the de facto provider of almost all healthcare in the UK, and CPRD providing broad and representative coverage. In this setting, concordance between primary and secondary care health data is high. For example, in a random sample of 50,000 patients in CPRD Aurum, 94% of the 1,260 patients with a code for myocardial infarction had corroborating evidence in the secondary care HES records.[3] Similar to our transparent approach to coding, the process of integrating primary and secondary healthcare sources in DaRe2THINK will be documented in an open-access, prospectively-published Statical Analysis Plan. The DaRe2 approach was only possible with the support and guidance of our Patient and Public Involvement (PPI) team. We are indebted to the group for their constructive criticism throughout the design process, and their ongoing contribution to trial management to produce sustainable systems focused on the needs and wants of patients. Achieving a social licence for data-related clinical research is critical, and only achievable with high-level involvement by the public, for example using the PPI-POSITIVE approach.[8] ## CONCLUSION The DaRe2THINK trial is using a series of digital innovations to reshape the deployment of an adaptive randomised clinical trial embedded in routine healthcare, with minimal burden for staff and patients. Integrating national healthcare data from primary and secondary sources, the system will provide evidence for a key public health concern, the prevention of cognitive decline and dementia in the rapidly growing number of patients with atrial fibrillation. Systematic and transparent methodology in the use of structured healthcare data, together with a patient and public mandate, are critical for evolution of these novel approaches in order to improve healthcare in our communities. ## Supporting information Appendix 1 [[supplements/275434_file04.pdf]](pending:yes) Appendix 2 [[supplements/275434_file05.pdf]](pending:yes) ICMJE [[supplements/275434_file06.zip]](pending:yes) ## Data Availability No data is presented in the manuscript - this paper provides all codes used in the trial (see appendix) [https://www.birmingham.ac.uk/d2t](https://www.birmingham.ac.uk/d2t) ## Authors’ contributions The coding framework was designed and implemented by XW, OT, KO, KN, DS and DK. The manuscript was drafted by XW, ARM and DK, with all other authors editing the manuscript for intellectual content. DK provided supervision and was responsible for the decision to submit the manuscript. ## Data Sharing No additional data are available at this time. ## Competing interests All authors have completed the ICMJE uniform disclosure form ([www.icmje.org/coi\_disclosure.pdf](http://www.icmje.org/coi_disclosure.pdf)) and declare: Dr. Wang and Mr. Mobley’s posts are funded by the National Institute of Health Research (NIHR) (130280). Dr. ica’s post was funded by the EU/EFPIA Innovative Medicines Initiative (BigData@Heart 116074) and a grant from Amomed Pharma, outside the submitted work. Dr. Okoth has nothing to disclose. Dr. Ghosh reports grants from NIHR, during the conduct of the study. Dr. Myles has nothing to disclose. Dr. Williams has nothing to disclose. Mrs. Haynes reports a grants from NIHR (130280), during the conduct of the study. Prof. Nirantharakumar reports grants from NIHR, during the conduct of the study; and research grants from UKRI/MRC, Kennedy Trust for Rheumatology Research, Health Data Research UK, Wellcome Trust, European Regional Development Fund, Institute for Global Innovation, Boehringer Ingelheim, Action Against Macular Degeneration Charity, Midlands Neuroscience Teaching and Development Funds, South Asian Health Foundation, Vifor Pharma, College of Police, and CSL Behring, outside the submitted work; and consulting fees from Boehringer Ingelheim, Sanofi, CEGEDIM, MSD and holds a leadership/fiduciary role with NICST, a charity and OpenClinical, a Social Enterprise; outside the submitted work. Dr. Shukla reports grants from NIHR (130280), during the conduct of the study; and grants from Pfizer outside the submitted work. Prof. Kotecha reports grants from NIHR (NIHR CDF-2015-08-074 RATE-AF; NIHR130280 DaRe2THINK; NIHR132974 D2T-NeuroVascular), the British Heart Foundation (PG/17/55/33087, AA/18/2/34218 and FS/CDRF/21/21032), the EU/EFPIA Innovative Medicines Initiative (BigData@Heart 116074), and the European Society of Cardiology supported by educational grants from Boehringer Ingelheim/BMS-Pfizer Alliance/Bayer/Daiichi Sankyo/Boston Scientific, the NIHR/University of Oxford Biomedical Research Centre and British Heart Foundation/University of Birmingham Accelerator Award (STEEER-AF [NCT04396418](http://medrxiv.org/lookup/external-ref?link_type=CLINTRIALGOV&access_num=NCT04396418&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom)). In addition, he has received a research grant to his institution from Pfizer; and research grants and advisory board fees from Bayer, Amomed and Protherics Medicines Development; all outside the submitted work. ## Funding The DaRe2THINK trial is funded by the National Institute of Health Research (NIHR130280). Medications in the trial are funded by the National Health Service (UK Department of Health and Social Care). The opinions expressed in this paper are those of the authors and do not represent the NIHR or the UK Department of Health and Social Care. ## Acknowledgements In memory of Jaqueline Jones, member of the Patient and Public Involvement team. We would like to acknowledge all members of the DaRe2THINK team and trial committees: **Trial Management Group:** Susan Beatty (Clinical Practice Research Datalink, MHRA), Samir Mehta (Birmingham Clinical Trials Unit), Sophie Breeze (Clinical Practice Research Datalink, MHRA), Karen Lancaster, (Clinical Practice Research Datalink, MHRA), Stuart Fordyce (Clinical Practice Research Datalink, MHRA). **Data Coordination Team:** Naomi Allen (University of Oxford, UK Biobank), Melanie Calvert (Centre for Patient Reported Outcomes Research & Birmingham Health Partners Centre for Regulatory Science and Innovation), Alastair Denniston (University Hospitals Birmingham NHS Foundation Trust & University of Birmingham), George Gkoutos (University of Birmingham & University Hospitals Birmingham NHS Foundation Trust). **Health Economics Team:** Sahan Jayawardana (London School of Economics and Political Science), Elias Mossialos (London School of Economics and Political Science). **Expert Advisory Group:** Simon Ball (University Hospitals Birmingham NHS Foundation Trust & Health Data Research UK Midlands), Colin Baigent (University of Oxford), Peter Brocklehurst (Birmingham Clinical Trials Unit), Will Lester (University Hospitals Birmingham NHS Foundation Trust), Richard McManus (University of Oxford), Stefano Seri (Aston University), Janet Valentine (Innovate UK, UK Research & Innovation). **Independent Trial Steering Committee:** A John Camm (St. George’s University of London), Sandra Haynes MBE (Patient and Public Involvement team lead), Dame Julie Moore (University of Warwick), Amy Rogers (University of Dundee), Mary Stanbury (Patient & Public Involvement). **Independent Data Monitoring Committee:** Marcus Flather (University of East Anglia), Suzy Walker (General Practitioner, Fortrose Medical Practice, Highland), Duolao Wang (Liverpool School of Tropical Medicine). ## Footnotes * † See Acknowledgments. * Received May 24, 2022. * Revision received May 24, 2022. * Accepted May 25, 2022. * © 2022, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/) ## REFERENCES 1. 1. R Studer, C Sartini, K Suzart-Woischnik, R Agrawal, H Natani, SK Gill, D Kotecha. Identification and Mapping Real-World Data Sources for Heart Failure, Acute Coronary Syndrome, and Atrial Fibrillation. Cardiology. 2022;147:98–106. 2. 2. D Kotecha, FW Asselbergs, European Society of Cardiology, BigData@Heart consortium. Best practice framework for the use of coded electronic healthcare records in clinical research (CoDe-EHR checklist): International stakeholder consensus from the European Society of Cardiology and the BigData@Heart consortium. 2021: [https://www.escardio.org/Research/Big-Data-Heart](https://www.escardio.org/Research/Big-Data-Heart) [Accessed 07 Apr 2021]. 3. 3. R Persson, T Sponholtz, C Vasilakis-Scaramozza, KW Hagberg, T Williams, D Kotecha, SS Jick. Quality and Completeness of Myocardial Infarction Recording in Clinical Practice Research Datalink Aurum. Clin Epidemiol. 2021;13:745–53. 4. 4. EM Burns, E Rigby, R Mamidanna, A Bottle, P Aylin, P Ziprin, OD Faiz. Systematic review of discharge coding accuracy. J Public Health (Oxf). 2012;34:138–48. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/pubmed/fdr054&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21795302&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000300722700021&link_type=ISI) 5. 5. R Persson, C Vasilakis-Scaramozza, KW Hagberg, T Sponholtz, T Williams, P Myles, SS Jick. CPRD Aurum database: Assessment of data quality and completeness of three important comorbidities. Pharmacoepidemiol Drug Saf. 2020;29:1456–64. 6. 6. D Kotecha, D Shukla, Clinical Practice Research Datalink. Preventing stroke, premature death and cognitive decline in a broader community of patients with atrial fibrillation using healthcare data for pragmatic research: A randomised controlled trial (DaRe2THINK). Sponsor: University of Birmingham; Funder: National Institute for Health Research. 2021;EudraCT 2020-005774-10; [http://ClinicalTrials.gov](http://ClinicalTrials.gov) [NCT04700826](http://medrxiv.org/lookup/external-ref?link_type=CLINTRIALGOV&access_num=NCT04700826&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom): [https://www.birmingham.ac.uk/dare2think](https://www.birmingham.ac.uk/dare2think) [Accessed 30 Mar 2022]. 7. 7. A Wolf, D Dedman, J Campbell, H Booth, D Lunn, J Chapman, P Myles. Data resource profile: Clinical Practice Research Datalink (CPRD) Aurum. Int J Epidemiol. 2019;48:1740–g. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ije/dyz034&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom) 8. 8. KV Bunting, M Stanbury, O Tica, D Kotecha. Transforming clinical research by involving and empowering patients-the RATE-AF randomized trial. Eur Heart J. 2021;42:2411–4. 9. 9. P Kirchhof, S Benussi, D Kotecha, A Ahlsson, D Atar, B Casadei, P Vardas. 2016 ESC Guidelines for the management of atrial fibrillation developed in collaboration with EACTS. Eur Heart J. 2016;37:2893–962. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/eurheartj/ehw210&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27567408&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom) 10. 10. D Kotecha, J Holmes, H Krum, DG Altman, L Manzano, JG Cleland, Beta-Blockers in Heart Failure Collaborative Group. Efficacy of beta blockers in patients with heart failure plus atrial fibrillation: an individual-patient data meta-analysis. Lancet. 2014;384:2235–43. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(14)61373-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25193873&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom) 11. 11. O Tica, W Khamboo, D Kotecha. Breaking the cycle of HFpEF and AF. Card Fail Rev. 2022: in press (cfr.2022.03R1). 12. 12. D Kotecha, G Breithardt, AJ Camm, GYH Lip, U Schotten, A Ahlsson, P Kirchhof. Integrating new approaches to atrial fibrillation management: the 6th AFNET/EHRA Consensus Conference. Europace. 2018;20:395–407. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/europace/eux318&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom) 13. 13. D Conen, N Rodondi, A Muller, JH Beer, P Ammann, G Moschovitis, S Osswald. Relationships of Overt and Silent Brain Lesions With Cognitive Function in Patients With Atrial Fibrillation. J Am Coll Cardiol. 2019;73:989–99. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjg6IjczLzkvOTg5IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDUvMjUvMjAyMi4wNS4yNC4yMjI3NTQzNC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 14. 14. MM Islam, TN Poly, BA Walther, HC Yang, CC Wu, MC Lin, YC Li. Association Between Atrial Fibrillation and Dementia: A Meta-Analysis. Front Aging Neurosci. 2019;11:305. 15. 15. P Kirchhof, KG Haeusler, B Blank, J De Bono, D Callans, A Elvan, L Di Biase. Apixaban in patients at risk of stroke undergoing atrial fibrillation ablation. Eur Heart J. 2018;39:2942–55. 16. 16. DA Lane, F Skjoth, GYH Lip, TB Larsen, D Kotecha. Temporal Trends in Incidence, Prevalence, and Mortality of Atrial Fibrillation in Primary Care. J Am Heart Assoc. 2017;6:e005155. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NToiYWhhb2EiO3M6NToicmVzaWQiO3M6MTE6IjYvNS9lMDA1MTU1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDUvMjUvMjAyMi4wNS4yNC4yMjI3NTQzNC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 17. 17.European Medicines Agency Committee for Medicinal Products for Human Use. Assessment report: Direct oral anticoagulants (DOACs). EMA/194375/2020; Procedure number: EMEA/H/A-5(3)/1487: European Medicines Agency, Amsterdam; 2020. 18. 18. D Kotecha, CV Pollack, Jr.., R De Caterina, G Renda, P Kirchhof. Direct Oral Anticoagulants Halve Thromboembolic Events After Cardioversion of AF Compared With Warfarin. J Am Coll Cardiol. 2018;72:1984–6. [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjEwOiI3Mi8xNi8xOTg0IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDUvMjUvMjAyMi4wNS4yNC4yMjI3NTQzNC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 19. 19.National Institute for Health and Care Excellence. Atrial fibrillation: diagnosis and management. NICE clinical guideline 196. 2021: [http://www.nice.org.uk/guidance/ng196](http://www.nice.org.uk/guidance/ng196). 20. 20. J Wise, AG de Barron, A Splendiani, B Balali-Mood, D Vasant, E Little, V Hedley. Implementation and relevance of FAIR data principles in biopharmaceutical R&D. Drug Discov Today. 2019;24:933–8. 21. 21.NHS Digital. SNOMED CT. 2021: [https://digital.nhs.uk/services/terminology-and-classifications/snomed-ct](https://digital.nhs.uk/services/terminology-and-classifications/snomed-ct) [Accessed 11 May 2022]. 22. 22.NHS Digital. Read codes 2020: [https://digital.nhs.uk/services/terminology-and-classifications/read-codes](https://digital.nhs.uk/services/terminology-and-classifications/read-codes) [Accessed 11 May 2022]. 23. 23.NHS Business Services Authority. Dictionary of medicines and devices (dm+d). 2021: [https://www.nhsbsa.nhs.uk/pharmacies-gp-practices-and-appliance-contractors/dictionary-medicines-and-devices-dmd](https://www.nhsbsa.nhs.uk/pharmacies-gp-practices-and-appliance-contractors/dictionary-medicines-and-devices-dmd) [Accessed 11 May 2022]. 24. 24.World Health Organization. International classification of disease version 10. 2019: [https://icd.who.int/browse10/2019/en#/XXI](https://icd.who.int/browse10/2019/en#/XXI) [Accessed 11 May 22]. 25. 25.NHS. OPCS Classification of Interventions and Procedures. 2022: [https://www.datadictionary.nhs.uk/data\_elements/opcs-4\_code.html](https://www.datadictionary.nhs.uk/data_elements/opcs-4_code.html) [Accessed 11 May 2022]. 26. 26.Office for National Statistics. User guide to mortality statistics. 2021: [https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/methodologies/userguidetomortalitystatisticsjuly2017](https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/methodologies/userguidetomortalitystatisticsjuly2017) [Accessed 11 May 22]. 27. 27. KM Gokhale, JS Chandan, K Toulis, G Gkoutos, P Tino, K Nirantharakumar. Data extraction for epidemiological research (DExtER): a novel tool for automated clinical epidemiology studies. European Journal of Epidemiology. 2021;36:165–78. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom) 28. 28. X Wang, GV Gkoutos, D Kotecha. SWAT 158: Validation of clinical events in the DaRe2THINK data-enabled clinical trial. MRC Northern Ireland Hub for Trials Methodology Research: Study-Within-A-Trial (SWAT) Repository Store. 2019: [https://www.qub.ac.uk/sites/TheNorthernIrelandNetworkforTrialsMethodologyResearch/FileStore/Filetoupload,1218963,en.pdf](https://www.qub.ac.uk/sites/TheNorthernIrelandNetworkforTrialsMethodologyResearch/FileStore/Filetoupload,1218963,en.pdf) [Accessed 12 May 2022]. 29. 29. SS Jick, KW Hagberg, R Persson, C Vasilakis-Scaramozza, T Williams, E Crellin, P Myles. Quality and completeness of diagnoses recorded in the new CPRD Aurum Database: evaluation of pulmonary embolism. Pharmacoepidemiol Drug Saf. 2020;29:1134–40. 30. 30. R Persson, C Vasilakis Scaramozza, KW Hagberg, T Sponholtz, T Williams, P Myles, SS Jick. CPRD Aurum database: Assessment of data quality and completeness of three important comorbidities. Pharmacoepidemiology and Drug Safety. 2020;29:1456–64. 31. 31. J Wu, MA Mamas, MO Mohamed, CS Kwok, C Roebuck, B Humberstone, CP Gale. Place and causes of acute cardiovascular mortality during the COVID-19 pandemic. Heart. 2021;107:113–9. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiaGVhcnRqbmwiO3M6NToicmVzaWQiO3M6OToiMTA3LzIvMTEzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDUvMjUvMjAyMi4wNS4yNC4yMjI3NTQzNC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 32. 32. N McCormick, D Lacaille, V Bhole, JA Avina-Zubieta. Validity of heart failure diagnoses in administrative databases: a systematic review and meta-analysis. PLoS One. 2014;9:e104519. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0104519&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25126761&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom) 33. 33. N McCormick, V Bhole, D Lacaille, JA Avina-Zubieta. Validity of Diagnostic Codes for Acute Stroke in Administrative Databases: A Systematic Review. PLoS One. 2015;10:e0135834. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0135834&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26292280&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom) 34. 34. LM Lix, S Sobhan, A St-Jean, JM Daigle, A Fisher, OHY Yu, KB Filion. Validity of an algorithm to identify cardiovascular deaths from administrative health records: a multi-database population-based cohort study. BMC Health Serv Res. 2021;21:758. 35. 35. L Nedkoff, D Lopez, J Hung, M Knuiman, TG Briffa, K Murray, FM Sanfilippo. Validation of ICD-10-AM Coding for Myocardial Infarction Subtype in Hospitalisation Data. Heart Lung Circ. 2022;31:849–58. 36. 36. T Ando, N Ooba, M Mochizuki, D Koide, K Kimura, SL Lee, K Kubota. Positive predictive value of ICD-10 codes for acute myocardial infarction in Japan: a validation study at a single center. BMC Health Serv Res. 2018;18:895. 37. 37. T Nagamine, B Gillette, A Pakhomov, J Kahoun, H Mayer, R Burghaus, M Saxena. Multiscale classification of heart failure phenotypes by unsupervised clustering of unstructured electronic medical record data. Sci Rep. 2020;10:21340. 38. 38. E Bosco, L Hsueh, KW McConeghy, S Gravenstein, E Saade. Major adverse cardiovascular event definitions used in observational analysis of administrative databases: a systematic review. BMC Med Res Methodol. 2021;21:241. 39. 39. B Miao, AV Hernandez, MJ Alberts, N Mangiafico, YM Roman, CI Coleman. Incidence and Predictors of Major Adverse Cardiovascular Events in Patients With Established Atherosclerotic Disease or Multiple Risk Factors. J Am Heart Assoc. 2020;9:e014402. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1161/JAHA.119.014402&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom) 40. 40. JW Groenendyk, AS Rivera, A Sinha, DM Lloyd-Jones, MJ Feinstein. Changes in proportionate cardiovascular mortality in patients with chronic infectious and inflammatory conditions in the United States, 1999-2018. Sci Rep. 2021;11:23985. 41. 41. JJ Carrero, DJ de Jager, M Verduijn, P Ravani, J De Meester, JG Heaf, KJ Jager. Cardiovascular and noncardiovascular mortality among men and women starting dialysis. Clin J Am Soc Nephrol. 2011;6:1722–30. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiY2xpbmphc24iO3M6NToicmVzaWQiO3M6ODoiNi83LzE3MjIiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wNS8yNS8yMDIyLjA1LjI0LjIyMjc1NDM0LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 42. 42. L McDonald, CJ Sammon, M Samnaliev, S Ramagopalan. Under-recording of hospital bleeding events in UK primary care: a linked Clinical Practice Research Datalink and Hospital Episode Statistics study. Clin Epidemiol. 2018;10:1155–68. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2147/CLEP.S170304&link_type=DOI) 43. 43. NK Mehta, LR Abrams, M Myrskyla. US life expectancy stalls due to cardiovascular disease, not drug deaths. Proc Natl Acad Sci U S A. 2020;117:6998–7000. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMToiMTE3LzEzLzY5OTgiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMi8wNS8yNS8yMDIyLjA1LjI0LjIyMjc1NDM0LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 44. 44. A Miniño, R Klein. Mortality From Major Cardiovascular Diseases: United States, 2007. National Center for Health Statistics. 2010:2020–07. 45. 45. DA Springate, E Kontopantelis, DM Ashcroft, I Olier, R Parisi, E Chamapiwa, D Reeves. ClinicalCodes: an online clinical codes repository to improve the validity and reproducibility of research using electronic medical records. PLoS One. 2014;9:e99825. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0099825&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24941260&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom) 46. 46. MR Cowie, JI Blomster, LH Curtis, S Duclaux, I Ford, F Fritz, A Zalewski. Electronic health records to facilitate clinical research. Clin Res Cardiol. 2017;106:1–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00392-016-1025-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom) 47. 47. KA Mc Cord, H Ewald, A Agarwal, D Glinz, S Aghlmandi, JPA Ioannidis, LG Hemkens. Treatment effects in randomised trials using routinely collected data for outcome assessment versus traditional trials: meta-research study. Bmj. 2021;372:n450. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE2OiIzNzIvbWFyMDNfNC9uNDUwIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjIvMDUvMjUvMjAyMi4wNS4yNC4yMjI3NTQzNC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 48. 48. D Kotecha, AD DeVore, F Asselbergs. Fit for the Future: Enhancing Clinical Trials with Digital Technology. Eur Heart J. 2022: in press. 49. 49. E Kjoller, J Hilden, P Winkel, S Galatius, NJ Frandsen, GB Jensen, CT Group. Agreement between public register and adjudication committee outcome in a cardiovascular randomized clinical trial. Am Heart J. 2014;168:197–204 e1-4. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ahj.2013.12.032&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25066559&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom) 50. 50. L Bowman, M Mafham, K Wallendszus, W Stevens, G Buck, J Barton, J Armitage. Effects of Aspirin for Primary Prevention in Persons with Diabetes Mellitus. N Engl J Med. 2018;379:1529–39. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa1804988&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2022%2F05%2F25%2F2022.05.24.22275434.atom)