Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

De-novo FAIRification via an Electronic Data Capture system by automated transformation of filled electronic Case Report Forms into machine-readable data

View ORCID ProfileMartijn G. Kersloot, View ORCID ProfileAnnika Jacobsen, View ORCID ProfileKarlijn H.J. Groenen, View ORCID ProfileBruna dos Santos Vieira, View ORCID ProfileRajaram Kaliyaperumal, View ORCID ProfileAmeen Abu-Hanna, View ORCID ProfileRonald Cornet, View ORCID ProfilePeter A.C. ‘t Hoen, View ORCID ProfileMarco Roos, View ORCID ProfileLeo Schultze Kool, View ORCID ProfileDerk L. Arts
doi: https://doi.org/10.1101/2021.03.04.21250752
Martijn G. Kersloot
aAmsterdam UMC, University of Amsterdam, Department of Medical Informatics, Amsterdam Public Health Research Institute, Amsterdam, The Netherlands
bCastor EDC, Amsterdam, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Martijn G. Kersloot
  • For correspondence: m.g.kersloot@amsterdamumc.nl
Annika Jacobsen
cDepartment of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Annika Jacobsen
Karlijn H.J. Groenen
dDepartment of Medical Imaging, Radboud Institute for Health Sciences, Radboud university medical center, Nijmegen, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Karlijn H.J. Groenen
Bruna dos Santos Vieira
dDepartment of Medical Imaging, Radboud Institute for Health Sciences, Radboud university medical center, Nijmegen, The Netherlands
eCenter for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboud university medical center, Nijmegen, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Bruna dos Santos Vieira
Rajaram Kaliyaperumal
cDepartment of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Rajaram Kaliyaperumal
Ameen Abu-Hanna
aAmsterdam UMC, University of Amsterdam, Department of Medical Informatics, Amsterdam Public Health Research Institute, Amsterdam, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ameen Abu-Hanna
Ronald Cornet
aAmsterdam UMC, University of Amsterdam, Department of Medical Informatics, Amsterdam Public Health Research Institute, Amsterdam, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ronald Cornet
Peter A.C. ‘t Hoen
eCenter for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboud university medical center, Nijmegen, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Peter A.C. ‘t Hoen
Marco Roos
cDepartment of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Marco Roos
Leo Schultze Kool
dDepartment of Medical Imaging, Radboud Institute for Health Sciences, Radboud university medical center, Nijmegen, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Leo Schultze Kool
Derk L. Arts
bCastor EDC, Amsterdam, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Derk L. Arts
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Introduction Existing methods to make data Findable, Accessible, Interoperable, and Reusable (FAIR) are usually carried out in a post-hoc manner: after the research project is conducted and data are collected. De-novo FAIRification, on the other hand, incorporates the FAIRification steps in the process of a research project. In medical research, data is often collected and stored via electronic Case Report Forms (eCRFs) in Electronic Data Capture (EDC) systems. By implementing a de-novo FAIRification process in such a system, the reusability and, thus, scalability of FAIRification across research projects can be greatly improved. In this study, we developed and implemented a novel method for de-novo FAIRification via an EDC system. We evaluated our method by applying it to the Registry of Vascular Anomalies (VASCA).

Methods Our EDC and research project independent method ensures that eCRF data entered into an EDC system can be transformed into machine-readable, FAIR data using a semantic data model (a canonical representation of the data, based on ontology concepts and semantic web standards) and mappings from the model to questions on the eCRF. The FAIRified data are stored in a triple store and can, together with associated metadata, be accessed and queried through a FAIR Data Point. The method was implemented in Castor EDC, an EDC system, through a data transformation application. The FAIRness of the output of the method, the FAIRified data and metadata, was evaluated using the FAIR Evaluation Services.

Results We successfully applied our FAIRification method to the VASCA registry. Data entered on eCRFs is automatically transformed into machine-readable data and can be accessed and queried using SPARQL queries in the FAIR Data Point. Twenty-one FAIR Evaluator tests pass and one test regarding the metadata persistence policy fails, since this policy is not in place yet.

Conclusion In this study, we developed a novel method for de-novo FAIRification via an EDC system. Its application in the VASCA registry and the automated FAIR evaluation show that the method can be used to make clinical research data FAIR when they are entered in an eCRF without any intervention from data management and data entry personnel. Due to the generic approach and developed tooling, we believe that our method can be used in other registries and clinical trials as well.

Competing Interest Statement

MK is employed by Castor, the Electronic Data Capture platform that was used for data collection. DA is Castor’s CEO. The remaining authors state no conflicts of interest.

Funding Statement

MK’s and DA’s work is supported by funding from Castor. AJ, BV, RK, PAC’tH, RC and MR’s work is supported by the funding from the European Union’s Horizon 2020 research and innovation programme under the EJP RD COFUND-EJP No 825575. BV and LSK are members of the Vascular Anomalies Working Group (VASCA WG) of the European Reference Network for Rare Multisystemic Vascular Diseases (VASCERN) - Project ID: 769036. KG’s work is supported by the department of Medical Imaging, Radboud University Medical Center.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Not applicable

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

Not applicable

  • Abbreviations

    AAI
    Authentication and Authorization Infrastructures
    API
    Application Programming Interface
    CDE
    Common Data Element
    DC
    Dublin Core
    DCAT2
    Data Catalogue Vocabulary version 2
    eCRF
    electronic Case Report Form
    EDC
    Electronic Data Capture
    ERN
    European Reference Network
    EU
    European Union
    FAIR
    Findable, Accessible, Interoperable, and Reusable
    JRC
    Joint Research Center
    NIH
    National Institute of Health
    NWO
    The Dutch Research Council
    RD
    rare disease
    RDF
    Resource Description Framework
    VASCA
    Vascular Anomalies
    VASCERN
    European Reference Network on rare vascular diseases
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
    Back to top
    PreviousNext
    Posted March 08, 2021.
    Download PDF
    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    De-novo FAIRification via an Electronic Data Capture system by automated transformation of filled electronic Case Report Forms into machine-readable data
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    De-novo FAIRification via an Electronic Data Capture system by automated transformation of filled electronic Case Report Forms into machine-readable data
    Martijn G. Kersloot, Annika Jacobsen, Karlijn H.J. Groenen, Bruna dos Santos Vieira, Rajaram Kaliyaperumal, Ameen Abu-Hanna, Ronald Cornet, Peter A.C. ‘t Hoen, Marco Roos, Leo Schultze Kool, Derk L. Arts
    medRxiv 2021.03.04.21250752; doi: https://doi.org/10.1101/2021.03.04.21250752
    Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
    Citation Tools
    De-novo FAIRification via an Electronic Data Capture system by automated transformation of filled electronic Case Report Forms into machine-readable data
    Martijn G. Kersloot, Annika Jacobsen, Karlijn H.J. Groenen, Bruna dos Santos Vieira, Rajaram Kaliyaperumal, Ameen Abu-Hanna, Ronald Cornet, Peter A.C. ‘t Hoen, Marco Roos, Leo Schultze Kool, Derk L. Arts
    medRxiv 2021.03.04.21250752; doi: https://doi.org/10.1101/2021.03.04.21250752

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Health Informatics
    Subject Areas
    All Articles
    • Addiction Medicine (76)
    • Allergy and Immunology (195)
    • Anesthesia (54)
    • Cardiovascular Medicine (489)
    • Dentistry and Oral Medicine (89)
    • Dermatology (56)
    • Emergency Medicine (168)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (211)
    • Epidemiology (5674)
    • Forensic Medicine (3)
    • Gastroenterology (215)
    • Genetic and Genomic Medicine (863)
    • Geriatric Medicine (88)
    • Health Economics (230)
    • Health Informatics (761)
    • Health Policy (390)
    • Health Systems and Quality Improvement (250)
    • Hematology (105)
    • HIV/AIDS (182)
    • Infectious Diseases (except HIV/AIDS) (6467)
    • Intensive Care and Critical Care Medicine (390)
    • Medical Education (117)
    • Medical Ethics (28)
    • Nephrology (90)
    • Neurology (846)
    • Nursing (44)
    • Nutrition (141)
    • Obstetrics and Gynecology (162)
    • Occupational and Environmental Health (258)
    • Oncology (514)
    • Ophthalmology (163)
    • Orthopedics (44)
    • Otolaryngology (105)
    • Pain Medicine (48)
    • Palliative Medicine (21)
    • Pathology (149)
    • Pediatrics (250)
    • Pharmacology and Therapeutics (146)
    • Primary Care Research (113)
    • Psychiatry and Clinical Psychology (963)
    • Public and Global Health (2224)
    • Radiology and Imaging (376)
    • Rehabilitation Medicine and Physical Therapy (174)
    • Respiratory Medicine (312)
    • Rheumatology (109)
    • Sexual and Reproductive Health (80)
    • Sports Medicine (82)
    • Surgery (118)
    • Toxicology (25)
    • Transplantation (34)
    • Urology (42)