ABSTRACT
Background Traditional heart transplant registries often lack the granularity required for deep phenotyping and rely on labor-intensive manual abstraction. We describe the methodology and validation of a next-generation, automated, multi-source registry designed to address these limitations.
Methods Utilizing a High-Performance Computing environment, we integrated structured data from Epic data warehouses (Clarity and Caboodle), external molecular diagnostics, and verified UNOS survival records. A custom deterministic rule-based Natural Language Processing (NLP) engine was developed to extract echocardiographic measures, rejection grades, and vasculopathy scores from over 21,000 unstructured clinical reports.
Results The Houston Methodist J.C. Walter Jr. Transplant Center Precision Registry and Platform-Heart (TCPR-Heart) captures 1,687 heart transplants (1,636 patients) spanning the years 1984-2025. The TCPR-Heart comprises 1,054 transplants with active clinical follow-up: 555 transplants were extracted and abstracted from our modern electronic health record (EHR) in the decade since deployment, providing access to data throughout the patient’s course of heart transplant; 427 were legacy active transplants (transplanted pre-2016 with continued follow-up), and 72 were external transplants (transplanted elsewhere but followed at Methodist).
Additionally, the registry houses a historic cohort of 633 transplants (last follow-up < June 2016) with limited variables. Automated deep phenotyping successfully generated longitudinal data trends across clinical domains, including immunosuppression strategies, rejection, immunologic HLA data, renal function, metabolic profiles, vasculopathy, graft function, hospitalization burden and survival information.
Conclusion This automated framework unifies clinical, administrative, and molecular data streams. By leveraging an automated, regularly updated registry, we established a scalable, high-fidelity data source as a foundation for further innovations and novel applications based on an expertly curated and validated data source.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study did not receive any funding
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The Institutional Review Board of Houston Methodist Research Institute gave ethical approval for this work (Protocol no: Pro00000587)
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
Due to HIPAA regulations and patient privacy protections, individual-level data cannot be made publicly available.





