Abstract
Combined immunodeficiencies (CID) and common variable immunodeficiencies (CVID), prevalent yet substantially underdiagnosed primary immunodeficiency disorders, necessitate improved early detection strategies. Leveraging large-scale electronic health record (EHR) data from four nationwide US cohorts, we developed a novel causal Bayesian Network (BN) model to unravel the complex interplay of antecedent clinical phenotypes associated with CID/CVID. Consensus directed acyclic graphs (DAGs) were constructed, which demonstrated robust predictive performance (ROC AUC in unseen data within each cohort ranged from 0.77-0.61) and generalizability (ROC AUC across all unseen cohort evaluations ranged from 0.72-0.56) in identifying CID/CVID across diverse patient populations, created using different inclusion criteria. These consensus DAGs elucidate causal relationships between comorbidities preceding CID/CVID diagnosis, including autoimmune and blood disorders, lymphomas, organ damage or inflammation, respiratory conditions, genetic anomalies, recurrent infections, and allergies. Further evaluation through causal inference and by expert clinical immunologists substantiates the clinical relevance of the identified phenotypic trajectories within the consensus DAGs. These findings hold promise for translation into improved clinical practice, potentially leading to earlier identification and intervention for adults at risk of CID/CVID.
Competing Interest Statement
GP, KB, NVV and VI are full-time employees of Pfizer and hold stock/stock options. The other authors do not have any financial or non-financial competing interests to declare.
Funding Statement
This study is supported by Pfizer Inc.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The study was performed with the approval of Pfizer US Medical Affairs Hospital Specialty Care Leadership Team. Data extraction, pre-processing, causal modeling and evaluation of the Optum data were performed in accordance with the Declaration of Helsinki. The Optum data have been acquired according to the Health Insurance Portability and Accountability Act (HIPAA) Privacy Rule and all data were fully de-identified before licensed by Pfizer.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
The datasets used for this study could not be made publicly available due to a data use commercial agreement between Pfizer and Optum. However, the authors encourage collaborations and would like to declare that the data can be made available to qualified investigators upon request with evidence of institutional review board approval. These data have been previously presented in this publication: https://www.nature.com/articles/s43856-023-00412-8#data-availability. In our current work, we improve the previous methodology for the early identification of these primary immunodeficiencies by developing a novel Causal Bayesian Network methodology.