Abstract
Using data from Ontario Canada, we previously developed machine learning-based algorithms incorporating newborn screening metabolites to estimate gestational age (GA). The objective of this study was to evaluate the use of these algorithms in a population of infants born in Siaya county, Kenya.
Cord and heel prick samples were collected from newborns in Kenya and metabolic analysis was carried out by Newborn Screening Ontario in Ottawa, Canada. Postnatal GA estimation models were developed with data from Ontario with multivariable linear regression using ELASTIC NET regularization. Model performance was evaluated by applying the models to the data collected from Kenya and comparing model-derived estimates of GA to reference estimates from early pregnancy ultrasound.
Heel prick samples were collected from 1,039 newborns from Kenya. Of these, 8.9% were born preterm and 8.5% were small for GA. Cord blood samples were also collected from 1,012 newborns. In data from heel prick samples, our best-performing model estimated GA within 9.5 days overall of reference GA [mean absolute error (MAE) 1.35 (95% CI 1.27, 1.43)]. In preterm infants and those small for GA, MAE was 2.62 (2.28, 2.99) and 1.81 (1.57, 2.07) weeks, respectively. In data from cord blood, model accuracy slightly decreased overall (MAE 1.44 (95% CI 1.36, 1.53)). Accuracy was not impacted by maternal HIV status and improved when the dating ultrasound occurred between 9 and 13 weeks of gestation, in both heel prick and cord blood data (overall MAE 1.04 (95% CI 0.87, 1.22) and 1.08 (95% CI 0.90, 1.27), respectively).
Compared to internal validation performance using Ontario data and to our previously published external validations, model performance was diminished in the Kenya cohort, suggesting that reference ultrasound timing is an important factor in model performance. Our study highlights the challenges in reliably estimating GA in low resource settings, even those with access to dating ultrasound, given that the timing of dating ultrasound is critical to develop algorithms for accurate estimation of GA based on metabolic analysis of blood obtained at birth.
Competing Interest Statement
The authors have declared no competing interest.
Clinical Trial
NA
Funding Statement
This research was supported by the Bill & Melinda Gates Foundation (KW: OPP1184574 GLD: OPP1182996). The funders played no role in this project.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This study was approved by the Ottawa Health Sciences Network Research Ethics Board (20180330-01H), Children’s Hospital of Eastern Ontario Research Ethics Board (18/58X), the Stanford University School of Medicine Institutional Review Board (44656) and the Kenya Medical Research Institute (KEMRI) Scientific and Ethics Review Unit (SSC 2880).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
De-identified data related to this manuscript will be made publicly available through Dryad at the time of publication. We have restricted public access until our manuscript is accepted for publication. For the purposes of reviewing our manuscript, our data can be accessed here: https://datadryad.org/stash/share/OXCJkq4Y7MJA_1zPwwa9r8sKEpPJD9eF_pLMgQ5ZvBg
https://datadryad.org/stash/share/OXCJkq4Y7MJA_1zPwwa9r8sKEpPJD9eF_pLMgQ5ZvBg