PT - JOURNAL ARTICLE AU - Waterhouse, Mary AU - Baxter, Catherine AU - Romero, Briony Duarte AU - McLeod, Donald S. A. AU - English, Dallas R. AU - Armstrong, Bruce K. AU - Clarke, Michael W. AU - Ebeling, Peter R. AU - Hartel, Gunter AU - Kimlin, Michael G. AU - O’Connell, Rachel L. AU - Pham, Hai AU - Harris, Rachael M. Rodney AU - van der Pols, Jolieke C. AU - Venn, Alison J. AU - Webb, Penelope M. AU - Whiteman, David C. AU - Neale, Rachel E. TI - Predicting deseasonalised serum 25 hydroxy vitamin D concentrations in the D-Health Trial: an analysis using boosted regression trees AID - 10.1101/2020.08.23.20180422 DP - 2020 Jan 01 TA - medRxiv PG - 2020.08.23.20180422 4099 - http://medrxiv.org/content/early/2020/08/25/2020.08.23.20180422.short 4100 - http://medrxiv.org/content/early/2020/08/25/2020.08.23.20180422.full AB - Background The D-Health Trial aims to determine whether monthly high-dose vitamin D supplementation can reduce the mortality rate and prevent cancer. We did not have adequate statistical power for subgroup analyses, so could not justify the high cost of collecting blood samples at baseline. To enable future exploratory analyses stratified by baseline vitamin D status, we developed a model to predict baseline serum 25 hydroxy vitamin D [25(OH)D] concentration.Methods We used data and serum 25(OH)D concentrations from participants who gave a blood sample during the trial for compliance monitoring and were randomised to placebo. Data were partitioned into training (80%) and validation (20%) datasets. Deseasonalised serum 25(OH)D concentrations were dichotomised using cut-points of 50 nmol/L, 60 nmol/L and 75 nmol/L. We fitted boosted regression tree models, based on 13 predictors, and evaluated model performance using the validation data.Results The training and validation datasets had 1788 (10.5% <50 nmol/L, 23.1% <60 nmol, 48.8 <75 nmol/L) and 447 (11.9% <50 nmol/L, 25.7% <60 nmol/L, and 49.2% <75 nmol/L) samples, respectively. Ambient UV radiation and total intake of vitamin D were the strongest predictors of ‘low’ serum 25(OH)D concentration. The area under the receiver operating characteristic curves were 0.71, 0.70, and 0.66 for cut-points of <50 nmol/L, <60 nmol/L and <75 nmol/L respectively.Conclusions We exploited compliance monitoring data to develop models to predict serum 25(OH)D concentration for D-Health participants at baseline. This approach may prove useful in other trial settings where there is an obstacle to exhaustive data collection.Competing Interest StatementThe authors have declared no competing interest.Clinical TrialAustralia New Zealand Clinical Trials Registry: ACTRN12613000743763Funding StatementThe D-Health Trial is funded by National Health and Medical Research Council (NHMRC) project grants (APP1046681, APP1120682). PM Webb and DC Whiteman are supported by fellowships from the NHMRC (GNT1173346, APP1155413). DSA McLeod is supported by a Metro North Clinician Research Fellowship and a Queensland Advancing Clinical Research Fellowship. H Pham is supported by a University of Queensland PhD Scholarship. MW Clarke is affiliated to Metabolomics Australia, University of Western Australia, Perth, Western Australia, Australia. Measurement of serum 25(OH)D concentration was supported by infrastructure funding from the Western Australian State Government in partnership with the Australian Federal Government, through Bioplatforms Australia and the National Collaborative Research Infrastructure Strategy (NCRIS).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study was approved by the QIMR Berghofer Medical Research Institute Human Research Ethics Committee (P1509).All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data are not publicly available.25(OH)D25 hydroxy vitamin DAHSAustralian Health SurveyAUCarea under the ROC curveBMIbody mass indexBRTboosted regression treeCIconfidence intervalIUinternational unitsMETmetabolic equivalent of taskNPVnegative predictive valuePPVpositive predictive valueQoLquality of lifeRCTrandomised controlled trialRMSEroot-mean-squared errorROCreceiver operating characteristicSDstandard deviationUVultraviolet.