PT - JOURNAL ARTICLE AU - Jasmine M. McCammon AU - Sricharan Bandhakavi AU - Doreen Salek AU - Zhipeng Liu AU - Xianglian Ni AU - Natalie Benner AU - Ronda Rogers AU - Hollie Yoder AU - Shelley Riser AU - Farbod Rahmanian TI - Use of machine learning to predict hypertension-related complication outcomes of varying severity AID - 10.1101/2020.10.30.20169615 DP - 2020 Jan 01 TA - medRxiv PG - 2020.10.30.20169615 4099 - http://medrxiv.org/content/early/2020/11/03/2020.10.30.20169615.short 4100 - http://medrxiv.org/content/early/2020/11/03/2020.10.30.20169615.full AB - Objective A challenge in hypertension-related risk management is identifying which people are likely to develop future complications. To address this, we present administrative-claims based predictive models for hypertension-related complications.Materials and Methods We used a national database to select 1,767,559 people with hypertension and extracted 112 features from past claims data based on their ability to predict hypertension complications in the next year. Complications affecting kidney, brain, and heart were grouped by clinical severity into three stages. Extreme gradient boosting binary classifiers for each stage were trained and tuned on 75% of the data, and performance on predicting outcomes for the remaining data and an independent dataset was evaluated.Results In the cohort under study, 6%, 17%, and 7% of people experienced a hypertension-related complication of stage 1, stage 2, or stage 3 severity, respectively. On an independent dataset, models for all three stages performed competitively with other published algorithms by the most commonly reported metric, area under the receiver operating characteristic curve, which ranged from 0.82-0.89. Features that were important across all models for predictions included total medical cost, cost related to hypertension, age, and number of outpatient visits.Discussion The model for stage 1 complications, such as left ventricular hypertrophy and retinopathy, is in contrast to other offerings in the field, which focus on more serious issues such as heart failure and stroke, and affords unique opportunities to intervene during earlier stages.Conclusion Predictive analytics for hypertension outcomes can be leveraged to help mitigate the immense healthcare burden of uncontrolled hypertension.LAY SUMMARY As the leading preventable risk factor for morbidity and mortality in the world, identifying which people with hypertension are likely to exacerbate is critically important for development of effective intervention strategies. Here we present a suite of predictive models that can predict future risk of development of hypertension-related complications. To have utility for triaging as well as identifying mild cases before they progress to critical end phases, the models predict three different stages of severity of hypertension-related complications. Our algorithms utilize variables calculated for the most recent 12 months, and predict probability of a hypertension-related complication for the next 12 months using administrative claims as the data source. Because the types of complications that have been analyzed can also result from comorbidities besides hypertension, such as diabetes and hyperlipidemia, these diagnoses are included as variables. Other variables pertain to demographic characteristics, prescription information, relevant procedures, and utilization patterns. Overall, all three models exhibited strong predictive performance. The ability to use straightforward variables found in claims data to predict future risk of disease-related complications, complemented with targeted clinical intervention strategies, has the potential to reduce cost of care and improve health outcomes for the many people living with hypertension.Competing Interest StatementAll authors are employees of Geneia LLC.Funding StatementNo external funding was received for this study.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The Geneia Research Ethics Committee (Committee) has reviewed the manuscript and has determined that this work does not require review by the Committee.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data used for this manuscript are commercially licensed from the CCAE database and not publically available. The independent dataset contains sensitive personal health information and is therefore also not publically available.