Nomogram-Based Prognostic Model to predict the High blood pressure in Children and Adolescents: Findings from 342,736 individuals in China

Background: Predicting the potential risk factors of High blood pressure(HBP) among children and adolescents is still a knowledge gap. Our study aimed to establish and validate a nomogram-based model for identifying children and adolescents at risk of developing HBP based on a population-based prospective study. Methods: Hypertension was defined as systolic blood pressure or diastolic blood pressure above 95th percentile, using age, gender and height-specific cut-points. Penalized regression with Lasso was used to identify the strongest predictors of hypertension. Internal validation was conducted by 5-fold cross-validation and bootstrapping approach. The predictive variables were identified along with the advanced nomogram plot by conducting univariate and multivariate logistic regression analyses. A nomogram was constructed by training group comprised of 239,546(69.89%)participants and subsequently validated by externally group with 103,190(30.11%)participants. Results: Of 342,736 children and adolescents, a total of 55,480(16.19%) youths were identified with HBP with mean age 11.51{+/-}1.45 year and 183,487 were boys(53.5%). Nine significant relevant predictors were identified including: age, gender, weight status, birthweight, breastfeeding, gestational hypertension, family history of obesity, 46family history of hypertension and physical activity. An acceptable discrimination[Area under the receiver operating characteristic curve(AUC):0.742(Development group), 0.740(Validation group)] and good calibration(Hosmer and Lemeshow statistics, P ? 0.05) were observed in our models. An available web-based nomogram was built online. Conclusions: This model composed of age, gender, early life factors, family history of disease, and lifestyle factors may predict the risk of HBP among children and adolescents, which has developed a promising nomogram that may aid in more accurately for identifying the HBP among youths in primary care. Funding Sources: The work was supported by the National Natural Science Foundation of China (No. 81673193).

were randomly assigned to the derivation group and 30. 11%( N=103,190)to the internal 110 validation group. Of the development group, the average age of children and 111 adolescents was 11.51 years(2.73) with range from 6 to 18, and 11.51 years (2.74) with 112 range from 7 to 18 in validation group. More than half of the children were boys (53.5%, 113 n=183,487). Of 342,736 participants,287,256(83.81%) and 55,480(16.19%) children 114 were identified as having HBP and non-HBP, respectively.

115
There was existing a statistically significant difference among the majority of 116 demographic variables in either the HBP group and non-HBP group(P＜0.001), such 117 as age, gender, birthweight and so on. We only found no significant differences in one 118 parental education level. The demographic characteristics of the subjects was listed in 119 is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint (Shown in Figure.3), while there are only 9 of them(age, gender, birthweight, weight 133 status, feeding mode, gestational hypertension, family history of obesity, family history 134 of hypertension and average outdoor physical activity time) were independently 135 associated with a higher probability of HBP after the backward elimination procedure 136 of multivariable logistic analysis. Table.2 shows the results of multivariable predictors.

137
The nomogram plot based on all the 9 independently predictors calculated by the 138 generalized linear models was depicted in Figure.1 with single score of each predictor 139 which contribute to a total score for predicting HBP. The discrimination of model as 140 per the above 9 significantly risk factors were detected by the apparent C-statistic for  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 26, 2021. ; 6 Website of nomogram 151 An available web-based nomogram calculator of HBP was built to present the 152 diagnostic probability for helping guardians and physicians to identify the HBP among 153 children and adolescents in a user-friendly way.

156
As the high potential hazard predictors reported previously, age, gender, birthweight, 157 feeding mode, gestational hypertension, weight status, family history of obesity, family 158 history of hypertension and frequency of PA were also be found and presented as a 159 simple and reliable nomogram plot which designed for identifying the children and 160 adolescents who were at high risk of developing HBP.

161
Accumulating evidence supports the view that the roots contribution of essential 162 hypertension extends back to childhood and adolescence(Cosenzi, S Ac Erdote, Bocin, 163 Molino, & Bellini, 1996;Kawabe et al., 2000). Targeted identifying the risk factors of 164 developing HBP in children and adolescents may be practicable to avoid unnecessary 165 burden, overdiagnosis or costs. Our study provides a more interpretability, more precise 166 classification standard for HBP in youths, instead of using complicated model to predict 167 HBP.

168
The strongest risk factor for primary HBP in children and adolescents is elevated 169 body mass index (Giussani et al., 2013), which was also demonstrated in our study. A  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 26, 2021. ; https://doi.org/10.1101/2021.08.24.21262545 doi: medRxiv preprint 8 finding also suggested that an excess of Sedentary behavior time(SBT) is associated 211 with an increased risk of hypertension compared to the recommended levels of SBT.

212
Overall, efforts to encourage youths increase their PA, and reduce their time of SBT 213 are warranted. For fired food intake, based on the initial regression shrinkage and 214 selection of Lasso, there were no statistically significant differences in youths for 215 hypertension. This discrepancy could be linked to the fact that salt intake in this study 216 did not include due to the lack of data, even though we use fired food instead of salt 217 intake roughly.

218
Previous studies using a similar methodology supported the hypothesis that is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 26, 2021. ; https://doi.org/10.1101/2021.08.24.21262545 doi: medRxiv preprint 9 health, but also for economizing the facilities and resources. Furthermore, such 241 identifications can help introducing earlier interventions for those at risk, which 242 include, but are not limited to, help children disengage themselves from the concern 243 of diseases early as possible, and prevent the transfer to other potential hazards such  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint In this study, a feasible nomogram was derived from the ongoing study to visually 273 predict the probability of HBP in children and adolescents for clinicians, policy-maker 274 even the family members on enhancement of the screening and subsequent diagnosis.  The dataset was independently divided by the cross-sectional study conducted in 2020 298 for identifying the first-ever incidence of HBP among children and adolescents. All 299 participants were students aged from 5 to 25 years old, and they were examined by 300 . CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 26, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 12 BP measurement which was conducted in a quiet platform thenceforth. A standard BP 331 cuff was positioned nearby the right arm elbow, 2 centimeters below the antecubital 332 fossa, and BP for each child was recorded 3 consecutive times in seated position on   Here, all analyses were conducted using the R language (X64 Version 4.1.0, R 358 Foundation for Statistical Computing, Vienna, Austria, https://www.r-project.org/) with 359 several packages including "Foreign", "Hmisc", "Glmnet", "Caret", "Rms" and 360 . CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 26, 2021. ; https://doi.org/10.1101/2021.08.24.21262545 doi: medRxiv preprint 13 "pROC". The bilateral P＜0.05 was recognized as statistically significant.  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 26, 2021. ; Figure.1 Clinical nomogram for predicting probability of developing HBP among children and adolescents, and its predictive performance. To use the nomogram, an individual HBP contact's values is located on each variable axis, and a line is drawn downward to the risk of HBP axes to detect the hypertension probability. As an example of how this nomogram can be calculated, we can take an 18 years old obese boy who was received a bottle feeding in early life, with gestational hypertension, family of hypertension and obesity, and less than one-hour physical activity expenditure. By drawing a line up towards the points for each of the variables this student will have 100 points(Age), 32 points(Gender), 74 points(BMI status), 42 points(Feeding mode), 20 points(Gestational hypertension), 21 points(Family obesity), 59 points(Family hypertension) and 26 points(Physical activity), giving a total of 374 points(at the bottom of the figure), and a probability of HBP of 80%.

Fgure.2 Receiver operating characteristic (ROC) curves for the prediction of high blood pressure in the training group and validation group.
(A) ROC curves of the factors and nomogram in the development group; (B) ROC curves of the factors and nomogram in the training group; (C) Calibration plot of nomogram prediction in the development group; (D) Calibration plot of nomogram prediction in the validation group. ROC curves from the prediction model and other predictive strategies(Model1:Age, gender, gestational hypertension, weight status, family history of hypertension, family history of obesity, average outdoor physical activity time; Model2: Age, gender, gestational hypertension, weight status, family history of hypertension, family history of obesity, average outdoor physical activity time, birthweight, feeding mode; Model3: Age, gender, gestational hypertension, weight status, family history of hypertension, family history of obesity, average outdoor physical activity time, birthweight, feeding mode; parental smoking status, parental education level, household monthly income, average screen-based time, fried food intake.) for comparison. Calibration curve represents the calibration of the nomogram, which shows the consistency between the predicted probability of conversion and actual conversion probability of HBP patients. The x-axis is the predicted probability by nomogram and the y-axis is the actual conversion rate of HBP patients. The grey line represents a perfect prediction by an ideal model, and the black-dotted line shows the performance of the nomogram, of which a closer fit to the grey line means a better prediction. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 26, 2021. ; https://doi.org/10.1101/2021.08.24.21262545 doi: medRxiv preprint Figure.3 Predictor selection using the LASSO binary logistic regression model. A LASSO coefficient of the total 14 predictors. (A) LASSO coefficient profiles of all predictors, a coefficient profile plot was provided against the log (Lambda) sequence. (B) Predictors selection by LASSO via minimum criteria, predictor selection in the LASSO model used tenfold cross-validation via minimum criteria. Red-dotted vertical lines were drawn at the optimal values by using the minimum criteria (minimize the meansquared error), the value 9 represents those 14 predictors were reduced to 9 nonzero features by LASSO. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted August 26, 2021. ; https://doi.org/10.1101/2021.08.24.21262545 doi: medRxiv preprint