RT Journal Article SR Electronic T1 Predicting ovarian/breast cancer pathogenic risks of BRCA1 gene variants of unknown significance JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2020.06.04.20120055 DO 10.1101/2020.06.04.20120055 A1 Hui-Heng Lin A1 Hongyan Xu A1 Hongbo Hu A1 Zhanzhong Ma A1 Jie Zhou A1 Qingyun Liang YR 2020 UL http://medrxiv.org/content/early/2020/06/05/2020.06.04.20120055.abstract AB The difficulty of early diagnosis for ovarian cancer is an important cause of the high mortal rates of ovarian cancer patients. Instead of symptom-based diagnostic methods, modern sequencing technologies enable the access of human’s genetic information via reading DNA/RNA molecules’ nucleotide base sequences. In such way, genes’ mutations and variants could be identified and hence a better clinical diagnosis in molecular level could be expected. However, as sequencing technologies gain more popularity, novel gene variants with unknown clinical significance are found, giving difficulties to interpretations of patients’ genetic data, precise disease diagnoses as well as the making of therapeutic strategies and decisions. In order to solve these issues, it is of critical importance to figure out ways to analyze and interpret such variants. In this work, BRCA1 gene variants with unknown clinical significance were identified from clinical sequencing data, and then we developed machine learning models so as to predict the pathogenicity for variants with unknown clinical significance. Amongst, in performance benchmarking, our optimized random forest model scored 0.85 in area under receiver-operating characteristic curve, which outperformed other models. Finally, we applied the optimized random forest model to predict the pathogenic risks of 7 BRCA1 variants of unknown clinical significances identified from our sequencing data, and 6315 variants of unknown clinical significance in ClinVar database. As a result, our model predicted 4724 benign and 1591 pathogenic variants, which helped the interpretation of these variants of unknown significance and diagnosis.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis project was supported from xxxAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This research project was screened and approved by the Ethics Committee of Yuebei people's hospital (Project approval ID:KY-2017-007). All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesFor BRCA1 gene VUS data, they are accessible and stored in NCBI ClinVar database. For the sequencing data, they will be available upon reasonable requests. https://www.ncbi.nlm.nih.gov/clinvar/