TY - JOUR T1 - Integration of multi-omics data improves prediction of cervicovaginal microenvironment in cervical cancer JF - medRxiv DO - 10.1101/2020.08.27.20183426 SP - 2020.08.27.20183426 AU - Nicholas A. Bokulich AU - Paweł Łaniewski AU - Dana M. Chase AU - J. Gregory Caporaso AU - Melissa M. Herbst-Kralovetz Y1 - 2020/01/01 UR - http://medrxiv.org/content/early/2020/09/01/2020.08.27.20183426.abstract N2 - Emerging evidence suggests that a complex interplay between human papillomavirus (HPV), microbiota, and the cervicovaginal microenvironment contribute to HPV persistence and carcinogenesis. Integration of multiple omics datasets is predicted to provide unique insight into HPV infection and cervical cancer progression. Cervicovaginal specimens were collected from a cohort (n=100) of Arizonan women with cervical cancer, cervical dysplasia, as well as HPV-positive and HPV-negative controls. Microbiome, immunoproteome and metabolome analyses were performed using 16S rRNA gene sequencing, multiplex cytometric bead arrays, and liquid chromatography-mass spectrometry, respectively. Multi-omics integration methods, including neural networks (mmvec) and Random Forest supervised learning, were utilized to explore potential interactions and develop predictive models. Our integrated bioinformatic analyses revealed that cancer biomarker concentrations were reliably predicted by Random Forest regressors trained on microbiome and metabolome features, suggesting close correspondence between the vaginal microbiome, metabolome, and genital inflammation involved in cervical carcinogenesis. Furthermore, we show that features of the microbiome and host microenvironment, including metabolites, microbial taxa, and immune biomarkers are predictive of genital inflammation status, but only weakly to moderately predictive of cervical cancer state. Different feature classes were important for prediction of different phenotypes. Lipids (e.g. sphingolipids and long-chain unsaturated fatty acids) were strong predictors of genital inflammation, whereas predictions of vaginal microbiota and vaginal pH relied mostly on alterations in amino acid metabolism. Finally, we identified key immune biomarkers associated with the vaginal microbiota composition and vaginal pH (MIF and TNFα), as well as genital inflammation (IL-6, IL-10, leptin and VEGF). Integration of multiple different microbiome “omics” data types resulted in modest increases in classifier performance over classifiers trained on the best performing individual omics data type. However, since the most predictive features cannot be known a priori, a multi-omics approach can still yield insights that might not be possible with a single data type. Additionally, integrating multiple omics datasets provided insight into different features of the cervicovaginal microenvironment and host response. Multi-omics is therefore likely to remain essential for realizing the advances promised by microbiome research.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was supported by the Flinn Foundation Grant #1974 to D.M.C. and M.M.H-K., Flinn Foundation Grant #2244 to M.M.H-K. and the National Institutes of Health NCI awards for the Partnership of Native American Cancer Prevention U54CA143924 (UACC) to M.M.H-K and U54CA143925 (NAU) to G.J.C.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:All participants provided informed written consent and all research and related activities involving human subjects were approved by the Institutional Review Boards at St. Joseph's Hospital and Medical Center, University of Arizona Cancer Center and Maricopa Integrated Health Systems, all located in Phoenix, AZ.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesBacterial 16s RNA gene sequence data analyzed in this study were deposited in SRA (PRJNA518153). Immunoproteome and metabolome data are available online as supplementary materials accompanying our previous reports (17, 18, 31, 32). ER -