TY - JOUR T1 - Repeated measures ASCA+ for analysis of longitudinal intervention studies with multivariate outcome data JF - medRxiv DO - 10.1101/2020.12.03.20243097 SP - 2020.12.03.20243097 AU - Torfinn S. Madssen AU - Guro F. Giskeødegård AU - Age K. Smilde AU - Johan A. Westerhuis Y1 - 2020/01/01 UR - http://medrxiv.org/content/early/2020/12/04/2020.12.03.20243097.abstract N2 - Longitudinal intervention studies with repeated measurements over time are an important type of experimental design in biomedical research. Due to the advent of “omics”-sciences (genomics, transcriptomics, proteomics, metabolomics), longitudinal studies generate increasingly multivariate outcome data. Analysis of such data must take both the longitudinal intervention structure and multivariate nature of the data into account. The ASCA+-framework combines general linear models with principal component analysis, and can be used to separate and visualize the multivariate effect of different experimental factors. However, this methodology has not yet been developed for the more complex designs often found in longitudinal intervention studies, which may be unbalanced, involve randomized interventions, and have substantial missing data. Here we describe a new methodology, repeated measures ASCA+ (RM-ASCA+), and show how it can be used to model metabolic changes over time, and compare metabolic changes between groups, in both randomized and non-randomized intervention studies. Tools for both visualization and model validation are discussed. This approach can facilitate easier interpretation of data from longitudinal clinical trials with multivariate outcomes.Author summary Clinical trials are increasingly generating large amounts of complex biological data. Examples can include measuring metabolism or gene expression in tissue or blood sampled repeatedly over the course of a treatment. In such cases, one might wish to compare changes in not one, but hundreds, or thousands of variables simultaneously. In order to effectively analyze such data, both the study design and the multivariate nature of the data should be considered during data analysis. ANOVA simultaneous component analysis+ (ASCA+) is a statistical method which combines general linear models with principal component analysis, and provides a way to separate and visualize the effects of different factors on complex biological data. In this work, we describe how repeated measures linear mixed models, a class of models commonly used when analyzing changes over time and treatment effects in longitudinal studies, can be used together with ASCA+ for analyzing clinical trials in a novel method called repeated measures-ASCA+ (RM-ASCA+).Competing Interest StatementThe authors have declared no competing interest.Clinical TrialNCT02480322 NCT00773695Funding StatementThis work was supported by a grant from the Norwegian Research School in Bioinformatics, Biostatistics and Systems Biology (NORBIS), and a grant from the Norwegian Cancer Society. No authors received payment or services from a third party for any aspect of the submitted work.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This work is based on publicly available datasets, and we therefore did not apply to any oversight body to perform this study.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe metabolomics-datasets used in this manuscript are publicly available, and can be found in the supplementary section at (https://doi.org/10.1007/s11306-017-1168-0), and on metabolomics data repository MetaboLights (MTBLS242). The data analysis code is available at (https://github.com/ntnu-mr-cancer/RM_ASCA) https://www.ebi.ac.uk/metabolights/MTBLS242/descriptors https://link.springer.com/article/10.1007/s11306-017-1168-0#Sec16 ER -