RT Journal Article SR Electronic T1 Data-driven research on eczema: systematic characterization of the field and recommendations for the future JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2022.01.14.22269294 DO 10.1101/2022.01.14.22269294 A1 A. Duverdier A1 A. Custovic A1 R.J. Tanaka YR 2022 UL http://medrxiv.org/content/early/2022/01/18/2022.01.14.22269294.abstract AB Background The past decade has seen a substantial rise in the employment of modern data-driven methods to study atopic dermatitis (AD) / eczema.Objective To summarise the past and future of data-driven AD research, and identify areas in the field that would benefit from the application of these methods.Methods We retrieved the publications that applied multivariate statistics (MS), artificial intelligence (AI, including machine learning-ML), and Bayesian statistics (BS) to AD and eczema research from the SCOPUS database over the last 50 years. We conducted a bibliometric analysis to highlight the publication trends and conceptual knowledge structure of the field, and applied topic modelling to retrieve the key topics in the literature.Results Five key themes of data-driven research on AD and eczema were identified: (1) allergic co-morbidities, (2) image analysis and classification, (3) disaggregation, (4) quality of life, and (5) risk factors and prevalence. ML&AI methods mapped to studies investigating quality of life, prevalence, risk factors, allergic co-morbidities and disaggregation of AD/eczema, but seldom in studies of therapies. MS was employed evenly between the topics, particularly in studies on risk factors and prevalence. BS was focused on three key topics: treatment, risk factors and allergy. The use of AD or eczema terms was not uniform, with studies applying ML&AI methods using the term eczema more often. Within MS, papers using cluster and factor analysis were often only identified with the term AD. In contrast, those using logistic regression and latent class/transition models were “eczema” papers.Conclusions Research areas that could benefit from the application of data-driven methods include the study of the pathogenesis of the condition and related risk factors, its disaggregation into validated subtypes, and personalized severity management and prognosis. We highlight Bayesian statistics as a new and promising approach in AD and eczema research.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was supported by the UKRI CDT in AI for Healthcare http://ai4health.io (Grant No. EP/S023283/1).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesI confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll data and code for the analysis are available at https://github.com/arianeduverdier/systematic_data-driven_eczema