RT Journal Article SR Electronic T1 Linking abdominal imaging traits to electronic health record phenotypes JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2020.09.08.20190330 DO 10.1101/2020.09.08.20190330 A1 Matthew T. MacLean A1 Qasim Jehangir A1 Marijana Vujkovic A1 Yi-An Ko A1 Harold Litt A1 Arijitt Borthakur A1 Hersh Sagraiya A1 Mark Rosen A1 David A. Mankoff A1 Mitchell D. Schnall A1 Haochang Shou A1 Julio Chirinos A1 Scott M. Damrauer A1 Drew A. Torigian A1 Rotonya Carr A1 Daniel J. Rader A1 Walter R. Witschey YR 2020 UL http://medrxiv.org/content/early/2020/09/09/2020.09.08.20190330.abstract AB Quantitative traits obtained from computed tomography (CT) scans performed in routine clinical practice have the potential to enhance translational research and genomic discovery when linked to electronic health record (EHR) and genomic data. For example, both liver fat and abdominal adipose mass are highly relevant to human disease; non-alcoholic fatty liver disease(NAFLD) is present in 30% of the US adult population, is strongly associated with obesity, and can progress to hepatic inflammation, cirrhosis, and hepatocellular carcinoma. We built a fully automated image curation and organ labeling technique using deep learning to identify liver, spleen, subcutaneous and visceral fat compartments in the abdomen and extract 12 quantitative imaging traits from 161,748 CT scans in 19,624 patients enrolled in the Penn Medicine Biobank (PMBB). The average liver fat, as defined by a difference in attenuation between spleen and liver, was −6.4 ± 9.1 Hounsfield units (HU). In 135 patients who had undergone both liver biopsy and imaging, receiver operating characteristic (ROC) analysis revealed an area under the curve(AUC) of 0.81 for hepatic steatosis. The mean fat volume within the abdominal compartment for subcutaneous fat was 4.9 ± 3.1 L and for visceral fat was 2.9 ± 2.1 L. We performed integrative analyses of liver fat with the phenome extracted from the EHR and found highly significant associations with chronic liver disease/cirrhosis, chronic non-alcoholic liver disease, diabetes mellitus, obesity, hypertension, renal failure, alcoholism, hepatitis C, use of therapeutic adrenal cortical steroids, respiratory failure and pancytopenia. Liver fat was significantly associated with two of the most robust genetic variants associated with NAFLD, namely rs738409 in PNPLA3 and rs58542926 in TM6SF2. Finally, we performed multivariate principle component analysis (PCA) to show the importance of each of the quantitative imaging traits to NAFLD and their interrelationships with the phenome. This work demonstrates the power of automated image quantitative trait analyses applied to routine clinical imaging studies to fuel translational scientific discovery.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was supported by the Sarnoff Cardiovascular Research Foundation (MM), NIH NCATS UL1TR001878, NIH/NHLBI R01 HL137984, R01 AA026302-02, P30 DK0503060 (RC), and the Penn Center for Precision Medicine.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This protocol was approved by the Institutional Review Board of the University of PennsylvaniaAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesData is available upon reasonable request.