Abstract
Kidney disease affects 50% of all diabetic patients; however, prediction of disease progression has been challenging due to inherent disease heterogeneity. We use deep learning to identify novel genetic signatures prognostically associated with outcomes. Using autoencoders and unsupervised clustering of electronic health record data on 1,372 diabetic kidney disease patients, we establish two clusters with differential prevalence of end-stage kidney disease. Exome-wide associations identify a novel variant in ARHGEF18, a Rho guanine exchange factor specifically expressed in glomeruli. Overexpression of ARHGEF18 in human podocytes leads to impairments in focal adhesion architecture, cytoskeletal dynamics, cellular motility, and RhoA/Rac1 activation. Mutant GEF18 is resistant to ubiquitin mediated degradation leading to pathologically increased protein levels. Our findings uncover the first known disease-causing genetic variant that affects protein stability of a cytoskeletal regulator through impaired degradation, a potentially novel class of expression quantitative trait loci that can be therapeutically targeted.
Competing Interest Statement
Girish N Nadkarni reports grants, personal fees and non-financial support from Renalytix AI; non-financial support from Pensieve Health, personal fees from AstraZeneca, personal fees from BioVie, personal fees from Siemens Healthineers, personal fees from Reata, personal fees from GLG Consulting, from outside the submitted work. The remaining authors have nothing to disclose.
Funding Statement
We acknowledge funding from NIH R01DK118222 (E. U. Azeloglu). In addition, J. C. He is supported by NIH R01DK109683, R01DK122980, R01DK129467, P01DK56492, and VA Merit Award I01BX000345; G. N. Nadkarni is supported by NIH R01DK127139 and R01HL155915; G. L. Gusella is supported by NIH R01DK131047; K. Lee is supported by NIH R01DK117913 and R01DK129467; J. Fu is supported by NIH K01DK125614; J. C. Haydak is supported by NIH T32HD075735 NICHD-Interdisciplinary Training in Systems and Developmental Biology and Birth Defects; A. Mendoza is supported by NIH R01DK131047 Diversity Supplement. Confocal imaging was performed at the Microscopy CoRE at the Icahn School of Medicine at Mount Sinai. We acknowledge the Dean's Flow Cytometry CORE at Mount Sinai for their assistance.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
For the Mount Sinai BioMe dataset, consent was obtained from all participants prior to clinical and genetic data collection. This study was approved by the Institutional Review Board at the Icahn School of Medicine.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
BioMe data, including clinical and genetic sequencing data is not publicly available due to patient privacy concerns and institutional policy. Transcriptomic data from CKD biopsy specimens is available on the Nephroseq website.