RT Journal Article SR Electronic T1 Integrative genetic and genomic networks identify microRNA associated with COPD and ILD JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2022.08.07.22278496 DO 10.1101/2022.08.07.22278496 A1 Pavel, Ana B. A1 Garrison, Carly A1 Luo, Lingqi A1 Liu, Gang A1 Taub, Daniel A1 Xiao, Ji A1 Juan-Guardela, Brenda A1 Tedrow, John A1 Alekseyev, Yuriy O. A1 Yang, Ivana V. A1 Geraci, Mark W. A1 Sciurba, Frank A1 Schwartz, David A. A1 Kaminski, Naftali A1 Beane, Jennifer A1 Spira, Avrum A1 Lenburg, Marc E. A1 Campbell, Joshua D. YR 2022 UL http://medrxiv.org/content/early/2022/08/09/2022.08.07.22278496.abstract AB Chronic obstructive pulmonary disease (COPD) and interstitial lung disease (ILD) are clinically and molecularly heterogeneous diseases. We utilized clustering and integrative network analyses to elucidate roles for microRNAs (miRNAs) and miRNA isoforms (isomiRs) in COPD and ILD pathogenesis. Short RNA sequencing was performed on 351 lung tissue samples of COPD (n=145), ILD (n=144) and controls (n=64). Five distinct subclusters of samples were identified including 1 COPD-predominant cluster and 2 ILD-predominant clusters which associated with different clinical measurements of disease severity. Utilizing 262 samples with gene expression and SNP microarrays, we built disease-specific genetic and expression networks to predict key miRNA regulators of gene expression. Members of miR-449/34 family, known to promote airway differentiation by repressing the Notch pathway, were among the top connected miRNAs in both COPD and ILD networks. Genes associated with miR-449/34 members in the disease networks were enriched among genes that increase in expression with airway differentiation at an air-liquid interface. A highly expressed isomiR containing a novel seed sequence was identified at the miR-34c-5p locus. 47% of the anticorrelated predicted targets for this isomiR were distinct from the canonical seed sequence for miR-34c-5p. Overexpression of the canonical miR-34c-5p and the miR-34c-5p isomiR with an alternative seed sequence down-regulated NOTCH1 and NOTCH4. However, only overexpression of the isomiR down-regulated genes involved in Ras signaling such as CRKL and GRB2. Overall, these findings elucidate molecular heterogeneity inherent across COPD and ILD patients and further suggest roles for miR-34c in regulating disease-associated gene-expression.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was supported by the National Institutes of Health/National Heart, Lung, and Blood Institute with funding from the Lung Genomics Research Consortium (RC2-HL101715) and R01HL118542.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:N/AI confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe microRNA expression datasets generated and analyzed during the current study are available in the Raw and normalized data is available at the Gene Expression Omnibus (GEO) under the accession number GSE201121.