Key point
Growing evidence has shown that the risk of NAFLD occurrence and progression varies among individuals and has highlighted the role of inherited genetic variants.
Non-alcoholic fatty liver disease (NAFLD) is the leading cause of liver disease in high-income countries, affecting more than 25% of the population.1 The inflammatory form of this condition, namely non-alcoholic steatohepatitis (NASH), is responsible for an increasing proportion of cases of cirrhosis and hepatocellular carcinoma (HCC).2,3 As for other complex traits (e.g. circulating lipids or cardiovascular diseases), the risk of developing NAFLD and NASH varies among individuals; it is ultimately determined by the combination of environmental factors, such as adiposity or the presence of type 2 diabetes (T2D), and inherited genetic variations.
Historically, genetic susceptibility to NAFLD has been evaluated using candidate gene studies – a study design evaluating the association of variants in a given gene and a phenotype of interest. However, most candidate gene studies have only identified and validated a handful of loci associated with the risk of NAFLD prevalence or progression. This can be explained by the limited number of genes selected a priori based on their plausible biological relevance, but also by various methodological drawbacks (e.g. limited statistical power).4 Conversely, genome-wide association studies (GWAS) test the association of millions of variants throughout the genome in an unbiased fashion. GWAS can be performed using various technologies, like single nucleotide polymorphism (SNP) arrays followed by imputation, a statistical method for inferring genotypes that are not directly measured using large reference panels (e.g. 1000G or the Haplotype Reference Consortium).5,6 More recently, next-generation sequencing approaches have been implemented, including whole-exome sequencing (WES), targeting the fraction of the genome that encodes proteins, or whole-genome sequencing (WGS). GWAS have identified robust and reproducible associations linked with the natural history of NAFLD, including variants in the patatin-like phospholipase domain-containing 3 (PNPLA3), the transmembrane 6 superfamily member 2 (TM6SF2) and more recently in the 17-beta hydroxysteroid dehydrogenase 13 (HSD17B13) genes.7 GWAS have uncovered novel NAFLD susceptibility genes and biological pathways and fostered improved understanding of NAFLD pathophysiology.
Herein, we provide an update on our current knowledge of NAFLD genetics and discuss the benefits and limitations of recent GWAS findings including biological understanding, risk prediction and drug development. Based on the available evidence, we propose suggestions for interpretation, and design of new studies in the field. We will frequently use the more general term fatty liver disease (FLD), as at risk alcohol intake was not an exclusion criterium in most studies, and it is always difficult to differentiate the role of alcohol and metabolic risk factors.8 Furthermore, genetic studies have highlighted shared inherited determinants of metabolic and alcohol-related FLD.9 Growing evidence has shown that the risk of NAFLD occurrence and progression varies among individuals and has highlighted the role of inherited genetic variants.Key point
Heritability is defined as the proportion of phenotypic variation in a trait that is due to genetic variation.10 Unlike monogenic diseases, such as hereditary haemochromatosis and Wilson's disease, the heritability of complex traits involves thousands of common genetic variants (minor-allele frequency [MAF] ≥5%) distributed throughout the genome, which are usually characterised by small effect sizes (e.g. relative risks or odds ratios [ORs]).11
A large fraction of hepatic fat and FLD variability
Recognition of the validity of GWAS for providing new insights into complex traits, as recently demonstrated for NAFLD, relies on the high reproducibility and robustness of their findings. However, this success comes at the cost of a stringent statistical significance threshold (p <5×10−8) used in most GWAS to correct for the burden of multiple testing, as millions of SNPs throughout the genome are assessed.83 The sample size required to detect a given genetic variant with suitable statistical
Based on the evidence reviewed in this manuscript, we propose a checklist for genetic studies aimed at further advancing the field of NAFLD, to ensure methodological robustness, reproducibility, and clinical relevance (Table S1). This is not an alternative to the STrengthening the REporting of Genetic Association Studies (STREGA) reporting guidelines,97 which should be taken into consideration, but provides specific suggestions relevant for NAFLD.
Specific genetic risk variants have now been robustly confirmed to exert a large impact on NAFLD, with an effect size comparable and synergic to that of the main metabolic risk factors, namely obesity and type 2 diabetes. This risk increase extends to the development and progression of the full spectrum of NAFLD, extending to liver-related and overall mortality. Furthermore, genetic risk variants may be able to profile subsets of patients with different pathophysiology and response to
LV was supported by MyFirst Grant AIRC n.16888, Ricerca Finalizzata Ministero della Salute RF-2016-02364358, Ricerca corrente Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico; Fondazione Sviluppo Ca' Granda for the project Liver-Bible (PR-0361); the European Union (EU) Programme Horizon 2020 (under grant agreement No. 777377) for the project LITMUS-“Liver Investigation: Testing Marker Utility in Steatohepatitis”.
ET was supported by a Marie Skłodowska-Curie Individual Fellowship, the
Authors contributed equally to the design, conceptualisation, writing of the manuscript, LV supervised the process.
LV reports having received speaking fees from: MSD, Gilead, AlfaSigma, AbbVie; consulting fees from: Gilead, Pfizer, Astra Zeneca, Novo Nordisk, Intercept pharmaceuticals, Diatech Pharmacogenetics; unrestricted research grants from: Gilead.
ET received unrestricted research grant from Gilead.
Please refer to the accompanying ICMJE disclosure forms for further details.
Author names in bold designate shared co-first authors