TY - JOUR T1 - Advancing Human Genetics Research and Drug Discovery through Exome Sequencing of the UK Biobank JF - medRxiv DO - 10.1101/2020.11.02.20222232 SP - 2020.11.02.20222232 AU - Joseph D. Szustakowski AU - Suganthi Balasubramanian AU - Ariella Sasson AU - Shareef Khalid AU - Paola G. Bronson AU - Erika Kvikstad AU - Emily Wong AU - Daren Liu AU - J. Wade Davis AU - Carolina Haefliger AU - A. Katrina Loomis AU - Rajesh Mikkilineni AU - Hyun Ji Noh AU - Samir Wadhawan AU - Xiaodong Bai AU - Alicia Hawes AU - Olga Krasheninina AU - Ricardo Ulloa AU - Alex Lopez AU - Erin N. Smith AU - Jeff Waring AU - Christopher D. Whelan AU - Ellen A. Tsai AU - John Overton AU - William Salerno AU - Howard Jacob AU - Sandor Szalma AU - Heiko Runz AU - Greg Hinkle AU - Paul Nioi AU - Slavé Petrovski AU - Melissa R. Miller AU - Aris Baras AU - Lyndon Mitnaul AU - Jeffrey G. Reid AU - on behalf of the UKB-ESC Research Team Y1 - 2020/01/01 UR - http://medrxiv.org/content/early/2020/11/04/2020.11.02.20222232.abstract N2 - The UK Biobank Exome Sequencing Consortium (UKB-ESC) is a unique private/public partnership between the UK Biobank and eight biopharma companies that will sequence the exomes of all ∼500,000 UK Biobank participants. Here we describe early results from the exome sequence data generated by this consortium for the first ∼200,000 UKB subjects and the key features of this project that enabled the UKB-ESC to come together and generate this data.Exome sequencing data from the first 200,643 UKB enrollees are now accessible to the research community. Approximately 10M variants were observed within the targeted regions, including: 8,086,176 SNPs, 370,958 indels and 1,596,984 multi-allelic variants. Of the ∼8M variants observed, 84.5% are coding variants and include 2,139,318 (25.3%) synonymous, 4,549,694 (53.8%) missense, 453,733 (5.4%) predicted loss-of-function (LOF) variants (initiation codon loss, premature stop codons, stop codon loss, splicing and frameshift variants) affecting at least one coding transcript. This open access data provides a rich resource of coding variants for rare variant genetic studies, and is particularly valuable for drug discovery efforts that utilize rare, functionally consequential variants.Over the past decade, the biopharma industry has increasingly leveraged human genetics as part of their drug discovery and development strategies. This shift was motivated by technical advances that enabled cost-effective human genetics research at scale, the emergence of electronic health records and biobanks, and a maturing understanding of how human genetics can increase the probability of successful drug development. Recognizing the need for large-scale human genetics data to drive drug discovery, and the unique value of the open data access policies and contribution terms of the UK Biobank, the UKB-ESC was formed. This precompetitive collaboration has further strengthened the ties between academia and industry and provided teams an unprecedented opportunity to interact with and learn from the wider research community.Competing Interest StatementThe authors have declared no competing interest.Funding StatementFunding for this work was provided by the authors' institutions, and no external funding was recieved for the generation and analysis of the 200k exome data results presented here. UK Biobank generally recieves external funding from a variety of sources.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This work was reviewed and approved by UK Biobank, for details see the UKB ethics and governance framework: https://www.ukbiobank.ac.uk/wp-content/uploads/2011/05/EGF20082.pdfAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll data is avaiable through the UK Biobank (https://www.ukbiobank.ac.uk/) to bona fide researchers who have submitted compliant research applications. https://www.ukbiobank.ac.uk/ ER -