Elsevier

Human Immunology

Volume 68, Issue 9, September 2007, Pages 779-788
Human Immunology

High-resolution HLA alleles and haplotypes in the United States population

https://doi.org/10.1016/j.humimm.2007.04.005Get rights and content

Summary

We extract and present high-resolution HLA allele and haplotype frequency data available from the National Marrow Donor Program databases from four major U.S. census categories of race and ethnicity. Population-based high-resolution HLA frequencies defined on the basis of from one to five loci are presented and made available online (http://bioinformatics.nmdp.org/haplotype2006). In addition, a discriminatory classification of HLA allelic variation on the basis of observed population allele frequencies (common, rare and unseen) for HLA A, C, B, DRB1, DQA1, and DQB1 is introduced. The electronic availability of this information will be useful for projects central to the typing and use of population data in HLA applications.

Introduction

Progress in the discovery of new allelic variability in the histocompatibility loci, with up to hundreds of alleles defined at individual HLA loci, is evident [1], yet an accurate and statistically adequate characterization of the distribution of this variation at the population level has been lagging. Knowledge of HLA data at the level of a specific population or ethnic group level is especially important because of the considerable differentiation in HLA frequencies that has occurred among human populations since our species’ first appearance and subsequent spread across Africa and other continents. The International HLA Workshop Anthropology components have played a very useful role in defining HLA polymorphism at the population level with surveys of HLA variation across a variety of human populations [2]. The sample sizes in these studies have been sufficient to sketch the broad outlines of the dominant HLA specificities and haplotypes present in a group. In contrast the value of a full picture of HLA variation for a given human population gained from samples sufficient for a complete and thorough characterization of the genetic diversity of HLA has a number of important applications which are not possible from studies involving less thorough typing and less sampling. We mention here the daily utility accrued from such information in ascertaining potential patient matches from a donor registry, the background information essential to determining the development of optimal donor registry size according to ethnic group, and the degree of HLA differentiation at the population level accrued during the course of human population evolution.

The National Marrow Donor Program (NMDP) has the mission of developing, maintaining and coordinating a repository of HLA-typed donors to facilitate hematopoietic stem cell transplantation among unrelated individuals in the United States. To this end a donor pool of several million individuals has been developed for which HLA typing has been performed (www.marrow.org). We have extracted the existing high-resolution HLA typings from the NMDP databanks and present this information according to the broad ethnic classifications of individuals in the United States defined by the United States census. The census categories themselves are only rough and sometimes poor categorization of original geographically based human ethnicities. In addition, the NMDP HLA high-resolution typings can stray from a random sample of even the census population categories. Nonetheless, we feel that this set of HLA population based frequency data can prove extremely valuable at this time. The large sizes of the available population samples and the uniform consistency and quality control procedures employed in the HLA typing make this data a unique resource.

Section snippets

Subjects and methods

The samples used in this analysis are from two NMDP sources: (1) prospective high-resolution (SBT-based) typing for the three minority population samples, and (2) donors from donor–recipient pairs typed at high resolution (SBT-, SSOP-, or SSP-based) for the European American sample. The minority prospective high-resolution typing was performed by quality-controlled NMDP contract laboratories and all data was validated against prior typings. The donor–recipient pair donor typing was subjected to

Results

HLA frequency data are presented on the basis of the four predominant US census categories for categorizing ethnic and racial groups: African Americans, Asians (Asians and Pacific Islanders), European Americans (white, or Caucasian) and Hispanics. These categories from NMDP input questionnaires define the self-described ethnic groups. High resolution HLA allele frequency data on the five most commonly typed loci (A, C, B, DRB1, and DQB1) and 10 haplotype categories (including two-, three-,

Discussion

Well-defined samples of HLA data for populations, such as those presented in this report, have several strengths. While many population-based reports of HLA typings have been published, these have not been uniformly consistent in mode and thoroughness of HLA typing methodologies and number of loci included, nor have they been conducted with sample sizes adequate for a statistically complete characterization of the variation present. The combination of these inadequacies in currently available

Acknowledgments

We are indebted to the network of NMDP contract laboratories that have performed HLA typing used in this study. American Red Cross National Histocompatibility Laboratory (Marcelo Fernandez-Vina, Ph.D.), American Red Cross New England Region (Marcela Saluzar, M.D, Neng Yu, M.D.), American Red Cross Penn-Jersey Region (Susan Hsu, Ph.D.), Children’s Hospital Oakland Research Institute (Elizabeth Trachtenberg, Ph.D.), Dana-Farber Cancer Institute (Edmond Yunis, M.D.), Histogenetics, Inc. (Soo Young

References (17)

There are more references available in the full text version of this article.

Cited by (369)

View all citing articles on Scopus
View full text