Rapid evolution of major histocompatibility complex class I genes in primates generates new disease alleles in humans via hitchhiking diversity.
ABSTRACT: A plausible explanation for many MHC-linked diseases is lacking. Sequencing of the MHC class I region (coding units or full contigs) in several human and nonhuman primate haplotypes allowed an analysis of single nucleotide variations (SNV) across this entire segment. This diversity was not evenly distributed. It was rather concentrated within two gene-rich clusters. These were each centered, but importantly not limited to, the antigen-presenting HLA-A and HLA-B/-C loci. Rapid evolution of MHC-I alleles, as evidenced by an unusually high number of haplotype-specific (hs) and hypervariable (hv) (which could not be traced to a single species or haplotype) SNVs within the classical MHC-I, seems to have not only hitchhiked alleles within nearby genes, but also hitchhiked deleterious mutations in these same unrelated loci. The overrepresentation of a fraction of these hvSNV (hv1SNV) along with hsSNV, as compared to those that appear to have been maintained throughout primate evolution (trans-species diversity; tsSNV; included within hv2SNV) tends to establish that the majority of the MHC polymorphism is de novo (species specific). This is most likely reminiscent of the fact that these hsSNV and hv1SNV have been selected in adaptation to the constantly evolving microbial antigenic repertoire.
Project description:Rapid advances in biochemical technologies have enabled several strategies for typing candidate HLA alleles, but linking them into a single MHC haplotype structure remains challenging. Here we have developed a multi-loci haplotype phasing technique and demonstrate its utility towards phasing of MHC and KIR loci in human samples. We accurately (~99%) reconstruct the complete haplotypes for over 90% of sequence variants spanning the 4-megabase region of these two loci. By haplotyping a majority of coding and non-coding alleles at the MHC and KIR loci in a single assay, this method has the potential to assist transplantation matching and facilitate investigation of the genetic basis of human immunity and disease. Complete haplotype phasing of 2 loci (MHC and KIR) in 1 human cell line.
Project description:The association of classic human leukocyte antigen (HLA) alleles with risk of cervical cancer has been extensively studied, and a protective effect has consistently been found for DRB1*1301, DQA1*0103, and/or DQB1*0603 (these three alleles are in perfect linkage disequilibrium [LD] and often occur on the same haplotype in Europeans), while reports have differed widely with respect to the effect of HLA-B*07, DRB1*1501, and/or DQB1*0602 (the last two alleles are also in perfect LD in Europeans). It is not clear whether the reported HLA alleles are responsible for the differences in cervical cancer susceptibility, or if functional variants at other locations within the major histocompatibility complex (MHC) region may explain the effect. In order to assess the relative contribution of both classic HLA alleles and single-nucleotide polymorphisms (SNPs) within the MHC region to cervical cancer susceptibility, we have imputed classic HLA alleles in 1034 cervical cancer patients and 3948 controls in a Swedish population for an integrated analysis. We found that the protective haplotype DRB1*1301-DQA1*0103-DQB1*0603 has a direct effect on cervical cancer and always occurs together with the C allele of a HLA-DRB1 cis-eQTL (rs9272143), which increases the expression of HLA-DRB1. The haplotype rs9272143C-DRB1*1301-DQA1*0103-DQB1*0603 conferred the strongest protection against cervical cancer (odds ratio [OR] = 0.41, 95% confidence interval [CI] = 0.32-0.52, P = 6.2 × 10(-13)). On the other hand, the associations with HLA-B*0702 and DRB1*1501-DQB1*0602 are attributable to the joint effects of both the HLA-DRB1 cis-eQTL (rs9272143) and a frameshift mutation (G inserion of rs67841474, also known as A5.1) of the MHC class I polypeptide-related sequence A gene (MICA). Variation in LD between the classic HLA loci, rs9272143 and rs67841474 between populations may explain the different associations of HLA-B*07 and DRB1*1501-DQB1*0602 with cervical cancer between studies. The mechanism suggested may also explain similar inconsistent results for other HLA-associated diseases.
Project description:The major histocompatibility complex (MHC) contains the most polymorphic genetic system in humans, the human leukocyte antigen (HLA) genes of the adaptive immune system. High allelic diversity in HLA is argued to be maintained by balancing selection, such as negative frequency-dependent selection or heterozygote advantage. Selective pressure against immune escape by pathogens can maintain appreciable frequencies of many different HLA alleles. The selection pressures operating on combinations of HLA alleles across loci, or haplotypes, have not been extensively evaluated since the high HLA polymorphism necessitates very large sample sizes, which have not been available until recently. We aimed to evaluate the effect of selection operating at the HLA haplotype level by analyzing HLA A~C~B~DRB1~DQB1 haplotype frequencies derived from over six million individuals genotyped by the National Marrow Donor Program registry. In contrast with alleles, HLA haplotype diversity patterns suggest purifying selection, as certain HLA allele combinations co-occur in high linkage disequilibrium. Linkage disequilibrium is positive (Dij'>0) among frequent haplotypes and negative (Dij'<0) among rare haplotypes. Fitting the haplotype frequency distribution to several population dynamics models, we found that the best fit was obtained when significant positive frequency-dependent selection (FDS) was incorporated. Finally, the Ewens-Watterson test of homozygosity showed excess homozygosity for 5-locus haplotypes within 23 US populations studied, with an average Fnd of 28.43. Haplotype diversity is most consistent with purifying selection for HLA Class I haplotypes (HLA-A, -B, -C), and was not inferred for HLA Class II haplotypes (-DRB1 and-DQB1). We discuss our empirical results in the context of evolutionary theory, exploring potential mechanisms of selection that maintain high linkage disequilibrium in MHC haplotype blocks.
Project description:The MHC and KIR loci are clinically relevant regions of the genome. Typing the sequence of these loci has a wide range of applications including organ transplantation, drug discovery, pharmacogenomics and furthering fundamental research in immune genetics. Rapid advances in biochemical and next-generation sequencing (NGS) technologies have enabled several strategies for precise genotyping and phasing of candidate HLA alleles. Nonetheless, as typing of candidate HLA alleles alone reveals limited aspects of the genetics of MHC region, it is insufficient for the comprehensive utility of the aforementioned applications. For this reason, we believe phasing the entire MHC and KIR locus onto a single locus-spanning haplotype can be a critical improvement for better understanding transplantation biology.Generating long-range (>1 Mb) phase information is traditionally very challenging. As proximity-ligation based methods of DNA sequencing preserves chromosome-span phase information, we have utilized this principle to demonstrate its utility towards generating full-length phasing of MHC and KIR loci in human samples. We accurately (~99%) reconstruct the complete haplotypes for over 90% of sequence variants (coding and non-coding) within these two loci that collectively span 4-megabases.By haplotyping a majority of coding and non-coding alleles at the MHC and KIR loci in a single assay, this method has the potential to assist transplantation matching and facilitate investigation of the genetic basis of human immunity and disease.
Project description:African non-human primates are SIV natural hosts and do not develop disease following infection. Understanding disease avoidance mechanisms in these species is important for HIV vaccine development. The largest captive population of sooty mangabeys, a SIV natural host species, resides at the Yerkes National Primate Research Center.Thirteen primer sets that amplify polymorphic microsatellite loci within the MHC region were used to genotype 144 animals. Immunogenetic Management Software (IMS) was used to identify MHC haplotypes and organize data.Seventy-three haplotypes were identified. Limited haplotype diversity was observed in this population with 88.2% of included animals carrying one of 18 haplotypes. Differences in haplotype frequency were observed between SIV (+) and SIV (-) populations.We have developed a novel tool for others to use in the analysis of the role of the MHC in a natural host non-human primate model species used for SIV research.
Project description:To determine the relationship between highly-conserved extended-haplotypes (CEHs) in the major histocompatibility complex (MHC) and MS-susceptibility.Among the ~200 MS-susceptibility regions, which are known from genome-wide analyses of single nucleotide polymorphisms (SNPs), the MHC accounts for roughly a third of the currently explained variance and the strongest MS-associations are for certain Class II alleles (e.g., HLA-DRB1*15:01; HLA-DRB1*03:01; and HLA-DRB1*13:03), which frequently reside on CEHs within the MHC.Autosomal SNPs (441,547) from 11,376 MS cases and 18,872 controls in the WTCCC dataset were phased. The most significant MS associated SNP haplotype was composed of 11 SNPs in the MHC Class II region surrounding the HLA-DRB1 gene. We also phased alleles at the HLA-A, HLA-C, HLA-B, HLA-DRB1, and HLA-DQB1 loci. This data was used to probe the relationship between CEHs and MS susceptibility.We phased a total of 59,884 extended haplotypes (HLA-A, HLA-C, HLA-B, HLA-DRB1, HLA-DQB1 and SNP haplotypes) from 29,942 individuals. Of these, 10,078 unique extended haplotypes were identified. The 10 most common CEHs accounted for 22% (13,302) of the total. By contrast, the 8,446 least common extended haplotypes also accounted for approximately 20% (12,298) of the total. This extreme frequency-disparity among extended haplotypes necessarily complicates interpretation of reported disease-associations with specific HLA alleles. In particular, the HLA motif HLA-DRB1*15:01~HLA-DQB1*06:02 is strongly associated with MS risk. Nevertheless, although this motif is almost always found on the a1 SNP haplotype, it can rarely be found on others (e.g., a27 and a36), and, in these cases, it seems to have no apparent disease-association (OR = 0.7; CI = 0.3-1.3 and OR = 0.7; CI = 0.2-2.2, respectively). Furthermore, single copy carriers of the a1 SNP-haplotype without this HLA motif still have an increased disease risk (OR = 2.2; CI = 1.2-3.8). In addition, even among the set of CEHs, which carry the Class II motif of HLA-DRB1*15:01~HLA-DQB1*06:02~a1, different CEHs have differing strengths in their MS-associations.The MHC in diverse human populations consists, primarily, of a very small collection of very highly-selected CEHs. Our findings suggest that the MS-association with the HLA-DRB1*15:01~HLA-DQB1*06:02 haplotype may be due primarily to the combined attributes of the CEHs on which this particular HLA-motif often resides.
Project description:The common marmoset (Callithrix jacchus) is a New World primate species that is highly susceptible to fatal infections caused by various strains of bacteria. We present here a first step in the molecular characterization of the common marmoset's Mhc class II genes by nucleotide sequence analysis of the polymorphic exon 2 segments. For this study, genetic material was obtained from animals bred in captivity as well as in the wild. The results demonstrate that the common marmoset has, like other primates, apparently functional Mhc-DR and -DQ regions, but the Mhc-DP region has been inactivated. At the -DR and -DQ loci, only a limited number of lineages were detected. On the basis of the number of alleles found, the -DQA and -B loci appear to be oligomorphic, whereas only a moderate degree of polymorphism was observed for two of three Mhc-DRB loci. The contact residues in the peptide-binding site of the Caja-DRB1*03 lineage members are highly conserved, whereas the -DRB*W16 lineage members show more divergence in that respect. The latter locus encodes five oligomorphic lineages whose members are not observed in any other primate species studied, suggesting rapid evolution, as illustrated by frequent exchange of polymorphic motifs. All common marmosets tested were found to share one monomorphic type of Caja-DRB*W12 allele probably encoded by a separate locus. Common marmosets apparently lack haplotype polymorphism because the number of Caja-DRB loci present per haplotype appears to be constant. Despite this, however, an unexpectedly high number of allelic combinations are observed at the haplotypic level, suggesting that Caja-DRB alleles are exchanged frequently between chromosomes by recombination, promoting an optimal distribution of limited Mhc polymorphisms among individuals of a given population. This peculiar genetic make up, in combination with the limited variability of the major histocompatability complex class II repertoire, may contribute to the common marmoset's susceptibility to particular bacterial infections.
Project description:Patient-derived HIV-1 subtype B Nef clones downregulate HLA-A more efficiently than HLA-B. However, it remains unknown whether this property is common to Nef proteins across primate lentiviruses and how antiviral immune responses may be affected. We examined 263 Nef clones from diverse primate lentiviruses including different pandemic HIV-1 group M subtypes for their ability to downregulate major histocompatibility complex class A (MHC-A) and MHC-B from the cell surface. Though lentiviral Nef proteins differed markedly in their absolute MHC-A and MHC-B downregulation abilities, all lentiviral Nef lineages downregulated MHC-A, on average, 11 to 32% more efficiently than MHC-B. Nef genotype/phenotype analyses in a cohort of HIV-1 subtype C-infected patients (n = 168), together with site-directed mutagenesis, revealed Nef position 9 as a subtype-specific determinant of differential HLA-A versus HLA-B downregulation activity. Nef clones harboring nonconsensus variants at codon 9 downregulated HLA-B (though not HLA-A) significantly better than those harboring the consensus sequence at this site, resulting in reduced recognition of infected target cells by HIV-1-specific CD8+ effector cells in vitro Among persons expressing protective HLA class I alleles, carriage of Nef codon 9 variants was also associated with reduced ex vivo HIV-specific T cell responses. Our results demonstrate that Nef's inferior ability to downregulate MHC-B compared to that of MHC-A is conserved across primate lentiviruses and suggest that this property influences antiviral cellular immune responses.IMPORTANCE Primate lentiviruses encode the Nef protein that plays an essential role in establishing persistent infection in their respective host species. Nef interacts with the cytoplasmic region of MHC-A and MHC-B molecules and downregulates them from the infected cell surface to escape recognition by host cellular immunity. Using a panel of Nef alleles isolated from diverse primate lentiviruses including pandemic HIV-1 group M subtypes, we demonstrate that Nef proteins across all lentiviral lineages downregulate MHC-A approximately 20% more effectively than MHC-B. We further identify a naturally polymorphic site at Nef position 9 that contributes to the MHC-B downregulation function in HIV-1 subtype C and show that carriage of Nef variants with enhanced MHC-B downregulation ability is associated with reduced breadth and magnitude of MHC-B-restricted cellular immune responses in HIV-infected individuals. Our study underscores an evolutionarily conserved interaction between lentiviruses and primate immune systems that may contribute to pathogenesis.
Project description:The control of gene regulation within the major histocompatibility complex (MHC) remains poorly understood, despite several expression quantitative trait loci (eQTL) studies revealing an association of MHC gene expression with independent tag-single nucleotide polymorphisms (SNPs). MHC haplotype variation may exert a greater effect on gene expression phenotype than specific single variants. To explore the effect of MHC haplotype sequence diversity on gene expression phenotypes across the MHC, we examined the MHC transcriptomic landscape at haplotype-specific resolution for three prominent MHC haplotypes (A2-B46-DR9, A33-B58-DR3, and A1-B8-DR3) derived from MHC-homozygous B-lymphoblastoid cell lines (B-LCLs). We demonstrate that MHC-wide gene expression patterns are dictated by underlying haplotypes, and identify 36 differentially expressed genes. By mapping these haplotype sequence variations to known eQTL, we provide evidence that unique allelic combinations of eQTL, embedded within haplotypes, are correlated with the level of expression of 17 genes. Interestingly, the influence of haplotype sequence on gene expression is not homogenous across the MHC. We show that haplotype sequence polymorphisms within or proximate to HLA-A, HLA-C, C4A, and HLA-DRB regions exert haplotype-specific gene regulatory effects, whereas the expression of genes in other parts of the MHC region are not affected by the haplotype sequence. Overall, we demonstrate that MHC haplotype sequence diversity can impact phenotypic outcome via the alteration of transcriptional variability, indicating that a haplotype-based approach is fundamental for the assessment of trait associations in the MHC.
Project description:DNA variants in the tumor necrosis factor-? (TNF) and linked lymphotoxin-? genes, and specific alleles of the highly polymorphic human leukocyte antigen B (HLA-B) gene have been implicated in a plethora of immune and infectious diseases. However, the tight linkage disequilibrium characterizing the central region of the human major histocompatibility complex (MHC) containing these gene loci has made difficult the unequivocal interpretation of genetic association data. To alleviate these difficulties and facilitate the design of more focused follow-up studies, we investigated the structure and distribution of HLA-B-specific MHC haplotypes reconstructed in a European population from unphased genotypes at a set of 25 single nucleotide polymorphism sites spanning a 66-kilobase long region across TNF. Consistent with the published data, we found limited genetic diversity across the so-called TNF block, with the emergence of seven common MHC haplotypes, termed TNF block super-haplotypes. We also found that the ancestral haplotype 8.1 shares a TNF block haplotype with HLA-B*4402. HLA-B*5701, a known protective allele in HIV-1 pathogenesis, occurred in a unique TNF block haplotype.