Genomics

Dataset Information

0

Novel Transcriptional Activity and Extensive Allelic Imbalance in the Human MHC Region


ABSTRACT: The MHC region encodes HLA genes and is the most complex region in the human genome. The extensive polymorphic nature of the HLA hinders accurate localization and functional assessment of disease risk loci within this region. Using targeted capture sequencing and constructing individualized genomes for transcriptome alignment, we identified 908 novel transcripts within the human MHC region. These include 593 novel isoforms of known genes, 137 antisense strand RNAs, 119 novel long intergenic noncoding RNAs, and 5 transcripts of 3 novel putative protein-coding human endogenous retrovirus genes. We revealed allele-dependent expression imbalance involving 88% of all heterozygous transcribed single nucleotide polymorphisms throughout the MHC transcriptome. Among these variants, we show that the genetic variant associated with Behc ̧et’s disease in the HLA-B/MICA region, which tags HLA-B*51, is within novel long intergenic noncoding RNA transcripts that are exclusively expressed from the haplotype with the protective but not the disease risk allele. Further, we showed that the transcriptome within the MHC region can be defined by 14 distinct coexpression clusters, with evidence of coregulation by unique transcription factors in at least 9 of these clusters. Our data suggest a very complex regulatory map of the human MHC, and can help uncover functional consequences of disease risk loci in this region.

ORGANISM(S): Homo sapiens

PROVIDER: GSE108663 | GEO | 2018/05/02

REPOSITORIES: GEO

Similar Datasets

2023-11-28 | GSE235106 | GEO
2018-09-30 | GSE119367 | GEO
2016-02-25 | E-GEOD-78246 | biostudies-arrayexpress
2015-10-15 | GSE65424 | GEO
2016-02-25 | GSE78246 | GEO
2020-05-12 | PXD013892 | Pride
| PRJEB21619 | ENA
2020-12-22 | GSE163605 | GEO
2016-02-01 | E-GEOD-65726 | biostudies-arrayexpress
2007-04-06 | E-CBIL-27 | biostudies-arrayexpress