Genomics

Dataset Information

0

Improved prediction of endogenous HLA-associated epitopes based on mono-allelic mass spectrometry profiling


ABSTRACT: LC-MS/MS-based identification of HLA-peptides is poised to provide a deep understanding of the rules underlying antigen presentation. However, a key obstacle limiting the utility of MS data is the ambiguity arising from the co-expression of multiple HLA alleles. Here, we introduce a strategy for profiling the HLA ligandome one allele at a time. By using cell lines expressing a single HLA allele, optimizing immunopurifications, and developing a novel spectral search algorithm, we identified thousands of peptides bound to 16 different HLA class I alleles. These data enabled the discovery of novel binding motifs, and an integrative analysis quantifying the contribution of factors critical to epitope presentation, such as protein cleavage and gene expression. We trained neural network prediction algorithms with our large dataset (>24,000 peptides) and outperformed algorithms trained on datasets of peptides with measured affinities. We thus demonstrate a scalable strategy for systematically learning the rules of endogenous antigen presentation.

ORGANISM(S): Homo sapiens

PROVIDER: GSE93315 | GEO | 2017/02/21

SECONDARY ACCESSION(S): PRJNA360601

REPOSITORIES: GEO

Similar Datasets

2017-02-01 | MSV000080527 | MassIVE
2019-12-16 | GSE131267 | GEO
2019-08-06 | MSV000084172 | MassIVE
2019-10-09 | MSV000084442 | MassIVE
2008-09-03 | E-GEOD-12606 | biostudies-arrayexpress
2021-12-03 | PXD028874 | Pride
2008-09-03 | GSE12606 | GEO
2021-05-29 | PXD024412 | Pride
2021-06-15 | PXD023064 | Pride
2024-01-26 | PXD045796 | Pride