Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

A large peptidome dataset improves HLA class I epitope prediction across most of the human population

ABSTRACT: Prediction of HLA epitopes is important for the development of cancer immunotherapies and vaccines. However, current prediction algorithms have limited predictive power, in part because they were not trained on high-quality epitope datasets covering a broad range of HLA alleles. To enable prediction of endogenous HLA class I-associated peptides across a large fraction of the human population, we used mass spectrometry to profile >185,000 peptides eluted from 95 HLA-A, -B, -C and -G mono-allelic cell lines. We identified canonical peptide motifs per HLA allele, unique and shared binding submotifs across alleles and distinct motifs associated with different peptide lengths. By integrating these data with transcript abundance and peptide processing, we developed HLAthena, providing allele-and-length-specific and pan-allele-pan-length prediction models for endogenous peptide presentation. These models predicted endogenous HLA class I-associated ligands with 1.5-fold improvement in positive predictive value compared with existing tools and correctly identified >75% of HLA-bound peptides that were observed experimentally in 11 patient-derived tumor cell lines.

ORGANISM(S): Homo sapiens

PROVIDER: GSE131267 | GEO | 2019/12/16

REPOSITORIES: GEO

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
		Other

Items per page:

1 - 1 of 1

Similar Datasets

Mono-allelic datasets for: A large peptidome dataset improves HLA class I epitope prediction across most of the human population

Project description:Sarkizova S, Klaeger S, Le PM, Li LW, Oliveira G, Keshishian H, Hartigan CH, Zhang W, Braun DA, Ligon KL, Bachireddy P, Zervantonakis IK, Rosenbluth JM, Ouspenskaia T, Law T, Justeson S, Stevens J, Lane WJ, Eisenhaure T, Zhang GL, Clauser KR, Hacohen N, Carr SA, Wu CJ, Keskin DB. Nature Biotechnology 2019. Prediction of HLA epitopes is important for the development of cancer immunotherapies and vaccines. However, current prediction algorithms have limited predictive power, in part because they were not trained on high-quality epitope datasets covering a broad range of HLA alleles. To enable prediction of endogenous HLA class I-associated peptides across a large fraction of the human population, we used mass spectrometry to profile >185,000 peptides eluted from 95 HLA-A, -B, -C and -G mono-allelic cell lines. We identified canonical peptide motifs per HLA allele, unique and shared binding submotifs across alleles and distinct motifs associated with different peptide lengths. By integrating these data with transcript abundance and peptide processing, we developed HLAthena, providing allele-and-length-specific and pan-allele-pan-length prediction models for endogenous peptide presentation. These models predicted endogenous HLA class I-associated ligands with 1.5-fold improvement in positive predictive value compared with existing tools and correctly identified >75% of HLA-bound peptides that were observed experimentally in 11 patient-derived tumor cell lines.

2019-08-06 | MSV000084172 | MassIVE

Patient datasets for: A large peptidome dataset improves HLA class I epitope prediction across most of the human population

2019-10-09 | MSV000084442 | MassIVE

Improved prediction of endogenous HLA-associated epitopes based on mono-allelic mass spectrometry profiling

Project description:LC-MS/MS-based identification of HLA-peptides is poised to provide a deep understanding of the rules underlying antigen presentation. However, a key obstacle limiting the utility of MS data is the ambiguity arising from the co-expression of multiple HLA alleles. Here, we introduce a strategy for profiling the HLA ligandome one allele at a time. By using cell lines expressing a single HLA allele, optimizing immunopurifications, and developing a novel spectral search algorithm, we identified thousands of peptides bound to 16 different HLA class I alleles. These data enabled the discovery of novel binding motifs, and an integrative analysis quantifying the contribution of factors critical to epitope presentation, such as protein cleavage and gene expression. We trained neural network prediction algorithms with our large dataset (>24,000 peptides) and outperformed algorithms trained on datasets of peptides with measured affinities. We thus demonstrate a scalable strategy for systematically learning the rules of endogenous antigen presentation.

2017-02-21 | GSE93315 | GEO

Unsupervised mining of HLA-I peptidomes reveals unsuspected false positives and new binding motifs

Project description:Modern antigen vaccine designs and studies of human leukocyte antigen (HLA)-mediated immune responses rely heavily on the knowledge of HLA allele-specific binding motifs and computational prediction of antigen-HLA binding affinity. Breakthroughs in HLA peptidomics have considerably expanded the databases of natural HLA antigens and enabled detailed characterizations of antigen-HLA binding specificity. However, cautions must be made when analyzing HLA peptidomics data because identified peptides may be contaminants or may weakly bind to the HLA molecules. Here, a hybrid de novo peptide sequencing approach was applied to large-scale mono-allelic HLA peptidomics datasets to uncover new antigens and refine current knowledge of HLA binding motifs. Up to 12-40% contaminations in the form of tryptic peptides were identified in the peptidomics data of HLA alleles whose binding motifs do not involve an arginine or a lysine at the C-terminus. Thousands of these peptides were reported in a community database as positive antigens and might be erroneously used to train prediction models. Furthermore, unsupervised clustering of identified antigens not only revealed additional binding motifs for several HLA class I alleles but also effectively isolated outliers which were confirmed to be false positives in a binding experiment. Overall, our findings expanded the knowledge of HLA binding specificity and indicated that a more careful HLA peptidomics data interpretation protocol is needed to ensure the high quality of community antigen databases.

2021-09-17 | PXD028088 | Pride

the landscape of phosphorylated HLA-I ligands

Project description:The identification and prediction of HLA-I–peptide interactions play an important role in our understanding of antigen recognition in infected or malignant cells. In cancer, non-self HLA-I ligands can arise from many different alterations, including non-synonymous mutations, gene fusion, cancer-specific alternative mRNA splicing or aberrant post-translational modifications. In this study, we collected in-depth phosphorylated HLA-I peptidomics data (1,920 unique phosphorylated peptides) from several studies covering 67 HLA-I alleles and expanded our motif deconvolution tool to identify precise binding motifs of phosphorylated HLA-I ligands for several alleles. In addition to the previously observed preferences for phosphorylation at P4, for proline next to the phosphosite and for arginine at P1, we could detect a clear enrichment of phosphorylated peptides among HLA-C ligands and among longer peptides. Binding assays were used to validate and interpret these observations. We then used these data to develop the first predictor of HLA-I– phosphorylated peptide interactions and demonstrated that combining phosphorylated and unmodified HLA-I ligands in the training of the predictor led to highest accuracy.

2019-12-18 | PXD013831 | Pride

Sensitive direct detection of cancer antigens enabled by user-defined peptide libraries

Project description:Data dependent mass spectrometry (MS) is routinely used to identify HLA-bound peptides but it can have limitations in sensitivity and reproducibility. We introduce Pepyrus, a method that uses E. coli to generate large-scale user-defined peptide libraries that can be utilized to improve the confidence in identification of HLA-bound peptides, including lowly abundant neoantigens. Using a Pepyrus peptide library paired with an HLA-specific data independent acquisition (DIA) MS strategy, we recovered >75% of the expected sequences per single injection for libraries of >10,000 peptides. Pepyrus peptide libraries also enabled the identification of 0.1 fmol spiked-in peptides in a complex background. Application of Pepyrus to create personalized peptide libraries facilitated the identification of clinically relevant HLA antigens from melanoma and renal cell carcinoma patient derived cell lines, several of which were previously undetected. Pepyrus customization enables rapid creation of patient- or disease-specific peptide libraries facilitating the confident identification of rare HLA peptides from immunopeptidomics data and the generation of large training datasets to improve spectrum, retention time, and IM prediction tools.

2025-11-20 | GSE309497 | GEO

Mass spectrometry profiling of HLA-associated peptidomes in mono-allelic cells enables more accurate epitope prediction

Project description:Abelin JG, Keskin DB, Sarkizova S, Hartigan CR, Zhang W, Sidney J, Stevens J, Lane W, Zhang GL, Eisenhaure T, Clauser KR, Hacohen N, Rooney MS, Carr SA, and Wu, CJ. Immunity, 2017. Identification of human leukocyte antigen (HLA)-bound peptides by liquid chromatography-tandem mass spectrometry (LC-MS/MS) is poised to provide a deep understanding of rules underlying antigen presentation. However, a key obstacle is the ambiguity that arises from the co-expression of multiple HLA alleles. Here, we have implemented a scalable mono-allelic strategy for profiling the HLA-peptidome. By using cell lines expressing a single HLA allele, optimizing immunopurifications, and developing an application-specific spectral search algorithm, we identified thousands of peptides bound to 16 different HLA class I alleles. These data enabled the discovery of subdominant binding motifs and an integrative analysis quantifying the contribution of factors critical to epitope presentation, such as protein cleavage and gene expression. We trained neural network prediction algorithms with our large dataset (>24,000 peptides) and outperformed algorithms trained on datasets of peptides with measured affinities. We thus demonstrate a strategy for systematically learning the rules of endogenous antigen presentation.

2017-02-01 | MSV000080527 | MassIVE

Secreted HLA Fc-Fusion Profiles Immunopeptidome in Hypoxic PDAC and Cellular Senescence

Project description:Here, we describe a secreted HLA (sHLA) Fc-fusion construct for simple single HLA allele profiling in hypoxic pancreatic ductal adenocarcinoma (PDAC) and cellular senescence. This method streamlines sample preparation, enables temporal control, and provides allele-restricted target identification. Over 30,000 unique HLA-associated peptides were identified across two different HLA alleles and seven cell lines, with ~9,300 peptides newly discovered. The sHLA Fc-fusion capture technology holds potential to expedite immunopeptidomics and advance therapeutic interest in peptide-HLA complexes.

2024-01-26 | PXD045796 | Pride

HLA-DRB1 allelic combinations differentially shape dendritic cell antigen presentation enhanced by tumor cell line lysate-pulsing

Project description:The study explores the anti-tumor immune response through the analysis of the HLA-II immunopeptidome of dendritic cells (DCs) from HLA-heterozygous donors pulsed with a protein extract from the MCF-7 tumor cell line. The objective was to characterize how different HLA-DRB1 allele combinations influence peptide presentation and how DC pulsing affects the immunopeptidome profile. Our findings demonstrate that HLA-DR heterozygosity significantly shapes the peptide repertoire in an allele-specific manner, with allele dominance modulated by specific allelic combinations. Notably, tumor-derived peptides such as annexin A2, galectin-1, and elongation factor 1-alpha 2 were identified in association with particular HLA-DRB1 alleles. These insights provide valuable information for the design of personalized immunotherapies based on peptide-pulsed DCs aimed at enhancing CD4+ tumor-infiltrating lymphocyte responses.

2026-02-09 | PXD064951 | Pride

HLA-DRB1 allelic combinations differentially shape dendritic cell antigen presentation enhanced by tumor cell line lysate-pulsing

2026-02-09 | PXD064999 | Pride