<HashMap><database>biostudies-literature</database><scores/><additional><submitter>Hudson D</submitter><funding>Rosalind Franklin Institute</funding><funding>National Institute for Health and Care Research</funding><funding>Wellcome Trust</funding><funding>Biotechnology and Biological Sciences Research Council</funding><funding>MRC</funding><pagination>None</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC10955519</full_dataset_link><repository>biostudies-literature</repository><omics_type>Unknown</omics_type><volume>13</volume><pubmed_abstract>The vast potential sequence diversity of TCRs and their ligands has presented an historic barrier to computational prediction of TCR epitope specificity, a holy grail of quantitative immunology. One common approach is to cluster sequences together, on the assumption that similar receptors bind similar epitopes. Here, we provide the first independent evaluation of widely used clustering algorithms for TCR specificity inference, observing some variability in predictive performance between models, and marked differences in scalability. Despite these differences, we find that different algorithms produce clusters with high degrees of similarity for receptors recognising the same epitope. Our analysis strengthens the case for use of clustering models to identify signals of common specificity from large repertoires, whilst highlighting scope for improvement of complex models over simple comparators.</pubmed_abstract><journal>Immunoinformatics (Amsterdam, Netherlands)</journal><pubmed_title>A comparison of clustering models for inference of T cell receptor antigen specificity.</pubmed_title><pmcid>PMC10955519</pmcid><funding_grant_id>203141_Z_16_Z</funding_grant_id><funding_grant_id>MC_UU_12010/3</funding_grant_id><funding_grant_id>203141</funding_grant_id><funding_grant_id>BB_T008784_1</funding_grant_id><pubmed_authors>Basham M</pubmed_authors><pubmed_authors>Hudson D</pubmed_authors><pubmed_authors>Lubbock A</pubmed_authors><pubmed_authors>Koohy H</pubmed_authors></additional><is_claimable>false</is_claimable><name>A comparison of clustering models for inference of T cell receptor antigen specificity.</name><description>The vast potential sequence diversity of TCRs and their ligands has presented an historic barrier to computational prediction of TCR epitope specificity, a holy grail of quantitative immunology. One common approach is to cluster sequences together, on the assumption that similar receptors bind similar epitopes. Here, we provide the first independent evaluation of widely used clustering algorithms for TCR specificity inference, observing some variability in predictive performance between models, and marked differences in scalability. Despite these differences, we find that different algorithms produce clusters with high degrees of similarity for receptors recognising the same epitope. Our analysis strengthens the case for use of clustering models to identify signals of common specificity from large repertoires, whilst highlighting scope for improvement of complex models over simple comparators.</description><dates><release>2024-01-01T00:00:00Z</release><publication>2024 Mar</publication><modification>2024-11-14T21:30:02.905Z</modification><creation>2024-11-14T21:30:02.905Z</creation></dates><accession>S-EPMC10955519</accession><cross_references><pubmed>38525047</pubmed><doi>10.1016/j.immuno.2024.100033</doi></cross_references></HashMap>