Ontology highlight
ABSTRACT:
SUBMITTER: Capel H
PROVIDER: S-EPMC9512797 | biostudies-literature | 2022 Sep
REPOSITORIES: biostudies-literature
Capel Henriette H Weiler Robin R Dijkstra Maurits M Vleugels Reinier R Bloem Peter P Feenstra K Anton KA
Scientific reports 20220926 1
Self-supervised language modeling is a rapidly developing approach for the analysis of protein sequence data. However, work in this area is heterogeneous and diverse, making comparison of models and methods difficult. Moreover, models are often evaluated only on one or two downstream tasks, making it unclear whether the models capture generally useful properties. We introduce the ProteinGLUE benchmark for the evaluation of protein representations: a set of seven per-amino-acid tasks for evaluati ...[more]