Unknown

Dataset Information

0

A roadmap for the functional annotation of protein families: a community perspective.


ABSTRACT: Over the last 25 years, biology has entered the genomic era and is becoming a science of 'big data'. Most interpretations of genomic analyses rely on accurate functional annotations of the proteins encoded by more than 500 000 genomes sequenced to date. By different estimates, only half the predicted sequenced proteins carry an accurate functional annotation, and this percentage varies drastically between different organismal lineages. Such a large gap in knowledge hampers all aspects of biological enterprise and, thereby, is standing in the way of genomic biology reaching its full potential. A brainstorming meeting to address this issue funded by the National Science Foundation was held during 3-4 February 2022. Bringing together data scientists, biocurators, computational biologists and experimentalists within the same venue allowed for a comprehensive assessment of the current state of functional annotations of protein families. Further, major issues that were obstructing the field were identified and discussed, which ultimately allowed for the proposal of solutions on how to move forward.

SUBMITTER: de Crecy-Lagard V 

PROVIDER: S-EPMC9374478 | biostudies-literature | 2022 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

A roadmap for the functional annotation of protein families: a community perspective.

de Crécy-Lagard Valérie V   Amorin de Hegedus Rocio R   Arighi Cecilia C   Babor Jill J   Bateman Alex A   Blaby Ian I   Blaby Ian I   Blaby-Haas Crysten C   Bridge Alan J AJ   Burley Stephen K SK   Cleveland Stacey S   Colwell Lucy J LJ   Conesa Ana A   Dallago Christian C   Danchin Antoine A   de Waard Anita A   Deutschbauer Adam A   Dias Raquel R   Ding Yousong Y   Fang Gang G   Friedberg Iddo I   Gerlt John J   Goldford Joshua J   Gorelik Mark M   Gyori Benjamin M BM   Henry Christopher C   Hutinet Geoffrey G   Jaroch Marshall M   Karp Peter D PD   Kondratova Liudmyla L   Lu Zhiyong Z   Marchler-Bauer Aron A   Martin Maria-Jesus MJ   McWhite Claire C   Moghe Gaurav D GD   Monaghan Paul P   Morgat Anne A   Mungall Christopher J CJ   Natale Darren A DA   Nelson William C WC   O'Donoghue Seán S   Orengo Christine C   O'Toole Katherine H KH   Radivojac Predrag P   Reed Colbie C   Roberts Richard J RJ   Rodionov Dmitri D   Rodionova Irina A IA   Rudolf Jeffrey D JD   Saleh Lana L   Sheynkman Gloria G   Thibaud-Nissen Francoise F   Thomas Paul D PD   Uetz Peter P   Vallenet David D   Carter Erica Watson EW   Weigele Peter R PR   Wood Valerie V   Wood-Charlson Elisha M EM   Xu Jin J  

Database : the journal of biological databases and curation 20220801


Over the last 25 years, biology has entered the genomic era and is becoming a science of 'big data'. Most interpretations of genomic analyses rely on accurate functional annotations of the proteins encoded by more than 500 000 genomes sequenced to date. By different estimates, only half the predicted sequenced proteins carry an accurate functional annotation, and this percentage varies drastically between different organismal lineages. Such a large gap in knowledge hampers all aspects of biologi  ...[more]

Similar Datasets

| S-EPMC540069 | biostudies-literature
| S-EPMC1891723 | biostudies-literature
| S-EPMC3243672 | biostudies-literature
| S-EPMC3205580 | biostudies-literature
| S-EPMC6064168 | biostudies-literature
| S-EPMC6166095 | biostudies-other
| S-EPMC6978412 | biostudies-literature
| S-EPMC4194055 | biostudies-other
| S-EPMC3598525 | biostudies-other
| S-EPMC6025185 | biostudies-literature