Ontology highlight
ABSTRACT:
SUBMITTER: Flamholz ZN
PROVIDER: S-EPMC11311208 | biostudies-literature | 2024 Feb
REPOSITORIES: biostudies-literature
Flamholz Zachary N ZN Biller Steven J SJ Kelly Libusha L
Nature microbiology 20240129 2
Viral genomes are poorly annotated in metagenomic samples, representing an obstacle to understanding viral diversity and function. Current annotation approaches rely on alignment-based sequence homology methods, which are limited by the paucity of characterized viral proteins and divergence among viral sequences. Here we show that protein language models can capture prokaryotic viral protein function, enabling new portions of viral sequence space to be assigned biologically meaningful labels. Wh ...[more]