Ontology highlight
ABSTRACT:
SUBMITTER: Yang Y
PROVIDER: S-EPMC10370184 | biostudies-literature | 2023 Jun
REPOSITORIES: biostudies-literature
Yang Yiyan Y Dufault-Thompson Keith K Yan Wei W Cai Tian T Xie Lei L Jiang Xiaofang X
bioRxiv : the preprint server for biology 20230616
Phage tailspike proteins are depolymerases that target diverse bacterial surface glycans with high specificity, determining the host-specificity of numerous phages. To address the challenge of identifying tailspike proteins due to their sequence diversity, we developed SpikeHunter, an approach based on the ESM-2 protein language model. Using SpikeHunter, we successfully identified 231,965 tailspike proteins from a dataset comprising 8,434,494 prophages found within 165,365 genomes of five common ...[more]