Unknown

Dataset Information

0

Phylogenomic analysis of the GIY-YIG nuclease superfamily.


ABSTRACT: The GIY-YIG domain was initially identified in homing endonucleases and later in other selfish mobile genetic elements (including restriction enzymes and non-LTR retrotransposons) and in enzymes involved in DNA repair and recombination. However, to date no systematic search for novel members of the GIY-YIG superfamily or comparative analysis of these enzymes has been reported.We carried out database searches to identify all members of known GIY-YIG nuclease families. Multiple sequence alignments together with predicted secondary structures of identified families were represented as Hidden Markov Models (HMM) and compared by the HHsearch method to the uncharacterized protein families gathered in the COG, KOG, and PFAM databases. This analysis allowed for extending the GIY-YIG superfamily to include members of COG3680 and a number of proteins not classified in COGs and to predict that these proteins may function as nucleases, potentially involved in DNA recombination and/or repair. Finally, all old and new members of the GIY-YIG superfamily were compared and analyzed to infer the phylogenetic tree.An evolutionary classification of the GIY-YIG superfamily is presented for the very first time, along with the structural annotation of all (sub)families. It provides a comprehensive picture of sequence-structure-function relationships in this superfamily of nucleases, which will help to design experiments to study the mechanism of action of known members (especially the uncharacterized ones) and will facilitate the prediction of function for the newly discovered ones.

SUBMITTER: Dunin-Horkawicz S 

PROVIDER: S-EPMC1564403 | BioStudies | 2006-01-01

REPOSITORIES: biostudies

Similar Datasets

2011-01-01 | S-EPMC3045582 | BioStudies
2010-01-01 | S-EPMC2955809 | BioStudies
2008-01-01 | S-EPMC2630997 | BioStudies
2013-01-01 | S-EPMC3664794 | BioStudies
2005-01-01 | S-EPMC1189080 | BioStudies
2011-01-01 | S-EPMC3161791 | BioStudies
1000-01-01 | S-EPMC2761285 | BioStudies
2012-01-01 | S-EPMC4335191 | BioStudies
2008-01-01 | S-EPMC2519379 | BioStudies
2006-01-01 | S-EPMC1421500 | BioStudies