Dataset Information


EST2Prot: mapping EST sequences to proteins.

ABSTRACT: BACKGROUND: EST libraries are used in various biological studies, from microarray experiments to proteomic and genetic screens. These libraries usually contain many uncharacterized ESTs that are typically ignored since they cannot be mapped to known genes. Consequently, new discoveries are possibly overlooked. RESULTS: We describe a system (EST2Prot) that uses multiple elements to map EST sequences to their corresponding protein products. EST2Prot uses UniGene clusters, substring analysis, information about protein coding regions in existing DNA sequences and protein database searches to detect protein products related to a query EST sequence. Gene Ontology terms, Swiss-Prot keywords, and protein similarity data are used to map the ESTs to functional descriptors. CONCLUSION: EST2Prot extends and significantly enriches the popular UniGene mapping by utilizing multiple relations between known biological entities. It produces a mapping between ESTs and proteins in real-time through a simple web-interface. The system is part of the Biozon database and is accessible at http://biozon.org/tools/est/.


PROVIDER: S-EPMC1456965 | BioStudies | 2006-01-01T00:00:00Z

REPOSITORIES: biostudies

Similar Datasets

2009-01-01 | S-EPMC2703627 | BioStudies
1000-01-01 | S-EPMC2447738 | BioStudies
2012-01-01 | S-EPMC3297614 | BioStudies
2015-01-01 | S-EPMC4557752 | BioStudies
2003-01-01 | S-EPMC149192 | BioStudies
2007-01-01 | S-EPMC1924502 | BioStudies
2010-01-01 | S-EPMC3091719 | BioStudies
1000-01-01 | S-EPMC2722630 | BioStudies
2007-01-01 | S-EPMC1994688 | BioStudies
2009-01-01 | S-EPMC2697633 | BioStudies