Dataset Information


Long W tracts are over-represented in the Escherichia coli and Haemophilus influenzae genomes.

ABSTRACT: The occurrence of DNA tracts of the three binary base combinations: R.Y, K.M and W;S has been mapped in the complete genomes of Haemophilus influenzae and Escherichia coli. A highly significant over-representation of W tracts is observed in both bacteria. The excess of W tracts is particularly striking in the 10% intercoding regions. Subdivision of intercoding regions into divergent (promoting), convergent (terminating) and sequential subregions shows that the excess of W tracts is most concentrated in the promoter regions. A particularly high excess of W tracts is observed in the first 200 bases 5' upstream of coding start sites. The data suggest that W tracts have a role in promoter function. A function as unwinding centers, analogous to the role of R.Y tracts in eukaryotes, is proposed. R.Y and K.M tracts are only modestly over-represented in the two bacteria.


PROVIDER: S-EPMC148734 | BioStudies | 1999-01-01T00:00:00Z


REPOSITORIES: biostudies

Similar Datasets

2003-01-01 | S-EPMC169031 | BioStudies
1000-01-01 | S-EPMC407849 | BioStudies
2008-01-01 | S-EPMC2567471 | BioStudies
2004-01-01 | S-EPMC400628 | BioStudies
2011-01-01 | S-EPMC3201770 | BioStudies
2009-01-01 | S-EPMC2673466 | BioStudies
2018-01-01 | S-EPMC5780452 | BioStudies
2020-01-01 | S-EPMC6939312 | BioStudies
2014-01-01 | S-EPMC4212969 | BioStudies
2019-01-01 | S-EPMC6820245 | BioStudies