Unknown

Dataset Information

0

Systematic overrepresentation of DNA termini and underrepresentation of subterminal regions among sequencing templates prepared from hydrodynamically sheared linear DNA molecules.


ABSTRACT:

Background

Analysis of fungal genome sequence assemblies reveals that telomeres are poorly represented even though telomeric reads tend to be superabundant. We surmised that the problem might lie in the DNA shearing conditions used to create clone libraries for genome sequencing.

Results

A shotgun strategy was used to sequence and assemble circular and linear cosmid DNAs sheared using conditions typical for a genome project. The DNA sheared in circular form assembled into a single sequence contig. However, the linearized cosmid produced an incomplete assembly because the two DNA termini, though greatly overrepresented in the clone library used for sequencing, were separated from neighboring sequences by gaps of approximately 1.4 and 1.8 kb. These gap sizes were reduced, but not eliminated, by shearing the linear cosmid into smaller fragments. Mapping of shearing breakpoints revealed a paucity of breaks in the subterminal regions of the linearized cosmid and also near chromosome ends of the fungus Neurospora crassa.

Conclusion

Together, our data indicate that the ends of linear DNA molecules are recalcitrant to hydrodynamic shearing. We propose that this causes DNA termini to be overrepresented in the resulting fragment population but ultimately prevents their incorporation into sequence assemblies.

SUBMITTER: Schwartz SL 

PROVIDER: S-EPMC2824731 | biostudies-literature | 2010 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Systematic overrepresentation of DNA termini and underrepresentation of subterminal regions among sequencing templates prepared from hydrodynamically sheared linear DNA molecules.

Schwartz Sherri L SL   Farman Mark L ML  

BMC genomics 20100202


<h4>Background</h4>Analysis of fungal genome sequence assemblies reveals that telomeres are poorly represented even though telomeric reads tend to be superabundant. We surmised that the problem might lie in the DNA shearing conditions used to create clone libraries for genome sequencing.<h4>Results</h4>A shotgun strategy was used to sequence and assemble circular and linear cosmid DNAs sheared using conditions typical for a genome project. The DNA sheared in circular form assembled into a single  ...[more]

Similar Datasets

| S-EPMC10368619 | biostudies-literature
| S-EPMC1909752 | biostudies-literature
| S-EPMC2764453 | biostudies-literature
2024-09-09 | GSE233599 | GEO
| S-EPMC8989226 | biostudies-literature
| S-EPMC7011048 | biostudies-literature
| S-EPMC6251336 | biostudies-literature
| S-EPMC3538198 | biostudies-literature
2024-09-09 | GSE270850 | GEO