Unknown

Dataset Information

0

Extraordinary expansion of a Sorangium cellulosum genome from an alkaline milieu.


ABSTRACT: Complex environmental conditions can significantly affect bacterial genome size by unknown mechanisms. The So0157-2 strain of Sorangium cellulosum is an alkaline-adaptive epothilone producer that grows across a wide pH range. Here, we show that the genome of this strain is 14,782,125 base pairs, 1.75-megabases larger than the largest bacterial genome from S. cellulosum reported previously. The total 11,599 coding sequences (CDSs) include massive duplications and horizontally transferred genes, regulated by lots of protein kinases, sigma factors and related transcriptional regulation co-factors, providing the So0157-2 strain abundant resources and flexibility for ecological adaptation. The comparative transcriptomics approach, which detected 90.7% of the total CDSs, not only demonstrates complex expression patterns under varying environmental conditions but also suggests an alkaline-improved pathway of the insertion and duplication, which has been genetically testified, in this strain. These results provide insights into and a paradigm for how environmental conditions can affect bacterial genome expansion.

SUBMITTER: Han K 

PROVIDER: S-EPMC3696898 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Extraordinary expansion of a Sorangium cellulosum genome from an alkaline milieu.

Han Kui K   Li Zhi-feng ZF   Peng Ran R   Zhu Li-ping LP   Zhou Tao T   Wang Lu-guang LG   Li Shu-guang SG   Zhang Xiao-bo XB   Hu Wei W   Wu Zhi-hong ZH   Qin Nan N   Li Yue-zhong YZ  

Scientific reports 20130101


Complex environmental conditions can significantly affect bacterial genome size by unknown mechanisms. The So0157-2 strain of Sorangium cellulosum is an alkaline-adaptive epothilone producer that grows across a wide pH range. Here, we show that the genome of this strain is 14,782,125 base pairs, 1.75-megabases larger than the largest bacterial genome from S. cellulosum reported previously. The total 11,599 coding sequences (CDSs) include massive duplications and horizontally transferred genes, r  ...[more]

Similar Datasets

| PRJNA36817 | ENA
| S-EPMC6037015 | biostudies-literature
| S-EPMC2752531 | biostudies-literature
| PRJNA179502 | ENA