Project description:We announce the availability of the 5.023-Mbp high-quality draft assembly of the Escherichia coli strain Nissle 1917 (serovar O6:K5:H1) genome. Short genomic segments from this important probiotic strain have been available in public databases, but the full genome sequence has remained inaccessible. Thus, high-coverage, whole genome sequencing of E. coli Nissle 1917 is presented herein. Reannotation and metabolic reconstruction will enable comparative genomics analysis and model-guided predictions of genetic manipulations leading to increased production of the K5 capsular polysaccharide known as N-acetyl heparosan, a precursor to the anticoagulant pharmaceutical heparin.
Project description:Large-scale bacterial genome sequencing efforts to date have provided limited information on the most prevalent category of disease: sporadically acquired infections caused by common pathogenic bacteria. Here, we performed whole-genome sequencing and de novo assembly of 312 blood- or urine-derived isolates of extraintestinal pathogenic (ExPEC) Escherichia coli, a common agent of sepsis and community-acquired urinary tract infections, obtained during the course of routine clinical care at a single institution. We find that ExPEC E. coli are highly genomically heterogeneous, consistent with pan-genome analyses encompassing the larger species. Investigation of differential virulence factor content and antibiotic resistance phenotypes reveals markedly different profiles among lineages and among strains infecting different body sites. We use high-resolution molecular epidemiology to explore the dynamics of infections at the level of individual patients, including identification of possible person-to-person transmission. Notably, a limited number of discrete lineages caused the majority of bloodstream infections, including one subclone (ST131-H30) responsible for 28% of bacteremic E. coli infections over a 3-yr period. We additionally use a microbial genome-wide-association study (GWAS) approach to identify individual genes responsible for antibiotic resistance, successfully recovering known genes but notably not identifying any novel factors. We anticipate that in the near future, whole-genome sequencing of microorganisms associated with clinical disease will become routine. Our study reveals what kind of information can be obtained from sequencing clinical isolates on a large scale, even well-characterized organisms such as E. coli, and provides insight into how this information might be utilized in a healthcare setting.
Project description:Here, we present the draft genome sequence of Escherichia coli ATCC 10798. E. coli ATCC 10798 is a K-12 strain, one of the most well-studied model microorganisms. The size of the genome was 4,685,496 bp, with a G+C content of 50.70%. This assembly consists of 62 contigs and the F plasmid.
Project description:Escherichia coli is a common pathogen recovered from cystitis infections. In this report, we announce the draft genome sequence of strain E2 isolated from the urine specimen from a female patient in South Brazil. The genome assembly has 5,081,209 bp, a G+C content of 50.57%, and virulence factors associated with both enteroaggregative and uropathogenic E. coli strains.
Project description:In total, 12 quinolone-resistant <i>Escherichia coli</i> (QREC) strains containing <i>qnrS1</i> were submitted to long-read sequencing using a FLO-MIN106 flow cell on a MinION device. The long reads were assembled with short reads (Illumina) and analyzed using the MOB-suite pipeline. Six of these QREC genome sequences were closed after hybrid assembly.
Project description:Escherichia coli C is a commonly used strain in the bioprocessing industry, but despite its utility, the publicly available sequence of the E. coli C genome has gaps and 4,180 ambiguous base calls. Here, we present an updated, high-quality, unambiguous genome sequence with no assembly gaps.
Project description:A draft genome sequence for Escherichia coli ATCC 29425 was investigated. The size of the genome was 4,608,319 bp, with an observed G+C content of 50.68%. This assembly consisted of 80 contigs, with an average coverage of 122.2×, including one contig representative of the complete genome for the temperate phage P1.
Project description:BACKGROUND:Classification of pathogenic Escherichia coli (E. coli) has traditionally relied on detecting specific virulence associated genes (VAGs) or combinations thereof. For E. coli isolated from faecal samples, the presence of specific genes associated with different intestinal pathogenic pathovars will determine their classification and further course of action. However, the E. coli genome is not a static entity, and hybrid strains are emerging that cross the pathovar definitions. Hybrid strains may show gene contents previously associated with several distinct pathovars making the correct diagnostic classification difficult. We extended the analysis of routinely submitted faecal isolates to include known virulence associated genes that are usually not examined in faecal isolates to detect the frequency of possible hybrid strains. METHODS:From September 2012 to February 2013, 168 faecal isolates of E. coli routinely submitted to the Norwegian Institute of Public Health (NIPH) from clinical microbiological laboratories throughout Norway were analysed for 33 VAGs using multiplex-PCR, including factors associated with extraintestinal pathogenic E. coli (ExPEC) strains. The strains were further typed by Multiple Locus Variable-Number Tandem-Repeat Analysis (MLVA), and the phylogenetic grouping was determined. One isolate from the study was selected for whole genome sequencing (WGS) with a combination of Oxford Nanopore's MinION and Illumina's MiSeq. RESULTS:The analysis showed a surprisingly high number of strains carrying ExPEC associated VAGs and strains carrying a combination of both intestinal pathogenic E. coli (IPEC) and ExPEC VAGs. In particular, 93.5% (101/108) of isolates classified as belonging to an IPEC pathovar additionally carried ExPEC VAGs. WGS analysis of a selected hybrid strain revealed that it could, with present classification criteria, be classified as belonging to all of the Enteropathogenic Escherichia coli (EPEC), Uropathogenic Escherichia coli (UPEC), Neonatal meningitis Escherichia coli (NMEC) and Avian pathogenic Escherichia coli (APEC) pathovars. CONCLUSION:Hybrid ExPEC/IPEC E. coli strains were found at a very high frequency in faecal samples and were in fact the predominant species present. A sequenced hybrid isolate was confirmed to be a cross-pathovar strain possessing recognised hallmarks of several pathovars, and a genome heavily influenced by horizontal gene transfer.
Project description:Escherichia coli EDL933 is the prototypic strain for enterohemorrhagic E. coli serotype O157:H7, associated with deadly food-borne outbreaks. Because the publicly available sequence of the EDL933 genome has gaps and >6,000 ambiguous base calls, we here present an updated high-quality, unambiguous genome sequence with no assembly gaps.