Metabolomics,Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

CGH analysis of 39 Lactococcus lactis strains.


ABSTRACT: Pangenome arrays contain DNA oligomers targeting several sequenced reference genomes from the same species. In microbiology these can be employed to investigate the often high genetic variability within a species by comparative genome hybridization (CGH). The biological interpretation of pangenome CGH data depends on the ability to compare strains at a functional level, particularly by comparing the presence or absence of orthologous genes. Due to the high genetic variability, available genotype-calling algorithms can not be applied to pangenome CGH data. Therefore, we have developed the algorithm PanCGH that incorporates orthology information about genes to predict the presence or absence of orthologous genes in a query organism using CGH arrays that target the genomes of sequenced representatives of a group of microorganisms. PanCGH was tested and applied in the analysis of genetic diversity among 39 Lactococcus lactis strains from three different subspecies (lactis, cremoris, hordniae) and isolated from two different niches (dairy and plant). Clustering of these strains using the presence/absence data of gene orthologs revealed a clear separation between different subspecies and reflected the niche of the strains. Keywords: CGH, CGH analysis, orthology, Lactococcus lactis We analyzed 39 CGH arrays, where on each array different strain of L. lactis was hybridized.

ORGANISM(S): Lactococcus lactis subsp. cremoris

SUBMITTER: Douwe Molenaar 

PROVIDER: E-GEOD-12638 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

altmetric image

Publications

PanCGH: a genotype-calling algorithm for pangenome CGH data.

Bayjanov Jumamurat R JR   Wels Michiel M   Starrenburg Marjo M   van Hylckama Vlieg Johan E T JE   Siezen Roland J RJ   Molenaar Douwe D  

Bioinformatics (Oxford, England) 20090107 3


<h4>Motivation</h4>Pangenome arrays contain DNA oligomers targeting several sequenced reference genomes from the same species. In microbiology, these can be employed to investigate the often high genetic variability within a species by comparative genome hybridization (CGH). The biological interpretation of pangenome CGH data depends on the ability to compare strains at a functional level, particularly by comparing the presence or absence of orthologous genes. Due to the high genetic variability  ...[more]

Similar Datasets

2008-12-31 | GSE12638 | GEO
2012-09-07 | E-GEOD-24015 | biostudies-arrayexpress
| PRJNA112819 | ENA
2021-11-02 | PXD028721 | Pride
2016-01-12 | E-GEOD-76764 | biostudies-arrayexpress
2012-09-07 | E-GEOD-23990 | biostudies-arrayexpress
2012-09-12 | E-GEOD-40780 | biostudies-arrayexpress
2012-09-07 | E-GEOD-23987 | biostudies-arrayexpress
2010-03-23 | E-GEOD-19005 | biostudies-arrayexpress
2017-06-29 | PXD006551 | Pride