Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

Prior Knowledge Transfer Across Transcriptional Datasets Using Compositional Statistics [Cell lines]

ABSTRACT: Compositional statistics and random gene-sets were used to assign the tumor site of origin and histopathology of 18 epithelial ovarian cancer cell lines

ORGANISM(S): Homo sapiens

PROVIDER: GSE73637 | GEO | 2016/11/08

SECONDARY ACCESSION(S): PRJNA297467

REPOSITORIES: GEO

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
		Other

Items per page:

1 - 1 of 1

Similar Datasets

Prior Knowledge Transfer Across Transcriptional Datasets Using Compositional Statistics [Tumor]

Project description:An expert-pathologist-reviewed epithelial ovarian cancer reference library (n = 50) used to assign the histopathology of epithelial ovarian cell lines using compositional statistics and random gene-sets

2016-11-08 | GSE73551 | GEO

Prior Knowledge Transfer Across Transcriptional Datasets Using Compositional Statistics

Project description:Prior Knowledge Transfer Across Transcriptional Datasets Using Compositional Statistics

| PRJNA297464 | ENA

Prior Knowledge Transfer Across Transcriptional Datasets Using Compositional Statistics

Project description:This SuperSeries is composed of the SubSeries listed below.

2016-11-08 | GSE73638 | GEO

Prior knowledge transfer across transcriptional data sets and technologies using compositional statistics yields new mislabelled ovarian cell line.

Project description:Here, we describe gene expression compositional assignment (GECA), a powerful, yet simple method based on compositional statistics that can validate the transfer of prior knowledge, such as gene lists, into independent data sets, platforms and technologies. Transcriptional profiling has been used to derive gene lists that stratify patients into prognostic molecular subgroups and assess biomarker performance in the pre-clinical setting. Archived public data sets are an invaluable resource for subsequent in silico validation, though their use can lead to data integration issues. We show that GECA can be used without the need for normalising expression levels between data sets and can outperform rank-based correlation methods. To validate GECA, we demonstrate its success in the cross-platform transfer of gene lists in different domains including: bladder cancer staging, tumour site of origin and mislabelled cell lines. We also show its effectiveness in transferring an epithelial ovarian cancer prognostic gene signature across technologies, from a microarray to a next-generation sequencing setting. In a final case study, we predict the tumour site of origin and histopathology of epithelial ovarian cancer cell lines. In particular, we identify and validate the commonly-used cell line OVCAR-5 as non-ovarian, being gastrointestinal in origin. GECA is available as an open-source R package.

| S-EPMC5041471 | biostudies-literature

Homo sapiens

Project description:Prior Knowledge Transfer Across Transcriptional Datasets Using Compositional Statistics [Tumor]

| PRJNA297280 | ENA

Homo sapiens

Project description:Prior Knowledge Transfer Across Transcriptional Datasets Using Compositional Statistics [Cell lines]

| PRJNA297467 | ENA

Compositional stability of sediment communities

Project description:Compositional stability of sediment microbial communities during a seagrass meadow decline

| PRJEB48899 | ENA

Model-based joint visualization of multiple compositional omics datasets.

Project description:The integration of multiple omics datasets measured on the same samples is a challenging task: data come from heterogeneous sources and vary in signal quality. In addition, some omics data are inherently compositional, e.g. sequence count data. Most integrative methods are limited in their ability to handle covariates, missing values, compositional structure and heteroscedasticity. In this article we introduce a flexible model-based approach to data integration to address these current limitations: COMBI. We combine concepts, such as compositional biplots and log-ratio link functions with latent variable models, and propose an attractive visualization through multiplots to improve interpretation. Using real data examples and simulations, we illustrate and compare our method with other data integration techniques. Our algorithm is available in the R-package combi.

| S-EPMC7671331 | biostudies-literature

Microbiome Datasets Are Compositional: And This Is Not Optional.

Project description:Datasets collected by high-throughput sequencing (HTS) of 16S rRNA gene amplimers, metagenomes or metatranscriptomes are commonplace and being used to study human disease states, ecological differences between sites, and the built environment. There is increasing awareness that microbiome datasets generated by HTS are compositional because they have an arbitrary total imposed by the instrument. However, many investigators are either unaware of this or assume specific properties of the compositional data. The purpose of this review is to alert investigators to the dangers inherent in ignoring the compositional nature of the data, and point out that HTS datasets derived from microbiome studies can and should be treated as compositions at all stages of analysis. We briefly introduce compositional data, illustrate the pathologies that occur when compositional data are analyzed inappropriately, and finally give guidance and point to resources and examples for the analysis of microbiome datasets using compositional data analysis.

| S-EPMC5695134 | biostudies-literature

Time-course gene expression profiles to understand compositional changes of the E. coli K-12 MG1655 transcriptiome during the transition from the exponential growth to the stationary phase

Project description:Time-course gene expression profiles to understand compositional changes of the E. coli K-12 MG1655 transcriptiome during the transition from the exponential growth to the stationary phase

2023-08-07 | GSE226643 | GEO

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data