Dataset Information

The suitability of statistical tests according to characteristics in variation of expression microarray data

ABSTRACT: Microarray technology has enabled the measurement of comprehensive transcriptomic information. However, each data entry may reflect trivial individual differences among samples and also contain technical noise. Therefore, the certainty of each observed difference should be confirmed at earlier steps of the analyses, and statistical tests are frequently used for this purpose. Since a microarray measures a huge number of genes and the results are processed simultaneously, concerns regarding problems of multiplicity have been raised to the tests. To deal with these problems, several methodologies have been proposed, making the tests very conservative. Indeed, arbitrary tuning of the test threshold has also been introduced to relax the test conditions. However, the appropriateness of the multiplicity problems as well as the compensation methods has not been confirmed. The appropriateness of the compensation methods was checked by means of coincidence of the premises of the methodologies with the observed characteristics found in real data of two typical platforms of microarray analysis. Normality was observed in within-group data variations, supporting applications of parametric tests. However, genes displayed their own tendencies in the magnitude of variations, and the distributions of P-values were rather complex and varied; these characteristics are inconsistent with the premises of the compensation methodologies. Additionally, the appropriateness of the proposed multiplicities is reconsidered. When we observed differences in the transcriptome, the family-wise error rate should not be considered, since analyses at higher levels would not be influenced by a few false positives among the huge numbers of true information. Likely, concerns for a false discovery rate are not suitable for the point null hypotheses on expression levels, since the rate of true null hypotheses should be rare in contradiction to the premise of the methodology. Although compensation methods have been recommended to deal with the problem of multiplicity, the compensations are actually inappropriate for many of the applications of transcriptome analyses. Compensations are not only unnecessary, but will increase the occurrence of false negative errors, and arbitrary adjustment of the threshold damages the objectivity of the tests. Rather, the results of parametric tests should be evaluated directly. This SuperSeries is composed of the SubSeries listed below.

ORGANISM(S): Mus musculus

PROVIDER: GSE25410 | GEO | 2010/11/17

SECONDARY ACCESSION(S): PRJNA134153

REPOSITORIES: GEO

ACCESS DATA

Dataset's files

Source:

			Action	DRS
		Other

Items per page:

1 - 1 of 1

Dataset Information

The suitability of statistical tests according to characteristics in variation of expression microarray data

Dataset's files

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Reframing gene essentiality in terms of adaptive flexibility
2018-07-19 | GSE117303 | GEO

Differentially expressed non-coding RNAs in mouse islets during pregnancy
2017-06-30 | GSE100645 | GEO

Chromosomal duplications increase gene dosage and mRNA level in C. elegans
2022-03-22 | GSE198682 | GEO

Establishment of objective criteria for selecting relevant intestinal cell-based models
2012-06-24 | GSE30292 | GEO

Comparative proteomic of Trypanosoma cruzi strains
2023-07-12 | PXD032748 | Pride

A simple method for generating high-resolution maps of genome wide protein binding
2015-06-12 | E-GEOD-67454 | biostudies-arrayexpress

Expression data from head and neck squamous cell carcinoma
2007-01-20 | GSE6631 | GEO

Resolving systematic errors in widely-used enhancer activity assays in human cells enables genome-wide functional enhancer characterization.
2017-07-18 | GSE100432 | GEO