Genomics

Dataset Information

0

Renormalized GSE8191 by using the parametric method


ABSTRACT: Although principal component analysis is frequently used in multivariate/ analysis, it has disadvantages when applied to experimental or diagnostic data. First, the identified principal components have poor generality; since the size and directions of the components are dependent on the particular data set, the components are valid only within the set. Second, the method is sensitive to experimental noise and bias between sample groups, since it cannot reflect the design of experiments; rather, it estimates the same weight and independence of all the samples in the matrix. Third, the resulting components are often difficult to interpret. To address these issues, several options were introduced to the methodology. The resulting components were scaled to unify their size unit. Also, the principal axes were identified using training data sets and shared among experiments. This training data reflects the design of experiments, and its preparation allows noise to be reduced and group bias to be removed. The effects of these options were observed in microarray experiments, and showed an improvement in the separation of groups and robustness to noise. Additionally, unknown samples were appropriately classified using pre-arranged axes, and principal axes well reflected the characteristics of groups in the experiments. The summarized levels the genes are presented in the Matrix form.

ORGANISM(S): Mus musculus

PROVIDER: GSE31291 | GEO | 2011/08/12

SECONDARY ACCESSION(S): PRJNA154131

REPOSITORIES: GEO

Similar Datasets

2011-08-15 | GSE31375 | GEO
2011-08-12 | GSE31289 | GEO
2006-06-06 | GSE4982 | GEO
2011-06-09 | GSE23682 | GEO
| PRJEB41928 | ENA
2020-07-09 | GSE144604 | GEO
2011-05-01 | E-GEOD-27071 | biostudies-arrayexpress
2008-10-25 | E-GEOD-8342 | biostudies-arrayexpress
2011-05-01 | GSE27071 | GEO
| PRJNA145901 | ENA