Unknown

Dataset Information

0

Sequential projection pursuit principal component analysis--dealing with missing data associated with new -omics technologies.


ABSTRACT: Principal Component Analysis (PCA) is a common exploratory tool used to evaluate large complex data sets. The resulting lower-dimensional representations are often valuable for pattern visualization, clustering, or classification of the data. However, PCA cannot be applied directly to many -omics data sets generated by newer technologies such as label-free mass spectrometry due to large numbers of non-random missing values. Here we present a sequential projection pursuit PCA (sppPCA) method for defining principal components in the presence of missing data. Our results demonstrate that this approach generates robust and informative low-dimensional data representations compared to commonly used imputation approaches.

SUBMITTER: Webb-Robertson BJ 

PROVIDER: S-EPMC6191041 | biostudies-literature | 2013 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sequential projection pursuit principal component analysis--dealing with missing data associated with new -omics technologies.

Webb-Robertson Bobbie-Jo M BJ   Matzke Melissa M MM   Metz Thomas O TO   McDermott Jason E JE   Walker Hyunjoo H   Rodland Karin D KD   Pounds Joel G JG   Waters Katrina M KM  

BioTechniques 20130301 3


Principal Component Analysis (PCA) is a common exploratory tool used to evaluate large complex data sets. The resulting lower-dimensional representations are often valuable for pattern visualization, clustering, or classification of the data. However, PCA cannot be applied directly to many -omics data sets generated by newer technologies such as label-free mass spectrometry due to large numbers of non-random missing values. Here we present a sequential projection pursuit PCA (sppPCA) method for  ...[more]

Similar Datasets

| S-EPMC5905607 | biostudies-literature
| S-EPMC6140545 | biostudies-literature
| S-EPMC3006395 | biostudies-literature
| S-EPMC6825836 | biostudies-literature
| S-EPMC6986895 | biostudies-literature
| S-EPMC5912177 | biostudies-literature
2011-08-15 | GSE31375 | GEO