Dataset Information

Descriptive statistics and visualization of data from the R datasets package with implications for clusterability.

ABSTRACT: The manuscript describes and visualizes datasets from the datasets package in the R statistical software, focusing on descriptive statistics and visualizations that provide insights into the clusterability of these datasets. These publicly available datasets are contained in the R software system, and can be downloaded at https://www.r-project.org/, with documentation provided at https://stat.ethz.ch/R-manual/R-devel/library/datasets/html/00Index.html. Further information on clusterability is found in the companion to this article, To Cluster or Not to Cluster: An Analysis of Clusterability Methods? (https://doi.org/10.1016/j.patcog.2018.10.026). Brief descriptions and graphs of the variables contained in each dataset are provided in the form of means, extrema, quartiles, standard deviation and standard error. Two-dimensional plots for each pair of variables are provided. Original references to the data sets are included when available. Further, each dataset is reduced to a single dimension by each of two different methods: pairwise distances and principal component analysis. For the latter, only the first component is used. Histograms of the reduced data are included for every dataset using both methods.

SUBMITTER: Brownstein NC

PROVIDER: S-EPMC6612012 | biostudies-literature | 2019 Aug

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Descriptive statistics and visualization of data from the R datasets package with implications for clusterability.

Brownstein Naomi C NC Adolfsson Andreas A Ackerman Margareta M

Data in brief 20190524

The manuscript describes and visualizes datasets from the datasets package in the R statistical software, focusing on descriptive statistics and visualizations that provide insights into the clusterability of these datasets. These publicly available datasets are contained in the R software system, and can be downloaded at https://www.r-project.org/, with documentation provided at https://stat.ethz.ch/R-manual/R-devel/library/datasets/html/00Index.html. Further information on ...[more]

PMID: 31317060

Dataset Information

Descriptive statistics and visualization of data from the R datasets package with implications for clusterability.

Publications

Descriptive statistics and visualization of data from the <i>R</i> datasets package with implications for clusterability.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

geneHapR: an R package for gene haplotypic statistics and visualization.
| S-EPMC10186671 | biostudies-literature

AlignStatPlot: An R package and online tool for robust sequence alignment statistics and innovative visualization of big data.
| S-EPMC10511070 | biostudies-literature

bubbleHeatmap: an R package for visualization of nightingale health metabolomics datasets.
| S-EPMC10518075 | biostudies-literature

smplot: An R Package for Easy and Elegant Data Visualization.
| S-EPMC8714909 | biostudies-literature

qPCRtools: An R package for qPCR data processing and visualization.
| S-EPMC9513427 | biostudies-literature

ceas: an R package for Seahorse data analysis and visualization.
| S-EPMC11349193 | biostudies-literature

BMDExpress Data Viewer - a visualization tool to analyze BMDExpress datasets.
| S-EPMC5064610 | biostudies-literature

Prior Knowledge Transfer Across Transcriptional Datasets Using Compositional Statistics
2016-11-08 | GSE73638 | GEO

cerebroViz: an R package for anatomical visualization of spatiotemporal brain data.
| S-EPMC5870797 | biostudies-literature

bigPint: A Bioconductor visualization package that makes big data pint-sized.
| S-EPMC7347224 | biostudies-literature