Genomics

Dataset Information

0

Synthetic data - Genome in a Bottle


ABSTRACT: In May, the National Institute of Standards and Technology (NIST) released its first genome in a bottle, a reference sample of DNA for validating human genome sequences. This so-called truth sequence comes from a decades-old sample donated by a Utah woman for (other) research purposes (NA12878 cell line), which, over the years, has been one of the most studied, and hence best-characterized, human samples. Seeing genomic medicine moving toward mainstream healthcare, researchers at NIST recognized the need for a reference human genome and assembled a private-public consortium in 2012 to create one. As detailed in a 2014 Nature Biotechnology paper (Nat. Biotechnol.32, 246–251, 2014), the group integrated and arbitrated among sequences from 14 data sets, five sequencing technologies, seven read mappers and three variant callers.

PROVIDER: EGAS00001005591 | EGA |

REPOSITORIES: EGA

Similar Datasets

2014-02-22 | GSE55239 | GEO
2025-02-21 | GSE290209 | GEO
2016-01-21 | E-MTAB-4344 | biostudies-arrayexpress
2023-09-06 | GSE222054 | GEO
2023-12-07 | GSE227218 | GEO
2023-12-07 | GSE227214 | GEO
| 2573355 | ecrin-mdr-crc
2016-03-31 | PXD003500 | Pride
2019-09-20 | E-MTAB-8349 | biostudies-arrayexpress
2010-01-13 | GSE19651 | GEO