Unknown

Dataset Information

0

Extensive sequencing of seven human genomes to characterize benchmark reference materials.


ABSTRACT: The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly.

SUBMITTER: Zook JM 

PROVIDER: S-EPMC4896128 | biostudies-literature | 2016 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Extensive sequencing of seven human genomes to characterize benchmark reference materials.

Zook Justin M JM   Catoe David D   McDaniel Jennifer J   Vang Lindsay L   Spies Noah N   Sidow Arend A   Weng Ziming Z   Liu Yuling Y   Mason Christopher E CE   Alexander Noah N   Henaff Elizabeth E   McIntyre Alexa B R AB   Chandramohan Dhruva D   Chen Feng F   Jaeger Erich E   Moshrefi Ali A   Pham Khoa K   Stedman William W   Liang Tiffany T   Saghbini Michael M   Dzakula Zeljko Z   Hastie Alex A   Cao Han H   Deikus Gintaras G   Schadt Eric E   Sebra Robert R   Bashir Ali A   Truty Rebecca M RM   Chang Christopher C CC   Gulbahce Natali N   Zhao Keyan K   Ghosh Srinka S   Hyland Fiona F   Fu Yutao Y   Chaisson Mark M   Xiao Chunlin C   Trow Jonathan J   Sherry Stephen T ST   Zaranek Alexander W AW   Ball Madeleine M   Bobe Jason J   Estep Preston P   Church George M GM   Marks Patrick P   Kyriazopoulou-Panagiotopoulou Sofia S   Zheng Grace X Y GX   Schnall-Levin Michael M   Ordonez Heather S HS   Mudivarti Patrice A PA   Giorda Kristina K   Sheng Ying Y   Rypdal Karoline Bjarnesdatter KB   Salit Marc M  

Scientific data 20160607


The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one o  ...[more]

Similar Datasets

| S-EPMC4659454 | biostudies-literature
| S-EPMC2928504 | biostudies-literature
| S-EPMC3486893 | biostudies-literature
| S-EPMC3404096 | biostudies-literature
| S-EPMC9988948 | biostudies-literature
| S-EPMC5287235 | biostudies-literature
| S-EPMC6256668 | biostudies-literature
| S-EPMC6362088 | biostudies-literature
| S-EPMC8147415 | biostudies-literature
| S-EPMC7953489 | biostudies-literature