Genomics

Dataset Information

0

De novo re-construction of the core genome from ChIP-Seq for large-genome organisms


ABSTRACT: Genetic diversity in plants is remarkably high. Recent whole genome sequencing (WGS) of 67 rice accessions recovered 10,872 novel genes. Comparison of the genetic architecture among divergent populations or between crops and wild relatives is essential for obtaining functional components determining crucial traits. However, many major crops have gigabase-scale genomes, which are not well-suited to WGS. Existing cost-effective sequencing approaches including re-sequencing, exome-sequencing and restriction enzyme-based methods all have difficulty in obtaining long novel genomic sequences from highly divergent population with large genome size. The present study presented a reference-independent core genome targeted sequencing approach, CGT-seq, which employed epigenomic information from both active and repressive epigenetic marks to guide the assembly of the core genome mainly composed of promoter and intragenic regions. This method was relatively easily implemented, and displayed high accuracy, sensitivity and specificity for capturing the core genome of bread wheat. 95% intragenic and 89% promoter region from wheat were covered by CGT-seq read. We further demonstrated in rice that CGT-seq captured hundreds of novel genes and regulatory sequences from a previously unsequenced ecotype. Together, with specific enrichment and sequencing of regions within and nearby genes, CGT-seq is a time- and resource-effective approach to profiling functionally relevant regions in sequenced and non-sequenced populations with large genomes.

ORGANISM(S): Oryza sativa Triticum aestivum Oryza sativa Japonica Group Oryza sativa Indica Group

PROVIDER: GSE107827 | GEO | 2018/05/27

REPOSITORIES: GEO

Similar Datasets

2021-02-04 | GSE139019 | GEO
2007-10-02 | GSE8064 | GEO
2023-03-31 | E-MTAB-11161 | biostudies-arrayexpress
2007-10-02 | E-GEOD-8064 | biostudies-arrayexpress
2023-04-24 | GSE198515 | GEO
2011-10-05 | GSE23889 | GEO
2017-11-13 | PXD004870 | Pride
2023-08-30 | GSE185875 | GEO
2011-12-06 | E-GEOD-27048 | biostudies-arrayexpress
2011-12-06 | GSE27048 | GEO