Unknown

Dataset Information

0

Genome assembly of two diploid and one auto-tetraploid Cyclocarya paliurus genomes.


ABSTRACT: Cyclocarya paliurus, an endemic species in the genus Juglandaceae with the character of heterodichogamy, is one of triterpene-rich medicinal plants in China. To uncover the genetic mechanisms behind the special characteristics, we sequenced the genomes of two diploid (protandry, PA-dip and protogyny, PG-dip) and one auto-tetraploid (PA-tetra) C. paliurus genomes. Based on 134.9 (~225x), 75.5 (~125x) and 271.8 Gb (~226x) subreads of PacBio platform sequencing data, we assembled 586.62 Mb (contig N50 = 1.9 Mb), 583.45 Mb (contig N50 = 1.4 Mb), and 2.38 Gb (contig N50 = 430.9 kb) for PA-dip, PG-dip and PA-tetra genome, respectively. Furthermore, 543.53, 553.87, and 2168.65 Mb in PA-dip, PG-dip, and PA-tetra, were respectively anchored to 16, 16, and 64 pseudo-chromosomes using over 65.4 Gb (~109x), 68 Gb (~113x), and 264 (~220x) Hi-C sequencing data. Annotation of PA-dip, PG-dip, and PA-tetra genome assembly identified 34,699, 35,221, and 34,633 protein-coding genes (90,752 gene models) or allele-defined genes, respectively. In addition, 45 accessions from nine locations were re-sequenced, and more than 10 × coverage reads were generated.

SUBMITTER: Qu Y 

PROVIDER: S-EPMC10397226 | biostudies-literature | 2023 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome assembly of two diploid and one auto-tetraploid Cyclocarya paliurus genomes.

Qu Yinquan Y   Shang Xulan X   Fang Shengzuo S   Zhang Xingtan X   Fu Xiangxiang X  

Scientific data 20230802 1


Cyclocarya paliurus, an endemic species in the genus Juglandaceae with the character of heterodichogamy, is one of triterpene-rich medicinal plants in China. To uncover the genetic mechanisms behind the special characteristics, we sequenced the genomes of two diploid (protandry, PA-dip and protogyny, PG-dip) and one auto-tetraploid (PA-tetra) C. paliurus genomes. Based on 134.9 (~225x), 75.5 (~125x) and 271.8 Gb (~226x) subreads of PacBio platform sequencing data, we assembled 586.62 Mb (contig  ...[more]

Similar Datasets

| PRJNA933771 | ENA
| PRJNA1239021 | ENA
| PRJNA1057569 | ENA
| PRJNA542760 | ENA
| PRJNA783768 | ENA
| PRJNA912370 | ENA
| PRJNA931978 | ENA
| PRJNA783953 | ENA
| PRJNA1058579 | ENA
| PRJNA1058593 | ENA