Unknown

Dataset Information

0

The first high-quality chromosome-level genome of the Sipuncula Sipunculus nudus using HiFi and Hi-C data.


ABSTRACT: Sipuncula is a class of exocoelomic unsegmented animals whose evolutionary relationships are unresolved. The peanut worm Sipunculus nudus is a globally distributed, economically important species belonging to the class Sipuncula. Herein, we present the first high-quality chromosome-level assembly of S. nudus based on HiFi reads and high-resolution chromosome conformation capture (Hi-C) data. The assembled genome was 1,427 Mb, with a contig N50 length of 29.46 Mb and scaffold N50 length of 80.87 Mb. Approximately 97.91% of the genome sequence was anchored to 17 chromosomes. A BUSCO assessment showed that 97.7% of the expectedly conserved genes were present in the genome assembly. The genome was composed of 47.91% repetitive sequences, and 28,749 protein-coding genes were predicted. A phylogenetic tree demonstrated that Sipuncula belongs to Annelida and diverged from the common ancestor of Polychaeta. The high-quality chromosome-level genome of S. nudus will serve as a valuable reference for studies of the genetic diversity and evolution of Lophotrochozoa.

SUBMITTER: Zheng Z 

PROVIDER: S-EPMC10212961 | biostudies-literature | 2023 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

The first high-quality chromosome-level genome of the Sipuncula Sipunculus nudus using HiFi and Hi-C data.

Zheng Zhe Z   Lai Zhuoxin Z   Wu Bin B   Song Xinlin X   Zhao Wei W   Zhong Ruzhuo R   Zhang Jiawei J   Liao Yongshan Y   Yang Chuangye C   Deng Yuewen Y   Mei Junpu J   Yue Zhen Z   Jian Jianbo J   Wang Qingheng Q  

Scientific data 20230525 1


Sipuncula is a class of exocoelomic unsegmented animals whose evolutionary relationships are unresolved. The peanut worm Sipunculus nudus is a globally distributed, economically important species belonging to the class Sipuncula. Herein, we present the first high-quality chromosome-level assembly of S. nudus based on HiFi reads and high-resolution chromosome conformation capture (Hi-C) data. The assembled genome was 1,427 Mb, with a contig N50 length of 29.46 Mb and scaffold N50 length of 80.87   ...[more]

Similar Datasets

| S-EPMC10492850 | biostudies-literature
| S-EPMC2639372 | biostudies-literature
| S-EPMC7800096 | biostudies-literature
| S-EPMC9633807 | biostudies-literature
| S-EPMC11271555 | biostudies-literature
| S-EPMC10776742 | biostudies-literature
| S-EPMC6213608 | biostudies-literature
| PRJNA592829 | ENA
| PRJNA997755 | ENA
| PRJNA543569 | ENA