Unknown

Dataset Information

0

Genome assembly of the ectoparasitoid wasp Theocolax elegans.


ABSTRACT: The ectoparasitoid wasp Theocolax elegans is a cosmopolitan and generalist pteromalid parasitoid of several major storage insect pests, and can effectively suppress a host population in warehouses. However, little molecular information about this wasp is currently available. In this study, we assembled the genome of T. elegans using PacBio long-read sequencing, Illumina sequencing, and Hi-C methods. The genome assembly is 662.73 Mb in length with contig and scaffold N50 values of 1.15 Mb and 88.8 Mb, respectively. The genome contains 56.4% repeat sequences and 23,212 protein-coding genes were annotated. Phylogenomic analyses revealed that T. elegans diverged from the lineage leading to subfamily Pteromalinae (Nasonia vitripennis and Pteromalus puparum) approximately 110.5 million years ago. We identified 130 significantly expanded gene families, 34 contracted families, 248 fast-evolving genes, and 365 positively selected genes in T. elegans. Additionally, 260 olfactory receptors and 285 venom proteins were identified. This genome assembly provides valuable genetic bases for future investigations on evolution, molecular biology and application of T. elegans.

SUBMITTER: Xiao S 

PROVIDER: S-EPMC10033727 | biostudies-literature | 2023 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome assembly of the ectoparasitoid wasp Theocolax elegans.

Xiao Shan S   Ye Xinhai X   Wang Shuping S   Yang Yi Y   Fang Qi Q   Wang Fang F   Ye Gongyin G  

Scientific data 20230322 1


The ectoparasitoid wasp Theocolax elegans is a cosmopolitan and generalist pteromalid parasitoid of several major storage insect pests, and can effectively suppress a host population in warehouses. However, little molecular information about this wasp is currently available. In this study, we assembled the genome of T. elegans using PacBio long-read sequencing, Illumina sequencing, and Hi-C methods. The genome assembly is 662.73 Mb in length with contig and scaffold N50 values of 1.15 Mb and 88.  ...[more]

Similar Datasets

| S-EPMC6954513 | biostudies-literature
| S-EPMC7783055 | biostudies-literature
| S-EPMC4776022 | biostudies-literature
| S-EPMC7290573 | biostudies-literature
| S-EPMC7349765 | biostudies-literature
| S-EPMC3544295 | biostudies-literature
| S-EPMC6927652 | biostudies-literature
| S-EPMC10482854 | biostudies-literature
| S-EPMC7822920 | biostudies-literature
| S-EPMC4018385 | biostudies-literature