Unknown

Dataset Information

0

Genome assembly of Hibiscus sabdariffa L. provides insights into metabolisms of medicinal natural products.


ABSTRACT: Hibiscus sabdariffa L. is a widely cultivated herbaceous plant with diverse applications in food, tea, fiber, and medicine. In this study, we present a high-quality genome assembly of H. sabdariffa using more than 33 Gb of high-fidelity (HiFi) long-read sequencing data, corresponding to ∼20× depth of the genome. We obtained 3 genome assemblies of H. sabdariffa: 1 primary and 2 partially haplotype-resolved genome assemblies. These genome assemblies exhibit N50 contig lengths of 26.25, 11.96, and 14.50 Mb, with genome coverage of 141.3, 86.0, and 88.6%, respectively. We also utilized 26 Gb of total RNA sequencing data to predict 154k, 79k, and 87k genes in the respective assemblies. The completeness of the primary genome assembly and its predicted genes was confirmed by the benchmarking universal single-copy ortholog analysis with a completeness rate of 99.3%. Based on our high-quality genomic resources, we constructed genetic networks for phenylpropanoid and flavonoid metabolism and identified candidate biosynthetic genes, which are responsible for producing key intermediates of roselle-specific medicinal natural products. Our comprehensive genomic and functional analysis opens avenues for further exploration and application of valuable natural products in H. sabdariffa.

SUBMITTER: Kim T 

PROVIDER: S-EPMC11304979 | biostudies-literature | 2024 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome assembly of Hibiscus sabdariffa L. provides insights into metabolisms of medicinal natural products.

Kim Taein T   Lee Jeong Hun JH   Seo Hyo Hyun HH   Moh Sang Hyun SH   Choi Sung Soo SS   Kim Jun J   Kim Sang-Gyu SG  

G3 (Bethesda, Md.) 20240801 8


Hibiscus sabdariffa L. is a widely cultivated herbaceous plant with diverse applications in food, tea, fiber, and medicine. In this study, we present a high-quality genome assembly of H. sabdariffa using more than 33 Gb of high-fidelity (HiFi) long-read sequencing data, corresponding to ∼20× depth of the genome. We obtained 3 genome assemblies of H. sabdariffa: 1 primary and 2 partially haplotype-resolved genome assemblies. These genome assemblies exhibit N50 contig lengths of 26.25, 11.96, and  ...[more]

Similar Datasets

| PRJNA987859 | ENA
| PRJNA1072616 | ENA
| PRJNA1072565 | ENA
| PRJNA1077780 | ENA
| PRJNA1072611 | ENA
| PRJNA1072564 | ENA
| PRJNA987858 | ENA
| PRJNA1077779 | ENA
| PRJNA416201 | ENA
| PRJNA1077781 | ENA