Dataset Information


Draft genome sequence of Camellia sinensis var. sinensis provides insights into the evolution of the tea genome and tea quality.

ABSTRACT: Tea, one of the world's most important beverage crops, provides numerous secondary metabolites that account for its rich taste and health benefits. Here we present a high-quality sequence of the genome of tea, Camellia sinensis var. sinensis (CSS), using both Illumina and PacBio sequencing technologies. At least 64% of the 3.1-Gb genome assembly consists of repetitive sequences, and the rest yields 33,932 high-confidence predictions of encoded proteins. Divergence between two major lineages, CSS and Camellia sinensis var. assamica (CSA), is calculated to ?0.38 to 1.54 million years ago (Mya). Analysis of genic collinearity reveals that the tea genome is the product of two rounds of whole-genome duplications (WGDs) that occurred ?30 to 40 and ?90 to 100 Mya. We provide evidence that these WGD events, and subsequent paralogous duplications, had major impacts on the copy numbers of secondary metabolite genes, particularly genes critical to producing three key quality compounds: catechins, theanine, and caffeine. Analyses of transcriptome and phytochemistry data show that amplification and transcriptional divergence of genes encoding a large acyltransferase family and leucoanthocyanidin reductases are associated with the characteristic young leaf accumulation of monomeric galloylated catechins in tea, while functional divergence of a single member of the glutamine synthetase gene family yielded theanine synthetase. This genome sequence will facilitate understanding of tea genome evolution and tea metabolite pathways, and will promote germplasm utilization for breeding improved tea varieties.


PROVIDER: S-EPMC5939082 | BioStudies | 2018-01-01

REPOSITORIES: biostudies

Similar Datasets

2015-01-01 | S-EPMC4527363 | BioStudies
2016-01-01 | S-EPMC4811933 | BioStudies
2019-01-01 | S-EPMC6359021 | BioStudies
2017-01-01 | S-EPMC5554289 | BioStudies
2018-01-01 | S-EPMC6094456 | BioStudies
2020-01-01 | S-EPMC7140895 | BioStudies
2016-01-01 | S-EPMC5003824 | BioStudies
2019-01-01 | S-EPMC6384637 | BioStudies
1000-01-01 | S-EPMC6186127 | BioStudies
2017-01-01 | S-EPMC5383724 | BioStudies