Genomics

Dataset Information

0

Model-driven generation of artificial yeast promoters


ABSTRACT: Promoters play a central role in controlling gene regulation; however, a small set of promoters is used for most genetic construct design in the yeast Saccharomyces cerevisiae. The ability to generate and utilize models that accurately predict protein expression from promoter sequence may enable rapid generation of novel useful promoters, facilitating synthetic biology efforts in this model organism. We measured the activity of over 675,000 unique sequences in a constitutive promoter library, and over 327,000 sequences in a library of inducible promoters. Training an ensemble of convolutional neural networks jointly on the two datasets enabled very high (R2 > 0.79) predictive accuracies on multiple prediction tasks. We developed model-guided design strategies which yielded large, sequence-diverse sets of novel promoters exhibiting activities similar to current best-in-class sequences. In addition to providing large sets of new promoters, our results show the value of model-guided design as an approach for generating DNA parts.

ORGANISM(S): synthetic construct

PROVIDER: GSE135464 | GEO | 2019/12/01

REPOSITORIES: GEO

Similar Datasets

2017-09-27 | GSE104252 | GEO
2015-11-13 | GSE74948 | GEO
2024-01-05 | PXD048221 | Pride
2013-09-01 | E-GEOD-48860 | biostudies-arrayexpress
2021-10-25 | GSE184426 | GEO
| PRJEB36433 | ENA
| PRJNA752141 | ENA
2019-11-27 | GSE140218 | GEO
2019-11-27 | GSE140215 | GEO
2019-11-27 | GSE140214 | GEO