Unknown

Dataset Information

0

Voršilák2020 - Bayesian estimation of synthetic accessibility of organic compounds


ABSTRACT:

SYBA uses a fragment-based approach to classify whether a molecule is easy or hard to synthesize, and it can also be used to analyze the contribution of individual fragments to the total synthetic accessibility. The easy-to-synthesize dataset is an extract of the ZINC purchasable compounds, and the hard-to-synthesize dataset is generated using a Nonpher approach (introducing small molecular perturbations to transform molecules into more complex compounds). The fragments are calculated with ECFP8 descriptors, and independence between fragments is assumed.

Model Type: Predictive machine learning model.
Model Relevance: Prediction of synthetic accessibility
Model Encoded by: Miquel Duran-Frigola (Ersilia)
Metadata Submitted in BioModels by: Zainab Ashimiyu-Abdusalam

Implementation of this model code by Ersilia is available here:
https://github.com/ersilia-os/eos7pw8

Ersilia Logo

SUBMITTER: Zainab Ashimiyu-Abdusalam 

PROVIDER: MODEL2407180002 | biostudies-other |

SECONDARY ACCESSION(S): 33431015

REPOSITORIES: biostudies-other

altmetric image

Publications

SYBA: Bayesian estimation of synthetic accessibility of organic compounds.

Voršilák Milan M   Kolář Michal M   Čmelo Ivan I   Svozil Daniel D  

Journal of cheminformatics 20200520 1


SYBA (SYnthetic Bayesian Accessibility) is a fragment-based method for the rapid classification of organic compounds as easy- (ES) or hard-to-synthesize (HS). It is based on a Bernoulli naïve Bayes classifier that is used to assign SYBA score contributions to individual fragments based on their frequencies in the database of ES and HS molecules. SYBA was trained on ES molecules available in the ZINC15 database and on HS molecules generated by the Nonpher methodology. SYBA was compared with a ran  ...[more]

Similar Datasets

| S-EPMC7238540 | biostudies-literature
| MODEL2407180004 | biostudies-other
2024-07-18 | MODEL2407180002 | BioModels
| S-EPMC10369256 | biostudies-literature
2024-07-18 | MODEL2407180004 | BioModels
| S-EPMC9692375 | biostudies-literature
| S-EPMC8486869 | biostudies-literature