Unknown

Dataset Information

0

EnviRule: an end-to-end system for automatic extraction of reaction patterns from environmental contaminant biotransformation pathways.


ABSTRACT:

Motivation

Transformation products (TPs) of man-made chemicals, formed through microbially mediated transformation in the environment, can have serious adverse environmental effects, yet the analytical identification of TPs is challenging. Rule-based prediction tools are successful in predicting TPs, especially in environmental chemistry applications that typically have to rely on small datasets, by imparting the existing knowledge on enzyme-mediated biotransformation reactions. However, the rules extracted from biotransformation reaction databases usually face the issue of being over/under-generalized and are not flexible to be updated with new reactions.

Results

We developed an automatic rule extraction tool called enviRule. It clusters biotransformation reactions into different groups based on the similarities of reaction fingerprints, and then automatically extracts and generalizes rules for each reaction group in SMARTS format. It optimizes the genericity of automatic rules against the downstream TP prediction task. Models trained with automatic rules outperformed the models trained with manually curated rules by 30% in the area under curve (AUC) scores. Moreover, automatic rules can be easily updated with new reactions, highlighting enviRule's strengths for both automatic extraction of optimized reactions rules and automated updating thereof.

Availability and implementation

enviRule code is freely available at https://github.com/zhangky12/enviRule.

SUBMITTER: Zhang K 

PROVIDER: S-EPMC10322654 | biostudies-literature | 2023 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

enviRule: an end-to-end system for automatic extraction of reaction patterns from environmental contaminant biotransformation pathways.

Zhang Kunyang K   Fenner Kathrin K  

Bioinformatics (Oxford, England) 20230701 7


<h4>Motivation</h4>Transformation products (TPs) of man-made chemicals, formed through microbially mediated transformation in the environment, can have serious adverse environmental effects, yet the analytical identification of TPs is challenging. Rule-based prediction tools are successful in predicting TPs, especially in environmental chemistry applications that typically have to rely on small datasets, by imparting the existing knowledge on enzyme-mediated biotransformation reactions. However,  ...[more]

Similar Datasets

| S-EPMC5919007 | biostudies-literature
| S-EPMC10544838 | biostudies-literature
| S-EPMC9072332 | biostudies-literature
| S-EPMC2901371 | biostudies-literature
| S-EPMC5915274 | biostudies-literature
| S-EPMC7924173 | biostudies-literature
| S-EPMC8860887 | biostudies-literature
| S-EPMC7994862 | biostudies-literature
| PRJEB37921 | ENA
| PRJNA1143158 | ENA