Unknown

Dataset Information

0

ShinyTPs: Curating Transformation Products from Text Mining Results.


ABSTRACT: Transformation product (TP) information is essential to accurately evaluate the hazards compounds pose to human health and the environment. However, information about TPs is often limited, and existing data is often not fully Findable, Accessible, Interoperable, and Reusable (FAIR). FAIRifying existing TP knowledge is a relatively easy path toward improving access to data for identification workflows and for machine-learning-based algorithms. ShinyTPs was developed to curate existing transformation information derived from text-mined data within the PubChem database. The application (available as an R package) visualizes the text-mined chemical names to facilitate the user validation of the automatically extracted reactions. ShinyTPs was applied to a case study using 436 tentatively identified compounds to prioritize TP retrieval. This resulted in the extraction of 645 reactions (associated with 496 compounds), of which 319 were not previously available in PubChem. The curated reactions were added to the PubChem Transformations library, which was used as a TP suspect list for identification of TPs using the open-source workflow patRoon. In total, 72 compounds from the library were tentatively identified, 18% of which were curated using ShinyTPs, showing that the app can help support TP identification in non-target analysis workflows.

SUBMITTER: Palm EH 

PROVIDER: S-EPMC10569035 | biostudies-literature | 2023 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

ShinyTPs: Curating Transformation Products from Text Mining Results.

Palm Emma H EH   Chirsir Parviel P   Krier Jessy J   Thiessen Paul A PA   Zhang Jian J   Bolton Evan E EE   Schymanski Emma L EL  

Environmental science & technology letters 20230929 10


Transformation product (TP) information is essential to accurately evaluate the hazards compounds pose to human health and the environment. However, information about TPs is often limited, and existing data is often not fully Findable, Accessible, Interoperable, and Reusable (FAIR). FAIRifying existing TP knowledge is a relatively easy path toward improving access to data for identification workflows and for machine-learning-based algorithms. ShinyTPs was developed to curate existing transformat  ...[more]

Similar Datasets

| S-EPMC5845379 | biostudies-literature
| S-EPMC6594987 | biostudies-literature
| S-EPMC7083782 | biostudies-literature
| S-EPMC5664974 | biostudies-literature
| S-EPMC7206865 | biostudies-literature
| S-EPMC2374703 | biostudies-literature
| S-EPMC2217579 | biostudies-literature
| S-EPMC5975701 | biostudies-literature
| S-EPMC4674139 | biostudies-literature
| S-EPMC3939821 | biostudies-literature