Unknown

Dataset Information

0

ESPERANTO: a GLP-field sEmi-SuPERvised toxicogenomics metadAta curatioN TOol.


ABSTRACT:

Summary

Biological data repositories are an invaluable source of publicly available research evidence. Unfortunately, the lack of convergence of the scientific community on a common metadata annotation strategy has resulted in large amounts of data with low FAIRness (Findable, Accessible, Interoperable and Reusable). The possibility of generating high-quality insights from their integration relies on data curation, which is typically an error-prone process while also being expensive in terms of time and human labour. Here, we present ESPERANTO, an innovative framework that enables a standardized semi-supervised harmonization and integration of toxicogenomics metadata and increases their FAIRness in a Good Laboratory Practice-compliant fashion. The harmonization across metadata is guaranteed with the definition of an ad hoc vocabulary. The tool interface is designed to support the user in metadata harmonization in a user-friendly manner, regardless of the background and the type of expertise.

Availability and implementation

ESPERANTO and its user manual are freely available for academic purposes at https://github.com/fhaive/esperanto. The input and the results showcased in Supplementary File S1 are available at the same link.

SUBMITTER: Di Lieto E 

PROVIDER: S-EPMC10313344 | biostudies-literature | 2023 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

ESPERANTO: a GLP-field sEmi-SuPERvised toxicogenomics metadAta curatioN TOol.

Di Lieto Emanuele E   Serra Angela A   Inkala Simo Iisakki SI   Saarimäki Laura Aliisa LA   Del Giudice Giusy G   Fratello Michele M   Hautanen Veera V   Annala Maria M   Federico Antonio A   Greco Dario D  

Bioinformatics (Oxford, England) 20230601 6


<h4>Summary</h4>Biological data repositories are an invaluable source of publicly available research evidence. Unfortunately, the lack of convergence of the scientific community on a common metadata annotation strategy has resulted in large amounts of data with low FAIRness (Findable, Accessible, Interoperable and Reusable). The possibility of generating high-quality insights from their integration relies on data curation, which is typically an error-prone process while also being expensive in t  ...[more]

Similar Datasets

| S-EPMC11850926 | biostudies-literature
2019-11-13 | GSE140262 | GEO
| S-EPMC5047769 | biostudies-literature
| S-EPMC3515863 | biostudies-literature
| S-EPMC3956069 | biostudies-literature
| S-EPMC4245907 | biostudies-other
| S-EPMC5868185 | biostudies-literature
| S-EPMC2768719 | biostudies-literature
| S-EPMC8101547 | biostudies-literature
| S-EPMC4649850 | biostudies-other