Unknown

Dataset Information

0

OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations.


ABSTRACT:

Background

Modern genome sequencing leads to an ever-growing collection of genomic annotations. Combining these elements with a set of input regions (e.g. genes) would yield new insights in genomic associations, such as those involved in gene regulation. The required data are scattered across different databases making a manual approach tiresome, unpractical, and prone to error. Semi-automatic approaches require programming skills in data parsing, processing, overlap calculation, and visualization, which most biomedical researchers lack. Our aim was to develop an automated tool providing all necessary algorithms, benefiting both bioinformaticians and researchers without bioinformatic training.

Results

We developed overlapping annotated genomic regions (OGRE) as a comprehensive tool to associate and visualize input regions with genomic annotations. It does so by parsing regions of interest, mining publicly available annotations, and calculating possible overlaps between them. The user can thus identify location, type, and number of associated regulatory elements. Results are presented as easy to understand visualizations and result tables. We applied OGRE to recent studies and could show high reproducibility and potential new insights. To demonstrate OGRE's performance in terms of running time and output, we have conducted a benchmark and compared its features with similar tools.

Conclusions

OGRE's functions and built-in annotations can be applied as a downstream overlap association step, which is compatible with most genomic sequencing outputs, and can thus enrich pre-existing analyses pipelines. Compared to similar tools, OGRE shows competitive performance, offers additional features, and has been successfully applied to two recent studies. Overall, OGRE addresses the lack of tools for automatic analysis, local genomic overlap calculation, and visualization by providing an easy to use, end-to-end solution for both biologists and computational scientists.

SUBMITTER: Berres S 

PROVIDER: S-EPMC10369718 | biostudies-literature | 2023 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

OGRE: calculate, visualize, and analyze overlap between genomic input regions and public annotations.

Berres Sven S   Gromoll Jörg J   Wöste Marius M   Sandmann Sarah S   Laurentino Sandra S  

BMC bioinformatics 20230726 1


<h4>Background</h4>Modern genome sequencing leads to an ever-growing collection of genomic annotations. Combining these elements with a set of input regions (e.g. genes) would yield new insights in genomic associations, such as those involved in gene regulation. The required data are scattered across different databases making a manual approach tiresome, unpractical, and prone to error. Semi-automatic approaches require programming skills in data parsing, processing, overlap calculation, and vis  ...[more]

Similar Datasets

| S-EPMC8128468 | biostudies-literature
| S-EPMC6171338 | biostudies-literature
| S-EPMC6336819 | biostudies-literature
| S-EPMC8609247 | biostudies-literature
| S-EPMC5155393 | biostudies-literature
| S-EPMC8903798 | biostudies-literature
| S-EPMC8728235 | biostudies-literature
| S-EPMC2731905 | biostudies-literature
| S-EPMC4991238 | biostudies-literature
| S-EPMC7083320 | biostudies-literature