Unknown

Dataset Information

0

Genomic data integration tutorial, a plant case study.


ABSTRACT:

Background

The ongoing evolution of the Next Generation Sequencing (NGS) technologies has led to the production of genomic data on a massive scale. While tools for genomic data integration and analysis are becoming increasingly available, the conceptual and analytical complexities still represent a great challenge in many biological contexts.

Results

To address this issue, we describe a six-steps tutorial for the best practices in genomic data integration, consisting of (1) designing a data matrix; (2) formulating a specific biological question toward data description, selection and prediction; (3) selecting a tool adapted to the targeted questions; (4) preprocessing of the data; (5) conducting preliminary analysis, and finally (6) executing genomic data integration.

Conclusion

The tutorial has been tested and demonstrated on publicly available genomic data generated from poplar (Populus L.), a woody plant model. We also developed a new graphical output for the unsupervised multi-block analysis, cimDiablo_v2, available at https://forgemia.inra.fr/umr-gdec/omics-integration-on-poplar , and allowing the selection of master drivers in genomic data variation and interplay.

SUBMITTER: Mardoc E 

PROVIDER: S-EPMC10792847 | biostudies-literature | 2024 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genomic data integration tutorial, a plant case study.

Mardoc Emile E   Sow Mamadou Dia MD   Déjean Sébastien S   Salse Jérôme J  

BMC genomics 20240117 1


<h4>Background</h4>The ongoing evolution of the Next Generation Sequencing (NGS) technologies has led to the production of genomic data on a massive scale. While tools for genomic data integration and analysis are becoming increasingly available, the conceptual and analytical complexities still represent a great challenge in many biological contexts.<h4>Results</h4>To address this issue, we describe a six-steps tutorial for the best practices in genomic data integration, consisting of (1) design  ...[more]

Similar Datasets

| S-EPMC11490075 | biostudies-literature
| S-EPMC6192561 | biostudies-literature
| S-EPMC10879751 | biostudies-literature
| S-EPMC4833864 | biostudies-other
| S-EPMC10508576 | biostudies-literature
| S-EPMC4539887 | biostudies-literature
| S-EPMC7158209 | biostudies-literature
| S-EPMC2762073 | biostudies-literature
| S-EPMC5198844 | biostudies-literature
| S-EPMC3866686 | biostudies-literature