Unknown

Dataset Information

0

A global cancer data integrator reveals principles of synthetic lethality, sex disparity and immunotherapy.


ABSTRACT:

Background

Advances in cancer biology are increasingly dependent on integration of heterogeneous datasets. Large-scale efforts have systematically mapped many aspects of cancer cell biology; however, it remains challenging for individual scientists to effectively integrate and understand this data.

Results

We have developed a new data retrieval and indexing framework that allows us to integrate publicly available data from different sources and to combine publicly available data with new or bespoke datasets. Our approach, which we have named the cancer data integrator (CanDI), is straightforward to implement, is well documented, and is continuously updated which should enable individual users to take full advantage of efforts to map cancer cell biology. We show that CanDI empowered testable hypotheses of new synthetic lethal gene pairs, genes associated with sex disparity, and immunotherapy targets in cancer.

Conclusions

CanDI provides a flexible approach for large-scale data integration in cancer research enabling rapid generation of hypotheses. The CanDI data integrator is available at https://github.com/GilbertLabUCSF/CanDI .

SUBMITTER: Yogodzinski C 

PROVIDER: S-EPMC8524992 | biostudies-literature | 2021 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

A global cancer data integrator reveals principles of synthetic lethality, sex disparity and immunotherapy.

Yogodzinski Christopher C   Arab Abolfazl A   Pritchard Justin R JR   Goodarzi Hani H   Gilbert Luke A LA  

Genome medicine 20211018 1


<h4>Background</h4>Advances in cancer biology are increasingly dependent on integration of heterogeneous datasets. Large-scale efforts have systematically mapped many aspects of cancer cell biology; however, it remains challenging for individual scientists to effectively integrate and understand this data.<h4>Results</h4>We have developed a new data retrieval and indexing framework that allows us to integrate publicly available data from different sources and to combine publicly available data w  ...[more]

Similar Datasets

| S-EPMC7217969 | biostudies-literature
2013-02-28 | E-GEOD-14217 | biostudies-arrayexpress
2013-02-28 | GSE14217 | GEO
| S-EPMC10691477 | biostudies-literature
| S-EPMC4848401 | biostudies-literature
| S-EPMC2592713 | biostudies-literature
| S-EPMC4172077 | biostudies-literature
2004-11-05 | GSE1758 | GEO
2004-11-05 | GSE1757 | GEO
2004-11-05 | GSE1756 | GEO