Dataset Information

Bayesian Correlation is a robust similarity measure for single cell RNA-seq data

ABSTRACT: Single-cell analysis of the transcriptome deepens our understanding of an individual cell's contribution to its microenvironment. Using single-cell analysis to study complex biological processes requires state-of-the-art computational tools. Assessing similarity is highly important for bioinformatics algorithms in order to determine correlations between biological information. Similarity can appear by chance, particularly for low expressed entities. This is especially relevant in single cell RNA-seq (scRNA-seq) because the read counts obtained are lower compared to bulk RNA-sequencing and therefore classic bioinformatics tools are insufficient to obtain reproducible results. Recently, a Bayesian correlation scheme, that assigns low correlation values to correlations coming from low expressed genes, has been proposed to assess similarity for bulk RNA-seq and miRNA. This Bayesian method uses a prior distribution before using empirical evidence. Our goal was to extend the properties of this Bayesian correlation scheme to scRNA-seq data. We assessed 3 ways to compute similarity. First, we computed the similarity of each pair of genes over all cells. Second, we identified specific cell populations and computed the correlation in those specific cells. Third, we computed the similarity of each pair of genes over all clusters, by including the total mRNA expression in those cells. To study the effect of the number of cells on the method, we did not rely on simulated data, we generated 4 scRNA-seq mouse liver cell libraries with a varying number of input cells. Results: We show that Bayesian correlations are more reproducible than Pearson correlations in all the scenarios studied. Compared to Pearson correlations, Bayesian correlations have a smaller dependence on the number of input cells. We demonstrate that the Bayesian correlation algorithm assigns high similarity values to genes with a biological relevance in a specific population. Significance: Our results demonstrate that Bayesian correlation is a robust similarity measure for scRNA-seq datasets. The Bayesian method allows researchers to study similarity between pairs of genes without discarding low expressed entities and to minimize biasing the results by fake correlations. Taken together, using our method of Bayesian correlation the reproducibility of scRNA-seq experiments is increased significantly.

ORGANISM(S): Mus musculus

PROVIDER: GSE134134 | GEO | 2019/12/31

REPOSITORIES: GEO

ACCESS DATA

Dataset's files

Source:

			Action	DRS
		Other

Items per page:

1 - 1 of 1

Similar Datasets

Project description:Kilian2024 - Immune cell dynamics in Cue-Induced Extended Human Colitis Model Single-cell technologies such as scRNA-seq and flow cytometry provide critical insights into immune cell behavior in inflammatory bowel disease (IBD). However, integrating these datasets into computational models for dynamic analysis remains challenging. Here, Kilian et al., (2024) developed a deterministic ODE-based model that incorporates these technologies to study immune cell population changes in murine colitis. The model parameters were optimized to fit experimental data, ensuring an accurate representation of immune cell behavior over time. It was then validated by comparing simulations with experimental data using Pearson’s correlation and further tested on independent datasets to confirm its robustness. Additionally, the model was applied to clinical bulk RNA-seq data from human IBD patients, providing valuable insights into immune system dynamics and potential therapeutic strategies. Figure 4c, obtained from the simulation of human colitis model is highlighted here. This model is described in the article: Kilian, C., Ulrich, H., Zouboulis, V.A. et al. Longitudinal single-cell data informs deterministic modelling of inflammatory bowel disease. npj Syst Biol Appl 10, 69 (2024). https://doi.org/10.1038/s41540-024-00395-9 Abstract: Single-cell-based methods such as flow cytometry or single-cell mRNA sequencing (scRNA-seq) allow deep molecular and cellular profiling of immunological processes. Despite their high throughput, however, these measurements represent only a snapshot in time. Here, we explore how longitudinal single-cell-based datasets can be used for deterministic ordinary differential equation (ODE)-based modelling to mechanistically describe immune dynamics. We derived longitudinal changes in cell numbers of colonic cell types during inflammatory bowel disease (IBD) from flow cytometry and scRNA-seq data of murine colitis using ODE-based models. Our mathematical model generalised well across different protocols and experimental techniques, and we hypothesised that the estimated model parameters reflect biological processes. We validated this prediction of cellular turnover rates with KI-67 staining and with gene expression information from the scRNA-seq data not used for model fitting. Finally, we tested the translational relevance of the mathematical model by deconvolution of longitudinal bulk mRNA-sequencing data from a cohort of human IBD patients treated with olamkicept. We found that neutrophil depletion may contribute to IBD patients entering remission. The predictive power of IBD deterministic modelling highlights its potential to advance our understanding of immune dynamics in health and disease. This model was curated during the Hackathon hosted by BioMed X GmbH in 2024.

Project description:The importance of brain-body interactions in the progresses of diseases are increasingly noticed, and understanding the associated processes could vastly improve the health of whole organism. A comprehensive, whole-organism analysis of organ dynamics to identify the molecular processes within and between organs after brain injury caused by ischemic stroke has been lacking. Mice models of 24 hours ischemic brain injury are constructed. Proteomics and metabolomics are used to detect the changes of organ’s proteome and metaboleome, respectively, and multiple bioinformatics analysis are performed to investigate the changing signatures across organs after stroke. Then, the organ proteomics data combined with aging mouse organ transcriptome data and mouse organ scRNA-seq data to reveal the biological age of organs and cellular resource of differentially expressed proteins (DEPs) of organs after stroke, respectively. Finally, we cross-referenced stroke-related DEPs in plasma with their corresponding protein expression in each organ to investigate the original source of plasma proteins by using spearman’s correlation analysis. Bioinformatics analysis showing the synchronization and asynchronization of the stroke-related proteins involved in multiple regulatory pathways are universally across organs. Although significant inter-organ correlations exist without influenced by stroke, the protein expression changes may better reflect the inter-organ correlation after brain injury by sharing some common changing signatures. Then, hundreds of DEPs in organs with the greatest numbers of DEPs are revealed in heart, and they are unique and common expressed between organs. Notably, an integrate analysis of these DEPs with mouse aging RNA-seq data reveal ageing-like changes in organs, suggesting increased biological ages of organs. Furthermore, a conjoint analysis of our proteomic data with scRNA-seq data confirms the cellular source of DEPs originated from intrinsic cells and immune cells, and the latter are widely located and activated in organs. Finally, we find that some DEPs in plasma are highly correlated with corresponding protein levels in distinct organs, potentially resulting in the stroke of the systemic circulation. Together, this study demonstrates a similar yet asynchronous inter- and intra-organ progression of stroke, providing a fundamental resource for understanding the molecular mechanisms underlying brain-body interaction and potential interventions for brain injury.

Dataset Information

Bayesian Correlation is a robust similarity measure for single cell RNA-seq data

Dataset's files

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets