Unknown

Dataset Information

0

Constrained Fourier estimation of short-term time-series gene expression data reduces noise and improves clustering and gene regulatory network predictions.


ABSTRACT:

Background

Biological data suffers from noise that is inherent in the measurements. This is particularly true for time-series gene expression measurements. Nevertheless, in order to to explore cellular dynamics, scientists employ such noisy measurements in predictive and clustering tools. However, noisy data can not only obscure the genes temporal patterns, but applying predictive and clustering tools on noisy data may yield inconsistent, and potentially incorrect, results.

Results

To reduce the noise of short-term (< 48 h) time-series expression data, we relied on the three basic temporal patterns of gene expression: waves, impulses and sustained responses. We constrained the estimation of the true signals to these patterns by estimating the parameters of first and second-order Fourier functions and using the nonlinear least-squares trust-region optimization technique. Our approach lowered the noise in at least 85% of synthetic time-series expression data, significantly more than the spline method ([Formula: see text]). When the data contained a higher signal-to-noise ratio, our method allowed downstream network component analyses to calculate consistent and accurate predictions, particularly when the noise variance was high. Conversely, these tools led to erroneous results from untreated noisy data. Our results suggest that at least 5-7 time points are required to efficiently de-noise logarithmic scaled time-series expression data. Investing in sampling additional time points provides little benefit to clustering and prediction accuracy.

Conclusions

Our constrained Fourier de-noising method helps to cluster noisy gene expression and interpret dynamic gene networks more accurately. The benefit of noise reduction is large and can constitute the difference between a successful application and a failing one.

SUBMITTER: Bar N 

PROVIDER: S-EPMC9364503 | biostudies-literature | 2022 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Constrained Fourier estimation of short-term time-series gene expression data reduces noise and improves clustering and gene regulatory network predictions.

Bar Nadav N   Nikparvar Bahareh B   Jayavelu Naresh Doni ND   Roessler Fabienne Krystin FK  

BMC bioinformatics 20220809 1


<h4>Background</h4>Biological data suffers from noise that is inherent in the measurements. This is particularly true for time-series gene expression measurements. Nevertheless, in order to to explore cellular dynamics, scientists employ such noisy measurements in predictive and clustering tools. However, noisy data can not only obscure the genes temporal patterns, but applying predictive and clustering tools on noisy data may yield inconsistent, and potentially incorrect, results.<h4>Results</h  ...[more]

Similar Datasets

| S-EPMC3219611 | biostudies-literature
| S-EPMC7518521 | biostudies-literature
| S-EPMC11681241 | biostudies-literature
| S-EPMC9942547 | biostudies-literature
| S-EPMC2920277 | biostudies-literature
| S-EPMC4586264 | biostudies-literature
| S-EPMC9736003 | biostudies-literature
| S-EPMC3772096 | biostudies-literature
| S-EPMC3843499 | biostudies-literature
| S-EPMC3022823 | biostudies-literature