Unknown

Dataset Information

0

Functional classification of long non-coding RNAs by k-mer content.


ABSTRACT: The functions of most long non-coding RNAs (lncRNAs) are unknown. In contrast to proteins, lncRNAs with similar functions often lack linear sequence homology; thus, the identification of function in one lncRNA rarely informs the identification of function in others. We developed a sequence comparison method to deconstruct linear sequence relationships in lncRNAs and evaluate similarity based on the abundance of short motifs called k-mers. We found that lncRNAs of related function often had similar k-mer profiles despite lacking linear homology, and that k-mer profiles correlated with protein binding to lncRNAs and with their subcellular localization. Using a novel assay to quantify Xist-like regulatory potential, we directly demonstrated that evolutionarily unrelated lncRNAs can encode similar function through different spatial arrangements of related sequence motifs. K-mer-based classification is a powerful approach to detect recurrent relationships between sequence and function in lncRNAs.

SUBMITTER: Kirk JM 

PROVIDER: S-EPMC6262761 | biostudies-literature | 2018 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications


The functions of most long non-coding RNAs (lncRNAs) are unknown. In contrast to proteins, lncRNAs with similar functions often lack linear sequence homology; thus, the identification of function in one lncRNA rarely informs the identification of function in others. We developed a sequence comparison method to deconstruct linear sequence relationships in lncRNAs and evaluate similarity based on the abundance of short motifs called k-mers. We found that lncRNAs of related function often had simil  ...[more]

Similar Datasets

| S-EPMC4177586 | biostudies-literature
| S-EPMC6128939 | biostudies-literature
| S-EPMC6378714 | biostudies-literature
| S-EPMC8508152 | biostudies-literature
| S-EPMC2859052 | biostudies-literature
| S-EPMC6924075 | biostudies-literature
| S-EPMC8730722 | biostudies-literature
| S-EPMC8649637 | biostudies-literature
| S-EPMC6779387 | biostudies-literature