Unknown

Dataset Information

0

ORFLine: a bioinformatic pipeline to prioritize small open reading frames identifies candidate secreted small proteins from lymphocytes.


ABSTRACT:

Motivation

The annotation of small open reading frames (smORFs) of <100 codons (<300 nucleotides) is challenging due to the large number of such sequences in the genome.

Results

In this study, we developed a computational pipeline, which we have named ORFLine, that stringently identifies smORFs and classifies them according to their position within transcripts. We identified a total of 5744 unique smORFs in datasets from mouse B and T lymphocytes and systematically characterized them using ORFLine. We further searched smORFs for the presence of a signal peptide, which predicted known secreted chemokines as well as novel micropeptides. Four novel micropeptides show evidence of secretion and are therefore candidate mediators of immunoregulatory functions.

Availability and implementation

Freely available on the web at https://github.com/boboppie/ORFLine.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Hu F 

PROVIDER: S-EPMC8504629 | biostudies-literature | 2021 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

ORFLine: a bioinformatic pipeline to prioritize small open reading frames identifies candidate secreted small proteins from lymphocytes.

Hu Fengyuan F   Lu Jia J   Matheson Louise S LS   Díaz-Muñoz Manuel D MD   Saveliev Alexander A   Xu Jinbo J   Turner Martin M  

Bioinformatics (Oxford, England) 20211001 19


<h4>Motivation</h4>The annotation of small open reading frames (smORFs) of <100 codons (<300 nucleotides) is challenging due to the large number of such sequences in the genome.<h4>Results</h4>In this study, we developed a computational pipeline, which we have named ORFLine, that stringently identifies smORFs and classifies them according to their position within transcripts. We identified a total of 5744 unique smORFs in datasets from mouse B and T lymphocytes and systematically characterized t  ...[more]

Similar Datasets

2021-04-28 | GSE154491 | GEO
| PRJNA646476 | ENA
2019-07-03 | GSE125218 | GEO
2014-09-11 | E-GEOD-60384 | biostudies-arrayexpress
| S-EPMC10152738 | biostudies-literature
| S-EPMC3334604 | biostudies-literature
| S-EPMC7085969 | biostudies-literature
2014-09-11 | GSE60384 | GEO
2020-03-14 | GSE131650 | GEO
| S-EPMC7856248 | biostudies-literature