Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

Identification of microproteins in Mycobacterium smegmatis by proteogenomics

ABSTRACT: We employed a proteogenomics workflow to identify microproteins encoded by small Open Reading Frames (ORFs) in the genome of Mycobacterium smegmatis strain mc²155.

ORGANISM(S): Mycolicibacterium Smegmatis Mc2 155

SUBMITTER: Pedro Ferrari Dalberto

PROVIDER: PXD025604 | JPOST Repository | Thu Apr 21 00:00:00 BST 2022

REPOSITORIES: jPOST

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
	20200901_ASPD-1.raw	Raw
	20200901_ASPD-2.raw	Raw
	20200901_ASPD-3.raw	Raw
	20200901_ASPD-AA-1.raw	Raw
	20200901_ASPD-AA-2.raw	Raw

Items per page:

1 - 5 of 20

Similar Datasets

MS-based identification of small proteins in Campylobacter jejuni

Project description:The Gram-negative bacterium Campylobacter jejuni is a widespread food-borne pathogen. Knowing the proteins encoded by the bacterial pathogen and how their expression is regulated is essential to understand how it survives, colonizes, and causes diseases. The present study focusses on small proteins (≤ 50-100 amino acids) translated from small open reading frames (sORFs). These poorly annotated components of the genome show emerging roles in bacterial physiology and virulence. Here, the proteome of C. jejuni during exponential growth in a complex medium was analyzed with the proteogenomics workflow presented in Fuchs et al., 2021 with the special focus on small proteins . In combination with different Ribo-Seq approaches new insights into the coding potential of the genome of C. jejuni are provided.

2024-05-23 | PXD036790 | Pride

Identification and quantification of small proteins and peptides in Staphylococcus aureus

Project description:Very small proteins with less than 100 amino acids (SP100) of Staphylococcus aureuse are underrepresented in proteomic analyses so far. However, in the last few years a variety of such small proteins with regulatory and virulence associated functions have been detected in several bacteria. The introduction of a new open source command line tool (Pepper) that provides a fully automated proteogenomic workflow enabled us to identify proteins encoded by non-annotated open reading frames based on identified peptides. Altogether, 185 soluble proteins with up to 100 amino acids have been detected of which 69 were not covered by the used gene annotation. Of these, 83 % were identified by at least two methods.

2021-05-26 | PXD017932 | Pride

Identification of small open reading frame encoded peptides of Thermosynechococcus vestitus BP-1(2133)

Project description:Identification of small open reading frame encoded peptides of Thermosynechococcus vestitus BP-1(2133)

2025-01-09 | PXD059620 |

Identification of small open reading frame encoded peptides of Nostoc sp. PCC 7120 (7120)

Project description:Identification of small open reading frame encoded peptides of Nostoc sp. PCC 7120 (7120)

2025-01-09 | PXD059618 |

Identification of small open reading frame encoded peptides of Synechocystis sp. PCC 6803 (6803)

Project description:Identification of small open reading frame encoded peptides of Synechocystis sp. PCC 6803 (6803)

2025-01-08 | PXD059576 |

Identification of small open reading frame encoded peptides of Picosynechococcus sp. PCC 7002 (7002)

Project description:Identification of small open reading frame encoded peptides of Picosynechococcus sp. PCC 7002 (7002)

2025-01-10 | PXD059623 |

Large-scale proteogenomics characterization of the Mycobacterium tuberculosis hidden proteome

Project description:Traditional genome annotation methods exclude Open Reading Frames shorter than 300 codons (smORFs), which leaves a substantial portion of the proteome overlooked. Proteogenomics is a multi-omics approach that merges genomics, transcriptomics and proteomics to identify proteoforms and unannotated proteins from Mass Spectrometry data. Here, we employed our recently developed proteogenomics pipeline to aid genome annotation and identify hundreds of novel microproteins encoded by smORFs in the genome of Mycobacterium tuberculosis (Mtb). To avoid limitations regarding sensitivity, we used 680 Mass Spectrometry experiments in a large-scale approach, which let us classify the findings by different degrees of confidence using our machine learning model. After integrating the results with RNA-Seq datasets, we explore the biological relevance of the novel sequences and show they are differentially expressed upon starvation and antibiotic treatment, and are co-expressed with many annotated genes that are vital for bacterial virulence. Moreover, some smORFs are located inside essential genomic segments and could be attractive targets for the development of new drugs. Altogether, our results should improve the current annotation of the proteome of Mtb and guide the following studies focusing on studying these microproteins thoroughly.

2025-05-06 | PXD042958 | Pride

Identification of small open reading frame encoded peptides of Synechococcus elongatus PCC 7942 = FACHB-805

Project description:identification of small open reading frame encoded peptides of Synechococcus elongatus PCC 7942 = FACHB-805

2025-01-08 | PXD059575 |

identification of small open reading frame encoded peptides of several cyanobacteria (2133, 6803, 7002, 7120, 7942)

Project description:Identification of small open reading frame encoded peptides of several cyanobacteria (2133, 6803, 7002, 7120, 7942)

2025-02-05 | PXD060504 |

Systematic identification of small Open reading frames-encoded peptides during the life cycle of Drosophila melanogaster

Project description:SEPs (Small open reading frames-encoded peptides) identified from 11 time-points, which were chosen during drosophila’s complete life cycle.

2021-04-08 | PXD025249 |

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data