Proteomics

Dataset Information

0

Codon Optimization Depletes Stop Codons in Alternative Reading Frames of Protein-Coding Nucleic Acid Therapeutics


ABSTRACT: Out-of-frame translation events, arising from ribosomal frameshifting or non-canonical initiation, are an intrinsic property of the translational machinery that cannot be fully prevented. The consequences of such events depend on the distribution of stop codons in alternative reading frames, which determine the permissiveness of those frames: whether out-of-frame translation terminates quickly or generates extended protein products. Here, using quantitative dual-fluorescence reporters, we demonstrate that stop codons in alternative reading frames function as molecular checkpoints that terminate out-of-frame translation and prevent product accumulation. Genome-wide analysis across ten organisms reveals that natural coding sequences maintain dense stop codon distributions in alternative frames, with a median spacing of approximately 20 amino acids. We then show that codon optimization, the standard method for enhancing translation, systematically depletes this safeguard. Because all three stop codons (UAA, UAG, UGA) begin with uridine, and optimal codons exclude uridine from third positions, stop codons in the −1 reading frame become structurally impossible in codon-optimized sequences. Analysis of 120 therapeutic sequences, including FDA-approved COVID-19 mRNA vaccines, confirms widespread −1 frame stop codon depletion: out-of-frame products average 164 amino acids, six-fold longer than in natural human genes. We demonstrate that strategic restoration of stop codons in alternative reading frames through synonymous substitutions eliminates detectable out-of-frame products by mass spectrometry, while preserving the intended protein. This approach requires only informed sequence design, with no changes to manufacturing, or regulatory framework. Our findings establish stop codon distribution in alternative reading frames as a critical design parameter for protein-coding nucleic acid therapeutics.

ORGANISM(S): Homo Sapiens

SUBMITTER: Weirui Ma  

PROVIDER: PXD074868 | iProX | Tue Feb 24 00:00:00 GMT 2026

REPOSITORIES: iProX

Similar Datasets

2026-01-18 | PXD073231 |
2016-04-12 | GSE80156 | GEO
2025-07-31 | GSE303542 | GEO
2023-03-01 | GSE184825 | GEO
2026-02-26 | GSE298635 | GEO
2026-02-26 | GSE298634 | GEO
2021-04-28 | GSE154491 | GEO
2023-12-31 | GSE245041 | GEO
2017-04-13 | GSE97717 | GEO
2019-10-18 | GSE138599 | GEO