Project description:Purpose: The goal of this study is to compare RNA-seq libraries of wildtype Caulobacter crescentus with two lon deletion strains (delta lon and delta lon clpX*). Methods: See Methods section of "Plasticity in AAA+ proteases reveals substrate specificity niches" for information regarding methods or contact lead correspondence. Briefly, Samples for RNAseq were extracted from wt and lon deletion strains grown to stationary phase. Conclusions: Our study represents the first detailed analysis of lon deletions (delta lon and delta lon clpX*) comparison to wt caulobacter transcriptomes, with biologic replicates, generated by RNA-seq technology in stationary phas
Project description:We present a method that employs two genetically encoded substrate phage display libraries coupled with next generation sequencing (SPD-NGS) that allows up to 10,000-fold deeper sequence coverage of the typical 6 to 8 residue protease cleavage sites compared to state-of-the-art synthetic peptide libraries or proteomics. We applied SPD-NGS to two classes of proteases, the intracellular caspases 2, 3, 6, 7 and 8, and the ectodomains of the membrane sheddases, ADAMs 10 and 17. The first library (Lib 10AA) was used to determine substrate cleavage motifs. Lib 10AA contains a highly diverse randomized 10-mer substrate peptide sequences (10^9 unique members) that was displayed mono-valently on filamentous phage and bound to magnetic beads via an N-terminal biotin. The protease was allowed to cleave the SPD beads, and the released phage subjected to up to three total rounds of positive selection followed by next generation sequencing (NGS). This allowed us to identify from 10^4 to 10^5 unique cleavage sites over a broad dynamic range of NGS counts (ranging from 3-5000), and produced consensus and optimal cleavage motifs based positional sequencing scoring matrices and state-of the-art machine learning algorithm that closely matched synthetic peptide data. A second SPD-NGS library (Lib hP) was constructed that allowed us to identify candidate human proteome sequences. Lib hP displayed virtually the entire human proteome tiled in contiguous 49AA sequences with 25AA overlaps (nearly 1 million members). After three rounds of positive selection we identified up to 10^4 natural linear cut sites depending on the protease and captured most of the examples previously identified by proteomics (ranging from 30 to 1000) and predicted 10 to 100-fold more.
Project description:The Saccharomyces cerevisiae TRAMP4 and TRAMP5 complexes, which consist of the poly(A) polymerase Trf4 or Trf5, respectively, the zinc knuckle proteins Air1 or Air2, and the RNA helicase Mtr4, play a critical role in nuclear RNA surveillance. Although it is known to enhance the nuclease activity of the exosome, relatively little is known about the exact mechanism and specificity of TRAMP. To better define the specificities of the TRAMP complexes, we used phenotypic analysis and RNA deep-sequencing technology to measure differences in global RNA polyadenylation in air mutants, revealing specific requirements for each Air protein in the regulation of the levels of non-coding and coding RNAs. These findings reveal differential functions for Air proteins in eukaryotic RNA metabolism and indicate that they control the substrate specificity of the RNA exosome. Poly(A)+ RNA from WT, rrp6-M-NM-^T, air1-M-NM-^T rrp6-M-NM-^T and air2-M-NM-^T rrp6-M-NM-^T was sequenced using ABI SOLiD platform, in duplicate.
Project description:Calpains are intracellular Ca2+-regulated cysteine proteases that are essential for various cellular functions. Mammalian conventional calpains (calpain-1 and calpain-2) modulate the structure and function of their substrates by limited proteolysis; however, their substrate specificity remains unclear because the amino acid (aa) sequences around their cleavage sites are very diverse. To clarify calpains’ substrate specificities, 84 20-mer oligopeptides, corresponding to P10-P10’ of reported cleavage site sequences, were proteolyzed by calpains, and the catalytic efficiencies (kcat/Km) were globally determined by LC/MS. This analysis revealed 483 cleavage site sequences, including 360 novel ones. The kcat/Kms for 119 sites ranged from 12.5~1,710 M-1s-1. Most sites were cleaved by both calpain-1 and -2 with a similar kcat/Km. The aa compositions of the novel sites were not significantly different from the 420 previously reported sites, suggesting calpains have a strict implicit rule for sequence specificity, and that the limited proteolysis of intact substrates is due to the substrates’ higher-order structures. Cleavage position frequencies indicated that longer sequences N-terminal to the cleavage site (P-sites) than C-terminal (P’-sites) were preferred for proteolysis. Quantitative structure-activity relationship (QSAR) analyses using partial least-squares regression and >1,300 aa descriptors achieved kcat/Km prediction with r=0.834, and binary-QSAR modeling attained 64.8% prediction accuracy for 132 reported calpain cleavage sites independent of our model construction. These results outperformed previous calpain cleavage predictors, and revealed the importance of the P2, P3’, P4’, and P1-P2 contexts. This study increases our understanding of calpain substrate specificities, and opens calpains to “next-generation,” i.e., activity-related quantitative and context-dependent analyses.