Project description:Viruses cause significant yield and quality losses in a wide variety of cultivated crops. Hence, the detection and identification of viruses is a crucial facet of successful crop production and of great significance in terms of world food security. Whilst the adoption of molecular techniques such as RT-PCR has increased the speed and accuracy of viral diagnostics, such techniques only allow the detection of known viruses, i.e., each test is specific to one or a small number of related viruses. Therefore, unknown viruses can be missed and testing can be slow and expensive if molecular tests are unavailable. Methods for simultaneous detection of multiple viruses have been developed, and (NGS) is now a principal focus of this area, as it enables unbiased and hypothesis-free testing of plant samples. The development of NGS protocols capable of detecting multiple known and emergent viruses present in infected material is proving to be a major advance for crops, nuclear stocks or imported plants and germplasm, in which disease symptoms are absent, unspecific or only triggered by multiple viruses. Researchers want to answer the question "how many different viruses are present in this crop plant?" without knowing what they are looking for: RNA-sequencing (RNA-seq) of plant material allows this question to be addressed. As well as needing efficient nucleic acid extraction and enrichment protocols, virus detection using RNA-seq requires fast and robust bioinformatics methods to enable host sequence removal and virus classification. In this review recent studies that use RNA-seq for virus detection in a variety of crop plants are discussed with specific emphasis on the computational methods implemented. The main features of a number of specific bioinformatics workflows developed for virus detection from NGS data are also outlined and possible reasons why these have not yet been widely adopted are discussed. The review concludes by discussing the future directions of this field, including the use of bioinformatics tools for virus detection deployed in analytical environments using cloud computing.
Project description:Novel coronavirus(SARS-CoV2) replicates the host cell's genome by interacting with the host proteins. Due to this fact, the identification of virus and host protein-protein interactions could be beneficial in understanding the disease transmission behavior of the virus as well as in potential COVID-19 drug identification. International Committee on Taxonomy of Viruses (ICTV) has declared that nCoV is highly genetically similar to the SARS-CoV epidemic in 2003 (∼89% similarity). With this hypothesis, the present work focuses on developing a computational model for the nCoV-Human protein interaction network, using the experimentally validated SARS-CoV-Human protein interactions. Initially, level-1 and level-2 human spreader proteins are identified in the SARS-CoV-Human interaction network, using Susceptible-Infected-Susceptible (SIS) model. These proteins are considered potential human targets for nCoV bait proteins. A gene-ontology-based fuzzy affinity function has been used to construct the nCoV-Human protein interaction network at a ∼99.98% specificity threshold. This also identifies 37 level-1 human spreaders for COVID-19 in the human protein-interaction network. 2474 level-2 human spreaders are subsequently identified using the SIS model. The derived host-pathogen interaction network is finally validated using six potential FDA-listed drugs for COVID-19 with significant overlap between the known drug target proteins and the identified spreader proteins.
Project description:RNA-binding proteins are functionally diverse within cells, being involved in RNA-metabolism, translation, DNA damage repair, and gene regulation at both the transcriptional and post-transcriptional levels. Much has been learnt about their interactions with RNAs through structure determination techniques and computational modeling. This review gives an overview of the structural data currently available for protein-RNA complexes, and discusses the technical issues facing structural biologists working to solve their structures. The review focuses on three techniques used to solve the 3-dimensional structure of protein-RNA complexes at atomic resolution, namely X-ray crystallography, solution nuclear magnetic resonance (NMR) and cryo-electron microscopy (cryo-EM). The review then focuses on the main computational modeling techniques that use these atomic resolution data: discussing the prediction of RNA-binding sites on unbound proteins, docking proteins, and RNAs, and modeling the molecular dynamics of the systems. In conclusion, the review looks at the future directions this field of research might take.
Project description:Protein kinases represent a family of enzymes that are critical in cell signaling. One mechanism by which protein kinases are regulated is via autophosphorylation. In the studies described here, we have examined the mechanism of autophosphosphorylation at serine 338 in the regulation of protein kinase A (PKA). Expressed protein ligation allowed for the covalent linkage of an ATP moiety to a Ser mimic at this phosphorylation site. Using a combination of size exclusion chromatography, fluorescence nucleotide binding, kinase measurements, and limited proteolysis assays on this semisynthetic ATP-linked protein, we have obtained unique evidence for an intramolecular autophosphorylation mechanism in PKA regulation. Computational analysis provided a plausible model for a PKA conformation consistent with intramolecular phosphoryl transfer. This approach could be applied to other autoprocessing enzymes by exploiting appropriate transition state analogue motifs in the context of protein semisynthesis.
Project description:The determination of membrane protein (MP) structures has always trailed that of soluble proteins due to difficulties in their overexpression, reconstitution into membrane mimetics, and subsequent structure determination. The percentage of MP structures in the protein databank (PDB) has been at a constant 1-2% for the last decade. In contrast, over half of all drugs target MPs, only highlighting how little we understand about drug-specific effects in the human body. To reduce this gap, researchers have attempted to predict structural features of MPs even before the first structure was experimentally elucidated. In this review, we present current computational methods to predict MP structure, starting with secondary structure prediction, prediction of trans-membrane spans, and topology. Even though these methods generate reliable predictions, challenges such as predicting kinks or precise beginnings and ends of secondary structure elements are still waiting to be addressed. We describe recent developments in the prediction of 3D structures of both ?-helical MPs as well as ?-barrels using comparative modeling techniques, de novo methods, and molecular dynamics (MD) simulations. The increase of MP structures has (1) facilitated comparative modeling due to availability of more and better templates, and (2) improved the statistics for knowledge-based scoring functions. Moreover, de novo methods have benefited from the use of correlated mutations as restraints. Finally, we outline current advances that will likely shape the field in the forthcoming decade.
Project description:The enormous number of redundant sequenced genomes has hindered efforts to analyze and functionally annotate proteins. As the taxonomy of viruses is not uniformly defined, viral proteomes pose special challenges in this regard. Grouping viruses based on the similarity of their proteins at proteome scale can normalize against potential taxonomic nomenclature anomalies.We present Viral Reference Proteomes (Viral RPs), which are computed from complete virus proteomes within UniProtKB. Viral RPs based on 95, 75, 55, 35 and 15% co-membership in proteome similarity based clusters are provided. Comparison of our computational Viral RPs with UniProt's curator-selected Reference Proteomes indicates that the two sets are consistent and complementary. Furthermore, each Viral RP represents a cluster of virus proteomes that was consistent with virus or host taxonomy. We provide BLASTP search and FTP download of Viral RP protein sequences, and a browser to facilitate the visualization of Viral RPs.http://proteininformationresource.org/rps/viruses/chenc@udel.eduSupplementary data are available at Bioinformatics online.
Project description:Viral respiratory infections (VRIs) cause seasonal epidemics and pandemics, with their transmission influenced by climate conditions. Despite the risks posed by novel VRIs, the relationships between climate change and VRIs remain poorly understood. In this review, we synthesized existing literature to explore the connections between changes in meteorological conditions, extreme weather events, long-term climate warming, and seasonal outbreaks, epidemics, and pandemics of VRIs from an interdisciplinary perspective. We proposed a comprehensive conceptual framework highlighting the potential biological, socioeconomic, and ecological mechanisms underlying the impact of climate change on VRIs. Our findings suggested that climate change increases the risk of VRI emergence and transmission by affecting the biology of viruses, host susceptibility, human behavior, and environmental conditions of both society and ecosystems. Further interdisciplinary research is needed to address the dual challenge of climate change and pandemics.
Project description:We review recent theoretical and algorithmic advances for the modeling of protein ligand binding free energies. We first describe a statistical mechanics theory of noncovalent association, with particular focus on deriving the fundamental formulas on which computational methods are based. The second part reviews the main computational models and algorithms in current use or development, pointing out the relations with each other and with the theory developed in the first part. Particular emphasis is given to the modeling of conformational reorganization and entropic effect. The methods reviewed are free energy perturbation, double decoupling, the Binding Energy Distribution Analysis Method, the potential of mean force method, mining minima and MM/PBSA. These models have different features and limitations, and their ranges of applicability vary correspondingly. Yet their origins can all be traced back to a single fundamental theory.
Project description:The molecular chaperone Hsp70 plays a central role in the import of cytoplasmic proteins into organelles, driving their translocation by binding them from the organellar interior. Starting from the experimentally-determined structure of the E. coli Hsp70, we computed, by means of molecular simulations, the effective free-energy profile for substrate translocation upon chaperone binding. We then used the resulting free energy to quantitatively characterize the kinetics of the import process, whose comparison with unassisted translocation highlights the essential role played by Hsp70 in importing cytoplasmic proteins.
Project description:The Hippo signaling pathway, which is implicated in the regulation of organ size, has emerged as a potential target for the development of cancer therapeutics. YAP, TAZ (transcription co-activators) and TEAD (transcription factor) are the downstream transcriptional machinery and effectors of the pathway. Formation of the YAP/TAZ-TEAD complex leads to transcription of growth-promoting genes. Conversely, disrupting the interactions of the complex decreases cell proliferation. Herein, we screened a 1000-member fragment library using Thermal Shift Assay and identified a hit fragment. We confirmed its binding at the YAP/TAZ-TEAD interface by X-ray crystallography, and showed that it occupies the same hydrophobic pocket as a conserved phenylalanine of YAP/TAZ. This hit fragment serves as a scaffold for the development of compounds that have the potential to disrupt YAP/TAZ-TEAD interactions. Structure-activity relationship studies and computational modeling were also carried out to identify more potent compounds that may bind at this validated druggable binding site.