Project description:Non-sewered sanitary systems (NSS) are emerging as one of the solutions to poor sanitation because of the limitations of the conventional flush toilet. These new sanitary systems are expected to safely treat faecal waste and operate without external connections to a sewer, water supply or energy source. The Nano Membrane Toilet (NMT) is a unique domestic-scale sanitary solution currently being developed to treat human waste on-site. This toilet will employ a small-scale gasifier to convert human faeces into products of high energy value. This study investigated the suitability of human faeces as a feedstock for gasification. It quantified the recoverable exergy potential from human faeces and explored the optimal routes for thermal conversion, using a thermodynamic equilibrium model. Fresh human faeces were found to have approximately 70-82 wt.% moisture and 3-6 wt.% ash. Product gas resulting from a typical dry human faeces (0 wt.% moisture) had LHV and exergy values of 17.2 MJ/kg and 24 MJ/kg respectively at optimum equivalence ratio of 0.31, values that are comparable to wood biomass. For suitable conversion of moist faecal samples, near combustion operating conditions are required, if an external energy source is not supplied. This is however at 5% loss in the exergy value of the gas, provided both thermal heat and energy of the gas are recovered. This study shows that the maximum recoverable exergy potential from an average adult moist human faeces can be up to 15 MJ/kg, when the gasifier is operated at optimum equivalence ratio of 0.57, excluding heat losses, distribution or other losses that result from operational activities.
Project description:Specific gene expression regulation strategy using antisense oligonucleotides occupy significant space in recent clinical trials. The therapeutical potential of oligos lies in the identification and prediction of accurate oligonucleotides against specific target mRNA. In this work we present a computational method that is built on Artificial Neural Network (ANN) which could recognize and predict oligonucleotides effectively. In this study first we identified 11 major parameters associated with oligo:mRNA duplex linkage. A feed forward multilayer perceptron ANN classifier is trained with a set of experimentally proven feature vectors. The classifier gives an exact prediction of the input sequences under 2 classes - oligo or non-oligo. On validation, our tool showed comparatively significant accuracy of 92.48% with 91.7% sensitivity and 92.09% specificity. This study was also able to reveal the relative impact of individual parameters we considered on antisense oligonucleotide predictions.
Project description:The quantitative analysis of biochemical reactions and metabolites is at frontier of biological sciences. The recent availability of high-throughput technology data sets in biology has paved the way for new modelling approaches at various levels of complexity including the metabolome of a cell or an organism. Understanding the metabolism of a single cell and multi-cell organism will provide the knowledge for the rational design of growth conditions to produce commercially valuable reagents in biotechnology. Here, we demonstrate how equations representing steady state mass conservation, energy conservation, the second law of thermodynamics, and reversible enzyme kinetics can be formulated as a single system of linear equalities and inequalities, in addition to linear equalities on exponential variables. Even though the feasible set is non-convex, the reformulation is exact and amenable to large-scale numerical analysis, a prerequisite for computationally feasible genome scale modelling. Integrating flux, concentration and kinetic variables in a unified constraint-based formulation is aimed at increasing the quantitative predictive capacity of flux balance analysis. Incorporation of experimental and theoretical bounds on thermodynamic and kinetic variables ensures that the predicted steady state fluxes are both thermodynamically and biochemically feasible. The resulting in silico predictions are tested against fluxomic data for central metabolism in Escherichia coli and compare favourably with in silico prediction by flux balance analysis.
Project description:The quantitative analysis of biochemical reactions and metabolites is at frontier of biological sciences. The recent availability of high-throughput technology data sets in biology has paved the way for new modelling approaches at various levels of complexity including the metabolome of a cell or an organism. Understanding the metabolism of a single cell and multi-cell organism will provide the knowledge for the rational design of growth conditions to produce commercially valuable reagents in biotechnology. Here, we demonstrate how equations representing steady state mass conservation, energy conservation, the second law of thermodynamics, and reversible enzyme kinetics can be formulated as a single system of linear equalities and inequalities, in addition to linear equalities on exponential variables. Even though the feasible set is non-convex, the reformulation is exact and amenable to large-scale numerical analysis, a prerequisite for computationally feasible genome scale modelling. Integrating flux, concentration and kinetic variables in a unified constraint-based formulation is aimed at increasing the quantitative predictive capacity of flux balance analysis. Incorporation of experimental and theoretical bounds on thermodynamic and kinetic variables ensures that the predicted steady state fluxes are both thermodynamically and biochemically feasible. The resulting in silico predictions are tested against fluxomic data for central metabolism in Escherichia coli and compare favourably with in silico prediction by flux balance analysis.
Project description:BackgroundMicroRNAs (miRNAs) are small regulatory RNA that mediate RNA interference by binding to various mRNA target regions. There have been several computational methods for the identification of target mRNAs for miRNAs. However, these have considered all contributory features as scalar representations, primarily, as thermodynamic or sequence-based features. Further, a majority of these methods solely target canonical sites, which are sites with "seed" complementarity. Here, we present a machine-learning classification scheme, titled Avishkar, which captures the spatial profile of miRNA-mRNA interactions via smooth B-spline curves, separately for various input features, such as thermodynamic and sequence features. Further, we use a principled approach to uniformly model canonical and non-canonical seed matches, using a novel seed enrichment metric.ResultsWe demonstrate that large number of seed-match patterns have high enrichment values, conserved across species, and that majority of miRNA binding sites involve non-canonical matches, corroborating recent findings. Using spatial curves and popular categorical features, such as target site length and location, we train a linear SVM model, utilizing experimental CLIP-seq data. Our model significantly outperforms all established methods, for both canonical and non-canonical sites. We achieve this while using a much larger candidate miRNA-mRNA interaction set than prior work.ConclusionsWe have developed an efficient SVM-based model for miRNA target prediction using recent CLIP-seq data, demonstrating superior performance, evaluated using ROC curves, specifically about 20% better than the state-of-the-art, for different species (human or mouse), or different target types (canonical or non-canonical). To the best of our knowledge we provide the first distributed framework for microRNA target prediction based on Apache Hadoop and Spark.AvailabilityAll source code and data is publicly available at https://bitbucket.org/cellsandmachines/avishkar.
Project description:BackgroundThe binding of peptide fragments of extracellular peptides to class II MHC is a crucial event in the adaptive immune response. Each MHC allotype generally binds a distinct subset of peptides and the enormous number of possible peptide epitopes prevents their complete experimental characterization. Computational methods can utilize the limited experimental data to predict the binding affinities of peptides to class II MHC.ResultsWe have developed the Regularized Thermodynamic Average, or RTA, method for predicting the affinities of peptides binding to class II MHC. RTA accounts for all possible peptide binding conformations using a thermodynamic average and includes a parameter constraint for regularization to improve accuracy on novel data. RTA was shown to achieve higher accuracy, as measured by AUC, than SMM-align on the same data for all 17 MHC allotypes examined. RTA also gave the highest accuracy on all but three allotypes when compared with results from 9 different prediction methods applied to the same data. In addition, the method correctly predicted the peptide binding register of 17 out of 18 peptide-MHC complexes. Finally, we found that suboptimal peptide binding registers, which are often ignored in other prediction methods, made significant contributions of at least 50% of the total binding energy for approximately 20% of the peptides.ConclusionsThe RTA method accurately predicts peptide binding affinities to class II MHC and accounts for multiple peptide binding registers while reducing overfitting through regularization. The method has potential applications in vaccine design and in understanding autoimmune disorders. A web server implementing the RTA prediction method is available at http://bordnerlab.org/RTA/.
Project description:Accurate predictions of RNA secondary structures can help uncover the roles of functional non-coding RNAs. Although machine learning-based models have achieved high performance in terms of prediction accuracy, overfitting is a common risk for such highly parameterized models. Here we show that overfitting can be minimized when RNA folding scores learnt using a deep neural network are integrated together with Turner's nearest-neighbor free energy parameters. Training the model with thermodynamic regularization ensures that folding scores and the calculated free energy are as close as possible. In computational experiments designed for newly discovered non-coding RNAs, our algorithm (MXfold2) achieves the most robust and accurate predictions of RNA secondary structures without sacrificing computational efficiency compared to several other algorithms. The results suggest that integrating thermodynamic information could help improve the robustness of deep learning-based predictions of RNA secondary structure.
Project description:We present a simple theory that uses thermodynamic parameters to predict the probability that a protein retains the wild-type structure after one or more random amino acid substitutions. Our theory predicts that for large numbers of substitutions the probability that a protein retains its structure will decline exponentially with the number of substitutions, with the severity of this decline determined by properties of the structure. Our theory also predicts that a protein can gain extra robustness to the first few substitutions by increasing its thermodynamic stability. We validate our theory with simulations on lattice protein models and by showing that it quantitatively predicts previously published experimental measurements on subtilisin and our own measurements on variants of TEM1 beta-lactamase. Our work unifies observations about the clustering of functional proteins in sequence space, and provides a basis for interpreting the response of proteins to substitutions in protein engineering applications.
Project description:Coal gasification fine slag (FS), a kind of by-product of coal chemical industry, was recovered for the preparation of functional adsorbents by acid leaching process, which was orthogonally optimized by HCl, HNO3, HF, HAc, and H2SO4. Methylene blue (MB) was used to evaluate the performance of functional adsorbents. The results demonstrated that 57.6% of the leaching efficiency (RLE) and 162.94 mg/g of adsorption capacity (CAC) of MB were achieved under the optimal conditions of HNO3 of 2.0 mol/L, acid leaching time of 2.0 h, and acid leaching temperature of 293K. The detections on X-ray Diffraction (XRD), Scanning Electron Microscope (SEM), Fourier Transform Infrared Spectroscopy (FTIR), and BET surface area (SBET) indicated that the synthesized functional adsorbents were characterized by mesoporous materials. The good fitting of adsorption process using pseudo-second-order and Langmuir models demonstrated that the chemisorption contributed to MB removal. The results of thermodynamics further revealed that the adsorption process of MB occurred spontaneously due to the exothermic properties. The work is expected to develop a novel and cost-effective strategy for the safe disposal of FS, and potentially offer an alternative pathway to increase the additional value for the coal chemical industry.
Project description:Future genomic evaluation models to be used routinely in breeding programs for pigs and poultry need to be able to optimally use information of crossbred (CB) animals to predict breeding values for CB performance of purebred (PB) selection candidates. Important challenges in the commonly used single-step genomic best linear unbiased prediction (ssGBLUP) model are the definition of relationships between the different line compositions and the definition of the base generation per line. The use of metafounders (MFs) in ssGBLUP has been proposed to overcome these issues. When relationships between lines are known to be different from 0, the use of MFs generalizes the concept of genetic groups relying on the genotype data. Our objective was to investigate the effect of using MFs in genomic prediction for CB performance on estimated variance components, and accuracy and bias of GEBV. This was studied using stochastic simulation to generate data representing a three-way crossbreeding scheme in pigs, with the parental lines being either closely related or unrelated. Results show that using MFs, the variance components should be scaled appropriately, especially when basing them on estimates obtained with, for example a pedigree-based model. The accuracies of GEBV that were obtained using MFs were similar to accuracies without using MFs, regardless whether the lines involved in the CB were closely related or unrelated. The use of MFs resulted in a model that had similar or somewhat better convergence properties compared to other models. We recommend the use of MFs in ssGBLUP for genomic evaluations in crossbreeding schemes.