Untargeted Mass Spectrometry Metabolomic Profiles of iPSC-derived Dopaminergic Neurons from Clinically Discordant Brothers with Identical PRKN Deletions
Project description:Metabolomics is the science of characterizing and quantifying small molecule metabolites in biological systems. These metabolites give organisms their biochemical characteristics, providing a link between genotype, environment, and phenotype. With these opportunities also come data challenges, such as compound annotation, missing values, and batch effects. We present the steps of a general pipeline to process untargeted mass spectrometry data to alleviate the latter two challenges. We assume to have a matrix with metabolite abundances, with metabolites in rows and samples in columns. The steps in the pipeline include summarizing technical replicates (if available), filtering, imputing, transforming, and normalizing the data. In each of these steps, a method and parameters should be chosen based on assumptions one is willing to make, the question of interest, and diagnostic tools. Besides giving a general pipeline that can be adapted by the reader, our goal is to review diagnostic tools and criteria that are helpful when making decisions in each step of the pipeline and assessing the effectiveness of normalization and batch correction. We conclude by giving a list of useful packages and discuss some alternative approaches that might be more appropriate for the reader's data.
Project description:Matrix-assisted laser desorption/ionization mass spectrometry imaging allows for the study of metabolic activity in the tumor microenvironment of brain cancers. The detectable metabolites within these tumors are contingent upon the choice of matrix, deposition technique, and polarity setting. In this study, we compared the performance of three different matrices, two deposition techniques, and the use of positive and negative polarity in two different brain cancer types and across two species. Optimal combinations were confirmed by a comparative analysis of lipid and small-molecule abundance by using liquid chromatography-mass spectrometry and RNA sequencing to assess differential metabolites and enzymes between normal and tumor regions. Our findings indicate that in the tumor-bearing brain, the recrystallized α-cyano-4-hydroxycinnamic acid matrix with positive polarity offered superior performance for both detected metabolites and consistency with other techniques. Beyond these implications for brain cancer, our work establishes a workflow to identify optimal matrices for spatial metabolomics studies.
Project description:The diverse characteristics and large number of entities make metabolite separation challenging in metabolomics. To date, there is not a singular instrument capable of analyzing all types of metabolites. In order to achieve a better separation for higher peak capacity and accurate metabolite identification and quantification, we integrated GC × GC-MS and parallel 2DLC-MS for analysis of polar metabolites. To test the performance of the developed system, 13 rats were fed different diets to form two animal groups. Polar metabolites extracted from rat livers were analyzed by GC × GC-MS, parallel 2DLC-MS (-) and parallel 2DLC-MS (+), respectively. By integrating all data together, 58 metabolites were detected with significant change in their abundance levels between groups (p≤ 0.05). Of the 58 metabolites, three metabolites were detected in two platforms and two in all three platforms. Manual examination showed that discrepancy of metabolite regulation measured by different platforms was mainly caused by the poor shape of chromatographic peaks resulting from low instrument response. Pathway analysis demonstrated that integrating the results from multiple platforms increased the confidence of metabolic pathway assignment.
Project description:The broad coverage of untargeted metabolomics poses fundamental challenges for the harmonization of measurements along time, even if they originate from the very same instrument. Internal isotopic standards can hardly cover the chemical complexity of study samples. Therefore, they are insufficient for normalizing data a posteriori as done for targeted metabolomics. Instead, it is crucial to verify instrument's performance a priori, that is, before samples are injected. Here, we propose a system suitability testing platform for time-of-flight mass spectrometers independent of liquid chromatography. It includes a chemically defined quality control mixture, a fast acquisition method, software for extracting ca. 3,000 numerical features from profile data, and a simple web service for monitoring. We ran a pilot for 21 months and present illustrative results for anomaly detection or learning causal relationships between the spectral features and machine settings. Beyond mere detection of anomalies, our results highlight several future applications such as 1) recommending instrument retuning strategies to achieve desired values of quality indicators, 2) driving preventive maintenance, and 3) using the obtained, detailed spectral features for posterior data harmonization.
Project description:Many solutes have been reported to remain at higher plasma levels relative to normal than the standard index solute urea in hemodialysis patients. Untargeted mass spectrometry was employed to compare solute levels in plasma and plasma ultrafiltrate of hemodialysis patients and normal subjects. Quantitative assays were employed to check the accuracy of untargeted results for selected solutes and additional measurements were made in dialysate and urine to estimate solute clearances and production. Comparison of peak areas indicated that many solutes accumulated to high levels in hemodialysis patients, with average peak areas in plasma ultrafiltrate of dialysis patients being more than 100 times greater than those in normals for 123 features. Most of these mass spectrometric features were identified only by their mass values. Untargeted analysis correctly ranked the accumulation of 5 solutes which were quantitatively assayed but tended to overestimate its extent. Mathematical modeling showed that the elevation of plasma levels for these solutes could be accounted for by a low dialytic to native kidney clearance ratio and a high dialytic clearance relative to the volume of the accessible compartment. Numerous solutes accumulate to high levels in hemodialysis patients because dialysis does not replicate the clearance provided by the native kidney. Many of these solutes remain to be chemically identified and their pathogenic potential elucidated.
Project description:Mapping the chemical space of compounds to chemical structures remains a challenge in metabolomics. Despite the advancements in untargeted liquid chromatography-mass spectrometry (LC-MS) to achieve a high-throughput profile of metabolites from complex biological resources, only a small fraction of these metabolites can be annotated with confidence. Many novel computational methods and tools have been developed to enable chemical structure annotation to known and unknown compounds such as in silico generated spectra and molecular networking. Here, we present an automated and reproducible Metabolome Annotation Workflow (MAW) for untargeted metabolomics data to further facilitate and automate the complex annotation by combining tandem mass spectrometry (MS2) input data pre-processing, spectral and compound database matching with computational classification, and in silico annotation. MAW takes the LC-MS2 spectra as input and generates a list of putative candidates from spectral and compound databases. The databases are integrated via the R package Spectra and the metabolite annotation tool SIRIUS as part of the R segment of the workflow (MAW-R). The final candidate selection is performed using the cheminformatics tool RDKit in the Python segment (MAW-Py). Furthermore, each feature is assigned a chemical structure and can be imported to a chemical structure similarity network. MAW is following the FAIR (Findable, Accessible, Interoperable, Reusable) principles and has been made available as the docker images, maw-r and maw-py. The source code and documentation are available on GitHub ( https://github.com/zmahnoor14/MAW ). The performance of MAW is evaluated on two case studies. MAW can improve candidate ranking by integrating spectral databases with annotation tools like SIRIUS which contributes to an efficient candidate selection procedure. The results from MAW are also reproducible and traceable, compliant with the FAIR guidelines. Taken together, MAW could greatly facilitate automated metabolite characterization in diverse fields such as clinical metabolomics and natural product discovery.
Project description:Cyanobacteria are notorious for their potential to produce hepatotoxic microcystins (MCs), but other bioactive compounds synthesized in the cells could be as toxic, and thus present interest for characterization. Ultra performance liquid chromatography and high-resolution accurate mass spectrometry (UPLC-QTOF-MS/MS) combined with untargeted analysis was used to compare the metabolomes of five different strains of the common bloom-forming cyanobacterium, Microcystis aeruginosa. Even in microcystin-producing strains, other classes of oligopeptides including cyanopeptolins, aeruginosins, and aerucyclamides, were often the more dominant compounds. The distinct and large variation between strains of the same widespread species highlights the need to characterize the metabolome of a larger number of cyanobacteria, especially as several metabolites other than microcystins can affect ecological and human health.
Project description:Methanococcus maripaludis is a methanogenic archaeon. Within its genome, there are two operons for membrane associated hydrogenases, eha and ehb. To investigate the regulation of ehb on the cell, an S40 mutant was constructed in such a way that a portion of the ehb operon was replaced by pac cassette in the wild type parental strain S2 (done by Whitman's group at the University of Georgia). The S40 and S2 strains were grown in 14N and 15N media with acetate separately. A biological replicate was made by switching the media. Mass spectrometry based quantitative proteomics were done on the mixtures to investigate the differences in expression patterns between S40 and S2. Keywords: isotope labeling mass spectrometry, quantitative proteomics
Project description:Neuropeptides are a chemically diverse class of cell-to-cell signaling molecules that are widely expressed throughout the central nervous system, often in a cell-specific manner. While cell-to-cell differences in neuropeptides is expected, it is often unclear how exactly neuropeptide expression varies among neurons. Here we created a microscopy-guided, high-throughput single cell matrix-assisted laser desorption/ionization mass spectrometry approach to investigate the neuropeptide heterogeneity of individual neurons in the central nervous system of the neurobiological model Aplysia californica, the California sea hare. In all, we analyzed more than 26,000 neurons from 18 animals and assigned 866 peptides from 66 prohormones by mass matching against an in silico peptide library generated from known Aplysia prohormones retrieved from the UniProt database. Louvain-Jaccard (LJ) clustering of mass spectra from individual neurons revealed 40 unique neuronal populations, or LJ clusters, each with a distinct neuropeptide profile. Prohormones and their related peptides were generally found in single cells from ganglia consistent with the prohormones' previously known ganglion localizations. Several LJ clusters also revealed the cellular colocalization of behaviorally related prohormones, such as an LJ cluster exhibiting achatin and neuropeptide Y, which are involved in feeding, and another cluster characterized by urotensin II, small cardiac peptide, sensorin A, and FRFa, which have shown activity in the feeding network or are present in the feeding musculature. This mass spectrometry-based approach enables the robust categorization of large cell populations based on single cell neuropeptide content and is readily adaptable to the study of a range of animals and tissue types.
Project description:Differential mobility spectrometry (DMS) has been gaining popularity in small molecule analysis over the last few years due to its selectivity towards a variety of isomeric compounds. While DMS has been utilized in targeted liquid chromatography-mass spectrometry (LC-MS), its use in untargeted discovery workflows has not been systematically explored. In this contribution, we propose a novel workflow for untargeted metabolomics based solely on DMS separation in a clinically relevant chronic kidney disease (CKD) patient population. We analyzed ten plasma samples from early- and late-stage CKD patients. Peak finding, alignment, and filtering steps performed on the DMS-MS data yielded a list of 881 metabolic features (unique mass-to-charge and migration time combinations). Differential analysis by CKD patient group revealed three main features of interest. One of them was putatively identified as bilirubin based on high-accuracy MS data and comparison of its optimum compensation voltage (COV) with that of an authentic standard. The DMS-MS analysis was four times faster than a typical HPLC-MS run, which suggests a potential for the utilization of this technique in screening studies. However, its lower separation efficiency and reduced signal intensity make it less suitable for low-abundant features. Fewer features were detected by the DMS-based platform compared with an HPLC-MS-based approach, but importantly, the two approaches resulted in different features. This indicates a high degree of orthogonality between HPLC- and DMS-based approaches and demonstrates the need for larger studies comparing the two techniques. The workflow described here can be adapted for other areas of metabolomics and has a value as a prescreening method to develop semi-targeted workflows and as a faster alternative to HPLC in large biomedical studies.