Project description:Purpose: A method for mapping chromatin accessibility genome-wide, to reveal chromatin accessibility in Intestinal stem cells. Methods: Intestinal stem cells(Lgr5-high cells) were sorted by flow cytometry from wild type mice. The samples were prepared in duplicate. HISAT2 was used to align the sequences to the mouse genome and generate bam files. bamCoverage was used to generate bigwig files from bam files. MACS2 (v2.2.5) was used for peak calling and to generate bed files from aligned reads. Conclusions: ATAC-seq analysis confirmed that Fosb binding sites in Chip-seq assay were correlated with the chromatin accessibility .
Project description:Purpose: A method for identifying genome-wide DNA binding sites for Fosb. Methods: Alive cells were sorted from retro-Fosb-OE(over-expression) organoids. The samples were incubated with anti-Fosb antibody (Abcam, ab184938). Purified DNA was subjected to Tru-seq library construction using NEBNext Ultra II DNA Library Prep Kit and sequenced as paired-end with Illumina Novaseq 6000. HISAT2 was used to align the sequences to the mouse genome and generate bam files. bamCoverage (CPM normalized and extended reads) was used to generate bigwig files from bam files. MACS2 was used for peak calling and to generate bed files from aligned reads. HOMER annotatePeaks.pl was used to annotate the peaks. Conclusions: Target genes of Fosb through ChIP assay were consistent with predicted target genes. Thus, we concluded that Fosb, which is a key TF could regulate most of ISC signature genes to maintain Lgr5+ ISCs.
Project description:Combinatorial promoter expression level estimation via cell sorting The purpose of this experiment was to determine the expression level of a library of synthetic promoters. The promoters were cloned in front of a GFP reporter and the resulting library transformed into yeast, sorted by FACS into six fluorescence bins, and the contents of the bins sequenced to determine the distribution of each promoter among each fluorescence bin. This was then used to calculate an expression level for each promoter with enough data. The promoters were sorted into six bins and these, along with the unsorted library were barcoded and sequenced on a single lane of an Illumina HiSeq. The following Series supplementary files are provided: allPromoters.fsa.txt: the sequences of the promoters corresponding to the names in allPromoters.txt, not actually fasta format. Promoter sequence starts at pos 155 (0 indexed). allPromoters.txt: the names of all the promoters, corresponding to the sequences in allPromoters.fsa.txt barcodes.txt: the sequncing barcodes, corresponding to read2 from the sequencing files.
Project description:Generation of a new library of targeted mass spectrometry assays for accurate protein quantification in triple negative breast cancer (TNBC) tissues. Primary tumor tissue lysates from 105 TNBC patients treated at Masaryk Memorial Cancer Institute (MMCI) in Brno, Czech Republic, were used to generate the spectral library. This project covers raw files from data-dependent acquisition (DDA) – parallel accumulation-serial fragmentation (PASEF) measurements of 12 hydrophilic chromatography (HILIC) fractions of aliquot pool from complete set of 105 samples measured on timsTOF Pro; raw files of 16 individual samples measured in data-independent acquisition (DIA) – PASEF mode and used for hybrid library generation and for demonstrative quantitative DIA data extraction; Pulsar archive generated in Spectronaut 16.0 from 12 DDA-PASEF measurements of HILIC fractions and from 16 data-independent acquisition DIA-PASEF measurements of individual samples. The 16 DIA-PASEF runs of individual samples used for library generation were analyzed using newest versions of Spectronaut (version 18.5) and DIA-NN (version 1.8.1) software tools in library-based setting using the newly generated library as well as in library-free setting showing library-based method to outperform the use of predicted libraries in the terms of identification numbers.
Project description:The submitted dataset contains raw files from 96 synthetic peptide libraries, using either HCD or ETD as fragmentation technique. The synthesized 96 tryptic peptide libraries containing >100,000 unmodified peptides plus their corresponding >100,000 phosphorylated counterparts with precisely known sequences and modification sites. All these libraries were subjected to LC-MS/MS on an Orbitrap mass spectrometer using HCD and ETD fragmentation. The generated mass spectrometric data deposited in this database can be used in numerous ways to develop, evaluate and improve experimental and computational proteomic strategies. Raw MS data files were converted into Mascot generic format files (MGF) using Mascot Distiller (2.4.2.0, www.matrixscience.com). Important parameters included: i) signal to noise ratio of 20 for MS/MS and ii) time domain off (no merging of spectra of the same precursor). The MGF files were searched against human IPI v3.72 including the sequences of all 96 libraries,using the Mascot search engine (2.3.1, 24). Search settings: Decoy search using a randomized version of the human IPI v3.72 including the sequences of all 96 libraries was enabled; monoisotopic peptide mass (considering up to two 13C isotopes); trypsin/P as protease; a maximum of four missed cleavages; peptide charge +2 and +3; peptide tol. +/- 5 ppm; MS/MS tol. +/- 0.02 Da; instrument type ESI-Trap (for HCD data) or ETD-Trap (for ETD data) respectively; variable modifications: oxidation (M), phospho (ST), phospho (Y). The result files were exported to pepXML and Mascot XML with default options provided by Mascot.
Project description:Mycorrhizae, symbiotic interactions between soil fungi and tree roots, are ubiquitous in terrestrial ecosystems. The fungi contribute phosphorous, nitrogen and mobilized nutrients from organic matter in the soil and in return the fungus receives photosynthetically-derived carbohydrates. This union of plant and fungal metabolisms is the mycorrhizal metabolome. Understanding this symbiotic relationship at a molecular level provides important contributions to the understanding of forest ecosystems and global carbon cycling. We generated next generation short-read transcriptomic sequencing data from fully-formed ectomycorrhizae between Laccaria bicolor and aspen (Populus tremuloides) roots. The transcriptomic data was used to identify statistically significantly expressed gene models using a bootstrap-style approach, and these expressed genes were mapped to specific metabolic pathways. Integration of expressed genes that code for metabolic enzymes and the set of expressed membrane transporters generates a predictive model of the ectomycorrhizal metabolome. Results indicate the specific compounds glycine, glutamate, and allantoin are synthesized by L. bicolor and that these compounds or their metabolites may be used for the benefit of aspen in exchange for the photosynthetically-derived sugars fructose and glucose.The analysis illustrates an approach to generate testable biological hypotheses to investigate the complex molecular interactions that drive ectomycorrhizal symbiosis. These models are consistent with experimental environmental data and provide insight into the molecular exchange processes for organisms in this complex ecosystem. The method used here for predicting metabolomic models of mycorrhizal systems from deep RNA sequencing data can be generalized and is broadly applicable to transcriptomic data derived from complex systems. Fully formed L.bicolor::P.trichocapra mycorrhizae in duplicate
Project description:The EMBL Data Library was the first internationally supported central resource for nucleic acid sequence data. Working in close collaboration with its American counterpart, GenBank (1), the library prepares and makes available to the scientific community a comprehensive collection of the published nucleic acid sequences. This paper describes briefly the contents of the database, how it is available, and possible future enhancements of Data Library services.