Project description:A new genome of Fraxinus excelsior (PRJNA865134) was assembled using a hybrid approach combining Nanopore and Illumina data. The gene expression of a 182 Danish tree panel (Harper et al. 2016) was assessed using the new genome as reference (BioProject PRJNA865134, SAMN30100368, genome JANJPF000000000 ).Manuscript title: Fraxinus excelsior updated long-read genome reveals the importance of MADS-box genes in tolerance mechanisms against ash dieback, G3:Genes|Genomes|Genetics
Project description:Nucleosomes are barriers to RNA Polymerase II (Pol II) transcribing along gene bodies in eukaryotes. We found that a fragile “tailless nucleosome” could be generated to resolve the Pol II pausing caused by the +1 nucleosome in metazoans 1-4. We hypothesized that “pan-acetylated nucleosomes” along gene bodies are fragile and are generated to resolve the resistance of nucleosomes for the elongation of Pol II 5. To confirm this hypothesis, we systematically analyzed pan-acetylation and pan-phosphorylation of the entire human genome in HEK293T cells. Herein, we show that both pan-acetylation and pan-phosphorylation of histones in nucleosomes along gene bodies are coupled with the activities of Pol II. Our results show that there are four major types of fragile nucleosomes required for Pol II to transcribe along gene bodies: nucleosomes without tails, nucleosomes with dominant acetylation, nucleosomes with dominant phosphorylation, and nucleosomes with both acetylation and phosphorylation.
Project description:Full title: Probing the pan genome of a foodborne bacterial pathogen Listeria monocytogenes: Implications for its niche adaptation, pathogenesis, and evolution Listeria monocytogenes is a foodborne bacterial pathogen well known for adaptability to diverse environmental and host niches, and a high fatality rate among infected, immuno-compromised individuals. Three genetic lineages have been identified within this species. Strains of genetic lineages I and II account for more than ninety percent of foodborne disease outbreaks worldwide, whereas strains from genetic lineage III are rarely implicated in human infectious for unknown, yet intriguing, reasons. Here we have probed the genomic diversity of 26 L. monocytogenes strains using both whole-genome sequences and a novel 385,000 probe pan-genome microarray, fully tiling the genomes of 20 representative strains. Using these methods to identify genes highly conserved in lineages I and II but rare in lineage III, we have identified 86 genes and 8 small RNAs that play roles in bacterial stress resistance, pathogenicity, and niche, potentially explaining the predominance of L. monocytogenes lineages I and II in foodborne disease outbreaks. Extending gene content analysis to all lineages revealed a L. monocytogenes core genome of approximately 2,350 genes (80% of each individual genome) and a pan-genomic reservoir of >4,000 unique genes. Combined gene content data from both sequences and arrays was used to reconstruct an informative phylogeny for the L. monocytogenes species that confirms three distinct lineages and describes the relationship of 9 new lineage III genomes. Comparative analysis of 18 fully sequenced L. monocytogenes lineage I and II genomes shows a high level of genomic conservation and synteny, indicative of a closed pan-genome, with moderate domain shuffling and sequence drift associated with bacteriophages is present in all lineages. In contrast with lineages I and II, notable genomic diversity and characteristics of an open pan-genome were observed in the lineage III genomes, including many strain-specific genes and a more complex conservation pattern. This indicates that the L. monocytogenes pan-genome has not yet been fully sampled by genome sequencing, and additional sequencing of lineage III genomes is necessary to survey the full diversity of this intriguing species and reveal its mechanisms for adaptability and virulence. This is a Listeria monocytogenes pan-genome tilling array designed using PanArray algorithm. 9 experimental strains (F2-569, M1-002, F2-208, J2-071, J1-208, W1-111, W1-110, F2-524, F2-501) vs reference (EGD-e) strain.
Project description:Variants on the Y chromosome for 62 danish males in VCF format from the GenomeDenmark Phase 2 cohort. Variants were called using reference based approaches such as the haplotype-caller module from GATK and using alignment of denovo assemblies to the reference using ASMvar.
Project description:Full title: Probing the pan genome of a foodborne bacterial pathogen Listeria monocytogenes: Implications for its niche adaptation, pathogenesis, and evolution Listeria monocytogenes is a foodborne bacterial pathogen well known for adaptability to diverse environmental and host niches, and a high fatality rate among infected, immuno-compromised individuals. Three genetic lineages have been identified within this species. Strains of genetic lineages I and II account for more than ninety percent of foodborne disease outbreaks worldwide, whereas strains from genetic lineage III are rarely implicated in human infectious for unknown, yet intriguing, reasons. Here we have probed the genomic diversity of 26 L. monocytogenes strains using both whole-genome sequences and a novel 385,000 probe pan-genome microarray, fully tiling the genomes of 20 representative strains. Using these methods to identify genes highly conserved in lineages I and II but rare in lineage III, we have identified 86 genes and 8 small RNAs that play roles in bacterial stress resistance, pathogenicity, and niche, potentially explaining the predominance of L. monocytogenes lineages I and II in foodborne disease outbreaks. Extending gene content analysis to all lineages revealed a L. monocytogenes core genome of approximately 2,350 genes (80% of each individual genome) and a pan-genomic reservoir of >4,000 unique genes. Combined gene content data from both sequences and arrays was used to reconstruct an informative phylogeny for the L. monocytogenes species that confirms three distinct lineages and describes the relationship of 9 new lineage III genomes. Comparative analysis of 18 fully sequenced L. monocytogenes lineage I and II genomes shows a high level of genomic conservation and synteny, indicative of a closed pan-genome, with moderate domain shuffling and sequence drift associated with bacteriophages is present in all lineages. In contrast with lineages I and II, notable genomic diversity and characteristics of an open pan-genome were observed in the lineage III genomes, including many strain-specific genes and a more complex conservation pattern. This indicates that the L. monocytogenes pan-genome has not yet been fully sampled by genome sequencing, and additional sequencing of lineage III genomes is necessary to survey the full diversity of this intriguing species and reveal its mechanisms for adaptability and virulence.
Project description:Nucleosomes are barriers to RNA Polymerase II (Pol II) transcribing along gene bodies in eukaryotes. We found that a fragile “tailless nucleosome” could be generated to resolve the Pol II pausing caused by the +1 nucleosome in metazoans 1-4. We hypothesized that “pan-acetylated nucleosomes” along gene bodies are fragile and are generated to resolve the resistance of nucleosomes for the elongation of Pol II 5. To confirm this hypothesis, we systematically analyzed pan-acetylation and pan-phosphorylation of the entire human genome in HEK293T cells. Herein, we show that both pan-acetylation and pan-phosphorylation of histones in nucleosomes along gene bodies are coupled with the activities of Pol II. Our results show that there are four major types of fragile nucleosomes required for Pol II to transcribe along gene bodies: nucleosomes without tails, nucleosomes with dominant acetylation, nucleosomes with dominant phosphorylation, and nucleosomes with both acetylation and phosphorylation.
Project description:Nucleosomes are barriers to RNA Polymerase II (Pol II) transcribing along gene bodies in eukaryotes. We found that a fragile “tailless nucleosome” could be generated to resolve the Pol II pausing caused by the +1 nucleosome in metazoans 1-4. We hypothesized that “pan-acetylated nucleosomes” along gene bodies are fragile and are generated to resolve the resistance of nucleosomes for the elongation of Pol II 5. To confirm this hypothesis, we systematically analyzed pan-acetylation and pan-phosphorylation of the entire human genome in HEK293T cells. Herein, we show that both pan-acetylation and pan-phosphorylation of histones in nucleosomes along gene bodies are coupled with the activities of Pol II. Our results show that there are four major types of fragile nucleosomes required for Pol II to transcribe along gene bodies: nucleosomes without tails, nucleosomes with dominant acetylation, nucleosomes with dominant phosphorylation, and nucleosomes with both acetylation and phosphorylation.
Project description:Sequencing of Danish parent-offspring trios to determine genomic variation within the Danish population. Phase II comprises of 50 trios sequenced to 80X using Illumina with libraries of insert sizes from 180bp-20kb.
Project description:Alignment of Genome Denmark Phase II dataset to GRCh38. The dataset consists of 150 Danish individuals (50 trios) sequenced to 80X. The BAM-file contains data from multiple libraries created from one individual with libraries of 180, 500, 800, 2000, 5000, 10000 and 20000 bp. The libraries were created using standard Illumina protocols for paired end reads (180-800bp libraries) and mate pair libraries (2kb-20kb).