Project description:We report the sequences bound to CENP-A in the dog genome (Canis familiaris) for high-throughput characterization of centromeric sequences. We compare these ChIPSeq reads (72 bp, single read) against a reference centromeric satellite DNA domain database for the dog genome, resulting in the annotation of sequence variation and estimated abundance of seven satellite families together with adjacent, non-satellite sequences. To study global patterns of sequence diversity and characterizing the subset of sequences correlated with centromere function, these sequences were evaluated relative to a comprehensive centromere sequence domain k-mer library. From this analysis, we identify functional sequence features from two satellite families (CarSat1 and CarSat2) that are defined by distinct arrays subtypes.
Project description:We report the sequences bound to CENP-A in the dog genome (Canis familiaris) for high-throughput characterization of centromeric sequences. We compare these ChIPSeq reads (72 bp, single read) against a reference centromeric satellite DNA domain database for the dog genome, resulting in the annotation of sequence variation and estimated abundance of seven satellite families together with adjacent, non-satellite sequences. To study global patterns of sequence diversity and characterizing the subset of sequences correlated with centromere function, these sequences were evaluated relative to a comprehensive centromere sequence domain k-mer library. From this analysis, we identify functional sequence features from two satellite families (CarSat1 and CarSat2) that are defined by distinct arrays subtypes. Sequences bound to CENP-A in MDCK (dog) cell line
Project description:Phylogenetic, microbiological and comparative genomic analysis was used to examine the diversity among members of the genus Caldicellulosiruptor with an eye towards the capacity of these extremely thermophilic bacteria for degrading the complex carbohydrate content of plant biomass. Seven species from this genus (C. saccharolyticus, C. bescii (formerly Anaerocellum thermophilum), C. hydrothermalis, C. owensensis, C. kronotskyensis, C. lactoaceticus, and C. kristjanssonii) were compared on the basis of 16S rRNA phylogeny and cross-species DNA-DNA hybridization to a whole genome C. saccharolyticus oligonucleotide microarray. Growth physiology of the seven Caldicellulosiruptor species on a range of carbohydrates showed that, while all could be cultivated on acid pre-treated switchgrass, only C. saccharolyticus, C. besci, C. kronotskyensis, and C. lactoaceticus were capable of hydrolyzing Whatman No. 1 filter paper. Two-dimensional gel electrophoresis of the secretomes from cells grown on microcrystalline cellulose revealed that species capable of crystalline cellulose hydrolysis also had diverse secretome fingerprints. The two-dimensional secretome of C. saccharolyticus revealed a prominent S-layer protein that appears to be also indicative of highly cellulolytic Caldicellulosiruptor species, suggesting a possible role in cell-substrate interaction. These growth physiology results were also linked to glycoside hydrolase and carbohydrate-binding module inventories for the seven bacteria, deduced from draft genome sequence information. These preliminary inventories indicated that the absence of a single glycoside hydrolase family and carbohydrate binding motif family appear to be responsible for some Caldicellulosiruptor species’ diminished cellulolytic capabilities. Overall, the genus Caldicellulosiruptor appears to contain more genomic and physiological diversity than previously reported, and is well suited for biomass deconstruction applications.
Project description:We present genome-scale maps of DNA methylation in early human development and perform comparative analysis to mouse that confirm a conserved global erasure of the paternal genome. We find that while many global features of the early embryo are consistent between the two species, the target sequences in which DNA methylation is maintained are distinct. Repetitive elements show a broader range of class specific behaviors in the human embryo and a larger degree of methylation escape in human sperm. We identify thousands of differentially methylated regions (DMRs) that are likely of maternal origin and found that these gamete contributed DMRs are far more species-specific than expected given the conservation of canonical imprint control regions (ICRs). Finally, we extended our studies to the derivation of new human embryonic stem cell (ESC) lines and found notable divergences in DNA methylation signatures from those found in the human embryo and different mouse ESC derivation conditions. Comparison of DNA methylation patterns in human early development, human ESC derivation and mouse ESC derivation