Project description:The glucocorticoid receptor (GR), is a hormone-activated transcription factor which binds to GR binding sequences (GBS) encoded in the genome. Interestingly, of the many genomic sequences matching the GR consensus sequence only a small subset is actually bound. This indicates that the presence of a GBS alone is insufficient to specify where in the genome GR binds. We identified sequences that can locally prevent GR binding and mass-spec analysis was used to identify proteins that bind to these sequences.
Project description:The published finished human genome contained 340 gaps including 250 gaps in the euchromatic region. The reasons for these gaps were not fully understood, although subsequent analysis revealed that presence of segmentally duplicated sequences were a good predictor for the presence of gaps. However, not all segmentally duplicated regions contained gaps. We made a systemic effort to close euchromatic gaps and understand the nature of gap closing sequences. Our studies clearly demonstrate that the gap closing sequences analyzed were over 2.3-fold more enriched in segmental duplications and that about 40% of the gap closing sequences were structurally variant. The structural variant nature of gap closing sequences was verified by aCGH analysis, and by paired-end-sequence and fingerprint analysis of gap spanning clones from recently available human genome fosmid libraries from eight individuals. Identification and characterization of gap closing sequences provides an effective approach for closing the remaining euchromatic gaps in the human genome. Keywords: comparative genomic hybridization
Project description:We report the sequences bound to CENP-A in the dog genome (Canis familiaris) for high-throughput characterization of centromeric sequences. We compare these ChIPSeq reads (72 bp, single read) against a reference centromeric satellite DNA domain database for the dog genome, resulting in the annotation of sequence variation and estimated abundance of seven satellite families together with adjacent, non-satellite sequences. To study global patterns of sequence diversity and characterizing the subset of sequences correlated with centromere function, these sequences were evaluated relative to a comprehensive centromere sequence domain k-mer library. From this analysis, we identify functional sequence features from two satellite families (CarSat1 and CarSat2) that are defined by distinct arrays subtypes.
Project description:We report the sequences bound to CENP-A in the dog genome (Canis familiaris) for high-throughput characterization of centromeric sequences. We compare these ChIPSeq reads (72 bp, single read) against a reference centromeric satellite DNA domain database for the dog genome, resulting in the annotation of sequence variation and estimated abundance of seven satellite families together with adjacent, non-satellite sequences. To study global patterns of sequence diversity and characterizing the subset of sequences correlated with centromere function, these sequences were evaluated relative to a comprehensive centromere sequence domain k-mer library. From this analysis, we identify functional sequence features from two satellite families (CarSat1 and CarSat2) that are defined by distinct arrays subtypes. Sequences bound to CENP-A in MDCK (dog) cell line