Project description:C2H2 zinc finger proteins represent the largest and most enigmatic class of human transcription factors. Their C2H2 arrays are highly variable, indicating that most will have unique DNA binding motifs. However, most of the binding motifs have not been directly determined. We have determined the binding sites and motifs of 119 C2H2 zinc finger proteins and the expression pattern of 80 cell lines overexpressing C2H2 zinc finger proteins in order to study the role of C2H2 zinc finger proteins in gene regulation.
Project description:C2H2 zinc finger proteins represent the largest and most enigmatic class of human transcription factors. Their C2H2 arrays are highly variable, indicating that most will have unique DNA binding motifs. However, most of the binding motifs have not been directly determined. We have determined the binding sites and motifs of 119 C2H2 zinc finger proteins and the expression pattern of 80 cell lines overexpressing C2H2 zinc finger proteins in order to study the role of C2H2 zinc finger proteins in gene regulation. We expressed GFP-tagged C2H2-ZF proteins in stable transgenic HEK293 cells. Total RNA was isolated using Trizol and sequencing libraries were constructed using TruSeq Stranded Total RNA Library Prep Kit with Ribo-Zero Gold or TruSeq RNA Library Preparation Kit v2.
Project description:We have identified protein protein interactions of 118 C2H2 zinc finger proteins by AP-MS using GFP-tagged constructs in order to understand their role in transcription regulation.
Project description:To define the sequence preference of SALL4 C2H2 zinc finger domains, we performed SELEX coupled with high-throughput sequencing (HT-SELEX) using the purified SALL4 ZFC1, ZFC2 and ZFC4 domains combined with no protein control experiment.
Project description:To define the sequence preference of SALL4 C2H2 zinc finger domains, we performed SELEX coupled with high-throughput sequencing (HT-SELEX) using the purified SALL4 ZFC1, ZFC2 and ZFC4 domains combined with no protein control experiment. We re-sequenced the libraries from E-MTAB-9236 with very high coverage to estimate the minimum number of reads required per sample for accurate results.
Project description:The largest and most diverse class of eukaryotic transcription factors contain Cys2-His2 zinc fingers (C2H2-ZFs), each of which typically binds a DNA nucleotide triplet within a larger binding site. Frequent recombination and diversification of their DNA-contacting residues suggests that these zinc fingers play a prevalent role in adaptive evolution. Very little is known about the function and evolution of the vast majority of C2H2-ZFs, including whether they even bind DNA. Using the bacterial 1-hybrid (B1H) system, we determined DNA-binding motifs for thousands of individual natural C2H2-ZFs, and correlated them with C2H2-ZF specificity residues. The data reported here includes results of protein-binding microarray (PBM) assays for 146 of these natural C2H2-ZFs, performed in order to validate B1H assays and to explore the DNA-binding specificity of C2H2-ZFs. Protein binding microarray (PBM) experiments were performed for a set of 185 variants of mouse Egr1 in which the third zinc finger was replaced by different C2H2-ZFs from different organisms. Briefly, the PBMs involved binding GST-tagged DNA-binding proteins to two double-stranded 44K Agilent microarrays, each containing a different DeBruijn sequence design, in order to determine their sequence preferences. Details of the PBM protocol are described in Berger et al., Nature Biotechnology 2006. Among the 185 variants examined, 146 variants yielded motifs in PBMs, which are included here.
Project description:The largest and most diverse class of eukaryotic transcription factors contain Cys2-His2 zinc fingers (C2H2-ZFs). Here, we have explored the diversity of sequence preferences of C2H2-ZF proteins in non-metazoan organisms using PBM experiments.
Project description:C2H2 zinc fingers (C2H2-ZFs) are the most prevalent type of vertebrate DNA-binding domain, and typically appear in tandem arrays (ZFAs), with sequential C2H2-ZFs each contacting 3 (or more) sequential bases. C2H2-ZFs can be assembled in a modular fashion, providing one explanation for their remarkable evolutionary success. Given a set of modules with defined 3-base specificities, modular assembly also presents a way to construct artificial proteins with specific DNA-binding preferences. However, a recent survey of a large number of three-finger ZFAs engineered by modular assembly reported high failure rates (~70%), casting doubt on the generality of modular assembly. Here, we used protein-binding microarrays to analyze 28 ZFAs that failed in the aforementioned study. Most (17) preferred specific sequences, which in all but one case resembled the intended target sequence. Like natural ZFAs, the engineered ZFAs typically yielded degenerate motifs, binding dozens to hundreds of related individual sequences. Thus, the failure of these proteins in previous assays is not due to lack of sequence-specific DNA-binding activity. Our findings underscore the relevance of individual C2H2-ZF sequence specificities within tandem arrays, and support the general ability of modular assembly to produce ZFAs with sequence-specific DNA-binding activity. Protein binding microarray (PBM) experiments were performed for a set of 20 artificial zinc finger arrays (ZFAs). Briefly, the PBMs involved binding GST-tagged DNA-binding proteins to two double-stranded 44K Agilent microarrays, each containing a different DeBruijn sequence design, in order to determine their sequence preferences. The method is described in Berger et al., Nature Biotechnology 2006.