Dataset Information

Genetics of High HDL Cholesterol

ABSTRACT:

PROVIDER: phs000512 | dbGaP |

SECONDARY ACCESSION(S): PRJNA170220PRJNA170221

REPOSITORIES: dbGaP

ACCESS DATA

Dataset's files

Source:

			Action	DRS
	Study_Report.phs000512.HDL_Etiology.v1.p1.MULTI.pdf	Pdf
	manifest_phs000512.HDL_Etiology.v1.p1.c1.CD.pdf	Pdf
	datadict_v2.xsl	Other
	phs000512.v1.pht002860.v1.Etiology_HDL_Cholesterol_Subject.data_dict.xml	Xml
	phs000512.v1.pht002860.v1.p1.Etiology_HDL_Cholesterol_Subject.var_report.xml	Xml

Items per page:

1 - 5 of 13

Similar Datasets

Project description:Purpose: Chromosomal microarray analysis (CMA) to assess copy number variation (CNV) content is now used as a first tier genetic diagnostic test for individuals with unexplained neurodevelopmental disorders (NDD) or multiple congenital anomalies (MCA). Over 100 cytogenetic labs worldwide are using the Affymetrix CytoScan HD 2.7M array to genotype >15,000 clinical samples per month. The aim of this study is to develop a CNV resource from a population control cohort that can be used as a community resource for interpretation of clinical and research samples. Methods: We have genotyped a large population control set (1,000 individuals from our Ontario Population Genomics Platform (OPGP)) using the Affymetrix CytoScan HD microarray comprising 2.7 million probes. Four independent algorithms were applied to detect and assess high confidence CNVs. Reproducibility and validations were quantified using sample replicates and Quantitative-PCR (QPCR), respectively. Results: DNA from 873 individuals from the OPGP cohort passed quality control and we have identified 71,178 CNVs (81 CNVs/individual) distributed across 796 different cytogenetic regions in the genome; 9.8% of the CNVs were previously unreported. After applying three layers of filtering criteria, from our high confidence CNVs dataset, we obtained a >95% reproducibility and >90% validation rate. Due to the array's high probe density within genic regions, our high confidence CNV data set show 73% of the detected CNVs overlapped at least one gene. Conclusion: The genotype data and annotated CNVs presented in this study will represent a valuable public resource enabling clinical genetics research and diagnostics. For array quality control, CEL files were processed using modules from the Affymetrix power tools and genotypes were extracted from the CHP file. Samples passing the median of the absolute pairwise differences (MAPD) < 0.20 and waviness-sd < 0.11 were retained for further analysis. After multiple checks, we excluded 52 samples that do not meet quality control (QC) cutoffs. To confirm the sample's self-reported gender, we have matched the sex chromosome information from the array and identified six samples with gender mismatch, which were excluded from the analysis. We also excluded 47 samples due to excessive CNV calls. A final set of 895 samples were used for further analysis. This number included 22 sample replicates (indicated by _1 following the Sample title), which were used to determine reproducibility of the array calls. The CNV data for this study is available from dbVar (NCBI), DGVa (EBI) accession number estd212, and DGV.

Project description:BackgroundThe three trypanosomatids pathogenic to men, Trypanosoma cruzi, Trypanosoma brucei and Leishmania major, are etiological agents of Chagas disease, African sleeping sickness and cutaneous leishmaniasis, respectively. The complete sequencing of these trypanosomatid genomes represented a breakthrough in the understanding of these organisms. Genome sequencing is a step towards solving the parasite biology puzzle, as there are a high percentage of genes encoding proteins without functional annotation. Also, technical limitations in protein expression in heterologous systems reinforce the evident need for the development of a high-throughput reverse genetics platform. Ideally, such platform would lead to efficient cloning and compatibility with various approaches. Thus, we aimed to construct a highly efficient cloning platform compatible with plasmid vectors that are suitable for various approaches.ResultsWe constructed a platform with a flexible structure allowing the exchange of various elements, such as promoters, fusion tags, intergenic regions or resistance markers. This platform is based on Gateway® technology, to ensure a fast and efficient cloning system. We obtained plasmid vectors carrying genes for fluorescent proteins (green, cyan or yellow), and sequences for the c-myc epitope, and tandem affinity purification or polyhistidine tags. The vectors were verified by successful subcellular localization of two previously characterized proteins (TcRab7 and PAR 2) and a putative centrin. For the tandem affinity purification tag, the purification of two protein complexes (ribosome and proteasome) was performed.ConclusionsWe constructed plasmids with an efficient cloning system and suitable for use across various applications, such as protein localization and co-localization, protein partner identification and protein expression. This platform also allows vector customization, as the vectors were constructed to enable easy exchange of its elements. The development of this high-throughput platform is a step closer towards large-scale trypanosome applications and initiatives.

Dataset Information

Genetics of High HDL Cholesterol

Dataset's files

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets