ABSTRACT: Genome-wide identification of copy number variations in Holstein cattle from Baja California, Mexico, using high-density SNP genotyping arrays
Project description:Copy number variations (CNVs) are an important source of genomic structural variation, and can be used as markers to investigate phenotypic and economic traits. CNVs also have functional effects on gene expression and can contribute to disease susceptibility in mammals. Currently, single nucleotide polymorphism genotyping arrays (SNP chips) are the technology of choice for identifying CNV variations. Microarray technologies have recently been used to study the bovine genome. The objective of the present study was to develop CNVs in Holstein cows from the Northwest of Mexico using the Affymetrix Axiom Genome-Wide BOS 1 Array, which assays 648,315 SNPs and provides a wide coverage for genome-wide studies. We applied the two most widely used algorithms for the discovery of CNVs (PennCNV and QuantiSNP) and found 56 CNV regions (CNVRs) representing 0.33% of the bovine genome (8.46 Mb). These CNVRs ranged from 1.5 to 970.8 kb with an average length of 151 kb. They involved 103 genes and showed a 28% overlap with CNVRs already reported. Of the 56 CNVRs found, 20 were novel. In this study we present the first genomic analysis of CNVs in Mexican cattle using highdensity SNP data. Our results provide a new reference basis for future genomic variation and association studies between CNVs and phenotypes, especially in Mexican cattle.
Project description:Goal: To identify copy number variation in normal individuals using high density, non-polymorphic oligonucleotide probes Background DNA sequence diversity within the human genome may be more greatly affected by copy number variations (CNVs) than single nucleotide polymorphisms (SNPs). Although the importance of CNVs in genome wide association studies (GWAS) is becoming widely accepted, the optimal methods for identifying these variants are still under evaluation. We have previously reported a comprehensive view of CNVs in the HapMap DNA collection using high density 500K EA (Early Access) SNP genotyping arrays which revealed greater than 1,000 CNVs ranging in size from 1kb to over 3Mb. Although the arrays used most commonly for GWAS predominantly interrogate SNPs, CNV identification and detection does not necessarily require the use of DNA probes centered on polymorphic nucleotides and may even be hindered by the dependence on a successful SNP genotyping assay. Results In this study, we have designed and evaluated a high density array predicated on the use of non-polymorphic oligonucleotide probes for CNV detection. This approach effectively uncouples copy number detection from SNP genotyping and thus has the potential to significantly improve probe coverage for genome-wide CNV identification. This array, in conjunction with PCR-based, complexity-reduced DNA target, queries over 1.3M independent NspI restriction enzyme fragments in the 200bp to 1100bp size range, which is a several fold increase in marker density as compared to the 500K EA array. In addition, a novel algorithm was developed and validated to extract CNV regions and boundaries. Conclusions Using a well-characterized pair of DNA samples, close to 200 CNVs were identified, of which nearly 50% appear novel yet were independently validated using quantitative PCR. The results indicate that non-polymorphic probes provide a robust approach for CNV identification, and the increasing precision of CNV boundary delineation should allow a more complete analysis of their genomic organization. Keywords: Copy number variation (CNV) detection
Project description:Goal: To identify copy number variation in normal individuals using high density, non-polymorphic oligonucleotide probes Background DNA sequence diversity within the human genome may be more greatly affected by copy number variations (CNVs) than single nucleotide polymorphisms (SNPs). Although the importance of CNVs in genome wide association studies (GWAS) is becoming widely accepted, the optimal methods for identifying these variants are still under evaluation. We have previously reported a comprehensive view of CNVs in the HapMap DNA collection using high density 500K EA (Early Access) SNP genotyping arrays which revealed greater than 1,000 CNVs ranging in size from 1kb to over 3Mb. Although the arrays used most commonly for GWAS predominantly interrogate SNPs, CNV identification and detection does not necessarily require the use of DNA probes centered on polymorphic nucleotides and may even be hindered by the dependence on a successful SNP genotyping assay. Results In this study, we have designed and evaluated a high density array predicated on the use of non-polymorphic oligonucleotide probes for CNV detection. This approach effectively uncouples copy number detection from SNP genotyping and thus has the potential to significantly improve probe coverage for genome-wide CNV identification. This array, in conjunction with PCR-based, complexity-reduced DNA target, queries over 1.3M independent NspI restriction enzyme fragments in the 200bp to 1100bp size range, which is a several fold increase in marker density as compared to the 500K EA array. In addition, a novel algorithm was developed and validated to extract CNV regions and boundaries. Conclusions Using a well-characterized pair of DNA samples, close to 200 CNVs were identified, of which nearly 50% appear novel yet were independently validated using quantitative PCR. The results indicate that non-polymorphic probes provide a robust approach for CNV identification, and the increasing precision of CNV boundary delineation should allow a more complete analysis of their genomic organization. A set of five genomic DNA samples containing different numbers of X chromosomes (1X to 5X sample set, including NA15510 and NA10851) were hybridized to Nsp copy number (CN) arrays in triplicate to evaluate detection of copy number variation using high density, non-polymorphic oligonucleotide probes. 6 Hapmap samples were hybridized to Nsp CN arrays to evaluate Mendelian inheritance of CNVs.
Project description:Renal tumors with complex morphology require extensive workup for accurate classification. Chromosomal aberrations that define subtypes of renal epithelial neoplasms have been reported. We explored if whole-genome chromosome copy number and loss-of-heterozygosity analysis with single nucleotide polymorphism (SNP) arrays can be used to identify these aberrations. Keywords: Chromosome copy number and LOH analysis with SNP Genotyping Arrays
Project description:Renal tumors with complex morphology require extensive workup for accurate classification. Chromosomal aberrations that define subtypes of renal epithelial neoplasms have been reported. We explored if whole-genome chromosome copy number and loss-of-heterozygosity analysis with single nucleotide polymorphism (SNP) arrays can be used to identify these aberrations in cases where morphology was unable to definitively classify these tumors. Keywords: Chromosome copy number and LOH analysis (virtual karyotyping) with SNP Genotyping Arrays Keywords: Genome variation profiling by SNP array
Project description:In this study, we generated 47 RNA-seq data for 14 tissues in Holstein cattle. We analyzed the variations of gene expression among tissues.
Project description:In this study, we generated whole genome bisulfite sequencing data 3 tissues in Holstein cattle. We analyzed the variations of DNA methylation among tissues.
Project description:Copy number variations (CNVs) affect a wide range of phenotypic traits; however, CNVs in or near segmental duplication regions are often intractable. Using a read depth approach based on next-generation sequencing, we examined genome-wide copy number differences among five taurine (three Angus, one Holstein and one Hereford) and one indicine (Nelore) cattle. Within mapped chromosomal sequence, we identified 1,265 CNV regions comprising ~55.6 Mbp sequence-476 of which (~38%) have not previously been reported. We validated this sequence-based CNV call set with aCGH, qPCR and FISH, achieving a validation rate of 82% and a false positive rate of 8%. We further estimated absolute copy numbers for genomic segments and annotated genes in each individual. Surveys of the top 25 most variable genes revealed that the Nelore individual had the lowest copy numbers in 13 cases (~52%, chi squared test; p value <0.05). In contrast, genes related to pathogen- and parasite-resistance, such as CATHL4 and ULBP17, were highly duplicated in the Nelore individual relative to the taurine cattle, while genes involved in lipid transport and metabolism, including APOL3 and FABP2, were highly duplicated in the beef breeds. These CNV regions also harbor genes like BPIFA2A (BSP30A) and WC1, suggesting that some CNVs may be associated with breed-specific differences in adaptation, health, and production traits. By providing the first individualized cattle CNV and segmental duplication maps and genome-wide gene copy number estimates, we enable future CNV studies into highly duplicated regions in the cattle genome. 5 NimbleGen Bos Taurus UMD3 custom 2.1M whole genome high density aCGH
Project description:In this study, we generated whole genome bisulfite sequencing data of 19 samples for 13 tissues in Holstein cattle. We analyzed the variations of DNA methylation among tissues. In this study, we generated whole genome bisulfite sequencing data of 6 samples for 5 tissues in Hereford cattle. We analyzed the variations of DNA methylation among tissues.
Project description:Screening for gene copy-number alterations (CNAs) has improved by applying genome-wide microarrays, where SNP arrays also allow analysis of loss of heterozygozity (LOH). We here analyzed 10 chronic lymphocytic leukemia (CLL) samples using four different high-resolution platforms: BAC arrays (32K), oligonucleotide arrays (185K, Agilent), and two SNP arrays (250K, Affymetrix and 317K, Illumina). Cross-platform comparison revealed 29 concordantly detected CNAs, including known recurrent alterations, which confirmed that all platforms are powerful tools when screening for large aberrations. However, detection of 32 additional regions present in 2-3 platforms illustrated a discrepancy in detection of small CNAs, which often involved reported copy-number variations. LOH analysis revealed concordance of mainly large regions, but showed numerous, small nonoverlapping regions and LOH escaping detection. Evaluation of baseline variation and copy-number ratio response showed the best performance for the Agilent platform and confirmed the robustness of BAC arrays. Accordingly, these platforms demonstrated a higher degree of platform-specific CNAs. The SNP arrays displayed higher technical variation, although this was compensated by high density of elements. Affymetrix detected a higher degree of CNAs compared to Illumina, while the latter showed a lower noise level and higher detection rate in the LOH analysis. Large-scale studies of genomic aberrations are now feasible, but new tools for LOH analysis are requested.