Dataset Information

Whole genome detection of sequences of six horses from diverse breeds.

ABSTRACT: Completed in 2009, the reference genome assembly of the domesticated horse (EquCab 2.0) produced the majority of publically available annotations of genetic variations in this species. Following that effort, a few other projects that focused at variant discovery of a particular breed or two. In this project we aim to identify and annotate single nucleotide polymorphisms (SNPs), insertions and deletions (INDELs), copy number variations (CNVs) and structural variations (SVs) in the genomes six horses of diverse genetic background using next generation sequencing. Analysis ERZ780739 uses EquCab 2.0 as the reference sequence, while analysis ERZ1195829 uses EquCab 3.0.

INSTRUMENT(S): Illumina HiSeq 2500

ORGANISM(S): Equus Caballus

SUBMITTER: UNIVERSITY OF FLORIDA

PROVIDER: PRJEB9799 | EVA | 2016-09-01

REPOSITORIES: EVA

ACCESS DATA

Dataset's files

Source:

Items per page:

1 - 5 of 5

Similar Datasets

Project description:Genomic structural variation is an important and abundant source of genetic and phenotypic variation. Here we describe the first systematic and genome-wide analysis of copy number variations (CNVs) in modern domesticated cattle using array comparative genomic hybridization (array CGH), quantitative PCR (qPCR) and fluorescent in situ hybridization (FISH). The array CGH panel included 90 animals from 11 Bos taurus, 3 Bos indicus and 3 composite breeds for beef, dairy or dual purpose. We identified over 200 candidate CNV regions (CNVRs) in total and 177 within known chromosomes, which harbor or are adjacent to gains or losses. These 177 high-confidence CNVRs cover 28.1 mega bases or ~1.07% of the genome. Over 50% of the CNVRs (89/177) were found in multiple animals or breeds and analysis revealed breed-specific frequency differences and reflected aspects of the known ancestry of these cattle breeds. Selected CNVs were further validated by independent methods using qPCR and FISH. Approximately 67% of the CNVRs (119/177) completely or partially span cattle genes and 61% of the CNVRs (108/177) directly overlap with segmental duplications. The CNVRs span about 400 annotated cattle genes that are significantly enriched for specific biological functions such as immunity, lactation, reproduction and rumination. Multiple gene families, including ULBP, have gone through ruminant lineage-specific gene amplification. We detected and confirmed marked differences in their CNV frequencies across diverse breeds, indicating that some cattle CNVs are likely to arise independently in breeds and contribute to breed differences. Our results provide a valuable resource beyond microsatellites and single nucleotide polymorphisms to explore the full dimension of genetic variability for future cattle genomic research. The custom aCGH chips that interrogated the whole genome CNVs were build for 90 cattles from diverse breeds, with Hereford L1 Dominette 01449 as refference sample.

			Action	DRS
	fixed2.accessioned.vcf.gz	Vcf
	fixed2.vcf.gz	Vcf
	fixed2.vcf.gz.tbi	Vcf
	updatedfixed2.vcf.gz	Vcf
	updatedfixed2.vcf.gz.tbi	Vcf

Dataset Information

Whole genome detection of sequences of six horses from diverse breeds.

Dataset's files

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets