Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

ABSTRACT: Variant calling against pig reference genome (GCA_000003025.6) using pig strain reads

PROVIDER: PRJEB93975 | ENA |

REPOSITORIES: ENA

ACCESS DATA

Json Xml

Similar Datasets

Genome-wide single nucleotide polymorphism array and whole-genome sequencing reveal the inbreeding progression of Banna minipig inbred line [Seq]

Project description:We sequenced and analyzed the genome of a highly inbred miniature Chinese pig strain, the Banna Minipig Inbred Line (BMI). we conducted whole genome screening using next generation sequencing (NGS) technology and performed SNP calling using Sus Scrofa genome assembly Sscrofa11.1.

2020-12-31 | GSE157688 | GEO

Anthracyclines induce global changes in chromatin accessibility in cardiomyocytes that overlap with cardiovascular disease loci

Project description:We performed chromatin immunoprecipitation followed by sequencing (ChIP-seq) in human iPSC-derived cardiomyocytes to map genome-wide binding sites of TOP2B. Libraries were prepared from ChIP and matched input DNA, sequenced using 40 bp paired-end reads. After quality control and alignment to the human reference genome (GRCh38), high-confidence reads were filtered and duplicate reads removed. Peak calling using MACS2 in broad mode identified 5,410 enriched genomic regions, enabling investigation of TOP2B-associated regulatory elements in human cardiomyocytes.

2025-09-15 | GSE303207 | GEO

Genotyping of E14 mouse embryonic stem cells by sequencing

Project description:More than 2x10E9 sequences made on Illumina platform derived from the genome of E14 embryonic stem cells cultured in our laboratory were used to build a database of about 2.7x10E6 single nucleotide variant. The database was validated using other two sequencing datasets from other laboratory and high overlap was observed. The identified variant are enriched on intergenic regions, but several thousands reside on gene exons and regulatory regions, such as promoters, enhancers, splicing site and untranslated regions of RNA, thus indicating high probability of an important functional impact on the molecular biology of this cells. We created a new E14 genome assembly including the new identified variants and used it to map reads from next generation sequencing data generated in our laboratory or in others on E14 cell line. We observed an increase in the number of mapped reads of about 5%. CpG dinucleotide showed the higher variation frequency, probably because of it could be target of DNA methylation. We performed a reduced representation bisulfite sequencing on E14 cell line to test our new genome assembly with respect to the mm9 genome reference. After mapping and methylation status calling, we obtained an increase of about 120,000 called CpG and we avoided about 20,000 wrong CpG calling. genotyping of E14 embryonic stem cells (ESCs) and Reduced representation Bisulfite Sequencing (RRBS) of E14 ESCs.

2014-07-10 | E-GEOD-53149 | biostudies-arrayexpress

Genotyping of E14 mouse embryonic stem cells by sequencing

2014-07-10 | GSE53149 | GEO

VCF files - samples

Project description:This dataset includes somatic small variant calling files derived from fifteen metastatic samples from cutaneous squamous cell carcinoma matched to normal blood samples. These samples were whole-genome sequenced by HiSeq X Ten and the resulting reads were mapped against the human genome (hg37) using BWA-MEM 0.7.10-r789. Somatic variant calling was then performed using strelka 1 (version 2.0.17).

| EGAD00001004530 | EGA

Bulk RNA-seq of adult pig and rat central canal–associated spinal cord cells

Project description:Bulk RNA sequencing was performed on central canal–associated spinal cord cells isolated from adult pig (Sus scrofa) and rat (Rattus norvegicus) spinal cords and expanded under ex vivo culture conditions. Libraries were sequenced on the Illumina NovaSeq 6000 platform using 150 bp single-end reads. Transcript abundances were quantified using Salmon (v1.x) against species-specific reference transcriptomes (pig: ss11; rat: rn6). This dataset enables cross-species analysis of transcriptional programs associated with proliferative activation and progenitor-associated states in the mammalian spinal cord.

2026-02-01 | GSE316901 | GEO

Exomes of human leukemic JMML (Juvenile MyeloMonocytic Leukemia) cells and paired fibroblasts (germilne controls) when available

Project description:The study included 15 patients (7 males, 8 females) with JMML. Peripheral blood and/or bone marrow aspirates were collected on EDTA at diagnosis. Non-hematopoietic tissues (fibroblasts) was derived from skin biopsy for each patient. Exome sequencing was performed in several distinct series between 2012 and 2017, which explains the differences in capture kit versions and reference genome version.Targeted enrichment and massive parallel sequencing were performed on paired genomic DNA from leukocytes and fibroblasts. Exome capture was carried out using the SureSelect Human All Exon V4+UTRs or V5 or V5+UTRs or SureSelect Clinical Research (Agilent Technologies, Santa Clara, CA, USA) according to manufacturer’s instruction and protocols by IntegraGen (Evry, France). Paired-end 75 bases sequencing was performed on a HiSeq2000 or HiSeq4000 instrument (Illumina, San Diego, CA, USA). Image analysis and base calling were performed using the Real Time Analysis (RTA) pipeline v. 1.14 (Illumina) with default parameters. The alignment of paired-end reads to the reference human genome (UCSC GRCh37/hg19 or UCSC GRCh38), variant calling and generation of Quality variants scores were carried out using the CASAVA v.1.8 pipeline (Illumina).

2018-03-14 | E-MTAB-6461 | biostudies-arrayexpress

Benchmarking Bulk and Single-cell Variant Calling Approaches on Chromium scRNA-seq and scATAC-seq Libraries

Project description:Single-cell sequencing methodologies such as scRNA-seq and scATAC-seq have become widespread and effective tools to interrogate tissue composition. Increasingly, variant callers are being applied to these methodologies to resolve the genetic heterogeneity of a sample, especially in the case of detecting the clonal architecture of a tumor. Typically, traditional bulk DNA variant callers are applied to the pooled reads of a single-cell library to detect candidate mutations. Recently, multiple studies have applied such callers on reads from individual cells, with some citing the ability to detect rare variants with higher sensitivity. Many studies apply these two approaches to the Chromium (10x Genomics) scRNA-seq and scATAC-seq methodologies. However, Chromium-based libraries may offer additional challenges to variant calling compared to existing single-cell methodologies, raising questions for the validity of variants obtained from such a workflow. To determine the merits and challenges of various variant-calling approaches on Chromium scRNA-seq and scATAC-seq libraries, we use sample libraries with matched bulk whole-genome-sequencing to evaluate the performance of callers. We review caller performance, finding that bulk callers applied on pooled reads significantly outperform individual-cell approaches. We also evaluate variants unique to scRNA-seq and scATAC-seq methodologies, finding patterns of noise but also potential capture of RNA-editing events. Finally, we review the notion that variant calling at the single-cell level can detect rare somatic variants, providing empirical results that suggest resolving such variants is infeasible in single-cell Chromium libraries.

2024-08-05 | GSE213503 | GEO

Canis lupus familiaris

Project description:Assessing impact of reference genome on variant calling accuracy

| PRJNA816174 | ENA

Benchmarking bulk and single-cell variant calling approaches on Chromium scRNA-seq and scATAC-seq libraries

2024-08-05 | GSE213338 | GEO