Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

Homo sapiens

ABSTRACT: Exome-wide benchmark of difficult-to-sequence regions using short-read next-generation DNA sequencing

PROVIDER: PRJDB14721 | ENA |

REPOSITORIES: ENA

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
	DRR415794_1.fastq.gz	Fastqsanger.gz
	DRR415794_2.fastq.gz	Fastqsanger.gz

Items per page:

1 - 2 of 2

Similar Datasets

Homo sapiens

Project description:Short-read sequencing platform benchmark

| PRJNA600063 | ENA

Project description:Illumina metagenomic next-generation short-read data

| PRJNA1420433 | ENA

Mussaenda lancipetala

Project description:short-read next-generation sequencing for Mussaenda lancipetala (PRJCA040959)

| PRJDB36979 | ENA

Novel driver gene MDC1 confers homologous recombination repair deficiency and genomic instability in chemoresistant relapsing ovarian cancer

Project description:Mutational burdens and clonal compositions are established early and are maintained throughout recurrence. Using both next generation and ultra long read sequencing to analyze single nucleotide and structural variants (SVs) we discovered that although tumors from the same patient remained relatively stable, homologous recombination repair proficient (HRP) and homologous recombination repair deficient (HRD) tumors presented with distinct clonal profiles. SV signature analysis revealed three distinct classes: tumors defined by DNA losses, DNA gains, and copy number neutral changes. Each class displayed structural variation affecting distinct regions of the genome. Ultra long read sequencing validated most of the SVs identified in short read sequencing and identified additional SVs. A novel candidate driver gene from the HRP pathway, MDC1, was significantly mutated in patients with HRP tumors.

2026-04-22 | GSE316442 | GEO

Giant marker chromosomes in cancer

Project description:To unravel the fine architecture of neocentromeres found in three well-differentiated liposarcoma (WDLPS) cell lines as patchworks of multiple short amplified sequences, disclosing a much more higher complexity than previously reported. Next generation sequencing data (WGS, RNA-seq, CENP-A/ChIP-seq) are available at the Sequence Read Archive (BioProject ID: PRJNA378952).

2017-12-22 | E-MTAB-5625 | biostudies-arrayexpress

Genome-wide HP1a occupancy measured by DamID-seq

Project description:In order to map levels of genome-wide HP1a occupancy we applied DamID (van Steensel & Henikoff, Nat Biotech, 2000; PMID: 10748524) in combination with next-generation sequencing of methylated GATC fragments. Mapping by next-generation sequencing makes it possible to examine heterochromatic regions that were not covered by earlier datasets which were generated using microarrays.

2016-09-20 | GSE83713 | GEO

Chondrosarcoma Validation Study

Project description:Agilent whole exome hybridisation capture was performed on genomic DNA derived from Chondrosarcoma cancer and matched normal DNA from the same patients. Next Generation sequencing performed on the resulting exome libraries and mapped to build 37 of the human reference genome to facilitate the identification of novel cancer genes. Now we aim to re find and validate the findings of those exome libraries using bespoke pulldown methods and sequencing the products.

2016-05-17 | E-ERAD-37 | biostudies-arrayexpress

LFQ Benchmark Dataset - Generation Beta: Assessing Modern Proteomics Instruments and Acquisition Workflows with High-Throughput LC Gradients

Project description:Recent advances in liquid chromatography–mass spectrometry (LC-MS) have accelerated the adoption of high-throughput workflows that deliver deep proteome coverage using minimal sample amounts. This trend is largely driven by single-cell proteomics, where sensitivity and reproducibility are essential. Here, we extend our previous benchmark dataset (PXD028735) that was generated using next-generation LC-MS platforms optimized for rapid proteome analysis. With shorter LC gradients and lower sample amounts, we generated an extensive DDA/DIA dataset on a standardized human-yeast-E. coli hybrid proteome. This new dataset includes data acquired by the Orbitrap Astral, which combines an Orbitrap with a time-of-flight (TOF) mass analyzer, and features new scanning quadrupole-based implementations, extending coverage across different instruments and acquisition strategies. Our comprehensive evaluation highlights how technological advances and reduced LC gradients affect proteome depth, quantitative precision, and cross-instrument consistency. The release of this benchmark dataset via ProteomeXchange (PXD070049), allows for the acceleration of cross-platform algorithm development, enhance data mining strategies, and support the continued standardization of short-gradient, high-throughput LC-MS-based proteomics. 

2025-10-31 | PXD070173 | panorama

Targeted MIP sequencing for healthy samples

Project description:We developed Del-Read, an algorithm targeting medium-sized deletions (6-100 BPs) in short-reads, which are challenging for current variant callers relying on alignment. Our focus was on Micro-Homology mediated End Joining deletions (MMEJ-dels), prevalent in myeloid malignancies. MMEJ-dels follow a distinct pattern, occurring between two homologies, allowing us to generate a comprehensive list of MMEJ-dels in the exome. Using Del-Read, we identified numerous novel germline and somatic MMEJ-dels in Beat AML and TCGA-breast datasets. Validation in 500 healthy individuals confirmed their presence.

2023-09-20 | E-MTAB-13306 | biostudies-arrayexpress

Multi-omic profiling of pathogen-stimulated primary immune cells

Project description:Objectives: To perform long-read transcriptome and proteome profiling of pathogen-stimulated peripheral blood mononuclear cells (PBMCs) from healthy donors. We aim to discover new transcripts and protein isoforms expressed during immune responses to diverse pathogens. Methods: PBMCs were exposed to four microbial stimuli for 24 hours: the TLR4 ligand lipopolysaccharide (LPS), the TLR3 ligand Poly(I:C), heat-inactivated Staphylococcus aureus, Candida albicans, and RPMI medium as negative controls. Long-read sequencing (PacBio) of one donor and secretome proteomics and short-read sequencing of five donors were performed. IsoQuant was used for transcriptome construction, Metamorpheus/FlashLFQ for proteome analysis, and Illumina short-read 3’-end mRNA sequencing for transcript quantification. Results: Long-read transcriptome profiling reveals the expression of novel sequences and isoform switching induced upon pathogen stimulation, including transcripts that are difficult to detect using traditional short-read sequencing. We observe widespread loss of intron retention as a common result of all pathogen stimulations. We highlight novel transcripts of NFKB1 and CASP1 that may indicate novel immunological mechanisms. In general, RNA expression differences did not result in differences in the amounts of secreted proteins. Interindividual differences in the proteome were larger than the differences between stimulated and unstimulated PBMCs. Clustering analysis of secreted proteins revealed a correlation between chemokine (receptor) expression on the RNA and protein levels in C. albicans- and Poly(I:C)-stimulated PBMCs. Conclusion: Isoform aware long-read sequencing of pathogen-stimulated immune cells highlights the potential of these methods to identify novel transcripts, revealing a more complex transcriptome landscape than previously appreciated.

2023-09-16 | PXD045237 | Pride

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data