Project description:The Yeonsan Ogye (Ogye) is the rare black chicken breed domesticated in Korean peninsula, which has been noted for entire black color upon its appearances including feather, skin, comb, eyes, shank, claws and internal organs. In this study, whole genome, transcriptome and epigenome sequencings of Ogye were performed using high-throughput NGS sequencing platforms. We have produced Illumina short-reads (Paired-End, Mate-Pair and FOSMID) and PacBio long-reads for whole genome sequencing (WGS), 1.4 billion reads for RNA-seq, and 123 million reads for RRBS (reduced representation bisulfite sequencing) data. Using WGS data, Ogye genome has been assembled, and coding/non-coding transcriptome maps were constructed on Ogye genome given largescale sequencing data. We have predicted 17,472 (3,550 newly annotated and 13,922 known) protein-coding transcripts, and 9,443 (6,689 novel and 2,754 known) long non-coding RNAs (lncRNAs).
Project description:The Yeonsan Ogye (Ogye) is the rare black chicken breed domesticated in Korean peninsula, which has been noted for entire black color upon its appearances including feather, skin, comb, eyes, shank, claws and internal organs. In this study, whole genome, transcriptome and epigenome sequencings of Ogye were performed using high-throughput NGS sequencing platforms. We have produced Illumina short-reads (Paired-End, Mate-Pair and FOSMID) and PacBio long-reads for whole genome sequencing (WGS), 1.4 billion reads for RNA-seq, and 123 million reads for RRBS (reduced representation bisulfite sequencing) data. Using WGS data, Ogye genome has been assembled, and coding/non-coding transcriptome maps were constructed on Ogye genome given largescale sequencing data. We have predicted 17,472 (3,550 newly annotated and 13,922 known) protein-coding transcripts, and 9,443 (6,689 novel and 2,754 known) long non-coding RNAs (lncRNAs).
Project description:Chromatin immunoprecipitation analysis of CENH3 in the Arabidopsis thaliana accessions Col-0, Ler-0, Cvi-0 and Tanz-1 was performed in order to align reads to PacBio HiFi genome assemblies which contain complete centromere repeat arrays.
Project description:We used PacBio data to identify more reliable transcripts from hESC, based on which we can estimate gene/transcript abundance better from Illumina data. PacBio long reads and Illumina short reads were generated from the same hESC cell line H1. PacBio reads were error-corrected by Illumina reads to identify transcripts. rSeq is used to estimate gene/transcript abundance of the identified transcriptome.
Project description:<p class='ql-align-justify'>Megasphaera hexanoica KCCM 43214T, isolated from cow rumen, is capable of producing medium-chain carboxylic acids such as n-caproate and n-caprylate. In this study, we present a high-quality genome assembly, along with intracellular metabolomic profiling and pangenomic analysis. Illumina sequencing generated 2.3 Mbp from 15,293,634 reads with a GC content of 49.5%, while PacBio HiFi sequencing produced 331.5 Mbp across 45,266 reads, with an average read length of 7,323 bp and a HiFi read N50 of 8,214 bp. Hybrid assembly of short and long reads resulted in a single 2.88 Mbp contig, containing 2,075-2,083 unique genes. A genome-scale metabolic model was constructed, to evaluate its metabolic capabilities under specific growth conditions. Intracellular metabolomic analysis of cells grown in fructose medium and lactate medium revealed key metabolic activities associated with chain elongation. Pangenomic analysis across nine annotated genomes identified 6,721 orthologous gene using OrthoMCL, emphasizing the genetic and functional diversity within the Megasphaera genus. This dataset offers valuable insights into the metabolism and biotechnological potential of M. hexanoica KCCM 43214T.</p>
Project description:Short-read WGS datasets of 15 eHHV-6B-positive Japanese subjects (Illumina WGS) and long-read WGS datasets of 3 eHHV-6B-positive Japanese subjects with SLE (PacBio 30x HiFi long-read sequencing).
Project description:Short-read WGS datasets of 15 eHHV-6B-positive Japanese subjects (Illumina WGS) and long-read WGS datasets of 3 eHHV-6B-positive Japanese subjects with SLE (PacBio 30x HiFi long-read sequencing).
Project description:The 2q31 region is commonly associated with pathogenic alleles of the HOXD cluster leading to various clinical phenotypes related to skeletal development. We present a patient with short stature, tetralogy of Fallot, and multiple congenital anomalies. Initial array comparative genomic hybridization (aCGH) revealed a de novo complex genomic rearrangement (CGR) spanning 2q31 in proband characterized as a triplication (TRP)-duplication (DUP)-triplication (TRP). Subsequent analysis applying combined next-generation sequencing methodologies including whole-genome sequencing (WGS) short-reads, PacBio WGS long-reads, and optical genome mapping (OGM) indicated two additional duplications on each end of the rearrangement, consisting of DUP-TRP-DUP-TRP-DUP. This involves three breakpoint junctions, of which each were resolved down to a nucleotide level. This genomic catastrophic event includes many disease-causing genes, and analysis of each gene and its known disorders was required in order to connect genotype to phenotype in a unique CNV.