Unknown

Dataset Information

0

IGLoo: Profiling the Immunoglobulin Heavy chain locus in Lymphoblastoid Cell Lines with PacBio High-Fidelity Sequencing reads.


ABSTRACT: New high-quality human genome assemblies derived from lymphoblastoid cell lines (LCLs) provide reference genomes and pangenomes for genomics studies. However, the characteristics of LCLs pose technical challenges to profiling immunoglobulin (IG) genes. IG loci in LCLs contain a mixture of germline and somatically recombined haplotypes, making them difficult to genotype or assemble accurately. To address these challenges, we introduce IGLoo, a software tool that implements novel methods for analyzing sequence data and genome assemblies derived from LCLs. IGLoo characterizes somatic V(D)J recombination events in the sequence data and identifies the breakpoints and missing IG genes in the LCL-based assemblies. Furthermore, IGLoo implements a novel reassembly framework to improve germline assembly quality by integrating information about somatic events and population structural variantions in the IG loci. We applied IGLoo to study the assemblies from the Human Pangenome Reference Consortium, providing new insights into the mechanisms, gene usage, and patterns of V(D)J recombination, causes of assembly fragmentation in the IG heavy chain (IGH) locus, and improved representation of the IGH assemblies.

SUBMITTER: Lin MJ 

PROVIDER: S-EPMC11291057 | biostudies-literature | 2024 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

IGLoo: Profiling the Immunoglobulin Heavy chain locus in Lymphoblastoid Cell Lines with PacBio High-Fidelity Sequencing reads.

Lin Mao-Jan MJ   Langmead Ben B   Safonova Yana Y  

bioRxiv : the preprint server for biology 20240723


New high-quality human genome assemblies derived from lymphoblastoid cell lines (LCLs) provide reference genomes and pangenomes for genomics studies. However, the characteristics of LCLs pose technical challenges to profiling immunoglobulin (IG) genes. IG loci in LCLs contain a mixture of germline and somatically recombined haplotypes, making them difficult to genotype or assemble accurately. To address these challenges, we introduce IGLoo, a software tool that implements novel methods for analy  ...[more]

Similar Datasets

| S-EPMC5025152 | biostudies-literature
2013-06-01 | E-GEOD-47129 | biostudies-arrayexpress
2013-06-01 | GSE47129 | GEO
| S-EPMC2268805 | biostudies-literature
| S-EPMC6380546 | biostudies-literature
| S-EPMC10362067 | biostudies-literature
| S-EPMC2212390 | biostudies-literature
2018-12-21 | GSE113938 | GEO
| S-EPMC4000994 | biostudies-literature
| S-EPMC3393677 | biostudies-literature