Unknown

Dataset Information

0

Characterization of the immunoglobulin lambda chain locus from diverse populations reveals extensive genetic variation.


ABSTRACT: Immunoglobulins (IGs), crucial components of the adaptive immune system, are encoded by three genomic loci. However, the complexity of the IG loci severely limits the effective use of short read sequencing, limiting our knowledge of population diversity in these loci. We leveraged existing long read whole-genome sequencing (WGS) data, fosmid technology, and IG targeted single-molecule, real-time (SMRT) long-read sequencing (IG-Cap) to create haplotype-resolved assemblies of the IG Lambda (IGL) locus from 6 ethnically diverse individuals. In addition, we generated 10 diploid assemblies of IGL from a diverse cohort of individuals utilizing IG-Cap. From these 16 individuals, we identified significant allelic diversity, including 36 novel IGLV alleles. In addition, we observed highly elevated single nucleotide variation (SNV) in IGLV genes relative to IGL intergenic and genomic background SNV density. By comparing SNV calls between our high quality assemblies and existing short read datasets from the same individuals, we show a high propensity for false-positives in the short read datasets. Finally, for the first time, we nucleotide-resolved common 5-10 Kb duplications in the IGLC region that contain functional IGLJ and IGLC genes. Together these data represent a significant advancement in our understanding of genetic variation and population diversity in the IGL locus.

SUBMITTER: Gibson WS 

PROVIDER: S-EPMC10041605 | biostudies-literature | 2023 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Characterization of the immunoglobulin lambda chain locus from diverse populations reveals extensive genetic variation.

Gibson William S WS   Rodriguez Oscar L OL   Shields Kaitlyn K   Silver Catherine A CA   Dorgham Abdullah A   Emery Matthew M   Deikus Gintaras G   Sebra Robert R   Eichler Evan E EE   Bashir Ali A   Smith Melissa L ML   Watson Corey T CT  

Genes and immunity 20221221 1


Immunoglobulins (IGs), crucial components of the adaptive immune system, are encoded by three genomic loci. However, the complexity of the IG loci severely limits the effective use of short read sequencing, limiting our knowledge of population diversity in these loci. We leveraged existing long read whole-genome sequencing (WGS) data, fosmid technology, and IG targeted single-molecule, real-time (SMRT) long-read sequencing (IG-Cap) to create haplotype-resolved assemblies of the IG Lambda (IGL) l  ...[more]

Similar Datasets

| S-EPMC11327106 | biostudies-literature
| S-EPMC3695908 | biostudies-literature
| S-EPMC10362067 | biostudies-literature
2013-06-01 | E-GEOD-47129 | biostudies-arrayexpress
| S-EPMC4801776 | biostudies-literature
| S-EPMC5025152 | biostudies-literature
2013-06-01 | GSE47129 | GEO
| S-EPMC2742984 | biostudies-literature
| S-EPMC1206658 | biostudies-other
| S-EPMC4382588 | biostudies-literature