Unknown

Dataset Information

0

Limitations of lymphoblastoid cell lines for establishing genetic reference datasets in the immunoglobulin loci.


ABSTRACT: Lymphoblastoid cell lines (LCLs) have been critical to establishing genetic resources for biomedical science. They have been used extensively to study human genetic diversity, genome function, and inform the development of tools and methodologies for augmenting disease genetics research. While the validity of variant callsets from LCLs has been demonstrated for most of the genome, previous work has shown that DNA extracted from LCLs is modified by V(D)J recombination within the immunoglobulin (IG) loci, regions that harbor antibody genes critical to immune system function. However, the impacts of V(D)J on short read sequencing data generated from LCLs has not been extensively investigated. In this study, we used LCL-derived short read sequencing data from the 1000 Genomes Project (n = 2,504) to identify signatures of V(D)J recombination. Our analyses revealed sample-level impacts of V(D)J recombination that varied depending on the degree of inferred monoclonality. We showed that V(D)J associated somatic deletions impacted genotyping accuracy, leading to adulterated population-level estimates of allele frequency and linkage disequilibrium. These findings illuminate limitations of using LCLs and short read data for building genetic resources in the IG loci, with implications for interpreting previous disease association studies in these regions.

SUBMITTER: Rodriguez OL 

PROVIDER: S-EPMC8668129 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC5667475 | biostudies-literature
| S-EPMC3613588 | biostudies-literature
| S-EPMC6747003 | biostudies-literature
| S-EPMC7430882 | biostudies-literature
| S-EPMC6804467 | biostudies-literature
| S-EPMC3370055 | biostudies-other
| S-EPMC2975793 | biostudies-literature
| S-EPMC3290781 | biostudies-literature
| S-EPMC6476876 | biostudies-literature