Unknown

Dataset Information

0

AIRR community curation and standardised representation for immunoglobulin and T cell receptor germline sets.


ABSTRACT: Analysis of an individual's immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changing. To exploit the potential of emerging data, and to provide the field with improved state-of-the-art germline sets, an intermediate approach is needed that will allow the rapid publication of consolidated sets derived from these emerging sources. These sets must use a consistent naming scheme and allow refinement and consolidation into genes as new information emerges. Name changes should be minimised, but, where changes occur, the naming history of a sequence must be traceable. Here we outline the current issues and opportunities for the curation of germline IG/TR genes and present a forward-looking data model for building out more robust germline sets that can dovetail with current established processes. We describe interoperability standards for germline sets, and an approach to transparency based on principles of findability, accessibility, interoperability, and reusability.

SUBMITTER: Lees WD 

PROVIDER: S-EPMC10310305 | biostudies-literature | 2023 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

AIRR community curation and standardised representation for immunoglobulin and T cell receptor germline sets.

Lees William D WD   Christley Scott S   Peres Ayelet A   Kos Justin T JT   Corrie Brian B   Ralph Duncan D   Breden Felix F   Cowell Lindsay G LG   Yaari Gur G   Corcoran Martin M   Karlsson Hedestam Gunilla B GB   Ohlin Mats M   Collins Andrew M AM   Watson Corey T CT   Busse Christian E CE  

Immunoinformatics (Amsterdam, Netherlands) 20230219


Analysis of an individual's immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changin  ...[more]

Similar Datasets

| S-EPMC10884231 | biostudies-literature
| S-EPMC6675132 | biostudies-literature
| S-EPMC6173121 | biostudies-literature
| S-EPMC3966476 | biostudies-literature
| S-EPMC8098023 | biostudies-literature
| S-EPMC6375602 | biostudies-literature
| S-EPMC8131366 | biostudies-literature
| S-EPMC6703969 | biostudies-literature
| S-EPMC1351202 | biostudies-literature
| S-EPMC8487578 | biostudies-literature