Unknown

Dataset Information

0

Pig pangenome graph reveals functional features of non-reference sequences.


ABSTRACT:

Background

The reliance on a solitary linear reference genome has imposed a significant constraint on our comprehensive understanding of genetic variation in animals. This constraint is particularly pronounced for non-reference sequences (NRSs), which have not been extensively studied.

Results

In this study, we constructed a pig pangenome graph using 21 pig assemblies and identified 23,831 NRSs with a total length of 105 Mb. Our findings revealed that NRSs were more prevalent in breeds exhibiting greater genetic divergence from the reference genome. Furthermore, we observed that NRSs were rarely found within coding sequences, while NRS insertions were enriched in immune-related Gene Ontology terms. Notably, our investigation also unveiled a close association between novel genes and the immune capacity of pigs. We observed substantial differences in terms of frequencies of NRSs between Eastern and Western pigs, and the heat-resistant pigs exhibited a substantial number of NRS insertions in an 11.6 Mb interval on chromosome X. Additionally, we discovered a 665 bp insertion in the fourth intron of the TNFRSF19 gene that may be associated with the ability of heat tolerance in Southern Chinese pigs.

Conclusions

Our findings demonstrate the potential of a graph genome approach to reveal important functional features of NRSs in pig populations.

SUBMITTER: Miao J 

PROVIDER: S-EPMC10882747 | biostudies-literature | 2024 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Pig pangenome graph reveals functional features of non-reference sequences.

Miao Jian J   Wei Xingyu X   Cao Caiyun C   Sun Jiabao J   Xu Yuejin Y   Zhang Zhe Z   Wang Qishan Q   Pan Yuchun Y   Wang Zhen Z  

Journal of animal science and biotechnology 20240222 1


<h4>Background</h4>The reliance on a solitary linear reference genome has imposed a significant constraint on our comprehensive understanding of genetic variation in animals. This constraint is particularly pronounced for non-reference sequences (NRSs), which have not been extensively studied.<h4>Results</h4>In this study, we constructed a pig pangenome graph using 21 pig assemblies and identified 23,831 NRSs with a total length of 105 Mb. Our findings revealed that NRSs were more prevalent in b  ...[more]

Similar Datasets

| S-EPMC10954445 | biostudies-literature
| S-EPMC11804092 | biostudies-literature
| S-EPMC11568064 | biostudies-literature
| S-EPMC10664547 | biostudies-literature
| S-EPMC11669723 | biostudies-literature
| S-EPMC10172123 | biostudies-literature
| S-EPMC3774778 | biostudies-literature
| S-EPMC8157972 | biostudies-literature
| S-EPMC6796347 | biostudies-literature
| S-EPMC10322713 | biostudies-literature