Unknown

Dataset Information

0

Privacy-preserving genotype imputation with fully homomorphic encryption.


ABSTRACT: Genotype imputation is the inference of unknown genotypes using known population structure observed in large genomic datasets; it can further our understanding of phenotype-genotype relationships and is useful for QTL mapping and GWASs. However, the compute-intensive nature of genotype imputation can overwhelm local servers for computation and storage. Hence, many researchers are moving toward using cloud services, raising privacy concerns. We address these concerns by developing an efficient, privacy-preserving algorithm called p-Impute. Our method uses homomorphic encryption, allowing calculations on ciphertext, thereby avoiding the decryption of private genotypes in the cloud. It is similar to k-nearest neighbor approaches, inferring missing genotypes in a genomic block based on the SNP genotypes of genetically related individuals in the same block. Our results demonstrate accuracy in agreement with the state-of-the-art plaintext solutions. Moreover, p-Impute is scalable to real-world applications as its memory and time requirements increase linearly with the increasing number of samples. p-Impute is freely available for download here: https://doi.org/10.5281/zenodo.5542001.

SUBMITTER: Gursoy G 

PROVIDER: S-EPMC8857019 | biostudies-literature | 2022 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Privacy-preserving genotype imputation with fully homomorphic encryption.

Gürsoy Gamze G   Chielle Eduardo E   Brannon Charlotte M CM   Maniatakos Michail M   Gerstein Mark M  

Cell systems 20211109 2


Genotype imputation is the inference of unknown genotypes using known population structure observed in large genomic datasets; it can further our understanding of phenotype-genotype relationships and is useful for QTL mapping and GWASs. However, the compute-intensive nature of genotype imputation can overwhelm local servers for computation and storage. Hence, many researchers are moving toward using cloud services, raising privacy concerns. We address these concerns by developing an efficient, p  ...[more]

Similar Datasets

| S-EPMC9886900 | biostudies-literature
| S-EPMC11529865 | biostudies-literature
| S-EPMC10437415 | biostudies-literature
| S-EPMC9632244 | biostudies-literature
| S-EPMC8505638 | biostudies-literature
| S-EPMC9898842 | biostudies-literature
| S-EPMC11262700 | biostudies-literature
| S-EPMC8542641 | biostudies-literature
| S-EPMC7755539 | biostudies-literature
| S-EPMC9041281 | biostudies-literature