Unknown

Dataset Information

0

Scaling up DNA digital data storage by efficiently predicting DNA hybridisation using deep learning.


ABSTRACT: Deoxyribonucleic acid (DNA) has shown great promise in enabling computational applications, most notably in the fields of DNA digital data storage and DNA computing. Information is encoded as DNA strands, which will naturally bind in solution, thus enabling search and pattern-matching capabilities. Being able to control and predict the process of DNA hybridisation is crucial for the ambitious future of Hybrid Molecular-Electronic Computing. Current tools are, however, limited in terms of throughput and applicability to large-scale problems. We present the first comprehensive study of machine learning methods applied to the task of predicting DNA hybridisation. For this purpose, we introduce an in silico-generated hybridisation dataset of over 2.5 million data points, enabling the use of deep learning. Depending on hardware, we achieve a reduction in inference time ranging from one to over two orders of magnitude compared to the state-of-the-art, while retaining high fidelity. We then discuss the integration of our methods in modern, scalable workflows.

SUBMITTER: Buterez D 

PROVIDER: S-EPMC8519920 | biostudies-literature | 2021 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Scaling up DNA digital data storage by efficiently predicting DNA hybridisation using deep learning.

Buterez David D  

Scientific reports 20211015 1


Deoxyribonucleic acid (DNA) has shown great promise in enabling computational applications, most notably in the fields of DNA digital data storage and DNA computing. Information is encoded as DNA strands, which will naturally bind in solution, thus enabling search and pattern-matching capabilities. Being able to control and predict the process of DNA hybridisation is crucial for the ambitious future of Hybrid Molecular-Electronic Computing. Current tools are, however, limited in terms of through  ...[more]

Similar Datasets

| S-EPMC10858265 | biostudies-literature
| S-EPMC10700131 | biostudies-literature
| S-EPMC8252642 | biostudies-literature
| S-EPMC8612674 | biostudies-literature
| S-EPMC6107285 | biostudies-literature
| S-EPMC8501764 | biostudies-literature
| S-EPMC9271978 | biostudies-literature
| S-EPMC6300887 | biostudies-other
| S-EPMC11558088 | biostudies-literature
| S-EPMC7644757 | biostudies-literature