Unknown

Dataset Information

0

A cloud-based pipeline for analysis of FHIR and long-read data.


ABSTRACT:

Motivation

As genome sequencing becomes cheaper and more accurate, it is becoming increasingly viable to merge this data with electronic health information to inform clinical decisions.

Results

In this work, we demonstrate a full pipeline for working with both PacBio sequencing data and clinical FHIR® data, from initial data to tertiary analysis. The electronic health records are stored in FHIR® (Fast Healthcare Interoperability Resource) format, the current leading standard for healthcare data exchange. For the genomic data, we perform variant calling on long-read PacBio HiFi data using Cromwell on Azure. Both data formats are parsed, processed and merged in a single scalable pipeline which securely performs tertiary analyses using cloud-based Jupyter notebooks. We include three example applications: exporting patient information to a database, clustering patients and performing a simple pharmacogenomic study.

Availability and implementation

https://github.com/microsoft/genomicsnotebook/tree/main/fhirgenomics.

Supplementary information

Supplementary data are available at Bioinformatics Advances online.

SUBMITTER: Dunn T 

PROVIDER: S-EPMC9872570 | biostudies-literature | 2023

REPOSITORIES: biostudies-literature

altmetric image

Publications

A cloud-based pipeline for analysis of FHIR and long-read data.

Dunn Tim T   Cosgun Erdal E  

Bioinformatics advances 20230120 1


<h4>Motivation</h4>As genome sequencing becomes cheaper and more accurate, it is becoming increasingly viable to merge this data with electronic health information to inform clinical decisions.<h4>Results</h4>In this work, we demonstrate a full pipeline for working with both PacBio sequencing data and clinical FHIR<sup>®</sup> data, from initial data to tertiary analysis. The electronic health records are stored in FHIR<sup>®</sup> (Fast Healthcare Interoperability Resource) format, the current  ...[more]

Similar Datasets

| S-EPMC9781625 | biostudies-literature
| S-EPMC3738164 | biostudies-literature
| S-EPMC4709609 | biostudies-literature
| S-EPMC11601321 | biostudies-literature
| S-EPMC6451199 | biostudies-literature
| S-EPMC3411592 | biostudies-literature
| S-EPMC6993992 | biostudies-literature
| S-EPMC3512137 | biostudies-literature
2024-03-15 | GSE243114 | GEO
| S-EPMC7931822 | biostudies-literature