Unknown

Dataset Information

0

An open source knowledge graph ecosystem for the life sciences.


ABSTRACT: Translational research requires data at multiple scales of biological organization. Advancements in sequencing and multi-omics technologies have increased the availability of these data, but researchers face significant integration challenges. Knowledge graphs (KGs) are used to model complex phenomena, and methods exist to construct them automatically. However, tackling complex biomedical integration problems requires flexibility in the way knowledge is modeled. Moreover, existing KG construction methods provide robust tooling at the cost of fixed or limited choices among knowledge representation models. PheKnowLator (Phenotype Knowledge Translator) is a semantic ecosystem for automating the FAIR (Findable, Accessible, Interoperable, and Reusable) construction of ontologically grounded KGs with fully customizable knowledge representation. The ecosystem includes KG construction resources (e.g., data preparation APIs), analysis tools (e.g., SPARQL endpoint resources and abstraction algorithms), and benchmarks (e.g., prebuilt KGs). We evaluated the ecosystem by systematically comparing it to existing open-source KG construction methods and by analyzing its computational performance when used to construct 12 different large-scale KGs. With flexible knowledge representation, PheKnowLator enables fully customizable KGs without compromising performance or usability.

SUBMITTER: Callahan TJ 

PROVIDER: S-EPMC11009265 | biostudies-literature | 2024 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

An open source knowledge graph ecosystem for the life sciences.

Callahan Tiffany J TJ   Tripodi Ignacio J IJ   Stefanski Adrianne L AL   Cappelletti Luca L   Taneja Sanya B SB   Wyrwa Jordan M JM   Casiraghi Elena E   Matentzoglu Nicolas A NA   Reese Justin J   Silverstein Jonathan C JC   Hoyt Charles Tapley CT   Boyce Richard D RD   Malec Scott A SA   Unni Deepak R DR   Joachimiak Marcin P MP   Robinson Peter N PN   Mungall Christopher J CJ   Cavalleri Emanuele E   Fontana Tommaso T   Valentini Giorgio G   Mesiti Marco M   Gillenwater Lucas A LA   Santangelo Brook B   Vasilevsky Nicole A NA   Hoehndorf Robert R   Bennett Tellen D TD   Ryan Patrick B PB   Hripcsak George G   Kahn Michael G MG   Bada Michael M   Baumgartner William A WA   Hunter Lawrence E LE  

Scientific data 20240411 1


Translational research requires data at multiple scales of biological organization. Advancements in sequencing and multi-omics technologies have increased the availability of these data, but researchers face significant integration challenges. Knowledge graphs (KGs) are used to model complex phenomena, and methods exist to construct them automatically. However, tackling complex biomedical integration problems requires flexibility in the way knowledge is modeled. Moreover, existing KG constructio  ...[more]

Similar Datasets

| S-EPMC7077981 | biostudies-literature
| S-EPMC11829977 | biostudies-literature
| S-EPMC7815964 | biostudies-literature
| S-EPMC7198240 | biostudies-literature
| S-EPMC9940622 | biostudies-literature
| S-EPMC9595524 | biostudies-literature
| S-EPMC8219732 | biostudies-literature
| S-EPMC8130828 | biostudies-literature
| S-EPMC6153488 | biostudies-other