Ontology highlight
ABSTRACT:
SUBMITTER: Liu H
PROVIDER: S-EPMC10283113 | biostudies-literature | 2023
REPOSITORIES: biostudies-literature
Liu Hao H Moustafa-Fahmy Nour N Ta Casey C Weng Chunhua C
AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science 20230616
This reproducibility study presents an algorithm to weigh in race distribution data of clinical research study samples when training biomedical embeddings. We extracted 12,864 PubMed abstracts published between January 1<sup>st</sup>, 2000 and January 1<sup>st</sup>, 2022 and weighed them based on the race distribution data extracted from their corresponding clinical trials registered on ClinicalTrials.gov. We trained Word2vec and BERT embeddings and evaluated their performance on predicting len ...[more]