Unknown

Dataset Information

0

Detection of emerging drugs involved in overdose via diachronic word embeddings of substances discussed on social media.


ABSTRACT: Substances involved in overdose deaths have shifted over time and continue to undergo transition. Early detection of emerging drugs involved in overdose is a major challenge for traditional public health data systems. While novel social media data have shown promise, there is a continued need for robust natural language processing approaches that can identify emerging substances. Consequently, we developed a new metric, the relative similarity ratio, based on diachronic word embeddings to measure movement in the semantic proximity of individual substance words to 'overdose' over time. Our analysis of 64,420,376 drug-related posts made between January 2011 and December 2018 on Reddit, the largest online forum site, reveals that this approach successfully identified fentanyl, the most significant emerging substance in the overdose epidemic, >1 year earlier than traditional public health data systems. Use of diachronic word embeddings may enable improved identification of emerging substances involved in drug overdose, thereby improving the timeliness of prevention and treatment activities.

SUBMITTER: Wright AP 

PROVIDER: S-EPMC10901232 | biostudies-literature | 2021 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Detection of emerging drugs involved in overdose via diachronic word embeddings of substances discussed on social media.

Wright Austin P AP   Jones Christopher M CM   Chau Duen Horng DH   Matthew Gladden R R   Sumner Steven A SA  

Journal of biomedical informatics 20210526


Substances involved in overdose deaths have shifted over time and continue to undergo transition. Early detection of emerging drugs involved in overdose is a major challenge for traditional public health data systems. While novel social media data have shown promise, there is a continued need for robust natural language processing approaches that can identify emerging substances. Consequently, we developed a new metric, the relative similarity ratio, based on diachronic word embeddings to measur  ...[more]

Similar Datasets

| S-EPMC8520005 | biostudies-literature
| S-EPMC8800511 | biostudies-literature
| S-EPMC7861243 | biostudies-literature
| S-EPMC6510737 | biostudies-literature
| S-EPMC7309261 | biostudies-literature
| S-EPMC7856086 | biostudies-literature
| S-EPMC5910851 | biostudies-literature