Unknown

Dataset Information

0

Can accurate demographic information about people who use prescription medications nonmedically be derived from Twitter?


ABSTRACT: Traditional substance use (SU) surveillance methods, such as surveys, incur substantial lags. Due to the continuously evolving trends in SU, insights obtained via such methods are often outdated. Social media-based sources have been proposed for obtaining timely insights, but methods leveraging such data cannot typically provide fine-grained statistics about subpopulations, unlike traditional approaches. We address this gap by developing methods for automatically characterizing a large Twitter nonmedical prescription medication use (NPMU) cohort (n = 288,562) in terms of age-group, race, and gender. Our natural language processing and machine learning methods for automated cohort characterization achieved 0.88 precision (95% CI:0.84 to 0.92) for age-group, 0.90 (95% CI: 0.85 to 0.95) for race, and 94% accuracy (95% CI: 92 to 97) for gender, when evaluated against manually annotated gold-standard data. We compared automatically derived statistics for NPMU of tranquilizers, stimulants, and opioids from Twitter with statistics reported in the National Survey on Drug Use and Health (NSDUH) and the National Emergency Department Sample (NEDS). Distributions automatically estimated from Twitter were mostly consistent with the NSDUH [Spearman r: race: 0.98 (< 0.005); age-group: 0.67 (< 0.005); gender: 0.66 (= 0.27)] and NEDS, with 34/65 (52.3%) of the Twitter-based estimates lying within 95% CIs of estimates from the traditional sources. Explainable differences (e.g., overrepresentation of younger people) were found for age-group-related statistics. Our study demonstrates that accurate subpopulation-specific estimates about SU, particularly NPMU, may be automatically derived from Twitter to obtain earlier insights about targeted subpopulations compared to traditional surveillance approaches.

SUBMITTER: Yang YC 

PROVIDER: S-EPMC9974473 | biostudies-literature | 2023 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Can accurate demographic information about people who use prescription medications nonmedically be derived from Twitter?

Yang Yuan-Chi YC   Al-Garadi Mohammed Ali MA   Love Jennifer S JS   Cooper Hannah L F HLF   Perrone Jeanmarie J   Sarker Abeed A  

Proceedings of the National Academy of Sciences of the United States of America 20230214 8


Traditional substance use (SU) surveillance methods, such as surveys, incur substantial lags. Due to the continuously evolving trends in SU, insights obtained via such methods are often outdated. Social media-based sources have been proposed for obtaining timely insights, but methods leveraging such data cannot typically provide fine-grained statistics about subpopulations, unlike traditional approaches. We address this gap by developing methods for automatically characterizing a large Twitter n  ...[more]

Similar Datasets

| S-EPMC11000278 | biostudies-literature
| S-EPMC6128140 | biostudies-literature
| S-EPMC11270238 | biostudies-literature
| S-EPMC6468981 | biostudies-literature
| S-EPMC4805522 | biostudies-literature
| S-EPMC5399219 | biostudies-literature
| S-EPMC10123905 | biostudies-literature
| S-EPMC8047635 | biostudies-literature
| S-EPMC5997528 | biostudies-literature
| S-EPMC2742307 | biostudies-other