Unknown

Dataset Information

0

Key language markers of depression on social media depend on race.


ABSTRACT: Depression has robust natural language correlates and can increasingly be measured in language using predictive models. However, despite evidence that language use varies as a function of individual demographic features (e.g., age, gender), previous work has not systematically examined whether and how depression's association with language varies by race. We examine how race moderates the relationship between language features (i.e., first-person pronouns and negative emotions) from social media posts and self-reported depression, in a matched sample of Black and White English speakers in the United States. Our findings reveal moderating effects of race: While depression severity predicts I-usage in White individuals, it does not in Black individuals. White individuals use more belongingness and self-deprecation-related negative emotions. Machine learning models trained on similar amounts of data to predict depression severity performed poorly when tested on Black individuals, even when they were trained exclusively using the language of Black individuals. In contrast, analogous models tested on White individuals performed relatively well. Our study reveals surprising race-based differences in the expression of depression in natural language and highlights the need to understand these effects better, especially before language-based models for detecting psychological phenomena are integrated into clinical practice.

SUBMITTER: Rai S 

PROVIDER: S-EPMC10998627 | biostudies-literature | 2024 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Key language markers of depression on social media depend on race.

Rai Sunny S   Stade Elizabeth C EC   Giorgi Salvatore S   Francisco Ashley A   Ungar Lyle H LH   Curtis Brenda B   Guntuku Sharath C SC  

Proceedings of the National Academy of Sciences of the United States of America 20240326 14


Depression has robust natural language correlates and can increasingly be measured in language using predictive models. However, despite evidence that language use varies as a function of individual demographic features (e.g., age, gender), previous work has not systematically examined whether and how depression's association with language varies by race. We examine how race moderates the relationship between language features (i.e., first-person pronouns and negative emotions) from social media  ...[more]

Similar Datasets

| S-EPMC9052361 | biostudies-literature
| S-EPMC10955894 | biostudies-literature
| S-EPMC8847554 | biostudies-literature
| S-EPMC9714561 | biostudies-literature
| S-EPMC7994843 | biostudies-literature
| S-EPMC9837664 | biostudies-literature
| S-EPMC7551727 | biostudies-literature
| S-EPMC8627225 | biostudies-literature
| S-EPMC10308542 | biostudies-literature
| S-EPMC9992514 | biostudies-literature