Unknown

Dataset Information

0

Enhancing the Classification of Aphasia: A Statistical Analysis Using Connected Speech.


ABSTRACT:

Background

Large shared databases and automated language analyses allow for the application of new data analysis techniques that can shed new light on the connected speech of people with aphasia (PWA).

Aims

To identify coherent clusters of PWA based on language output using unsupervised statistical algorithms and to identify features that are most strongly associated with those clusters.

Methods & procedures

Clustering and classification methods were applied to language production data from 168 PWA. Language samples were from a standard discourse protocol tapping four genres: free speech personal narratives, picture descriptions, Cinderella storytelling, procedural discourse.

Outcomes & results

Seven distinct clusters of PWA were identified by the K-means algorithm. Using the random forests algorithm, a classification tree was proposed and validated, showing 91% agreement with the cluster assignments. This representative tree used only two variables to divide the data into distinct groups: total words from free speech tasks and total closed class words from the Cinderella storytelling task.

Conclusion

Connected speech data can be used to distinguish PWA into coherent groups, providing insight into traditional aphasia classifications, factors that may guide discourse research and clinical work.

SUBMITTER: Fromm D 

PROVIDER: S-EPMC9708051 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

Enhancing the Classification of Aphasia: A Statistical Analysis Using Connected Speech.

Fromm Davida D   Greenhouse Joel J   Pudil Mitchell M   Shi Yichun Y   MacWhinney Brian B  

Aphasiology 20210921 12


<h4>Background</h4>Large shared databases and automated language analyses allow for the application of new data analysis techniques that can shed new light on the connected speech of people with aphasia (PWA).<h4>Aims</h4>To identify coherent clusters of PWA based on language output using unsupervised statistical algorithms and to identify features that are most strongly associated with those clusters.<h4>Methods & procedures</h4>Clustering and classification methods were applied to language pro  ...[more]

Similar Datasets

| S-EPMC10793520 | biostudies-literature
| S-EPMC11370793 | biostudies-literature
| S-EPMC9381031 | biostudies-literature
| S-EPMC2892940 | biostudies-literature
| S-EPMC11848170 | biostudies-literature
| S-EPMC4522343 | biostudies-literature
| S-EPMC5560767 | biostudies-other
| S-EPMC6822609 | biostudies-literature
| S-EPMC2720937 | biostudies-literature
| S-EPMC7985793 | biostudies-literature