Unknown

Dataset Information

0

Localization and recognition of human action in 3D using transformers.


ABSTRACT: Understanding a person's behavior from their 3D motion sequence is a fundamental problem in computer vision with many applications. An important component of this problem is 3D action localization, which involves recognizing what actions a person is performing, and when the actions occur in the sequence. To promote the progress of the 3D action localization community, we introduce a new, challenging, and more complex benchmark dataset, BABEL-TAL (BT), for 3D action localization. Important baselines and evaluating metrics, as well as human evaluations, are carefully established on this benchmark. We also propose a strong baseline model, i.e., Localizing Actions with Transformers (LocATe), that jointly localizes and recognizes actions in a 3D sequence. The proposed LocATe shows superior performance on BABEL-TAL as well as on the large-scale PKU-MMD dataset, achieving state-of-the-art performance by using only 10% of the labeled training data. Our research could advance the development of more accurate and efficient systems for human behavior analysis, with potential applications in areas such as human-computer interaction and healthcare.

SUBMITTER: Sun J 

PROVIDER: S-EPMC11372174 | biostudies-literature | 2024 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Localization and recognition of human action in 3D using transformers.

Sun Jiankai J   Huang Linjiang L   Wang Hongsong H   Zheng Chuanyang C   Qiu Jianing J   Islam Md Tauhidul MT   Xie Enze E   Zhou Bolei B   Xing Lei L   Chandrasekaran Arjun A   Black Michael J MJ  

Communications engineering 20240903 1


Understanding a person's behavior from their 3D motion sequence is a fundamental problem in computer vision with many applications. An important component of this problem is 3D action localization, which involves recognizing what actions a person is performing, and when the actions occur in the sequence. To promote the progress of the 3D action localization community, we introduce a new, challenging, and more complex benchmark dataset, BABEL-TAL (BT), for 3D action localization. Important baseli  ...[more]

Similar Datasets

| S-EPMC10383990 | biostudies-literature
| S-EPMC11622946 | biostudies-literature
| S-EPMC6720755 | biostudies-literature
| S-EPMC7501562 | biostudies-literature
| S-EPMC6477676 | biostudies-literature
| S-EPMC5579071 | biostudies-literature
| S-EPMC11557913 | biostudies-literature
| S-EPMC7727351 | biostudies-literature
| S-EPMC11622993 | biostudies-literature
| S-EPMC10280426 | biostudies-literature