PoseC3D (RGB + Pose)
|
Revisiting Skeleton-based Action Recognition
|
96.40
|
2021-04-28
|
|
π-ViT (RGB + Pose)
|
Just Add $π$! Pose Induced Video Transformers for…
|
96.10
|
2023-11-30
|
|
EPP-Net (Parsing + Pose)
|
Explore Human Parsing Modality for Action Recogni…
|
92.80
|
2024-01-04
|
|
STAR-Transformer (RGB + Pose)
|
STAR-Transformer: A Spatio-temporal Cross Attenti…
|
92.70
|
2022-10-14
|
|
EPAM-Net
|
EPAM-Net: An Efficient Pose-driven Attention-guid…
|
92.40
|
2024-08-10
|
|
π-ViT (RGB only)
|
Just Add $π$! Pose Induced Video Transformers for…
|
91.90
|
2023-11-30
|
|
IPP-Net (Parsing + Pose)
|
Integrating Human Parsing and Pose Network for Hu…
|
91.70
|
2023-07-16
|
|
3DA (RGB + Pose)
|
Cross-Modal Learning with 3D Deformable Attention…
|
91.40
|
2022-12-12
|
|
DSTSA-GCN
|
DSTSA-GCN: Advancing Skeleton-Based Gesture Recog…
|
90.97
|
2025-01-21
|
|
VPN++ (RGB + Pose)
|
VPN++: Rethinking Video-Pose embeddings for under…
|
90.70
|
2021-05-17
|
|
DVANet (RGB only)
|
DVANet: Disentangling View and Action Features fo…
|
90.40
|
2023-12-10
|
|
VPN (RGB + Pose)
|
VPN: Learning Video-Pose Embedding for Activities…
|
86.30
|
2020-07-06
|
|
ST-GCN + AS-GCN w/DH-TCN
|
Vertex Feature Encoding and Hierarchical Temporal…
|
78.30
|
2019-12-20
|
|
Gimme Signals (AIS)
|
Gimme Signals: Discriminative signal encoding for…
|
70.80
|
2020-03-13
|
|
TSRJI
|
Skeleton Image Representation for 3D Action Recog…
|
67.90
|
2019-09-11
|
|
Skelemotion + Yang et al. (skeleton only)
|
SkeleMotion: A New Representation of Skeleton Joi…
|
66.90
|
2019-07-30
|
|