ML Research Wiki / Benchmarks / Action Recognition / NTU RGB+D 120

NTU RGB+D 120

Action Recognition Benchmark

Performance Over Time

📊 Showing 16 results | 📏 Metric: Accuracy (Cross-Setup)

Top Performing Models

Rank Model Paper Accuracy (Cross-Setup) Date Code
1 PoseC3D (RGB + Pose) Revisiting Skeleton-based Action Recognition 96.40 2021-04-28 📦 open-mmlab/mmaction2 📦 kennymckormick/pyskl 📦 txyugood/PaddlePoseC3D 📦 sandman002/One-Style-is-All-You-Need-to-Generate-a-Video
2 π-ViT (RGB + Pose) 📚 Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living 96.10 2023-11-30 📦 dominickrei/pi-vit
3 EPP-Net (Parsing + Pose) Explore Human Parsing Modality for Action Recognition 92.80 2024-01-04 📦 liujf69/EPP-Net-Action
4 STAR-Transformer (RGB + Pose) STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition 92.70 2022-10-14 -
5 EPAM-Net EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition 92.40 2024-08-10 📦 ahmed-nady/multimodal-action-recognition
6 π-ViT (RGB only) 📚 Just Add $π$! Pose Induced Video Transformers for Understanding Activities of Daily Living 91.90 2023-11-30 📦 dominickrei/pi-vit
7 IPP-Net (Parsing + Pose) Integrating Human Parsing and Pose Network for Human Action Recognition 91.70 2023-07-16 📦 liujf69/ipp-net-parsing
8 3DA (RGB + Pose) Cross-Modal Learning with 3D Deformable Attention for Action Recognition 91.40 2022-12-12 -
9 DSTSA-GCN DSTSA-GCN: Advancing Skeleton-Based Gesture Recognition with Semantic-Aware Spatio-Temporal Topology Modeling 90.97 2025-01-21 📦 HuCui2022/DSTSA-GCN_Gesture
10 VPN++ (RGB + Pose) 📚 VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living 90.70 2021-05-17 📦 srijandas07/vpnplusplus

All Papers (16)