DreamVLA
|
DreamVLA: A Vision-Language-Action Model Dreamed …
|
4.44
|
2025-07-06
|
|
VPP
|
Video Prediction Policy: A Generalist Robot Polic…
|
4.29
|
2024-12-19
|
|
RoboVLMs
|
Towards Generalist Robot Policies: What Matters i…
|
4.25
|
2024-12-18
|
|
Openhelix
|
OpenHelix: A Short Survey, Empirical Analysis, an…
|
4.08
|
2025-05-06
|
|
UP-VLA
|
UP-VLA: A Unified Understanding and Prediction Mo…
|
4.08
|
2025-01-31
|
|
GR-MG
|
GR-MG: Leveraging Partially Annotated Data via Mu…
|
4.04
|
2024-08-26
|
|
MoDE
|
Efficient Diffusion Transformer Policies with Mix…
|
4.01
|
2024-12-17
|
|
RoboUniView
|
RoboUniView: Visual-Language Model with Unified V…
|
3.86
|
2024-06-27
|
|
UniVLA
|
UniVLA: Learning to Act Anywhere with Task-centri…
|
3.80
|
2025-05-09
|
|
RoboDual
|
Towards Synergistic, Generalized, and Efficient D…
|
3.66
|
2024-10-10
|
|
VidMan
|
VidMan: Exploiting Implicit Dynamics from Video D…
|
3.42
|
2024-11-14
|
|
3DDA
|
3D Diffuser Actor: Policy Diffusion with 3D Scene…
|
3.35
|
2024-02-18
|
|
OpenVLA
|
OpenVLA: An Open-Source Vision-Language-Action Mo…
|
3.27
|
2024-06-13
|
|
3D Diffusor Actor
|
3D Diffuser Actor: Policy Diffusion with 3D Scene…
|
3.27
|
2024-02-18
|
|
GR-1
|
Unleashing Large-Scale Video Generative Pre-train…
|
3.06
|
2023-12-20
|
|
Roboflamingo
|
Vision-Language Foundation Models as Effective Ro…
|
2.47
|
2023-11-02
|
|
LCB
|
From LLMs to Actions: Latent Codes as Bridges in …
|
1.78
|
2024-05-08
|
|
Uni-Pi
|
Learning Universal Policies via Text-Guided Video…
|
0.92
|
2023-01-31
|
|
RT-1
|
RT-1: Robotics Transformer for Real-World Control…
|
0.90
|
2022-12-13
|
|