Procgen Benchmark includes 16 simple-to-use procedurally-generated environments which provide a direct measure of how quickly a reinforcement learning agent learns generalizable skills.
Variants: ProcGen
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Reinforcement Learning (RL) | PPG | Phasic Policy Gradient | 2020-09-09 |
Reinforcement Learning (RL) | PPO | Phasic Policy Gradient | 2020-09-09 |
Recent papers with results on this dataset: