RealEstate10K is a large dataset of camera poses corresponding to 10 million frames derived from about 80,000 video clips, gathered from about 10,000 YouTube videos. For each clip, the poses form a trajectory where each pose specifies the camera position and orientation along the trajectory. These poses are derived by running SLAM and bundle adjustment algorithms on a large set of videos.
Variants: RealEstate10K
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Novel View Synthesis | Hybrid | Geometry-Free View Synthesis: Transformers and … | 2021-04-15 |
Novel View Synthesis | Impl.-depth | Geometry-Free View Synthesis: Transformers and … | 2021-04-15 |
Recent papers with results on this dataset: