(2025)
The paper presents DreamCube, a framework for generating RGB-D cubemaps using a method called multi-plane synchronization, which aims to improve 3D panorama generation from single-view inputs. DreamCube addresses the challenges faced by existing 2D diffusion models when applied to multi-plane panoramic representations. By adapting spatial operators to maintain translation equivariance, it enables seamless integration of multiple views without overlapping FoV techniques that can degrade image quality. Key contributions include a comprehensive analysis of existing methods' limitations and the introduction of a synchronized generation approach that enhances the quality of RGB-D scene generative outputs. Extensive experiments validate DreamCube’s effectiveness in RGB-D panorama generation, depth estimation, and 3D scene reconstruction, showcasing its superior performance compared to existing models.
This paper employs the following methods:
The following datasets were used in this research:
The authors identified the following limitations: