Zhengyi Wang Dept. of Comp. Sci. & Tech Tsinghua-Bosch Joint ML Center BNRist Center Tsinghua University ShengShu BeijingChina, Cheng Lu Dept. of Comp. Sci. & Tech Tsinghua-Bosch Joint ML Center BNRist Center Tsinghua University, Yikai Wang [email protected], Fan Bao Dept. of Comp. Sci. & Tech Tsinghua-Bosch Joint ML Center BNRist Center Tsinghua University ShengShu BeijingChina, Chongxuan Li [email protected] Dept. of Comp. Sci. & Tech Tsinghua-Bosch Joint ML Center BNRist Center Tsinghua University Gaoling School of Artificial Intelligence Key Laboratory of Big Data, Hang Su [email protected] Dept. of Comp. Sci. & Tech Tsinghua-Bosch Joint ML Center BNRist Center Tsinghua University Pazhou Laboratory (Huangpu) GuangzhouChina, Jun Zhu Dept. of Comp. Sci. & Tech Tsinghua-Bosch Joint ML Center BNRist Center Tsinghua University ShengShu BeijingChina Pazhou Laboratory (Huangpu) GuangzhouChina (2023)
The paper introduces ProlificDreamer, a framework for high-fidelity text-to-3D generation, addressing the limitations of existing methods like Score Distillation Sampling (SDS) that suffer from over-saturation and low-diversity. The authors propose Variational Score Distillation (VSD), which models 3D parameters as random variables to enhance the quality and diversity of generated 3D scenes. ProlificDreamer efficiently generates high-resolution Neural Radiance Fields (NeRF) and detailed textured meshes while allowing flexible configuration of guidance weights. The paper also presents improvements in design aspects like rendering resolution and distillation schedules, demonstrating superior results compared to existing methods through empirical evaluations and theoretical comparisons.
This paper employs the following methods:
The following datasets were used in this research:
The authors identified the following limitations: