arXiv:2302.01330 Abstract | arXiv Analytics

arXiv:2302.01330 [cs.CV]Abstract References Reviews Resources

SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections

Published 2023-02-02Version 1

In this work, we present SceneDreamer, an unconditional generative model for unbounded 3D scenes, which synthesizes large-scale 3D landscapes from random noises. Our framework is learned from in-the-wild 2D image collections only, without any 3D annotations. At the core of SceneDreamer is a principled learning paradigm comprising 1) an efficient yet expressive 3D scene representation, 2) a generative scene parameterization, and 3) an effective renderer that can leverage the knowledge from 2D images. Our framework starts from an efficient bird's-eye-view (BEV) representation generated from simplex noise, which consists of a height field and a semantic field. The height field represents the surface elevation of 3D scenes, while the semantic field provides detailed scene semantics. This BEV scene representation enables 1) representing a 3D scene with quadratic complexity, 2) disentangled geometry and semantics, and 3) efficient training. Furthermore, we propose a novel generative neural hash grid to parameterize the latent space given 3D positions and the scene semantics, which aims to encode generalizable features across scenes. Lastly, a neural volumetric renderer, learned from 2D image collections through adversarial training, is employed to produce photorealistic images. Extensive experiments demonstrate the effectiveness of SceneDreamer and superiority over state-of-the-art methods in generating vivid yet diverse unbounded 3D worlds.

Comments: Project Page https://scene-dreamer.github.io/

Categories: cs.CV, cs.GR

Keywords: 2d image collections, unbounded 3d scene generation, scenedreamer, novel generative neural hash grid, bev scene representation enables

Tags: github project

Related articles:

arXiv:2211.08610 [cs.CV] (Published 2022-11-16)

CoNFies: Controllable Neural Face Avatars

Heng Yu, Koichiro Niinuma, Laszlo A. Jeni

arXiv:2309.15830 [cs.CV] (Published 2023-09-27)

OrthoPlanes: A Novel Representation for Better 3D-Awareness of GANs

Honglin He, Zhuoqian Yang, Shikai Li, Bo Dai, Wayne Wu

arXiv Analytics

arXiv:2302.01330 [cs.CV]Abstract References Reviews Resources

SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections

Links

Toolbox

arXiv:2302.01330 [cs.CV]AbstractReferencesReviewsResources

SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections

Links

Toolbox

arXiv:2302.01330 [cs.CV]Abstract References Reviews Resources