Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase

06/21/2023
by   Qiuyu Wang, et al.
0

Despite the rapid advance of 3D-aware image synthesis, existing studies usually adopt a mixture of techniques and tricks, leaving it unclear how each part contributes to the final performance in terms of generality. Following the most popular and effective paradigm in this field, which incorporates a neural radiance field (NeRF) into the generator of a generative adversarial network (GAN), we build a well-structured codebase, dubbed Carver, through modularizing the generation process. Such a design allows researchers to develop and replace each module independently, and hence offers an opportunity to fairly compare various approaches and recognize their contributions from the module perspective. The reproduction of a range of cutting-edge algorithms demonstrates the availability of our modularized codebase. We also perform a variety of in-depth analyses, such as the comparison across different types of point feature, the necessity of the tailing upsampler in the generator, the reliance on the camera pose prior, etc., which deepen our understanding of existing methods and point out some further directions of the research work. We release code and models at https://github.com/qiuyu96/Carver to facilitate the development and evaluation of this field.

READ FULL TEXT

page 3

page 6

page 8

page 15

page 16

research
12/20/2021

3D-aware Image Synthesis via Learning Structural and Textural Representations

Making generative models 3D-aware bridges the 2D image space and the 3D ...
research
10/19/2021

CIPS-3D: A 3D-Aware Generator of GANs Based on Conditionally-Independent Pixel Synthesis

The style-based GAN (StyleGAN) architecture achieved state-of-the-art re...
research
11/19/2020

Creative Sketch Generation

Sketching or doodling is a popular creative activity that people engage ...
research
10/11/2022

GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models

We propose AudioStyleGAN (ASGAN), a new generative adversarial network (...
research
08/31/2021

InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images

In this paper, we present InSeGAN, an unsupervised 3D generative adversa...
research
12/27/2021

Multimodal Image Synthesis and Editing: A Survey

As information exists in various modalities in real world, effective int...
research
11/30/2022

BEVPoolv2: A Cutting-edge Implementation of BEVDet Toward Deployment

We release a new codebase version of the BEVDet, dubbed branch dev2.0. W...

Please sign up or login with your details

Forgot password? Click here to reset