Generative Multiplane Images: Making a 2D GAN 3D-Aware

Generative Multiplane Images

Making a 2D GAN 3D-Aware

¹Apple
²University of Illinois, Urbana-Champaign

"What is really needed to make an existing 2D GAN 3D-aware?"

To answer this question, we modify a classical GAN, i.e., StyleGANv2, as little as possible. We find that only two modifications are absolutely necessary:

A multiplane image style generator branch which produces a set of alpha maps conditioned on their depth;
A pose-conditioned discriminator.

We refer to the generated output as a 'generative multiplane image' (GMPI) and emphasize that its renderings are not only high-quality but also guaranteed to be view-consistent, which makes GMPIs different from many prior works. Importantly, the number of alpha maps can be dynamically adjusted and can differ between training and inference, alleviating memory concerns and enabling fast training of GMPIs in less than half a day at a resolution of 1024². Our findings are consistent across three challenging and common high-resolution datasets, including FFHQ, AFHQv2 and MetFaces.

RGB and Geometry Split View

Please click to play

/ pause

each figure; drag the separator to see the pixel-aligned geometry (during the video playing).
Click here to reset.
Desktop browser is recommended for this to work properly.

Interactive MPI Viewer

We present several generated scenes in an interactive viewer. Please click each image to open the interactive viewer.
The Chrome browser is recommended.
Here are brief instructions for using the viewer.
We would like to thank the DeepView authors for their interactive MPI web viewer.

Results as Videos

The following videos present the appearance and geometry of 3D content generated by GMPI with random sampling. We use Marching Cubes to extract geometry from predicted alpha maps.

FFHQ 1024²

@inproceedings{zhao2022gmpi, title = {Generative Multiplane Images: Making a 2D GAN 3D-Aware}, author = {Xiaoming Zhao and Fangchang Ma and David Güera and Zhile Ren and Alexander G. Schwing and Alex Colburn}, booktitle = {Proc. ECCV}, year = {2022}, }

Generative Multiplane Images

Making a 2D GAN 3D-Aware

ECCV 2022 (Oral)

Xiaoming Zhao^1,2, Fangchang Ma¹, David Güera Cobo¹, Zhile Ren¹
Alexander G. Schwing², Alex Colburn¹

"What is really needed to make an existing 2D GAN 3D-aware?"

RGB and Geometry Split View

Interactive MPI Viewer

FFHQ 1024²

AFHQCat 512²

MetFaces 1024²

Results as Videos

FFHQ 1024²

AFHQCat 512²

MetFaces 1024²

Bibtex

Acknowledgements

Generative Multiplane Images

Making a 2D GAN 3D-Aware

ECCV 2022 (Oral)

Xiaoming Zhao1,2, Fangchang Ma1, David Güera Cobo1, Zhile Ren1 Alexander G. Schwing2, Alex Colburn1

"What is really needed to make an existing 2D GAN 3D-aware?"

RGB and Geometry Split View

Interactive MPI Viewer

FFHQ 10242

AFHQCat 5122

MetFaces 10242

Results as Videos

FFHQ 10242

AFHQCat 5122

MetFaces 10242

Bibtex

Acknowledgements

Xiaoming Zhao^1,2, Fangchang Ma¹, David Güera Cobo¹, Zhile Ren¹
Alexander G. Schwing², Alex Colburn¹

FFHQ 1024²

AFHQCat 512²

MetFaces 1024²

FFHQ 1024²

AFHQCat 512²

MetFaces 1024²