Mask CycleGAN: Unpaired Multi-modal Domain Translation with Interpretable Latent Variable

05/14/2022
by   Minfa Wang, et al.
0

We propose Mask CycleGAN, a novel architecture for unpaired image domain translation built based on CycleGAN, with an aim to address two issues: 1) unimodality in image translation and 2) lack of interpretability of latent variables. Our innovation in the technical approach is comprised of three key components: masking scheme, generator and objective. Experimental results demonstrate that this architecture is capable of bringing variations to generated images in a controllable manner and is reasonably robust to different masks.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset