Scenimefy: Learning to Craft Anime Scene via Semi-Supervised Image-to-Image Translation

08/24/2023
by   Yuxin Jiang, et al.
0

Automatic high-quality rendering of anime scenes from complex real-world images is of significant practical value. The challenges of this task lie in the complexity of the scenes, the unique features of anime style, and the lack of high-quality datasets to bridge the domain gap. Despite promising attempts, previous efforts are still incompetent in achieving satisfactory results with consistent semantic preservation, evident stylization, and fine details. In this study, we propose Scenimefy, a novel semi-supervised image-to-image translation framework that addresses these challenges. Our approach guides the learning with structure-consistent pseudo paired data, simplifying the pure unsupervised setting. The pseudo data are derived uniquely from a semantic-constrained StyleGAN leveraging rich model priors like CLIP. We further apply segmentation-guided data selection to obtain high-quality pseudo supervision. A patch-wise contrastive style loss is introduced to improve stylization and fine details. Besides, we contribute a high-resolution anime scene dataset to facilitate future research. Our extensive experiments demonstrate the superiority of our method over state-of-the-art baselines in terms of both perceptual quality and quantitative performance.

READ FULL TEXT

page 14

page 15

page 16

page 17

page 18

page 19

page 20

page 21

research
04/07/2022

Unsupervised Image-to-Image Translation with Generative Prior

Unsupervised image-to-image translation aims to learn the translation be...
research
04/29/2019

Attribute Guided Unpaired Image-to-Image Translation with Semi-supervised Learning

Unpaired Image-to-Image Translation (UIT) focuses on translating images ...
research
11/27/2017

Discriminative Region Proposal Adversarial Networks for High-Quality Image-to-Image Translation

Image-to-image translation has been made much progress with embracing Ge...
research
08/05/2022

Memory-Guided Collaborative Attention for Nighttime Thermal Infrared Image Colorization

Nighttime thermal infrared (NTIR) image colorization, also known as tran...
research
04/18/2019

A Novel BiLevel Paradigm for Image-to-Image Translation

Image-to-image (I2I) translation is a pixel-level mapping that requires ...
research
03/16/2021

Semi-Supervised Graph-to-Graph Translation

Graph translation is very promising research direction and has a wide ra...
research
09/05/2018

Semantic Human Matting

Human matting, high quality extraction of humans from natural images, is...

Please sign up or login with your details

Forgot password? Click here to reset