CartoonRenderer: An Instance-based Multi-Style Cartoon Image Translator

by   Yugang Chen, et al.

Instance based photo cartoonization is one of the challenging image stylization tasks which aim at transforming realistic photos into cartoon style images while preserving the semantic contents of the photos. State-of-the-art Deep Neural Networks (DNNs) methods still fail to produce satisfactory results with input photos in the wild, especially for photos which have high contrast and full of rich textures. This is due to that: cartoon style images tend to have smooth color regions and emphasized edges which are contradict to realistic photos which require clear semantic contents, i.e., textures, shapes etc. Previous methods have difficulty in satisfying cartoon style textures and preserving semantic contents at the same time. In this work, we propose a novel "CartoonRenderer" framework which utilizing a single trained model to generate multiple cartoon styles. In a nutshell, our method maps photo into a feature model and renders the feature model back into image space. In particular, cartoonization is achieved by conducting some transformation manipulation in the feature space with our proposed Soft-AdaIN. Extensive experimental results show our method produces higher quality cartoon style images than prior arts, with accurate semantic content preservation. In addition, due to the decoupling of whole generating process into "Modeling-Coordinating-Rendering" parts, our method could easily process higher resolution photos, which is intractable for existing methods.


page 2

page 10

page 11


Photo style transfer with consistency losses

We address the problem of style transfer between two photos and propose ...

3D Virtual Garment Modeling from RGB Images

We present a novel approach that constructs 3D virtual garment models fr...

Personalized Image Enhancement Featuring Masked Style Modeling

We address personalized image enhancement in this study, where we enhanc...

Theme Aware Aesthetic Distribution Prediction with Full Resolution Photos

Aesthetic quality assessment (AQA) of photos is a challenging task due t...

Inferring Restaurant Styles by Mining Crowd Sourced Photos from User-Review Websites

When looking for a restaurant online, user uploaded photos often give pe...

Neural Rerendering in the Wild

We explore total scene capture -- recording, modeling, and rerendering a...

Enhancing Underexposed Photos using Perceptually Bidirectional Similarity

This paper addresses the problem of enhancing underexposed photos. Exist...

Please sign up or login with your details

Forgot password? Click here to reset