Multimodal Prediction and Personalization of Photo Edits with Deep Generative Models

04/17/2017
by   Ardavan Saeedi, et al.
0

Professional-grade software applications are powerful but complicated-expert users can achieve impressive results, but novices often struggle to complete even basic tasks. Photo editing is a prime example: after loading a photo, the user is confronted with an array of cryptic sliders like "clarity", "temp", and "highlights". An automatically generated suggestion could help, but there is no single "correct" edit for a given image-different experts may make very different aesthetic decisions when faced with the same image, and a single expert may make different choices depending on the intended use of the image (or on a whim). We therefore want a system that can propose multiple diverse, high-quality edits while also learning from and adapting to a user's aesthetic preferences. In this work, we develop a statistical model that meets these objectives. Our model builds on recent advances in neural network generative modeling and scalable inference, and uses hierarchical structure to learn editing patterns across many diverse users. Empirically, we find that our model outperforms other approaches on this challenging multimodal prediction task.

READ FULL TEXT

page 3

page 8

page 9

research
02/08/2020

Correction of Chromatic Aberration from a Single Image Using Keypoints

In this paper, we propose a method to correct for chromatic aberration i...
research
11/02/2019

Self-supervised Deformation Modeling for Facial Expression Editing

Recent advances in deep generative models have demonstrated impressive r...
research
12/18/2022

Internal Diverse Image Completion

Image completion is widely used in photo restoration and editing applica...
research
06/01/2015

User Preferences Modeling and Learning for Pleasing Photo Collage Generation

In this paper we consider how to automatically create pleasing photo col...
research
07/04/2023

Identifying Professional Photographers Through Image Quality and Aesthetics in Flickr

In our generation, there is an undoubted rise in the use of social media...
research
02/24/2022

CAISE: Conversational Agent for Image Search and Editing

Demand for image editing has been increasing as users' desire for expres...
research
12/02/2021

Sample-Efficient Generation of Novel Photo-acid Generator Molecules using a Deep Generative Model

Photo-acid generators (PAGs) are compounds that release acids (H^+ ions)...

Please sign up or login with your details

Forgot password? Click here to reset