Color selection plays a critical role in graphic document design and req...
Human evaluation is critical for validating the performance of text-to-i...
Creative workflows for generating graphical documents involve complex
in...
Controllable layout generation aims at synthesizing plausible arrangemen...
Color is a critical design factor for web pages, affecting important fac...
Video summarization aims to select the most informative subset of frames...
With the broad growth of video capturing devices and applications on the...
Vector graphic documents present multiple visual elements, such as image...
Is more data always better to train vision-and-language models? We study...
As clean ImageNet accuracy nears its ceiling, the research community is
...
Evaluation measures have a crucial impact on the direction of research.
...
Mean Average Precision (mAP) is the primary evaluation measure for objec...
Video question answering (VideoQA) is designed to answer a given questio...
It is common in graphic design humans visually arrange various elements
...
How far can we go with textual representations for understanding picture...
Learning from implicit feedback is challenging because of the difficult
...
Learning from implicit user feedback is challenging as we can only obser...
Solving cold-start problems is indispensable to provide meaningful
recom...
The query-based moment retrieval is a problem of localising a specific c...
Answering questions related to art pieces (paintings) is a difficult tas...
We propose a novel video understanding task by fusing knowledge-based an...
We propose a novel video understanding task by fusing knowledge-based an...
Video summarization is a technique to create a short skim of the origina...
A paraphrase is a restatement of the meaning of a text in other words.
P...
This paper presents a video summarization technique for an Internet vide...
Our objective is video retrieval based on natural language queries. In
a...