The task of shape abstraction with semantic part consistency is challeng...
We are concerned with a challenging scenario in unpaired multiview video...
Prototype, as a representation of class embeddings, has been explored to...
Despite the success of fully-supervised human skeleton sequence modeling...
Multimodal data collected from the real world are often imperfect due to...
Optical flow estimation is a fundamental task in computer vision. Recent...
Group Activity Recognition (GAR) detects the activity performed by a gro...
We study a worst-case scenario in generalization: Out-of-domain
generali...
This work proposes a method of wind farm scenario generation to support
...
Attention-based models, exemplified by the Transformer, can effectively ...
Although hierarchical structures are popular in recent vision transforme...
Attention mechanisms have been widely applied to cross-modal tasks such ...
A common assumption in multimodal learning is the completeness of traini...
False positive is one of the most serious problems brought by agnostic d...
We introduce a novel representation learning method to disentangle
pose-...
Recognition of human poses and activities is crucial for autonomous syst...
Adversarial data augmentation has shown promise for training robust deep...
Search engine has become a fundamental component in various web and mobi...
Cross-modal knowledge distillation deals with transferring knowledge fro...
We are concerned with a worst-case scenario in model generalization, in ...
Graph kernels are kernel methods measuring graph similarity and serve as...
We propose a Dynamic Graph-Based Spatial-Temporal Attention (DG-STA) met...
In this paper, we study the problem of learning Graph Convolutional Netw...
To efficiently support safety-related vehicular applications, the
ultra-...
Short-term road traffic prediction (STTP) is one of the most important
m...
In a mixed-traffic scenario where both autonomous vehicles and human-dri...
We consider the problem of image-to-video translation, where an input im...
Generating multi-view images from a single-view input is an essential ye...
Driven by the rapid development of wireless communication system, more a...
In recent years, wireless networks are undergoing the paradigm transform...
We address the problem of using hand-drawn sketch to edit facial identit...
Weakly-supervised object detection (WOD) is a challenging problems in
co...