In this paper, we propose a novel time-frequency joint learning method f...
Transformer has emerged in speech emotion recognition (SER) at present.
...
Score distillation sampling (SDS) has shown great promise in text-to-3D
...
Diffusion models have exhibited excellent performance in various domains...
Guided sampling is a vital approach for applying diffusion models in
rea...
Recently, diffusion probabilistic models (DPMs) have achieved promising
...
Performance of trimap-free image matting methods is limited when trying ...
Diffusion probabilistic models (DPMs) have achieved impressive success i...
Spectrogram is commonly used as the input feature of deep neural network...
In offline reinforcement learning, weighted regression is a common metho...
Active Domain Adaptation (ADA) queries the label of selected target samp...
Enhancing model prediction confidence on unlabeled target data is an
imp...
The waive of labels in the target domain makes Unsupervised Domain Adapt...
Brain tumor segmentation (BTS) in magnetic resonance image (MRI) is cruc...
We tackle the essential task of finding dense visual correspondences bet...
Score-based generative models have excellent performance in terms of
gen...
Diffusion probabilistic models (DPMs) are emerging powerful generative
m...
The circular cartogram, also known as the Dorling map, is a widely used ...
Most automatic matting methods try to separate the salient foreground fr...
Unsupervised Domain Adaptation (UDA) aims to leverage a label-rich sourc...
Lung cancer is the leading cause of cancer death worldwide, and
adenocar...
Most existing CNN-based salient object detection methods can identify lo...
Unsupervised Domain Adaptation (UDA), a branch of transfer learning wher...
Deaf and Hard-of-Hearing (DHH) audiences have long complained about capt...
Several video-based 3D pose and shape estimation algorithms have been
pr...
Most existing human matting algorithms tried to separate pure human-only...
In this paper, we present a thorough study of maximizing a regularized
n...
Normalizing flows define a probability distribution by an explicit inver...
Recently, facial expression recognition (FER) in the wild has gained a l...
We study a class of fused lasso problems where the estimated parameters ...
Vision is often used as a complementary modality for audio speech recogn...
Unconstrained remote gaze estimation remains challenging mostly due to i...
Generative flows are promising tractable models for density modeling tha...
The multiple-input multiple-output (MIMO) detection problem is a fundame...
In this paper, we consider a class of nonconvex complex quadratic progra...