Changsheng Xu

research

∙ 09/05/2023

A Survey on Interpretable Cross-modal Reasoning

In recent years, cross-modal reasoning (CMR), the process of understandi...

0 Dizhan Xue, et al. ∙

research

∙ 08/30/2023

Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection

In this paper, we for the first time explore helpful multi-modal context...

0 Yifan Xu, et al. ∙

research

∙ 07/13/2023

Introducing Foundation Models as Surrogate Models: Advancing Towards More Practical Adversarial Attacks

Recently, the no-box adversarial attack, in which the attacker lacks acc...

0 Jiaming Zhang, et al. ∙

research

∙ 07/05/2023

Multimodal Imbalance-Aware Gradient Modulation for Weakly-supervised Audio-Visual Video Parsing

Weakly-supervised audio-visual video parsing (WS-AVVP) aims to localize ...

0 Jie Fu, et al. ∙

research

∙ 05/30/2023

Multi-modal Queried Object Detection in the Wild

We introduce MQ-Det, an efficient architecture and pre-training strategy...

0 Yifan Xu, et al. ∙

research

∙ 05/25/2023

ProSpect: Expanded Conditioning for the Personalization of Attribute-aware Image Generation

Personalizing generative models offers a way to guide image generation w...

0 Yuxin Zhang, et al. ∙

research

∙ 05/25/2023

Camera-Incremental Object Re-Identification with Identity Knowledge Evolution

Object Re-identification (ReID) aims to retrieve the probe object from m...

0 Hantao Yao, et al. ∙

research

∙ 05/15/2023

CLIP-VG: Self-paced Curriculum Adapting of CLIP via Exploiting Pseudo-Language Labels for Visual Grounding

Visual Grounding (VG) refers to locating a region described by expressio...

0 Linhui Xiao, et al. ∙

research

∙ 03/09/2023

A Unified Arbitrary Style Transfer Framework via Adaptive Contrastive Learning

We present Unified Contrastive Arbitrary Style Transfer (UCAST), a novel...

0 Yuxin Zhang, et al. ∙

research

∙ 03/01/2023

Backdoor for Debias: Mitigating Model Bias with Backdoor Attack-based Artificial Bias

With the swift advancement of deep learning, state-of-the-art algorithms...

0 Shangxi Wu, et al. ∙

research

∙ 02/23/2023

Region-Aware Diffusion for Zero-shot Text-driven Image Editing

Image manipulation under the guidance of textual descriptions has recent...

0 Nisha Huang, et al. ∙

research

∙ 01/30/2023

GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis

Synthesizing high-fidelity complex images from text is challenging. Base...

0 Ming Tao, et al. ∙

research

∙ 12/31/2022

Unlearnable Clusters: Towards Label-agnostic Unlearnable Examples

There is a growing interest in developing unlearnable examples (UEs) aga...

0 Jiaming Zhang, et al. ∙

research

∙ 11/28/2022

SgVA-CLIP: Semantic-guided Visual Adapting of Vision-Language Models for Few-shot Image Classification

Although significant progress has been made in few-shot learning, most o...

0 Fang Peng, et al. ∙

research

∙ 11/23/2022

Inversion-Based Style Transfer with Diffusion Models

The artistic style within a painting is the means of expression, which i...

1 Yuxin Zhang, et al. ∙

research

∙ 11/19/2022

DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization

Despite the impressive results of arbitrary image-guided style transfer ...

0 Nisha Huang, et al. ∙

research

∙ 09/27/2022

Draw Your Art Dream: Diverse Digital Art Synthesis with Multimodal Guided Diffusion

Digital art synthesis is receiving increasing attention in the multimedi...

0 Nisha Huang, et al. ∙

research

∙ 05/22/2022

Learning Muti-expert Distribution Calibration for Long-tailed Video Classification

Most existing state-of-the-art video classification methods assume the t...

0 Yufan Hu, et al. ∙

research

∙ 05/19/2022

Domain Enhanced Arbitrary Image Style Transfer via Contrastive Learning

In this work, we tackle the challenging problem of arbitrary image style...

4 Yuxin Zhang, et al. ∙

research

∙ 04/05/2022

MGDCF: Distance Learning via Markov Graph Diffusion for Neural Collaborative Filtering

Collaborative filtering (CF) is widely used by personalized recommendati...

2 Jun Hu, et al. ∙

research

∙ 04/04/2022

Learning Commonsense-aware Moment-Text Alignment for Fast Video Temporal Grounding

Grounding temporal video segments described in natural language queries ...

0 Ziyue Wu, et al. ∙

research

∙ 03/31/2022

Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization

We target at the task of weakly-supervised action localization (WSAL), w...

0 Junyu Gao, et al. ∙

research

∙ 01/26/2022

Mitigating the Mutual Error Amplification for Semi-Supervised Object Detection

Semi-supervised object detection (SSOD) has achieved substantial progres...

0 Chengcheng Ma, et al. ∙

research

∙ 12/09/2021

Dual Cluster Contrastive learning for Person Re-Identification

Recently, cluster contrastive learning has been proven effective for per...

0 Hantao Yao, et al. ∙

research

∙ 12/05/2021

SSAGCN: Social Soft Attention Graph Convolution Network for Pedestrian Trajectory Prediction

Pedestrian trajectory prediction is an important technique of autonomous...

0 Pei Lv, et al. ∙

research

∙ 12/02/2021

Contrastive Adaptive Propagation Graph Neural Networks for Efficient Graph Learning

Graph Neural Networks (GNNs) have achieved great success in processing g...

0 Jun Hu, et al. ∙

research

∙ 12/01/2021

Weakly-Supervised Video Object Grounding via Causal Intervention

We target at the task of weakly-supervised video object grounding (WSVOG...

0 Wei Wang, et al. ∙

research

∙ 11/19/2021

GRecX: An Efficient and Unified Benchmark for GNN-based Recommendation

In this paper, we present GRecX, an open-source TensorFlow framework for...

0 Desheng Cai, et al. ∙

research

∙ 10/18/2021

Learning to Learn a Cold-start Sequential Recommender

The cold-start recommendation is an urgent problem in contemporary onlin...

0 Xiaowen Huang, et al. ∙

research

∙ 10/14/2021

Contrastive Proposal Extension with LSTM Network for Weakly Supervised Object Detection

Weakly supervised object detection (WSOD) has attracted more and more at...

0 Pei Lv, et al. ∙

research

∙ 08/03/2021

Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

Vision transformers have recently received explosive popularity, but the...

0 Yifan Xu, et al. ∙

research

∙ 07/10/2021

DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering

Video question answering is a challenging task, which requires agents to...

0 Jianyu Wang, et al. ∙

research

∙ 06/16/2021

ECKPN: Explicit Class Knowledge Propagation Network for Transductive Few-shot Learning

Recently, the transductive graph-based methods have achieved great succe...

0 Chaofan Chen, et al. ∙

research

∙ 06/14/2021

User-Guided Personalized Image Aesthetic Assessment based on Deep Reinforcement Learning

Personalized image aesthetic assessment (PIAA) has recently become a hot...

0 Pei Lv, et al. ∙

research

∙ 05/30/2021

StyTr^2: Unbiased Image Style Transfer with Transformers

The goal of image style transfer is to render an image with artistic fea...

0 Yingying Deng, et al. ∙

research

∙ 04/21/2021

Towards Corruption-Agnostic Robust Domain Adaptation

Big progress has been achieved in domain adaptation in decades. Existing...

0 Yifan Xu, et al. ∙

research

∙ 03/23/2021

Health Status Prediction with Local-Global Heterogeneous Behavior Graph

Health management is getting increasing attention all over the world. Ho...

0 Xuan Ma, et al. ∙

research

∙ 03/08/2021

Unveiling the Potential of Structure-Preserving for Weakly Supervised Object Localization

Weakly supervised object localization remains an open problem due to the...

7 Xingjia Pan, et al. ∙

research

∙ 01/27/2021

Efficient Graph Deep Learning in TensorFlow with tf_geometric

We introduce tf_geometric, an efficient and friendly library for graph d...

0 Jun Hu, et al. ∙

research

∙ 12/04/2020

Effective Label Propagation for Discriminative Semi-Supervised Domain Adaptation

Semi-supervised domain adaptation (SSDA) methods have demonstrated great...

0 Zhiyong Huang, et al. ∙

research

∙ 09/17/2020

Arbitrary Video Style Transfer via Multi-Channel Correlation

Video style transfer is getting more attention in AI community for its n...

0 Yingying Deng, et al. ∙

research

∙ 06/18/2020

MMCGAN: Generative Adversarial Network with Explicit Manifold Prior

Generative Adversarial Network(GAN) provides a good generative framework...

0 Guanhua Zheng, et al. ∙

research

∙ 06/02/2020

Distribution Aligned Multimodal and Multi-Domain Image Stylization

Multimodal and multi-domain stylization are two important problems in th...

0 Minxuan Lin, et al. ∙

research

∙ 05/31/2020

Attribute-Induced Bias Eliminating for Transductive Zero-Shot Learning

Transductive Zero-shot learning (ZSL) targets to recognize the unseen ca...

0 Hantao Yao, et al. ∙

research

∙ 05/30/2020

Joint Person Objectness and Repulsion for Person Search

Person search targets to search the probe person from the unconstrainted...

0 Hantao Yao, et al. ∙

research

∙ 05/27/2020

Arbitrary Style Transfer via Multi-Adaptation Network

Arbitrary style transfer is a significant topic with both research value...

4 Yingying Deng, et al. ∙

research

∙ 05/25/2020

Adaptive Adversarial Logits Pairing

Adversarial examples provide an opportunity as well as impose a challeng...

3 Shangxi Wu, et al. ∙

research

∙ 05/20/2020

Dynamic Refinement Network for Oriented and Densely Packed Object Detection

Object detection has achieved remarkable progress in the past decade. Ho...

0 Xingjia Pan, et al. ∙

research

∙ 02/26/2020

Multi-Attribute Guided Painting Generation

Controllable painting generation plays a pivotal role in image stylizati...

0 Minxuan Lin, et al. ∙

research

∙ 11/28/2019

A Generalization Theory based on Independent and Task-Identically Distributed Assumption

Existing generalization theories analyze the generalization performance ...

0 Guanhua Zheng, et al. ∙

Changsheng Xu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro