Lu Sheng

research

∙ 06/11/2023

LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark

Large language models have become a potential pathway toward achieving a...

0 Zhenfei Yin, et al. ∙

research

∙ 03/31/2023

Siamese DETR

Recent self-supervised methods are mainly designed for representation le...

3 Zeren Chen, et al. ∙

research

∙ 03/25/2023

VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud

The task of 3D semantic scene graph (3DSSG) prediction in the point clou...

0 Ziqin Wang, et al. ∙

research

∙ 01/29/2023

Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline

Recently, perception task based on Bird's-Eye View (BEV) representation ...

0 Yangguang Li, et al. ∙

research

∙ 08/14/2022

SketchSampler: Sketch-based 3D Reconstruction via View-dependent Depth Sampling

Reconstructing a 3D shape based on a single sketch image is challenging ...

0 Chenjian Gao, et al. ∙

research

∙ 03/16/2022

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation

In computer vision, pre-training models based on largescale supervised l...

0 Yinan He, et al. ∙

research

∙ 03/15/2022

Bamboo: Building Mega-Scale Vision Dataset Continually with Human-Machine Synergy

Large-scale datasets play a vital role in computer vision. Existing data...

15 Yuanhan Zhang, et al. ∙

research

∙ 12/15/2021

ForgeryNet – Face Forgery Analysis Challenge 2021: Methods and Results

The rapid progress of photorealistic synthesis techniques has reached a ...

0 Yinan He, et al. ∙

research

∙ 10/17/2021

VoteHMR: Occlusion-Aware Voting Network for Robust 3D Human Mesh Recovery from Partial Point Clouds

3D human mesh recovery from point clouds is essential for various tasks,...

0 Guanze Liu, et al. ∙

research

∙ 04/13/2021

Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

3D object detection in point clouds is a challenging vision task that be...

8 Bowen Cheng, et al. ∙

research

∙ 03/18/2021

DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer

In this work, we propose a novel deep learning framework that can genera...

0 Buyu Li, et al. ∙

research

∙ 03/09/2021

ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis

The rapid progress of photorealistic synthesis techniques has reached at...

0 Yinan He, et al. ∙

research

∙ 11/02/2020

PV-NAS: Practical Neural Architecture Search for Video Recognition

Recently, deep learning has been utilized to solve video recognition pro...

0 Zihao Wang, et al. ∙

research

∙ 10/21/2020

Adaptive Gradient Method with Resilience and Momentum

Several variants of stochastic gradient descent (SGD) have been proposed...

0 Jie Liu, et al. ∙

research

∙ 07/18/2020

Thinking in Frequency: Face Forgery Detection by Mining Frequency-aware Clues

As realistic facial manipulation technologies have achieved remarkable p...

11 Yuyang Qian, et al. ∙

research

∙ 05/26/2020

Unsupervised Domain Expansion from Multiple Sources

Given an existing system learned from previous source domains, it is des...

5 Jing Zhang, et al. ∙

research

∙ 05/21/2020

Powering One-shot Topological NAS with Stabilized Share-parameter Proxy

One-shot NAS method has attracted much interest from the research commun...

5 Ronghao Guo, et al. ∙

research

∙ 11/30/2019

Morphing and Sampling Network for Dense Point Cloud Completion

3D point cloud completion, the task of inferring the complete geometric ...

16 Minghua Liu, et al. ∙

research

∙ 10/10/2019

Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization

Pedestrian attribute recognition has been an emerging research topic in ...

23 Chufeng Tang, et al. ∙

research

∙ 09/12/2019

CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval

Text-image cross-modal retrieval is a challenging task in the field of l...

0 Zihao Wang, et al. ∙

research

∙ 05/06/2019

Visibility Constrained Generative Model for Depth-based 3D Facial Pose Tracking

In this paper, we propose a generative framework that unifies depth-base...

7 Lu Sheng, et al. ∙

research

∙ 04/02/2019

Semantics Disentangling for Text-to-Image Generation

Synthesizing photo-realistic images from text descriptions is a challeng...

0 Guojun Yin, et al. ∙

research

∙ 04/02/2019

Context and Attribute Grounded Dense Captioning

Dense captioning aims at simultaneously localizing semantic regions and ...

0 Guojun Yin, et al. ∙

research

∙ 03/26/2019

GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving

We present an efficient 3D object detection framework based on a single ...

0 Buyu Li, et al. ∙

research

∙ 03/11/2019

Video Generation from Single Semantic Label Map

This paper proposes the novel task of video generation conditioned on a ...

16 Junting Pan, et al. ∙

research

∙ 03/03/2019

Unsupervised Bi-directional Flow-based Video Generation from one Snapshot

Imagining multiple consecutive frames given one single snapshot is chall...

6 Lu Sheng, et al. ∙

research

∙ 09/16/2018

Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection

Multi-label image classification is a fundamental but challenging task t...

2 Yongcheng Liu, et al. ∙

research

∙ 07/13/2018

Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition

Recognizing visual relationships <subject-predicate-object> among any pa...

4 Guojun Yin, et al. ∙

research

∙ 05/10/2018

Avatar-Net: Multi-scale Zero-shot Style Transfer by Feature Decoration

Zero-shot artistic style transfer is an important image synthesis proble...

2 Lu Sheng, et al. ∙

research

∙ 04/10/2018

Exploring Disentangled Feature Representation Beyond Face Identification

This paper proposes learning disentangled but complementary face feature...

2 Yu Liu, et al. ∙

research

∙ 11/29/2017

Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition

Motion representation plays a vital role in human action recognition in ...

0 Shuyang Sun, et al. ∙

research

∙ 09/28/2017

HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis

Pedestrian analysis plays a vital role in intelligent video surveillance...

0 Xihui Liu, et al. ∙

Lu Sheng

Featured Co-authors

Sign in with Google

Consider DeepAI Pro