Thomas H Li

research

∙ 08/15/2023

A^2Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models

We study the task of zero-shot vision-and-language navigation (ZS-VLN), ...

0 Peihao Chen, et al. ∙

research

∙ 07/22/2023

Learning Vision-and-Language Navigation from YouTube Videos

Vision-and-language navigation (VLN) requires an embodied agent to navig...

0 Kunyang Lin, et al. ∙

research

∙ 03/24/2023

Hard Sample Matters a Lot in Zero-Shot Quantization

Zero-shot quantization (ZSQ) is promising for compressing and accelerati...

0 Huantong Li, et al. ∙

research

∙ 03/21/2023

Detecting the open-world objects with the help of the Brain

Open World Object Detection (OWOD) is a novel computer vision task with ...

0 Shuailei Ma, et al. ∙

research

∙ 02/28/2023

LIO-PPF: Fast LiDAR-Inertial Odometry via Incremental Plane Pre-Fitting and Skeleton Tracking

As a crucial infrastructure of intelligent mobile robots, LiDAR-Inertial...

0 Xingyu Chen, et al. ∙

research

∙ 01/26/2023

Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring

Image-text pretrained models, e.g., CLIP, have shown impressive general ...

0 Ruyang Liu, et al. ∙

research

∙ 01/05/2023

CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection

Open-world object detection (OWOD), as a more general and challenging go...

0 Shuailei Ma, et al. ∙

research

∙ 10/14/2022

Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation

We address a practical yet challenging problem of training robot agents ...

0 Peihao Chen, et al. ∙

research

∙ 10/14/2022

Learning Active Camera for Multi-Object Navigation

Getting robots to navigate to multiple objects autonomously is essential...

0 Peihao Chen, et al. ∙

research

∙ 10/12/2022

M^3Video: Masked Motion Modeling for Self-Supervised Video Representation Learning

We study self-supervised video representation learning that seeks to lea...

0 Xinyu Sun, et al. ∙

research

∙ 10/11/2022

Frequency-Aware Self-Supervised Monocular Depth Estimation

We present two versatile methods to generally enhance self-supervised mo...

0 Xingyu Chen, et al. ∙

research

∙ 10/02/2022

Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem

Self-supervised monocular depth estimation (MDE) models universally suff...

0 Xingyu Chen, et al. ∙

research

∙ 04/29/2022

Deep Geometry Post-Processing for Decompressed Point Clouds

Point cloud compression plays a crucial role in reducing the huge cost o...

0 Xiaoqing Fan, et al. ∙

research

∙ 04/13/2022

Neural Texture Extraction and Distribution for Controllable Person Image Synthesis

We deal with the controllable person image synthesis task which aims to ...

5 Yurui Ren, et al. ∙

research

∙ 09/17/2021

PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering

Generating portrait images by controlling the motions of existing faces ...

5 Yurui Ren, et al. ∙

research

∙ 08/04/2021

Combining Attention with Flow for Person Image Synthesis

Pose-guided person image synthesis aims to synthesize person images by t...

0 Yurui Ren, et al. ∙

research

∙ 08/27/2020

Deep Spatial Transformation for Pose-Guided Person Image Generation and Animation

Pose-guided person image generation and animation aim to transform a sou...

35 Yurui Ren, et al. ∙

research

∙ 03/02/2020

Deep Image Spatial Transformation for Person Image Generation

Pose-guided person image generation is to transform a source person imag...

0 Yurui Ren, et al. ∙

research

∙ 08/11/2019

StructureFlow: Image Inpainting via Structure-aware Appearance Flow

Image inpainting techniques have shown significant improvements by using...

5 Yurui Ren, et al. ∙

research

∙ 04/18/2019

Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds

Point cloud is a fundamental 3D representation which is widely used in r...

0 Wei Yan, et al. ∙

research

∙ 03/18/2019

Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection

Video anomaly detection under weak labels is formulated as a typical mul...

0 Jia-Xing Zhong, et al. ∙

research

∙ 07/09/2018

Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector

Weakly supervised temporal action detection is a Herculean task in under...

0 Jia-Xing Zhong, et al. ∙

research

∙ 05/14/2018

Exploiting the Value of the Center-dark Channel Prior for Salient Object Detection

Saliency detection aims to detect the most attractive objects in images ...

0 Chunbiao Zhu, et al. ∙

research

∙ 03/23/2018

PDNet: Prior-model Guided Depth-enhanced Network for Salient Object Detection

Fully convolutional neural networks (FCNs) have shown outstanding perfor...

0 Chunbiao Zhu, et al. ∙

Thomas H Li

Featured Co-authors

Sign in with Google

Consider DeepAI Pro