Shiwei Zhang

research

∙ 09/15/2023

Unleashing Potential of Evidence in Knowledge-Intensive Dialogue Generation

Incorporating external knowledge into dialogue generation (KIDG) is cruc...

0 Xianjie Wu, et al. ∙

research

∙ 09/14/2023

Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning

Recently, large-scale pre-trained language-image models like CLIP have s...

0 Zhiwu Qing, et al. ∙

research

∙ 08/20/2023

Towards Real-World Visual Tracking with Temporal Contexts

Visual tracking has made significant improvements in the past few decade...

0 Ziang Cao, et al. ∙

research

∙ 08/18/2023

RLIPv2: Fast Scaling of Relational Language-Image Pre-training

Relational Language-Image Pre-training (RLIP) aims to align vision repre...

0 Hangjie Yuan, et al. ∙

research

∙ 08/12/2023

ModelScope Text-to-Video Technical Report

This paper introduces ModelScopeT2V, a text-to-video synthesis model tha...

0 Jiuniu Wang, et al. ∙

research

∙ 08/10/2023

Temporally-Adaptive Models for Efficient Video Understanding

Spatial convolutions are extensively used in numerous deep video models....

0 Ziyuan Huang, et al. ∙

research

∙ 06/03/2023

VideoComposer: Compositional Video Synthesis with Motion Controllability

The pursuit of controllability as a higher standard of visual content cr...

0 Xiang Wang, et al. ∙

research

∙ 04/03/2023

MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition

Current state-of-the-art approaches for few-shot action recognition achi...

0 Xiang Wang, et al. ∙

research

∙ 03/06/2023

CLIP-guided Prototype Modulating for Few-shot Action Recognition

Learning from large-scale contrastive language-image pre-training like C...

0 Xiang Wang, et al. ∙

research

∙ 02/16/2023

Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform

We present Rhino, a system for accelerating tensor programs with automat...

0 Shiwei Zhang, et al. ∙

research

∙ 02/13/2023

Expediting Distributed DNN Training with Device Topology-Aware Graph Deployment

This paper presents TAG, an automatic system to derive optimized DNN tra...

0 Shiwei Zhang, et al. ∙

research

∙ 11/02/2022

Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning

Recent incremental learning for action recognition usually stores repres...

0 Yixuan Pei, et al. ∙

research

∙ 09/26/2022

Optimizing DNN Compilation for Distributed Training with Joint OP and Tensor Fusion

This paper proposes DisCo, an automatic deep learning compilation module...

0 Xiaodong Yi, et al. ∙

research

∙ 09/14/2022

Prompt Combines Paraphrase: Teaching Pre-trained Models to Understand Rare Biomedical Words

Prompt-based fine-tuning for pre-trained models has proven effective for...

5 Haochun Wang, et al. ∙

research

∙ 07/24/2022

MAR: Masked Autoencoders for Efficient Action Recognition

Standard approaches for video recognition usually operate on the full in...

10 Zhiwu Qing, et al. ∙

research

∙ 07/04/2022

Open-world Semantic Segmentation for LIDAR Point Clouds

Current methods for LIDAR semantic segmentation are not robust enough fo...

0 Jun Cen, et al. ∙

research

∙ 06/18/2022

Context-aware Proposal Network for Temporal Action Detection

This technical report presents our first place winning solution for temp...

0 Xiang Wang, et al. ∙

research

∙ 03/03/2022

TCTrack: Temporal Contexts for Aerial Tracking

Temporal contexts among consecutive frames are far from being fully util...

24 Ziang Cao, et al. ∙

research

∙ 12/30/2021

Does QA-based intermediate training help fine-tuning language models for text classification?

Fine-tuning pre-trained language models for downstream tasks has become ...

0 Shiwei Zhang, et al. ∙

research

∙ 10/12/2021

TAda! Temporally-Adaptive Convolutions for Video Understanding

Spatial convolutions are widely used in numerous deep video models. It f...

0 Ziyuan Huang, et al. ∙

research

∙ 08/24/2021

Support-Set Based Cross-Supervision for Video Grounding

Current approaches for video grounding propose kinds of complex architec...

2 Xinpeng Ding, et al. ∙

research

∙ 08/24/2021

ParamCrop: Parametric Cubic Cropping for Video Contrastive Learning

The central idea of contrastive learning is to discriminate between diff...

7 Zhiwu Qing, et al. ∙

research

∙ 06/24/2021

Exploring Stronger Feature for Temporal Action Localization

Temporal action localization aims to localize starting and ending time w...

0 Zhiwu Qing, et al. ∙

research

∙ 06/21/2021

OadTR: Online Action Detection with Transformers

Most recent approaches for online action detection tend to apply Recurre...

0 Xiang Wang, et al. ∙

research

∙ 06/20/2021

Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling

Weakly-Supervised Temporal Action Localization (WS-TAL) task aims to rec...

0 Xiang Wang, et al. ∙

research

∙ 06/20/2021

Proposal Relation Network for Temporal Action Detection

This technical report presents our solution for temporal action detectio...

0 Xiang Wang, et al. ∙

research

∙ 06/15/2021

Relation Modeling in Spatio-Temporal Action Localization

This paper presents our solution to the AVA-Kinetics Crossover Challenge...

0 Yutong Feng, et al. ∙

research

∙ 06/13/2021

A Stronger Baseline for Ego-Centric Action Detection

This technical report analyzes an egocentric video action detection meth...

0 Zhiwu Qing, et al. ∙

research

∙ 06/09/2021

Towards Training Stronger Video Vision Transformers for EPIC-KITCHENS-100 Action Recognition

With the recent surge in the research of vision transformers, they have ...

0 Ziyuan Huang, et al. ∙

research

∙ 04/07/2021

Self-Supervised Learning for Semi-Supervised Temporal Action Proposal

Self-supervised learning presents a remarkable performance to utilize un...

0 Xiang Wang, et al. ∙

research

∙ 08/07/2020

Multi-Level Temporal Pyramid Network for Action Detection

Currently, one-stage frameworks have been widely applied for temporal ac...

0 Xiang Wang, et al. ∙

research

∙ 07/09/2020

Less is More: Rejecting Unreliable Reviews for Product Question Answering

Promptly and accurately answering questions on products is important for...

0 Shiwei Zhang, et al. ∙

research

∙ 06/13/2020

CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1)

In this report, we present our solution for the task of temporal action ...

0 Xiang Wang, et al. ∙

research

∙ 06/13/2020

Temporal Fusion Network for Temporal Action Localization:Submission to ActivityNet Challenge 2020 (Task E)

This technical report analyzes a temporal action localization method we ...

0 Zhiwu Qing, et al. ∙

research

∙ 01/07/2020

A3: An Automatic Topology-Aware Malfunction Detection and Fixation System in Data Center Networks

Link failures and cable miswirings are not uncommon in building data cen...

0 Che Zhang, et al. ∙

research

∙ 09/20/2019

Scalable Traffic Engineering for Higher Throughput in Heavily-loaded Software Defined Networks

Existing traffic engineering (TE) solutions performs well for software d...

0 Che Zhang, et al. ∙

research

∙ 05/31/2019

TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection

Current state-of-the-art approaches for spatio-temporal action detection...

0 Lin Song, et al. ∙

Shiwei Zhang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro