b'Xiaoming Wei'

research

∙ 08/14/2023

Orthogonal Temporal Interpolation for Zero-Shot Video Recognition

Zero-shot video recognition (ZSVR) is a task that aims to recognize vide...

0 Yan Zhu, et al. ∙

research

∙ 02/01/2023

EfficientRep:An Efficient Repvgg-style ConvNets with Hardware-aware Neural Network Design

We present a hardware-efficient architecture of convolutional neural net...

0 Kaiheng Weng, et al. ∙

research

∙ 11/30/2022

Uncertainty-Aware Image Captioning

It is well believed that the higher uncertainty in a word of the caption...

0 Zhengcong Fei, et al. ∙

research

∙ 10/05/2022

Progressive Denoising Model for Fine-Grained Text-to-Image Generation

Recently, vector quantized autoregressive (VQ-AR) models have shown rema...

0 Zhengcong Fei, et al. ∙

research

∙ 10/05/2022

Meta-Ensemble Parameter Learning

Ensemble of machine learning models yields improved performance as well ...

0 Zhengcong Fei, et al. ∙

research

∙ 09/30/2022

Rethinking skip connection model as a learnable Markov chain

Over past few years afterward the birth of ResNet, skip connection has b...

0 Dengsheng Chen, et al. ∙

research

∙ 09/07/2022

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

For years, the YOLO series has been the de facto industry-level standard...

0 Chuyi Li, et al. ∙

research

∙ 08/11/2022

PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding

Panoptic Narrative Grounding (PNG) is an emerging task whose goal is to ...

1 Zihan Ding, et al. ∙

research

∙ 07/22/2022

Efficient Modeling of Future Context for Image Captioning

Existing approaches to image captioning usually generate the sentence wo...

0 Zhengcong Fei, et al. ∙

research

∙ 06/08/2022

Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation

Referring video object segmentation aims to predict foreground labels fo...

0 Zihan Ding, et al. ∙

research

∙ 05/12/2021

Structure Guided Lane Detection

Recently, lane detection has made great progress with the rapid developm...

6 Jinming Su, et al. ∙

research

∙ 04/27/2021

Rethinking BiSeNet For Real-time Semantic Segmentation

BiSeNet has been proved to be a popular two-stream network for real-time...

0 Mingyuan Fan, et al. ∙

research

∙ 03/30/2021

Large Scale Visual Food Recognition

Food recognition plays an important role in food choice and intake, whic...

0 Weiqing Min, et al. ∙

research

∙ 08/13/2020

ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network

Food recognition has received more and more attention in the multimedia ...

0 Weiqing Min, et al. ∙

research

∙ 05/09/2019

Grand Challenge of 106-Point Facial Landmark Localization

Facial landmark localization is a very crucial step in numerous face rel...

0 Yinglu Liu, et al. ∙

Xiaoming Wei

Featured Co-authors

Sign in with Google

Consider DeepAI Pro