Jiajun Deng

research

∙ 08/18/2023

Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition

We tackle the data scarcity challenge in few-shot point cloud recognitio...

0 Xuanyu Yi, et al. ∙

research

∙ 08/17/2023

SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning

In fisheye images, rich distinct distortion patterns are regularly distr...

0 Hao Feng, et al. ∙

research

∙ 08/14/2023

Masked Motion Predictors are Strong 3D Action Representation Learners

In 3D human action recognition, limited supervised data makes it challen...

0 Yunyao Mao, et al. ∙

research

∙ 08/11/2023

Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection

Recent progress in weakly supervised object detection is featured by a c...

0 Yufei Yin, et al. ∙

research

∙ 07/06/2023

Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition

Accurate recognition of cocktail party speech containing overlapping spe...

0 Guinan Li, et al. ∙

research

∙ 06/27/2023

Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition

Automatic recognition of disordered and elderly speech remains highly ch...

0 Tianzi Wang, et al. ∙

research

∙ 06/26/2023

Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems

Rich sources of variability in natural speech present significant challe...

0 Jiajun Deng, et al. ∙

research

∙ 06/23/2023

Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems

Current ASR systems are mainly trained and evaluated at the utterance le...

0 Mingyu Cui, et al. ∙

research

∙ 06/02/2023

Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection

LiDAR and Radar are two complementary sensing approaches in that LiDAR s...

0 Yingjie Wang, et al. ∙

research

∙ 05/18/2023

Use of Speech Impairment Severity for Dysarthric Speech Recognition

A key challenge in dysarthric speech recognition is the speaker-level di...

0 Mengzhe Geng, et al. ∙

research

∙ 04/18/2023

Deep Unrestricted Document Image Rectification

In recent years, tremendous efforts have been made on document image rec...

0 Hao Feng, et al. ∙

research

∙ 02/28/2023

Exploring Self-supervised Pre-trained ASR Models For Dysarthric and Elderly Speech Recognition

Automatic recognition of disordered and elderly speech remains a highly ...

0 Shujie Hu, et al. ∙

research

∙ 02/15/2023

Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems

Speaker adaptation techniques provide a powerful solution to customise a...

0 Jiajun Deng, et al. ∙

research

∙ 01/21/2023

Recurrent Contour-based Instance Segmentation with Progressive Learning

Contour-based instance segmentation has been actively studied, thanks to...

0 Hao Feng, et al. ∙

research

∙ 01/13/2023

OA-BEV: Bringing Object Awareness to Bird's-Eye-View Representation for Multi-Camera 3D Object Detection

The recent trend for multi-camera 3D object detection is through the uni...

0 Xiaomeng Chu, et al. ∙

research

∙ 11/03/2022

Adversarial Data Augmentation Using VAE-GAN for Disordered Speech Recognition

Automatic recognition of disordered speech remains a highly challenging ...

0 Zengrui Jin, et al. ∙

research

∙ 10/29/2022

Exploiting prompt learning with pre-trained language models for Alzheimer's Disease detection

Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating p...

0 Yi Wang, et al. ∙

research

∙ 10/15/2022

Geometric Representation Learning for Document Image Rectification

In document image rectification, there exist rich geometric constraints ...

0 Hao Feng, et al. ∙

research

∙ 08/26/2022

CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation

In 3D action recognition, there exists rich complementary information be...

0 Yunyao Mao, et al. ∙

research

∙ 06/24/2022

Confidence Score Based Conformer Speaker Adaptation for Speech Recognition

A key challenge for automatic speech recognition (ASR) systems is to mod...

0 Jiajun Deng, et al. ∙

research

∙ 06/23/2022

Conformer Based Elderly Speech Recognition System for Alzheimer's Disease Detection

Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating p...

2 Tianzi Wang, et al. ∙

research

∙ 06/23/2022

Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR Systems

Fundamental modelling differences between hybrid and end-to-end (E2E) au...

0 Mingyu Cui, et al. ∙

research

∙ 06/15/2022

Exploiting Cross-domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition

Articulatory features are inherently invariant to acoustic signal distor...

0 Shujie Hu, et al. ∙

research

∙ 06/14/2022

TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer

In this work, we explore neat yet effective Transformer-based frameworks...

19 Jiajun Deng, et al. ∙

research

∙ 05/13/2022

Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition

Despite the rapid progress of automatic speech recognition (ASR) technol...

1 Zengrui Jin, et al. ∙

research

∙ 04/05/2022

Audio-visual multi-channel speech separation, dereverberation and recognition

Despite the rapid advance of automatic speech recognition (ASR) technolo...

0 Guinan Li, et al. ∙

research

∙ 01/08/2022

Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks

State-of-the-art automatic speech recognition (ASR) system development i...

0 Shoukang Hu, et al. ∙

research

∙ 11/29/2021

VPFNet: Improving 3D Object Detection with Virtual Point based LiDAR and Stereo Data Fusion

It has been well recognized that fusing the complementary information fr...

0 Hanqi Zhu, et al. ∙

research

∙ 10/28/2021

DocScanner: Robust Document Image Rectification with Progressive Learning

Compared to flatbed scanners, portable smartphones are much more conveni...

0 Hao Feng, et al. ∙

research

∙ 10/25/2021

DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction

In this work, we propose a new framework, called Document Image Transfor...

0 Hao Feng, et al. ∙

research

∙ 07/30/2021

From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection

As an emerging data modal with precise distance sensing, LiDAR point clo...

0 Jiajun Deng, et al. ∙

research

∙ 07/06/2021

Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance Voting

As cameras are increasingly deployed in new application domains such as ...

0 Xiaomeng Chu, et al. ∙

research

∙ 06/30/2021

Weakly Supervised Temporal Adjacent Network for Language Grounding

Temporal language grounding (TLG) is a fundamental and challenging probl...

0 Yuechen Wang, et al. ∙

research

∙ 04/17/2021

TransVG: End-to-End Visual Grounding with Transformers

In this paper, we present a neat yet effective transformer-based framewo...

0 Jiajun Deng, et al. ∙

research

∙ 01/31/2021

PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection

3D object detection is receiving increasing attention from both industry...

5 Shaoshuai Shi, et al. ∙

research

∙ 12/31/2020

Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection

Recent advances on 3D object detection heavily rely on how the 3D data a...

0 Jiajun Deng, et al. ∙

research

∙ 10/15/2020

Masked Contrastive Representation Learning for Reinforcement Learning

Improving sample efficiency is a key research problem in reinforcement l...

0 Jinhua Zhu, et al. ∙

research

∙ 07/07/2020

Single Shot Video Object Detector

Single shot detectors that are potentially faster and simpler than two-s...

0 Jiajun Deng, et al. ∙

research

∙ 03/07/2020

Adaptive Offline Quintuplet Loss for Image-Text Matching

Existing image-text matching approaches typically leverage triplet loss ...

2 Tianlang Chen, et al. ∙

research

∙ 08/26/2019

Relation Distillation Networks for Video Object Detection

It has been well recognized that modeling object-to-object relations wou...

0 Jiajun Deng, et al. ∙

Jiajun Deng

Featured Co-authors

Sign in with Google

Consider DeepAI Pro