We tackle the data scarcity challenge in few-shot point cloud recognitio...
In fisheye images, rich distinct distortion patterns are regularly
distr...
In 3D human action recognition, limited supervised data makes it challen...
Recent progress in weakly supervised object detection is featured by a
c...
Accurate recognition of cocktail party speech containing overlapping
spe...
Automatic recognition of disordered and elderly speech remains highly
ch...
Rich sources of variability in natural speech present significant challe...
Current ASR systems are mainly trained and evaluated at the utterance le...
LiDAR and Radar are two complementary sensing approaches in that LiDAR
s...
A key challenge in dysarthric speech recognition is the speaker-level
di...
In recent years, tremendous efforts have been made on document image
rec...
Automatic recognition of disordered and elderly speech remains a highly
...
Speaker adaptation techniques provide a powerful solution to customise
a...
Contour-based instance segmentation has been actively studied, thanks to...
The recent trend for multi-camera 3D object detection is through the uni...
Automatic recognition of disordered speech remains a highly challenging ...
Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating
p...
In document image rectification, there exist rich geometric constraints
...
In 3D action recognition, there exists rich complementary information be...
A key challenge for automatic speech recognition (ASR) systems is to mod...
Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating
p...
Fundamental modelling differences between hybrid and end-to-end (E2E)
au...
Articulatory features are inherently invariant to acoustic signal distor...
In this work, we explore neat yet effective Transformer-based frameworks...
Despite the rapid progress of automatic speech recognition (ASR) technol...
Despite the rapid advance of automatic speech recognition (ASR) technolo...
State-of-the-art automatic speech recognition (ASR) system development i...
It has been well recognized that fusing the complementary information fr...
Compared to flatbed scanners, portable smartphones are much more conveni...
In this work, we propose a new framework, called Document Image Transfor...
As an emerging data modal with precise distance sensing, LiDAR point clo...
As cameras are increasingly deployed in new application domains such as
...
Temporal language grounding (TLG) is a fundamental and challenging probl...
In this paper, we present a neat yet effective transformer-based framewo...
3D object detection is receiving increasing attention from both industry...
Recent advances on 3D object detection heavily rely on how the 3D data a...
Improving sample efficiency is a key research problem in reinforcement
l...
Single shot detectors that are potentially faster and simpler than two-s...
Existing image-text matching approaches typically leverage triplet loss ...
It has been well recognized that modeling object-to-object relations wou...