Vision Transformer (ViT) has shown great potential for various visual ta...
Form 10-K report is a financial report disclosing the annual financial s...
Novel artificial intelligence (AI) technology has expedited various
scie...
Large transformer models display promising performance on a wide range o...
Self-supervised pre-training techniques have achieved remarkable progres...
A creative image-and-text generative AI system mimics humans' extraordin...
We study the joint learning of image-to-text and text-to-image generatio...
In this paper, a parallel structured divide-and-conquer (PSDC) eigensolv...
Video temporal action detection aims to temporally localize and recogniz...