Although vision transformers (ViTs) have achieved great success in compu...
Video creation has been an attractive yet challenging task for artists t...
Very recently, Window-based Transformers, which computed self-attention
...
Multi-person human pose estimation and tracking in the wild is important...