Object localization in general environments is a fundamental part of vis...
The Segment Anything Model (SAM) has established itself as a powerful
ze...
The recent advancement in Video Instance Segmentation (VIS) has largely ...
Segmenting highly-overlapping image objects is challenging, because ther...
While Video Instance Segmentation (VIS) has seen rapid progress, current...
Two-stage and query-based instance segmentation methods have achieved
re...
Conventional video inpainting is neither object-oriented nor occlusion-a...
Multiple object tracking and segmentation requires detecting, tracking, ...
Segmenting highly-overlapping objects is challenging, because typically ...
We present a novel end-to-end framework named as GSNet (Geometric and
Sc...
Partially supervised instance segmentation aims to perform learning on
l...
End-to-end deep representation learning has achieved remarkable accuracy...
State-of-the-art image captioning methods mostly focus on improving visu...
Typical techniques for video captioning follow the encoder-decoder frame...