Knowledge Combination to Learn Rotated Detection Without Rotated Annotation

04/05/2023
by   Tianyu Zhu, et al.
5

Rotated bounding boxes drastically reduce output ambiguity of elongated objects, making it superior to axis-aligned bounding boxes. Despite the effectiveness, rotated detectors are not widely employed. Annotating rotated bounding boxes is such a laborious process that they are not provided in many detection datasets where axis-aligned annotations are used instead. In this paper, we propose a framework that allows the model to predict precise rotated boxes only requiring cheaper axis-aligned annotation of the target dataset 1. To achieve this, we leverage the fact that neural networks are capable of learning richer representation of the target domain than what is utilized by the task. The under-utilized representation can be exploited to address a more detailed task. Our framework combines task knowledge of an out-of-domain source dataset with stronger annotation and domain knowledge of the target dataset with weaker annotation. A novel assignment process and projection loss are used to enable the co-training on the source and target datasets. As a result, the model is able to solve the more detailed task in the target domain, without additional computation overhead during inference. We extensively evaluate the method on various target datasets including fresh-produce dataset, HRSC2016 and SSDD. Results show that the proposed method consistently performs on par with the fully supervised approach.

READ FULL TEXT

page 3

page 7

page 8

research
10/04/2022

Centerpoints Are All You Need in Overhead Imagery

Labeling data to use for training object detectors is expensive and time...
research
12/23/2020

Efficient video annotation with visual interpolation and frame selection guidance

We introduce a unified framework for generic video annotation with bound...
research
08/19/2021

Box-Adapt: Domain-Adaptive Medical Image Segmentation using Bounding BoxSupervision

Deep learning has achieved remarkable success in medicalimage segmentati...
research
08/17/2020

Video Region Annotation with Sparse Bounding Boxes

Video analysis has been moving towards more detailed interpretation (e.g...
research
10/29/2018

A Coarse-to-fine Pyramidal Model for Person Re-identification via Multi-Loss Dynamic Training

Most existing Re-IDentification (Re-ID) methods are highly dependent on ...
research
06/26/2020

Text Detection on Roughly Placed Books by Leveraging a Learning-based Model Trained with Another Domain Data

Text detection enables us to extract rich information from images. In th...
research
07/22/2022

Evaluation of Different Annotation Strategies for Deployment of Parking Spaces Classification Systems

When using vision-based approaches to classify individual parking spaces...

Please sign up or login with your details

Forgot password? Click here to reset