PBRnet: Pyramidal Bounding Box Refinement to Improve Object Localization Accuracy

by   Li Xiao, et al.

Many recently developed object detectors focused on coarse-to-fine framework which contains several stages that classify and regress proposals from coarse-grain to fine-grain, and obtains more accurate detection gradually. Multi-resolution models such as Feature Pyramid Network(FPN) integrate information of different levels of resolution and effectively improve the performance. Previous researches also have revealed that localization can be further improved by: 1) using fine-grained information which is more translational variant; 2) refining local areas which is more focused on local boundary information. Based on these principles, we designed a novel boundary refinement architecture to improve localization accuracy by combining coarse-to-fine framework with feature pyramid structure, named as Pyramidal Bounding Box Refinement network(PBRnet), which parameterizes gradually focused boundary areas of objects and leverages lower-level feature maps to extract finer local information when refining the predicted bounding boxes. Extensive experiments are performed on the MS-COCO dataset. The PBRnet brings a significant performance gains by roughly 3 point of mAP when added to FPN or Libra R-CNN. Moreover, by treating Cascade R-CNN as a coarse-to-fine detector and replacing its localization branch by the regressor of PBRnet, it leads an extra performance improvement by 1.5 mAP, yielding a total performance boosting by as high as 5 point of mAP.


page 5

page 7

page 12

page 14


Acquisition of Localization Confidence for Accurate Object Detection

Modern CNN-based object detectors rely on bounding box regression and no...

RepPoints: Point Set Representation for Object Detection

Modern object detectors rely heavily on rectangular bounding boxes, such...

P2P-Loc: Point to Point Tiny Person Localization

Bounding-box annotation form has been the most frequently used method fo...

Side-Aware Boundary Localization for More Precise Object Detection

Current object detection frameworks mainly rely on bounding box regressi...

Precise Temporal Action Localization by Evolving Temporal Proposals

Locating actions in long untrimmed videos has been a challenging problem...

Wise-IoU: Bounding Box Regression Loss with Dynamic Focusing Mechanism

The loss function for bounding box regression (BBR) is essential to obje...

IMMVP: An Efficient Daytime and Nighttime On-Road Object Detector

It is hard to detect on-road objects under various lighting conditions. ...

Please sign up or login with your details

Forgot password? Click here to reset