Hard-Attention for Scalable Image Classification

02/20/2021
by   Athanasios Papadopoulos, et al.
9

Deep neural networks (DNNs) are typically optimized for a specific input resolution (e.g. 224 × 224 px) and their adoption to inputs of higher resolution (e.g., satellite or medical images) remains challenging, as it leads to excessive computation and memory overhead, and may require substantial engineering effort (e.g., streaming). We show that multi-scale hard-attention can be an effective solution to this problem. We propose a novel architecture, TNet, which traverses an image pyramid in a top-down fashion, visiting only the most informative regions along the way. We compare our model against strong hard-attention baselines, achieving a better trade-off between resources and accuracy on ImageNet. We further verify the efficacy of our model on satellite images (fMoW dataset) of size up to 896 × 896 px. In addition, our hard-attention mechanism guarantees predictions with a degree of interpretability, without extra cost beyond inference. We also show that we can reduce data acquisition and annotation cost, since our model attends only to a fraction of the highest resolution content, while using only image-level labels without bounding boxes.

READ FULL TEXT

page 2

page 8

page 18

page 19

research
08/20/2019

Saccader: Improving Accuracy of Hard Attention Models for Vision

Although deep convolutional neural networks achieve state-of-the-art per...
research
03/31/2021

Channel-Based Attention for LCC Using Sentinel-2 Time Series

Deep Neural Networks (DNNs) are getting increasing attention to deal wit...
research
07/31/2020

A Novel Global Spatial Attention Mechanism in Convolutional Neural Network for Medical Image Classification

Spatial attention has been introduced to convolutional neural networks (...
research
02/19/2023

Mimicking a Pathologist: Dual Attention Model for Scoring of Gigapixel Histology Images

Some major challenges associated with the automated processing of whole ...
research
06/06/2018

Attention Incorporate Network: A network can adapt various data size

In traditional neural networks for image processing, the inputs of the n...
research
10/16/2019

Large-Scale Landslides Detection from Satellite Images with Incomplete Labels

Earthquakes and tropical cyclones cause the suffering of millions of peo...
research
12/26/2018

Region Proposal Networks with Contextual Selective Attention for Real-Time Organ Detection

State-of-the-art methods for object detection use region proposal networ...

Please sign up or login with your details

Forgot password? Click here to reset