Heterogeneous Grid Convolution for Adaptive, Efficient, and Controllable Computation

04/22/2021
by   Ryuhei Hamaguchi, et al.
6

This paper proposes a novel heterogeneous grid convolution that builds a graph-based image representation by exploiting heterogeneity in the image content, enabling adaptive, efficient, and controllable computations in a convolutional architecture. More concretely, the approach builds a data-adaptive graph structure from a convolutional layer by a differentiable clustering method, pools features to the graph, performs a novel direction-aware graph convolution, and unpool features back to the convolutional layer. By using the developed module, the paper proposes heterogeneous grid convolutional networks, highly efficient yet strong extension of existing architectures. We have evaluated the proposed approach on four image understanding tasks, semantic segmentation, object localization, road extraction, and salient object detection. The proposed method is effective on three of the four tasks. Especially, the method outperforms a strong baseline with more than 90 segmentation, and achieves the state-of-the-art result for road extraction. We will share our code, model, and data.

READ FULL TEXT

page 1

page 3

page 6

page 15

page 16

page 17

page 18

page 19

research
03/17/2023

Adaptive Graph Convolution Module for Salient Object Detection

Salient object detection (SOD) is a task that involves identifying and s...
research
12/25/2021

Cyberattack Detection in Large-Scale Smart Grids using Chebyshev Graph Convolutional Networks

As a highly complex and integrated cyber-physical system, modern power g...
research
11/29/2018

Grid R-CNN

This paper proposes a novel object detection framework named Grid R-CNN,...
research
11/11/2022

Dual Complementary Dynamic Convolution for Image Recognition

As a powerful engine, vanilla convolution has promoted huge breakthrough...
research
11/27/2020

Road Scene Graph: A Semantic Graph-Based Scene Representation Dataset for Intelligent Vehicles

Rich semantic information extraction plays a vital role on next-generati...
research
02/17/2023

3D Human Pose Lifting with Grid Convolution

Existing lifting networks for regressing 3D human poses from 2D single-v...
research
12/07/2020

CARAFE++: Unified Content-Aware ReAssembly of FEatures

Feature reassembly, i.e. feature downsampling and upsampling, is a key o...

Please sign up or login with your details

Forgot password? Click here to reset