Dense Transformer Networks

05/24/2017
by   Jun Li, et al.
0

The key idea of current deep learning methods for dense prediction is to apply a model on a regular patch centered on each pixel to make pixel-wise predictions. These methods are limited in the sense that the patches are determined by network architecture instead of learned from data. In this work, we propose the dense transformer networks, which can learn the shapes and sizes of patches from data. The dense transformer networks employ an encoder-decoder architecture, and a pair of dense transformer modules are inserted into each of the encoder and decoder paths. The novelty of this work is that we provide technical solutions for learning the shapes and sizes of patches from data and efficiently restoring the spatial correspondence required for dense prediction. The proposed dense transformer modules are differentiable, thus the entire network can be trained. We apply the proposed networks on natural and biological image segmentation tasks and show superior performance is achieved in comparison to baseline methods.

READ FULL TEXT

page 5

page 7

page 8

research
01/26/2021

CPTR: Full Transformer Network for Image Captioning

In this paper, we consider the image captioning task from a new sequence...
research
03/26/2023

Contrastive Transformer: Contrastive Learning Scheme with Transformer innate Patches

This paper presents Contrastive Transformer, a contrastive learning sche...
research
09/27/2021

Sparse Spatial Transformers for Few-Shot Learning

Learning from limited data is a challenging task since the scarcity of d...
research
03/30/2023

PMatch: Paired Masked Image Modeling for Dense Geometric Matching

Dense geometric matching determines the dense pixel-wise correspondence ...
research
04/16/2019

SparseMask: Differentiable Connectivity Learning for Dense Image Prediction

In this paper, we aim at automatically searching an efficient network ar...
research
11/09/2020

Detecting Outliers with Foreign Patch Interpolation

In medical imaging, outliers can contain hypo/hyper-intensities, minor d...
research
12/16/2020

CompositeTasking: Understanding Images by Spatial Composition of Tasks

We define the concept of CompositeTasking as the fusion of multiple, spa...

Please sign up or login with your details

Forgot password? Click here to reset