An Algorithm for Routing Capsules in All Domains

11/02/2019
by   Franz A. Heinsen, et al.
55

Building on recent work on capsule networks, we propose a new form of "routing by agreement" that activates output capsules in a layer as a function of their net benefit to use and net cost to ignore input capsules from earlier layers. As sample applications, we present two capsule networks that use our algorithm without change in different domains: vision and language. The first network achieves new state-of-the-art accuracy of 99.1 recognition task with fewer parameters and an order of magnitude less training than previous capsule models, and we find evidence that it learns to perform a form of "reverse graphics." The second network achieves new state-of-the-art accuracies on the root sentences of the Stanford Sentiment Treebank: 58.5 fine-grained and 95.6 frozen embeddings from a pretrained transformer as capsules. Both networks are trained with the same regime. Code is available at https://github.com/glassroom/heinsen_routing along with replication instructions.

READ FULL TEXT
research
02/12/2020

Capsules with Inverted Dot-Product Attention Routing

We introduce a new routing algorithm for capsule networks, in which a ch...
research
09/19/2021

Capsule networks with non-iterative cluster routing

Capsule networks use routing algorithms to flow information between cons...
research
08/20/2018

CapsDeMM: Capsule network for Detection of Munro' s Microabscess in skin biopsy images

This paper presents an approach for automatic detection of Munro' s Micr...
research
08/27/2018

Generalized Capsule Networks with Trainable Routing Procedure

CapsNet (Capsule Network) was first proposed by capsule and later anothe...
research
01/26/2022

Momentum Capsule Networks

Capsule networks are a class of neural networks that achieved promising ...
research
11/20/2022

An Algorithm for Routing Vectors in Sequences

We propose a routing algorithm that takes a sequence of vectors and comp...
research
05/24/2022

History Compression via Language Models in Reinforcement Learning

In a partially observable Markov decision process (POMDP), an agent typi...

Please sign up or login with your details

Forgot password? Click here to reset