This paper analyzes three formal models of Transformer encoders that dif...
A regular language is almost fully characterized by its right congruence...
This paper analyzes the behavior of stack-augmented recurrent neural net...
We present the first polynomial time algorithm to learn nontrivial class...