We develop a generalised notion of disentanglement in Variational
Auto-E...
Deep latent-variable models learn representations of high-dimensional da...
We present an approach to labeling short video clips with English verbs ...
We present a framework that allows an observer to determine occluded por...
We present a system that produces sentential descriptions of video: who ...
The common internal structure and algorithmic organization of object
det...