We propose a novel multimodal video benchmark - the Perception Test - to...
The unabated mystique of large-scale neural networks, such as the CLIP d...
Whilst there are perhaps only a few scientific methods, there seem to be...
For artificial general intelligence (AGI) it would be efficient if multi...
In this work we introduce a differentiable version of the Compositional
...