The Multimedia and Computer Vision Lab of the University of Augsburg
par...
Video-to-Text (VTT) is the task of automatically generating descriptions...
Automatic medical report generation from chest X-ray images is one
possi...
Automatically generating descriptive captions for images is a well-resea...
Automatically captioning images with natural language sentences is an
im...