Large language models (LLMs) are at the forefront of transforming numero...
We propose Retrieval Augmented Generation (RAG) as an approach for autom...
Generative AI models have impressive performance on many Natural Languag...
Infographics are often an integral component of scientific documents for...
We present very early results on using GPT-3 to perform question answeri...
Leveraging shared learning through Massively Multilingual Models,
state-...
Visual cues such as structure, emphasis, and icons play an important rol...
Printed documents continue to be a challenge for blind, low-vision, and ...
Accessing daily news content still remains a big challenge for people wi...
Recent advancements in NLP have given us models like mBERT and XLMR that...
Executing computer vision models on streaming visual data, or streaming
...
In the context of the ongoing Covid-19 pandemic, several reports and stu...