Supervised visual captioning models typically require a large scale of i...
Generating an informative and attractive title for the product is a cruc...
Automatic radiology report generation has attracted enormous research
in...
Natural Language Generation (NLG) accepts input data in the form of imag...
The "Patient Instruction" (PI), which contains critical instructional
in...
Gene Ontology (GO) is the primary gene function knowledge base that enab...
For video captioning, "pre-training and fine-tuning" has become a de fac...
Video captioning combines video understanding and language generation.
D...
Existing state-of-the-art autoregressive video captioning methods (ARVC)...