The excellent text-to-image synthesis capability of diffusion models has...
Commonsense question answering (QA) research requires machines to answer...
Online Social Network (OSN) has become a hotbed of fake news due to the ...
Visual entailment (VE) is to recognize whether the semantics of a hypoth...