GINet: Graph Interaction Network for Scene Parsing

09/14/2020
by   Tianyi Wu, et al.
15

Recently, context reasoning using image regions beyond local convolution has shown great potential for scene parsing. In this work, we explore how to incorporate the linguistic knowledge to promote context reasoning over image regions by proposing a Graph Interaction unit (GI unit) and a Semantic Context Loss (SC-loss). The GI unit is capable of enhancing feature representations of convolution networks over high-level semantics and learning the semantic coherency adaptively to each sample. Specifically, the dataset-based linguistic knowledge is first incorporated in the GI unit to promote context reasoning over the visual graph, then the evolved representations of the visual graph are mapped to each local representation to enhance the discriminated capability for scene parsing. GI unit is further improved by the SC-loss to enhance the semantic representations over the exemplar-based semantic graph. We perform full ablation studies to demonstrate the effectiveness of each component in our approach. Particularly, the proposed GINet outperforms the state-of-the-art approaches on the popular benchmarks, including Pascal-Context and COCO Stuff.

READ FULL TEXT
research
08/06/2019

Aligning Linguistic Words and Visual Semantic Units for Image Captioning

Image captioning attempts to generate a sentence composed of several lin...
research
07/10/2019

Modeling Semantic Compositionality with Sememe Knowledge

Semantic compositionality (SC) refers to the phenomenon that the meaning...
research
07/29/2019

Consensus Feature Network for Scene Parsing

Scene parsing is challenging as it aims to assign one of the semantic ca...
research
04/17/2019

CaseNet: Content-Adaptive Scale Interaction Networks for Scene Parsing

Objects in an image exhibit diverse scales. Adaptive receptive fields ar...
research
10/17/2017

Scene Parsing with Global Context Embedding

We present a scene parsing method that utilizes global context informati...
research
04/09/2019

Graphonomy: Universal Human Parsing via Graph Transfer Learning

Prior highly-tuned human parsing models tend to fit towards each dataset...
research
12/08/2022

Latent Graph Representations for Critical View of Safety Assessment

Assessing the critical view of safety in laparoscopic cholecystectomy re...

Please sign up or login with your details

Forgot password? Click here to reset