FEANet: Feature-Enhanced Attention Network for RGB-Thermal Real-time Semantic Segmentation

10/18/2021
by   Fuqin Deng, et al.
17

The RGB-Thermal (RGB-T) information for semantic segmentation has been extensively explored in recent years. However, most existing RGB-T semantic segmentation usually compromises spatial resolution to achieve real-time inference speed, which leads to poor performance. To better extract detail spatial information, we propose a two-stage Feature-Enhanced Attention Network (FEANet) for the RGB-T semantic segmentation task. Specifically, we introduce a Feature-Enhanced Attention Module (FEAM) to excavate and enhance multi-level features from both the channel and spatial views. Benefited from the proposed FEAM module, our FEANet can preserve the spatial information and shift more attention to high-resolution features from the fused RGB-T images. Extensive experiments on the urban scene dataset demonstrate that our FEANet outperforms other state-of-the-art (SOTA) RGB-T methods in terms of objective metrics and subjective visual comparison (+2.6 For the 480 x 640 RGB-T test images, our FEANet can run with a real-time speed on an NVIDIA GeForce RTX 2080 Ti card.

READ FULL TEXT

page 1

page 3

page 4

page 6

research
08/24/2023

Channel and Spatial Relation-Propagation Network for RGB-Thermal Semantic Segmentation

RGB-Thermal (RGB-T) semantic segmentation has shown great potential in h...
research
04/09/2020

Spatial Information Guided Convolution for Real-Time RGBD Semantic Segmentation

3D spatial information is known to be beneficial to the semantic segment...
research
05/24/2019

ACNet: Attention Based Network to Exploit Complementary Features for RGBD Semantic Segmentation

Compared to RGB semantic segmentation, RGBD semantic segmentation can ac...
research
04/27/2021

Rethinking BiSeNet For Real-time Semantic Segmentation

BiSeNet has been proved to be a popular two-stream network for real-time...
research
12/24/2021

Multi-Scale Feature Fusion: Learning Better Semantic Segmentation for Road Pothole Detection

This paper presents a novel pothole detection approach based on single-m...
research
07/01/2023

Learning Content-enhanced Mask Transformer for Domain Generalized Urban-Scene Segmentation

Domain-generalized urban-scene semantic segmentation (USSS) aims to lear...
research
11/08/2021

D-Flow: A Real Time Spatial Temporal Model for Target Area Segmentation

Semantic segmentation has attracted a large amount of attention in recen...

Please sign up or login with your details

Forgot password? Click here to reset