CiT-Net: Convolutional Neural Networks Hand in Hand with Vision Transformers for Medical Image Segmentation

06/06/2023
by   Tao Lei, et al.
0

The hybrid architecture of convolutional neural networks (CNNs) and Transformer are very popular for medical image segmentation. However, it suffers from two challenges. First, although a CNNs branch can capture the local image features using vanilla convolution, it cannot achieve adaptive feature learning. Second, although a Transformer branch can capture the global features, it ignores the channel and cross-dimensional self-attention, resulting in a low segmentation accuracy on complex-content images. To address these challenges, we propose a novel hybrid architecture of convolutional neural networks hand in hand with vision Transformers (CiT-Net) for medical image segmentation. Our network has two advantages. First, we design a dynamic deformable convolution and apply it to the CNNs branch, which overcomes the weak feature extraction ability due to fixed-size convolution kernels and the stiff design of sharing kernel parameters among different inputs. Second, we design a shifted-window adaptive complementary attention module and a compact convolutional projection. We apply them to the Transformer branch to learn the cross-dimensional long-term dependency for medical images. Experimental results show that our CiT-Net provides better medical image segmentation results than popular SOTA methods. Besides, our CiT-Net requires lower parameters and less computational costs and does not rely on pre-training. The code is publicly available at https://github.com/SR0920/CiT-Net.

READ FULL TEXT

page 3

page 5

research
06/07/2023

TEC-Net: Vision Transformer Embrace Convolutional Neural Networks for Medical Image Segmentation

The hybrid architecture of convolution neural networks (CNN) and Transfo...
research
02/26/2021

Convolution-Free Medical Image Segmentation using Transformers

Like other applications in computer vision, medical image segmentation h...
research
11/11/2022

Dual Complementary Dynamic Convolution for Image Recognition

As a powerful engine, vanilla convolution has promoted huge breakthrough...
research
02/16/2021

TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation

U-Net based convolutional neural networks with deep feature representati...
research
12/04/2019

Divided We Stand: A Novel Residual Group Attention Mechanism for Medical Image Segmentation

Given that convolutional neural networks extract features via learning c...
research
09/09/2023

ConvFormer: Plug-and-Play CNN-Style Transformers for Improving Medical Image Segmentation

Transformers have been extensively studied in medical image segmentation...
research
08/31/2023

Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation

Medical image segmentation has seen significant improvements with transf...

Please sign up or login with your details

Forgot password? Click here to reset