MANet: Multimodal Attention Network based Point- View fusion for 3D Shape Recognition

by   Yaxin Zhao, et al.

3D shape recognition has attracted more and more attention as a task of 3D vision research. The proliferation of 3D data encourages various deep learning methods based on 3D data. Now there have been many deep learning models based on point-cloud data or multi-view data alone. However, in the era of big data, integrating data of two different modals to obtain a unified 3D shape descriptor is bound to improve the recognition accuracy. Therefore, this paper proposes a fusion network based on multimodal attention mechanism for 3D shape recognition. Considering the limitations of multi-view data, we introduce a soft attention scheme, which can use the global point-cloud features to filter the multi-view features, and then realize the effective fusion of the two features. More specifically, we obtain the enhanced multi-view features by mining the contribution of each multi-view image to the overall shape recognition, and then fuse the point-cloud features and the enhanced multi-view features to obtain a more discriminative 3D shape descriptor. We have performed relevant experiments on the ModelNet40 dataset, and experimental results verify the effectiveness of our method.


PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition

3D object recognition has attracted wide research attention in the field...

PVRNet: Point-View Relation Neural Network for 3D Shape Recognition

Three-dimensional (3D) shape recognition has drawn much research attenti...

CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis

Learning 3D representations by fusing point cloud and multi-view data ha...

Improved Kidney Stone Recognition Through Attention and Multi-View Feature Fusion Strategies

This contribution presents a deep learning method for the extraction and...

SCA-PVNet: Self-and-Cross Attention Based Aggregation of Point Cloud and Multi-View for 3D Object Retrieval

To address 3D object retrieval, substantial efforts have been made to ge...

ViewFormer: View Set Attention for Multi-view 3D Shape Understanding

This paper presents ViewFormer, a simple yet effective model for multi-v...

Delving into Ipsilateral Mammogram Assessment under Multi-View Network

In many recent years, multi-view mammogram analysis has been focused wid...

Please sign up or login with your details

Forgot password? Click here to reset