VisTaNet: Attention Guided Deep Fusion for Surface Roughness Classification

09/18/2022
by   Prasanna Kumar Routray, et al.
0

Human texture perception is a weighted average of multi-sensory inputs: visual and tactile. While the visual sensing mechanism extracts global features, the tactile mechanism complements it by extracting local features. The lack of coupled visuotactile datasets in the literature is a challenge for studying multimodal fusion strategies analogous to human texture perception. This paper presents a visual dataset that augments an existing tactile dataset. We propose a novel deep fusion architecture that fuses visual and tactile data using four types of fusion strategies: summation, concatenation, max-pooling, and attention. Our model shows significant performance improvements (97.22 surface roughness classification accuracy over tactile only (SVM - 92.60 visual only (FENet-50 - 85.01 techniques, attention-guided architecture results in better classification accuracy. Our study shows that analogous to human texture perception, the proposed model chooses a weighted combination of the two modalities (visual and tactile), thus resulting in higher surface roughness classification accuracy; and it chooses to maximize the weightage of the tactile modality where the visual modality fails and vice-versa.

READ FULL TEXT

page 1

page 2

page 3

page 6

research
02/17/2019

"Touching to See" and "Seeing to Feel": Robotic Cross-modal SensoryData Generation for Visual-Tactile Perception

The integration of visual-tactile stimulus is common while humans perfor...
research
11/21/2019

Visual Tactile Fusion Object Clustering

Object clustering, aiming at grouping similar objects into one cluster w...
research
08/02/2023

Grasp Stability Assessment Through Attention-Guided Cross-Modality Fusion and Transfer Learning

Extensive research has been conducted on assessing grasp stability, a cr...
research
08/11/2021

Elastic Tactile Simulation Towards Tactile-Visual Perception

Tactile sensing plays an important role in robotic perception and manipu...
research
09/15/2021

A Framework for Multisensory Foresight for Embodied Agents

Predicting future sensory states is crucial for learning agents such as ...
research
10/01/2021

Touching Art – A Method for Visualizing Tactile Experience

It is human to want to touch artworks, to feel their surface curvature a...

Please sign up or login with your details

Forgot password? Click here to reset