On the Linguistic and Computational Requirements for Creating Face-to-Face Multimodal Human-Machine Interaction

11/24/2022
by   João Ranhel, et al.
0

In this study, conversations between humans and avatars are linguistically, organizationally, and structurally analyzed, focusing on what is necessary for creating face-to-face multimodal interfaces for machines. We videorecorded thirty-four human-avatar interactions, performed complete linguistic microanalysis on video excerpts, and marked all the occurrences of multimodal actions and events. Statistical inferences were applied to data, allowing us to comprehend not only how often multimodal actions occur but also how multimodal events are distributed between the speaker (emitter) and the listener (recipient). We also observed the distribution of multimodal occurrences for each modality. The data show evidence that double-loop feedback is established during a face-to-face conversation. This led us to propose that knowledge from Conversation Analysis (CA), cognitive science, and Theory of Mind (ToM), among others, should be incorporated into the ones used for describing human-machine multimodal interactions. Face-to-face interfaces require an additional control layer to the multimodal fusion layer. This layer has to organize the flow of conversation, integrate the social context into the interaction, as well as make plans concerning 'what' and 'how' to progress on the interaction. This higher level is best understood if we incorporate insights from CA and ToM into the interface system.

READ FULL TEXT

page 1

page 5

page 6

page 8

research
01/29/2019

Guidelines for creating man-machine multimodal interfaces

Understanding details of human multimodal interaction can elucidate many...
research
07/29/2022

Face-to-Face Contrastive Learning for Social Intelligence Question-Answering

Creating artificial social intelligence - algorithms that can understand...
research
01/13/2020

Detecting depression in dyadic conversations with multimodal narratives and visualizations

Conversations contain a wide spectrum of multimodal information that giv...
research
03/15/2022

DialogueNeRF: Towards Realistic Avatar Face-to-face Conversation Video Generation

Conversation is an essential component of virtual avatar activities in t...
research
06/21/2023

Visual-Aware Text-to-Speech

Dynamically synthesizing talking speech that actively responds to a list...
research
08/29/2023

Sequential annotations for naturally-occurring HRI: first insights

We explain the methodology we developed for improving the interactions a...
research
10/29/2019

Smartphone and the changing practices of face-to-face interaction

Smartphone use has grown rapidly, but the ways it shapes concurrent face...

Please sign up or login with your details

Forgot password? Click here to reset