The State of Speech in HCI: Trends, Themes and Challenges

by   Leigh Clark, et al.

Speech interfaces are growing in popularity. Through a review of 68 research papers this work maps the trends, themes, findings and methods of empirical research on speech interfaces in HCI. We find that most studies are usability/theory-focused or explore wider system experiences, evaluating Wizard of Oz, prototypes, or developed systems by using self-report questionnaires to measure concepts like usability and user attitudes. A thematic analysis of the research found that speech HCI work focuses on nine key topics: system speech production, modality comparison, user speech production, assistive technology & accessibility, design insight, experiences with interactive voice response (IVR) systems, using speech technology for development, people's experiences with intelligent personal assistants (IPAs) and how user memory affects speech interface interaction. From these insights we identify gaps and challenges in speech research, notably the need to develop theories of speech interface interaction, grow critical mass in this domain, increase design work, and expand research from single to multiple user interaction contexts so as to reflect current use contexts. We also highlight the need to improve measure reliability, validity and consistency, in the wild deployment and reduce barriers to building fully functional speech interfaces for research.


page 1

page 2

page 3

page 4


The Partner Modelling Questionnaire: A validated self-report measure of perceptions toward machines as dialogue partners

Recent work has looked to understand user perceptions of speech agent ca...

Mapping Perceptions of Humanness in Speech-Based Intelligent Personal Assistant Interaction

Humanness is core to speech interface design. Yet little is known about ...

Towards Universal Interaction for Extended Reality

Extended Reality (XR) is a rapidly growing field offering unique immersi...

An Analysis of the Recent Visibility of the SigDial Conference

Automated speech and text interfaces are continuing to improve, resultin...

The Challenges of Studying Misinformation on Video-Sharing Platforms During Crises and Mass-Convergence Events

Mis- and disinformation can spread rapidly on video-sharing platforms (V...

What Do We See in Them? Identifying Dimensions of Partner Models for Speech Interfaces Using a Psycholexical Approach

Perceptions of system competence and communicative ability, termed partn...

Quantifying the Impact of Making and Breaking Interface Habits

The frequency with which people interact with technology means that user...

Please sign up or login with your details

Forgot password? Click here to reset