Audio-Visual Evaluation of Oratory Skills

09/30/2021
by   Tzvi Michelson, et al.
0

What makes a talk successful? Is it the content or the presentation? We try to estimate the contribution of the speaker's oratory skills to the talk's success, while ignoring the content of the talk. By oratory skills we refer to facial expressions, motions and gestures, as well as the vocal features. We use TED Talks as our dataset, and measure the success of each talk by its view count. Using this dataset we train a neural network to assess the oratory skills in a talk through three factors: body pose, facial expressions, and acoustic features. Most previous work on automatic evaluation of oratory skills uses hand-crafted expert annotations for both the quality of the talk and for the identification of predefined actions. Unlike prior art, we measure the quality to be equivalent to the view count of the talk as counted by TED, and allow the network to automatically learn the actions, expressions, and sounds that are relevant to the success of a talk. We find that oratory skills alone contribute substantially to the chances of a talk being successful.

READ FULL TEXT
research
05/06/2021

Estimating Presentation Competence using Multimodal Nonverbal Behavioral Cues

Public speaking and presentation competence plays an essential role in m...
research
10/29/2018

Audiovisual speaker conversion: jointly and simultaneously transforming facial expression and acoustic characteristics

An audiovisual speaker conversion method is presented for simultaneously...
research
09/11/2017

Automated Identification of Trampoline Skills Using Computer Vision Extracted Pose Estimation

A novel method to identify trampoline skills using a single video camera...
research
10/21/2020

Synthetic Expressions are Better Than Real for Learning to Detect Facial Actions

Critical obstacles in training classifiers to detect facial actions are ...
research
12/18/2017

IMU2Face: Real-time Gesture-driven Facial Reenactment

We present IMU2Face, a gesture-driven facial reenactment system. To this...
research
01/22/2020

VoiceCoach: Interactive Evidence-based Training for Voice Modulation Skills in Public Speaking

The modulation of voice properties, such as pitch, volume, and speed, is...
research
12/24/2019

Audio-based automatic mating success prediction of giant pandas

Giant pandas, stereotyped as silent animals, make significantly more voc...

Please sign up or login with your details

Forgot password? Click here to reset