Modeling and Inferring Human Intents and Latent Functional Objects for Trajectory Prediction

by   Dan Xie, et al.

This paper is about detecting functional objects and inferring human intentions in surveillance videos of public spaces. People in the videos are expected to intentionally take shortest paths toward functional objects subject to obstacles, where people can satisfy certain needs (e.g., a vending machine can quench thirst), by following one of three possible intent behaviors: reach a single functional object and stop, or sequentially visit several functional objects, or initially start moving toward one goal but then change the intent to move toward another. Since detecting functional objects in low-resolution surveillance videos is typically unreliable, we call them "dark matter" characterized by the functionality to attract people. We formulate the Agent-based Lagrangian Mechanics wherein human trajectories are probabilistically modeled as motions of agents in many layers of "dark-energy" fields, where each agent can select a particular force field to affect its motions, and thus define the minimum-energy Dijkstra path toward the corresponding source "dark matter". For evaluation, we compiled and annotated a new dataset. The results demonstrate our effectiveness in predicting human intent behaviors and trajectories, and localizing functional objects, as well as discovering distinct functional classes of objects by clustering human motion behavior in the vicinity of functional objects.


page 1

page 3

page 5

page 8

page 11

page 12

page 14


Articulated motion discovery using pairs of trajectories

We propose an unsupervised approach for discovering characteristic motio...

Interactive Surveillance Technologies for Dense Crowds

We present an algorithm for realtime anomaly detection in low to medium ...

How Shall I Drive? Interaction Modeling and Motion Planning towards Empathetic and Socially-Graceful Driving

While intelligence of autonomous vehicles (AVs) has significantly advanc...

Functional Object-Oriented Network for Manipulation Learning

This paper presents a novel structured knowledge representation called t...

Object Properties Inferring from and Transfer for Human Interaction Motions

Humans regularly interact with their surrounding objects. Such interacti...

HYPER: Learned Hybrid Trajectory Prediction via Factored Inference and Adaptive Sampling

Modeling multi-modal high-level intent is important for ensuring diversi...

Please sign up or login with your details

Forgot password? Click here to reset