Policy Learning for Active Target Tracking over Continuous SE(3) Trajectories

12/03/2022
by   Pengzhi Yang, et al.
0

This paper proposes a novel model-based policy gradient algorithm for tracking dynamic targets using a mobile robot, equipped with an onboard sensor with limited field of view. The task is to obtain a continuous control policy for the mobile robot to collect sensor measurements that reduce uncertainty in the target states, measured by the target distribution entropy. We design a neural network control policy with the robot SE(3) pose and the mean vector and information matrix of the joint target distribution as inputs and attention layers to handle variable numbers of targets. We also derive the gradient of the target entropy with respect to the network parameters explicitly, allowing efficient model-based policy gradient optimization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/10/2021

Active Exploration and Mapping via Iterative Covariance Regulation over Continuous SE(3) Trajectories

This paper develops iterative Covariance Regulation (iCR), a novel metho...
research
05/27/2019

Policy Search by Target Distribution Learning for Continuous Control

We observe that several existing policy gradient methods (such as vanill...
research
03/05/2018

Learning Sample-Efficient Target Reaching for Mobile Robots

In this paper, we propose a novel architecture and a self-supervised pol...
research
04/12/2023

Neural Network Algorithm for Intercepting Targets Moving Along Known Trajectories by a Dubins' Car

The task of intercepting a target moving along a rectilinear or circular...
research
12/14/2019

Active Object Tracking using Context Estimation: Handling Occlusions and Detecting Missing Targets

When performing visual servoing or object tracking tasks, active sensor ...
research
08/29/2023

Robot Manipulation Task Learning by Leveraging SE(3) Group Invariance and Equivariance

This paper presents a differential geometric control approach that lever...
research
02/21/2022

Guided Visual Attention Model Based on Interactions Between Top-down and Bottom-up Information for Robot Pose Prediction

Learning to control a robot commonly requires mapping between robot stat...

Please sign up or login with your details

Forgot password? Click here to reset