Nonsmooth optimal value and policy functions for mechanical systems subject to unilateral constraints

10/18/2017
by   Bora S. Banjanin, et al.
0

State-of-the-art approaches to optimal control of contact-rich robot dynamics use smooth approximations of value and policy functions and gradient-based algorithms for improving approximator parameters. Unfortunately, the dynamics of mechanical systems subject to unilateral constraints--i.e. robot locomotion and manipulation--are generally nonsmooth. We show that value and policy functions generally inherit regularity properties like (non)smoothness from the underlying system's dynamics, and demonstrate this effect in a simple mechanical system. We conclude with a discussion of implications for the use of gradient-based algorithms for optimal control of contact-rich robot dynamics.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset

Sign in with Google

×

Use your Google Account to sign in to DeepAI

×

Consider DeepAI Pro