research
          
      
      ∙
      11/12/2014
    On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence
We provide non-asymptotic bounds for the well-known temporal difference ...
          
            research
          
      
      ∙
      01/08/2014
    Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games
We consider the problem of finding stationary Nash equilibria (NE) in a ...
          
            research
          
      
      ∙
      06/11/2013
     
             
  
  
     
                             
                             
                             share
 share