Structured Monte Carlo Sampling for Nonisotropic Distributions via Determinantal Point Processes

05/29/2019
by   Krzysztof Choromanski, et al.
8

We propose a new class of structured methods for Monte Carlo (MC) sampling, called DPPMC, designed for high-dimensional nonisotropic distributions where samples are correlated to reduce the variance of the estimator via determinantal point processes. We successfully apply DPPMCs to problems involving nonisotropic distributions arising in guided evolution strategy (GES) methods for RL, CMA-ES techniques and trust region algorithms for blackbox optimization, improving state-of-the-art in all these settings. In particular, we show that DPPMCs drastically improve exploration profiles of the existing evolution strategy algorithms. We further confirm our results, analyzing random feature map estimators for Gaussian mixture kernels. We provide theoretical justification of our empirical results, showing a connection between DPPMCs and structured orthogonal MC methods for isotropic distributions.

READ FULL TEXT
research
05/29/2019

Variance Reduction for Evolution Strategies via Structured Control Variates

Evolution Strategies (ES) are a powerful class of blackbox optimization ...
research
12/26/2022

MC-Nonlocal-PINNs: handling nonlocal operators in PINNs via Monte Carlo sampling

We propose, Monte Carlo Nonlocal physics-informed neural networks (MC-No...
research
07/18/2021

Compressed Monte Carlo with application in particle filtering

Bayesian models have become very popular over the last years in several ...
research
06/07/2016

Reducing the error of Monte Carlo Algorithms by Learning Control Variates

Monte Carlo (MC) sampling algorithms are an extremely widely-used techni...
research
01/28/2021

Model-Based Policy Search Using Monte Carlo Gradient Estimation with Real Systems Application

In this paper, we present a Model-Based Reinforcement Learning algorithm...
research
05/12/2022

Low-variance estimation in the Plackett-Luce model via quasi-Monte Carlo sampling

The Plackett-Luce (PL) model is ubiquitous in learning-to-rank (LTR) bec...
research
02/19/2022

Graph Reparameterizations for Enabling 1000+ Monte Carlo Iterations in Bayesian Deep Neural Networks

Uncertainty estimation in deep models is essential in many real-world ap...

Please sign up or login with your details

Forgot password? Click here to reset