Asymptotic optimality of adaptive importance sampling

06/04/2018
by   Bernard Delyon, et al.
0

Adaptive importance sampling (AIS) uses past samples to update the sampling policy q_t at each stage t. Each stage t is formed with two steps : (i) to explore the space with n_t points according to q_t and (ii) to exploit the current amount of information to update the sampling policy. The very fundamental question raised in this paper concerns the behavior of empirical sums based on AIS. Without making any assumption on the allocation policy n_t, the theory developed involves no restriction on the split of computational resources between the explore (i) and the exploit (ii) step. It is shown that AIS is asymptotically optimal : the asymptotic behavior of AIS is the same as some "oracle" strategy that knows the targeted sampling policy from the beginning. From a practical perspective, weighted AIS is introduced, a new method that allows to forget poor samples from early stages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2018

Efficiency of adaptive importance sampling

The sampling policy of stage t, formally expressed as a probability dens...
research
10/02/2017

Oracle Importance Sampling for Stochastic Simulation Models

We consider the problem of estimating an expected outcome from a stochas...
research
03/20/2019

Adaptive importance sampling by kernel smoothing

A key determinant of the success of Monte Carlo simulation is the sampli...
research
10/16/2019

Conditional Importance Sampling for Off-Policy Learning

The principal contribution of this paper is a conceptual framework for o...
research
06/11/2019

Importance Resampling for Off-policy Prediction

Importance sampling (IS) is a common reweighting strategy for off-policy...
research
03/21/2022

Lean Evolutionary Reinforcement Learning by Multitasking with Importance Sampling

Studies have shown evolution strategies (ES) to be a promising approach ...
research
06/12/2020

A general framework for label-efficient online evaluation with asymptotic guarantees

Achieving statistically significant evaluation with passive sampling of ...

Please sign up or login with your details

Forgot password? Click here to reset