Video streaming usage has seen a significant rise as entertainment,
educ...
Adversarial training suffers from robust overfitting, a phenomenon where...
Recent work argues that robust training requires substantially larger
da...
Modern neural networks excel at image classification, yet they remain
vu...
Adversarial training suffers from robust overfitting, a phenomenon where...
Many real-world physical control systems are required to satisfy constra...
Deploying Reinforcement Learning (RL) agents to solve real-world applica...
Adversarial training and its variants have become de facto standards for...
Online recommender systems often face long delays in receiving feedback,...
Recent research has made the surprising finding that state-of-the-art de...
Adversarial testing methods based on Projected Gradient Descent (PGD) ar...
We consider the core reinforcement-learning problem of on-policy value
f...
We provide a framework for incorporating robustness -- to perturbations ...
Robust Markov Decision Processes (RMDPs) intend to ensure robustness wit...
Recent works have shown that it is possible to train models that are
ver...
Temporal-Difference learning (TD) [Sutton, 1988] with function approxima...
This paper addresses the problem of formally verifying desirable propert...
Being able to reason in an environment with a large number of discrete
a...
Off-policy learning in dynamic decision problems is essential for provid...