Information Avoidance and Overvaluation in Sequential Decision Making under Epistemic Constraints

06/09/2021
by   Shuo Li, et al.
0

Decision makers involved in the management of civil assets and systems usually take actions under constraints imposed by societal regulations. Some of these constraints are related to epistemic quantities, as the probability of failure events and the corresponding risks. Sensors and inspectors can provide useful information supporting the control process (e.g. the maintenance process of an asset), and decisions about collecting this information should rely on an analysis of its cost and value. When societal regulations encode an economic perspective that is not aligned with that of the decision makers, the Value of Information (VoI) can be negative (i.e., information sometimes hurts), and almost irrelevant information can even have a significant value (either positive or negative), for agents acting under these epistemic constraints. We refer to these phenomena as Information Avoidance (IA) and Information OverValuation (IOV). In this paper, we illustrate how to assess VoI in sequential decision making under epistemic constraints (as those imposed by societal regulations), by modeling a Partially Observable Markov Decision Processes (POMDP) and evaluating non optimal policies via Finite State Controllers (FSCs). We focus on the value of collecting information at current time, and on that of collecting sequential information, we illustrate how these values are related and we discuss how IA and IOV can occur in those settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2023

Exploration via Epistemic Value Estimation

How to efficiently explore in reinforcement learning is an open problem....
research
12/05/2021

Differentiating Approach and Avoidance from Traditional Notions of Sentiment in Economic Contexts

There is growing interest in the role of sentiment in economic decision-...
research
03/12/2021

Optimal sequential decision making with probabilistic digital twins

Digital twins are emerging in many industries, typically consisting of s...
research
07/16/2022

ChronosPerseus: Randomized Point-based Value Iteration with Importance Sampling for POSMDPs

In reinforcement learning, agents have successfully used environments mo...
research
02/26/2019

Information Gathering in Decentralized POMDPs by Policy Graph Improvement

Decentralized policies for information gathering are required when multi...
research
08/19/2019

Evaluating Hierarchies through A Partially Observable Markov Decision Processes Methodology

Hierarchical clustering has been shown to be valuable in many scenarios,...
research
07/05/2021

Neyman-Pearson Hypothesis Testing, Epistemic Reliability and Pragmatic Value-Laden Asymmetric Error Risks

Neyman and Pearson's theory of testing hypotheses does not warrant minim...

Please sign up or login with your details

Forgot password? Click here to reset