Online Residential Demand Response via Contextual Multi-Armed Bandits

03/07/2020
by   Xin Chen, et al.
8

Residential load demands have huge potential to be exploited to enhance the efficiency and reliability of power system operation through demand response (DR) programs. This paper studies the strategies to select the right customers for residential DR from the perspective of load service entities (LSEs). One of the main challenges to implement residential DR is that customer responses to the incentives are uncertain and unknown, which are influenced by various personal and environmental factors. To address this challenge, this paper employs the contextual multi-armed bandit (CMAB) method to model the optimal customer selection problem with uncertainty. Based on Thompson sampling framework, an online learning and decision-making algorithm is proposed to learn customer behaviors and select appropriate customers for load reduction. This algorithm takes the contextual information into consideration and is applicable to complicated DR settings. Numerical simulations are performed to demonstrate the efficiency and learning effectiveness of the proposed algorithm.

READ FULL TEXT
research
10/11/2020

Online Learning and Distributed Control for Residential Demand Response

This paper studies the automated control method for regulating air condi...
research
10/26/2018

A Data-Driven Approach for Estimating Customer Contribution to System Peak Demand

The increasing penetration of smart meters (SMs) provides an opportunity...
research
10/02/2018

Contextual Multi-Armed Bandits for Causal Marketing

This work explores the idea of a causal contextual multi-armed bandit ap...
research
07/10/2019

Productization Challenges of Contextual Multi-Armed Bandits

Contextual Multi-Armed Bandits is a well-known and accepted online optim...
research
09/23/2020

Demand Responsive Dynamic Pricing Framework for Prosumer Dominated Microgrids using Multiagent Reinforcement Learning

Demand Response (DR) has a widely recognized potential for improving gri...
research
02/24/2023

A Novel Demand Response Model and Method for Peak Reduction in Smart Grids – PowerTAC

One of the widely used peak reduction methods in smart grids is demand r...
research
09/08/2022

A Nonparametric Contextual Bandit with Arm-level Eligibility Control for Customer Service Routing

Amazon Customer Service provides real-time support for millions of custo...

Please sign up or login with your details

Forgot password? Click here to reset