Sales Time Series Analytics Using Deep Q-Learning

by   Bohdan M. Pavlyshenko, et al.

The article describes the use of deep Q-learning models in the problems of sales time series analytics. In contrast to supervised machine learning which is a kind of passive learning using historical data, Q-learning is a kind of active learning with goal to maximize a reward by optimal sequence of actions. Model free Q-learning approach for optimal pricing strategies and supply-demand problems was considered in the work. The main idea of the study is to show that using deep Q-learning approach in time series analytics, the sequence of actions can be optimized by maximizing the reward function when the environment for learning agent interaction can be modeled using the parametric model and in the case of using the model which is based on the historical data. In the pricing optimizing case study environment was modeled using sales dependence on extras price and randomly simulated demand. In the pricing optimizing case study, the environment was modeled using sales dependence on extra price and randomly simulated demand. In the supply-demand case study, it was proposed to use historical demand time series for environment modeling, agent states were represented by promo actions, previous demand values and weekly seasonality features. Obtained results show that using deep Q-learning, we can optimize the decision making process for price optimization and supply-demand problems. Environment modeling using parametric models and historical data can be used for the cold start of learning agent. On the next steps, after the cold start, the trained agent can be used in real business environment.


Analytics of Business Time Series Using Machine Learning and Bayesian Inference

In the survey we consider the case studies on sales time series forecast...

Retail Demand Forecasting: A Comparative Study for Multivariate Time Series

Accurate demand forecasting in the retail industry is a critical determi...

Deep-Gap: A deep learning framework for forecasting crowdsourcing supply-demand gap based on imaging time series and residual learning

Mobile crowdsourcing has become easier thanks to the widespread of smart...

Leveraging Elastic Demand for Forecasting

Demand variance can result in a mismatch between planned supply and actu...

Measuring Market Performance with Stochastic Demand: Price of Anarchy and Price of Uncertainty

Globally operating suppliers face the rising challenge of wholesale pric...

A unified decision making framework for supply and demand management in microgrid networks

This paper considers two important problems - on the supply-side and dem...

Learn to Earn: Enabling Coordination within a Ride Hailing Fleet

The problem of optimizing social welfare objectives on multi sided ride ...

Please sign up or login with your details

Forgot password? Click here to reset