Learning to Solve Multiple-TSP with Time Window and Rejections via Deep Reinforcement Learning

09/13/2022
by   Rongkai Zhang, et al.
0

We propose a manager-worker framework based on deep reinforcement learning to tackle a hard yet nontrivial variant of Travelling Salesman Problem (TSP),  multiple-vehicle TSP with time window and rejections (mTSPTWR), where customers who cannot be served before the deadline are subject to rejections. Particularly, in the proposed framework, a manager agent learns to divide mTSPTWR into sub-routing tasks by assigning customers to each vehicle via a Graph Isomorphism Network (GIN) based policy network. A worker agent learns to solve sub-routing tasks by minimizing the cost in terms of both tour length and rejection rate for each vehicle, the maximum of which is then fed back to the manager agent to learn better assignments. Experimental results demonstrate that the proposed framework outperforms strong baselines in terms of higher solution quality and shorter computation time. More importantly, the trained agents also achieve competitive performance for solving unseen larger instances.

READ FULL TEXT

page 1

page 4

page 7

page 12

research
02/13/2020

Multi-Vehicle Routing Problems with Soft Time Windows: A Multi-Agent Reinforcement Learning Approach

Multi-vehicle routing problem with soft time windows (MVRPSTW) is an ind...
research
02/12/2018

Deep Reinforcement Learning for Solving the Vehicle Routing Problem

We present an end-to-end framework for solving Vehicle Routing Problem (...
research
09/27/2018

Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation

We propose a method to efficiently learn diverse strategies in reinforce...
research
11/24/2019

Which Channel to Ask My Question? Personalized Customer Service Request Stream Routing using Deep Reinforcement Learning

Customer services are critical to all companies, as they may directly co...
research
06/05/2021

Reinforcement Learning for Assignment Problem with Time Constraints

We present an end-to-end framework for the Assignment Problem with multi...
research
10/06/2021

Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem

Existing deep reinforcement learning (DRL) based methods for solving the...
research
08/23/2020

DSP: A Differential Spatial Prediction Scheme for Comprehensive real industrial datasets

Inverse Distance Weighted models (IDW) have been widely used for predict...

Please sign up or login with your details

Forgot password? Click here to reset