Cache Allocation in Multi-Tenant Edge Computing via online Reinforcement Learning

by   Ayoub Ben-Ameur, et al.

We consider in this work Edge Computing (EC) in a multi-tenant environment: the resource owner, i.e., the Network Operator (NO), virtualizes the resources and lets third party Service Providers (SPs - tenants) run their services, which can be diverse and with heterogeneous requirements. Due to confidentiality guarantees, the NO cannot observe the nature of the traffic of SPs, which is encrypted. This makes resource allocation decisions challenging, since they must be taken based solely on observed monitoring information. We focus on one specific resource, i.e., cache space, deployed in some edge node, e.g., a base station. We study the decision of the NO about how to partition cache among several SPs in order to minimize the upstream traffic. Our goal is to optimize cache allocation using purely data-driven, model-free Reinforcement Learning (RL). Differently from most applications of RL, in which the decision policy is learned offline on a simulator, we assume no previous knowledge is available to build such a simulator. We thus apply RL in an online fashion, i.e., the policy is learned by directly perturbing the actual system and monitoring how its performance changes. Since perturbations generate spurious traffic, we also limit them. We show in simulation that our method rapidly converges toward the theoretical optimum, we study its fairness, its sensitivity to several scenario characteristics and compare it with a method from the state-of-the-art.


Multiple Resource Allocation in Multi-Tenant Edge Computing via Sub-modular Optimization

Edge Computing (EC) allows users to access computing resources at the ne...

Scaling Serverless Functions in Edge Networks: A Reinforcement Learning Approach

With rapid advances in containerization techniques, the serverless compu...

Reinforcement Learning Based Resource Allocation for Network Slices in O-RAN Midhaul

Network slicing envisions the 5th generation (5G) mobile network resourc...

Model-Based Reinforcement Learning Framework of Online Network Resource Allocation

Online Network Resource Allocation (ONRA) for service provisioning is a ...

Optimizing Replacement Policies for Content Delivery Network Caching: Beyond Belady to Attain A Seemingly Unattainable Byte Miss Ratio

When facing objects/files of differing sizes in content delivery network...

Learning from Peers: Transfer Reinforcement Learning for Joint Radio and Cache Resource Allocation in 5G Network Slicing

Radio access network (RAN) slicing is an important part of network slici...

Learning-to-Dispatch: Reinforcement Learning Based Flight Planning under Emergency

The effectiveness of resource allocation under emergencies especially hu...

Please sign up or login with your details

Forgot password? Click here to reset