AHPA: Adaptive Horizontal Pod Autoscaling Systems on Alibaba Cloud Container Service for Kubernetes

03/07/2023
by   Zhiqiang Zhou, et al.
0

The existing resource allocation policy for application instances in Kubernetes cannot dynamically adjust according to the requirement of business, which would cause an enormous waste of resources during fluctuations. Moreover, the emergence of new cloud services puts higher resource management requirements. This paper discusses horizontal POD resources management in Alibaba Cloud Container Services with a newly deployed AI algorithm framework named AHPA – the adaptive horizontal pod auto-scaling system. Based on a robust decomposition forecasting algorithm and performance training model, AHPA offers an optimal pod number adjustment plan that could reduce POD resources and maintain business stability. Since being deployed in April 2021, this system has expanded to multiple customer scenarios, including logistics, social networks, AI audio and video, e-commerce, etc. Compared with the previous algorithms, AHPA solves the elastic lag problem, increasing CPU usage by 10 and reducing resource cost by more than 20 automatically perform flexible planning according to the predicted business volume without manual intervention, significantly saving operation and maintenance costs.

READ FULL TEXT

page 2

page 7

research
05/29/2020

AI-based Resource Allocation: Reinforcement Learning for Adaptive Auto-scaling in Serverless Environments

Serverless computing has emerged as a compelling new paradigm of cloud c...
research
05/18/2018

ALVEC: Auto-scaling by Lotka Volterra Elastic Cloud: A QoS aware Non Linear Dynamical Allocation Model

Elasticity in resource allocation is still a relevant problem in cloud c...
research
08/07/2018

Performance-Aware Management of Cloud Resources: A Taxonomy and Future Directions

Dynamic nature of the cloud environment has made distributed resource ma...
research
06/30/2018

Harnessing constrained resources in service industry via video analytics

Service industries contribute significantly to many developed and develo...
research
11/06/2019

Developing a Process in Architecting Microservice Infrastructure with Docker, Kubernetes, and Istio

As an application usage grows, its owner scales up vertically by replaci...
research
02/19/2018

Dynamic Cloud Network Control under Reconfiguration Delay and Cost

Network virtualization and programmability allow operators to deploy a w...
research
02/21/2018

Analyzing Real-Time Multimedia Content From Network Cameras: Using CPUs and GPUs in the Cloud

Millions of network cameras are streaming real-time multimedia content (...

Please sign up or login with your details

Forgot password? Click here to reset