Evaluating the Effectiveness of Autoscaling Mechanisms in Dynamic Cloud Workload Environments

Authors

  • Dimas Hidayatullah Cloud Performance Engineer, Indonesia Author

Keywords:

Cloud Computing, Autoscaling, Resource Management, Workload Prediction, Elasticity, Cloud Infrastructure

Abstract

Autoscaling is a critical feature in cloud computing that enables automatic resource provisioning in response to fluctuating workload demands. With the exponential growth in cloud-native applications, ensuring system responsiveness while minimizing resource waste has become an operational priority. This paper evaluates various autoscaling mechanisms, focusing on their adaptability, cost-effectiveness, and performance consistency under dynamic workloads. By comparing rule-based, reactive, and predictive approaches, this study identifies key trade-offs in latency, scalability, and provisioning efficiency. A mixed-method analysis is employed, using both simulated and real-world cloud workload traces to assess the efficacy of autoscaling strategies under volatile traffic conditions..

References

Chieu, Trieu C., et al. “Dynamic Scaling of Web Applications in a Virtualized Cloud Computing Environment.” IEEE International Conference on e-Business Engineering, 2009, pp. 281–286.

Padala, Pradeep, et al. “Automated Control of Multiple Virtualized Resources.” Proceedings of the 4th ACM European Conference on Computer Systems, 2007, pp. 13–26.

Islam, Shariful, et al. “Empirical Prediction Models for Adaptive Resource Provisioning in the Cloud.” Future Generation Computer Systems, vol. 28, no. 1, 2012, pp. 155–162.

Mao, Hongzi, et al. “Resource Management with Deep Reinforcement Learning.” Proceedings of the 15th ACM Workshop on Hot Topics in Networks, 2016, pp. 50–56.

Kim, Hojin, et al. “Efficient Autoscaling of Cloud Resources through Predictive Control.” IEEE Transactions on Parallel and Distributed Systems, vol. 26, no. 1, 2015, pp. 198–207.

Lorido-Botran, Tania, Jose Miguel-Alonso, and Jose A. Lozano. “A Review of Autoscaling Techniques for Elastic Applications in Cloud Environments.” Journal of Grid Computing, vol. 12, no. 4, 2014, pp. 559–592.

Pratinav, A. (2025). Handling Long-Running Tasks in a Serverless Architecture. ISCSITR–International Journal of Cloud Computing (ISCSITR-IJCC), 6(5), 1–5. https://doi.org/10.63397/ISCSITR-IJCC_2025_06_05_001

Calheiros, Rodrigo N., et al. “CloudSim: A Toolkit for Modeling and Simulation of Cloud Computing Environments and Evaluation of Resource Provisioning Algorithms.” Software: Practice and Experience, vol. 41, no. 1, 2011, pp. 23–50.

Roy, Nilabja, Abhishek Dubey, and Aniruddha Gokhale. “Efficient Autoscaling in the Cloud Using Predictive Models for Workload Forecasting.” Proceedings of IEEE International Conference on Cloud Computing, 2011, pp. 500–507.

Herbst, Nikolas R., Samuel Kounev, and Ralf Reussner. “Elasticity in Cloud Computing: What It Is, and What It Is Not.” Proceedings of the 10th International Conference on Autonomic Computing, 2013, pp. 23–27.

Kalyvianaki, Evangelia, et al. “Self-Adaptive, Elastic Resource Provisioning for Virtualized Servers Using Kalman Filters.” Proceedings of the 6th ACM International Conference on Autonomic Computing, 2009, pp. 117–126.

Downloads

Published

2025-11-13