Self-Adaptive SLA-Driven Capacity Management for Internet Services
- 1 January 2006
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 6 (15421201) , 557-568
- https://doi.org/10.1109/noms.2006.1687584
Abstract
This work considers the problem of hosting multiple third-party Internet services in a cost-effective manner so as to maximize a provider's business objective. For this purpose, we present a dynamic capacity management framework based on an optimization model, which links a cost model based on SLA contracts with an analytical queuing-based performance model, in an attempt to adapt the platform to changing capacity needs in real time. In addition, we propose a two-level SLA specification for different operation modes, namely, normal and surge, which allows for per-use service accounting with respect to requirements of throughput and tail distribution response time. The cost model proposed is based on penalties, incurred by the provider due to SLA violation, and rewards, received when the service level expectations are exceeded. Finally, we evaluate approximations for predicting the performance of the hosted services under two different scheduling disciplines, namely FCFS and processor sharing. Through simulation, we assess the effectiveness of the proposed approach as well as the level of accuracy resulting from the performance model approximationsKeywords
This publication has 15 references indexed in Scilit:
- Resource Allocation for Autonomic Data Centers using Analytic Performance ModelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Adaptive entitlement control of resource containers on shared serversPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Provisioning servers in the application tier for E-commerce systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- The utility business model and the future of computing servicesIBM Systems Journal, 2004
- Utility computing SLA management based upon business objectivesIBM Systems Journal, 2004
- Using MIMO feedback control to enforce policies for interrelated metrics with application to the Apache Web serverPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- On maximizing service-level-agreement profitsPublished by Association for Computing Machinery (ACM) ,2001
- An SQP method for general nonlinear programs using only equality constrained subproblemsMathematical Programming, 1998
- Wide area traffic: the failure of Poisson modelingIEEE/ACM Transactions on Networking, 1995
- Processor-sharing queues: Some progress in analysisQueueing Systems, 1987