Integrated resource management for cluster-based Internet services
- 31 December 2002
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGOPS Operating Systems Review
- Vol. 36 (SI), 225-238
- https://doi.org/10.1145/844128.844150
Abstract
Client request rates for Internet services tend to be bursty and thus it is important to maintain efficient resource utilization under a wide range of load conditions. Network service clients typically seek services interactively and maintaining reasonable response time is often imperative for such services. In addition, providing differentiated service qualities and resource allocation to multiple service classes can also be desirable at times. This paper presents an integrated resource management framework (part of Neptune system ) that provides flexible service quality specification, efficient resource utilization, and service differentiation for cluster-based services. This framework introduces the metric of quality-aware service yield to combine the overall system efficiency and individual service response time in one flexible model. Resources are managed through a two-level request distribution and scheduling scheme. At the cluster level, a fully decentralized request distribution architecture is employed to achieve high scalability and availability. Inside each service node, an adaptive scheduling policy maintains efficient resource utilization under a wide range of load conditions. Our trace-driven evaluations demonstrate the performance, scalability, and service differentiation achieved by the proposed techniques.Keywords
This publication has 19 references indexed in Scilit:
- On maximizing service-level-agreement profitsPublished by Association for Computing Machinery (ACM) ,2001
- Web server support for tiered servicesIEEE Network, 1999
- Locality-aware request distribution in cluster-based network serversPublished by Association for Computing Machinery (ACM) ,1998
- Self-similarity in World Wide Web traffic: evidence and possible causesIEEE/ACM Transactions on Networking, 1997
- CPU reservations and time constraintsPublished by Association for Computing Machinery (ACM) ,1997
- Cluster-based scalable network servicesPublished by Association for Computing Machinery (ACM) ,1997
- Exploiting process lifetime distributions for dynamic load balancingACM Transactions on Computer Systems, 1997
- Lazy receiver processing (LRP)Published by Association for Computing Machinery (ACM) ,1996
- Open issues and challenges in providing quality of service guarantees in high-speed networksACM SIGCOMM Computer Communication Review, 1993
- Reducibility among Combinatorial ProblemsPublished by Springer Nature ,1972