Multi-class latency-bounded Web services

7 November 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 231-239
https://doi.org/10.1109/iwqos.2000.847959

Abstract

Two recent advances have resulted in significant improvements in Web server quality of service. First, both centralized and distributed Web servers can provide isolation among service classes by fairly distributing system resources. Second, session admission control can protect classes from performance degradation due to overload. The goal of this work is to design a general "front-end" algorithm that uses these two building blocks to support a new Web service model, namely, multi-class services which control response latencies to within pre-specified targets. Our key technique is to devise a general service abstraction to adaptively control not only the latency of a particular class, but also to assess the inter-class relationships. In this way, we capture the extent to which classes are isolated or share system resources (as determined by the server architecture and system internals) and hence their effects on each other's QoS. For example, if the server provides class isolation (i.e., a minimum fraction of system resources independent of other classes), yet also allows a class to utilize unused resources from other classes, the algorithm infers and exploits this behavior without an explicit low level model of the server. Thus, as new functionalities are incorporated into Web servers, the approach naturally exploits their properties to efficiently satisfy the classes' performance targets. We validate the scheme with trace driven simulations.

Keywords

This publication has 9 references indexed in Scilit:

Effective envelopes: statistical bounds on multiplexed traffic in packet networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
A measurement-based admission-controlled Web server
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Cluster reserves
Published by Association for Computing Machinery (ACM) ,2000
Web server support for tiered services
IEEE Network, 1999
Admission control for statistical QoS: theory and practice
IEEE Network, 1999
Inter-class resource sharing using statistical service envelopes
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1999
Locality-aware request distribution in cluster-based network servers
Published by Association for Computing Machinery (ACM) ,1998
Self-similarity in World Wide Web traffic: evidence and possible causes
IEEE/ACM Transactions on Networking, 1997
Quality of service guarantees in virtual circuit switched networks
IEEE Journal on Selected Areas in Communications, 1995

Cited by 30 articles