Analysis and modeling of job arrivals in a production grid
- 1 March 2007
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMETRICS Performance Evaluation Review
- Vol. 34 (4), 59-70
- https://doi.org/10.1145/1243401.1243402
Abstract
In this paper we present an initial analysis of job arrivals in a production data-intensive Grid and investigate several traffic models for the interarrival time processes. Our analysis focuses on the heavy-tail behavior and autocorrelations, and the modeling is carried out at three different levels: Grid, Virtual Organization (VO) , and region . A set of m-state Markov modulated Poisson processes (MMPP) is investigated, while Poisson processes and hyperexponential renewal processes are evaluated for comparison studies. We apply the transportation distance metric from dynamical systems theory to further characterize the differences between the data trace and the simulated time series, and estimate errors by bootstrapping . The experimental results show that MMPPs with a certain number of states are successful to a certain extent in simulating the job traffic at different levels, fitting both the interarrival time distribution and the autocorrelation function. However, MMPPs are not able to match the autocorrelations for certain VOs, in which strong deterministic semi-periodic patterns are observed. These patterns are further characterized using different representations. Future work is needed to model both deterministic and stochastic components in order to better capture the correlation structure in the series.Keywords
This publication has 16 references indexed in Scilit:
- The origin of bursts and heavy tails in human dynamicsNature, 2005
- Detection of nonlinearity and chaoticity in time series using the transportation distance functionPhysics Letters A, 2002
- Bayesian Methods for Hidden Markov ModelsJournal of the American Statistical Association, 2002
- The elusive goal of workload characterizationACM SIGMETRICS Performance Evaluation Review, 1999
- The impact of job arrival patterns on parallel schedulingACM SIGMETRICS Performance Evaluation Review, 1999
- On the self-similar nature of Ethernet traffic (extended version)IEEE/ACM Transactions on Networking, 1994
- Parameter estimation for Markov modulated poisson processesCommunications in Statistics. Stochastic Models, 1994
- The Markov-modulated Poisson process (MMPP) cookbookPerformance Evaluation, 1993
- The Jackknife and the Bootstrap for General Stationary ObservationsThe Annals of Statistics, 1989
- A Markov Modulated Characterization of Packetized Voice and Data Traffic and Related Statistical Multiplexer PerformanceIEEE Journal on Selected Areas in Communications, 1986