Analysis and modeling of job arrivals in a production grid

Abstract
In this paper we present an initial analysis of job arrivals in a production data-intensive Grid and investigate several traffic models for the interarrival time processes. Our analysis focuses on the heavy-tail behavior and autocorrelations, and the modeling is carried out at three different levels: Grid, Virtual Organization (VO) , and region . A set of m-state Markov modulated Poisson processes (MMPP) is investigated, while Poisson processes and hyperexponential renewal processes are evaluated for comparison studies. We apply the transportation distance metric from dynamical systems theory to further characterize the differences between the data trace and the simulated time series, and estimate errors by bootstrapping . The experimental results show that MMPPs with a certain number of states are successful to a certain extent in simulating the job traffic at different levels, fitting both the interarrival time distribution and the autocorrelation function. However, MMPPs are not able to match the autocorrelations for certain VOs, in which strong deterministic semi-periodic patterns are observed. These patterns are further characterized using different representations. Future work is needed to model both deterministic and stochastic components in order to better capture the correlation structure in the series.

This publication has 16 references indexed in Scilit: