Self-similarity in file systems
- 1 June 1998
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMETRICS Performance Evaluation Review
- Vol. 26 (1), 141-150
- https://doi.org/10.1145/277858.277894
Abstract
We demonstrate that high-level file system events exhibit self-similar behaviour, but only for short-term time scales of approximately under a day. We do so through the analysis of four sets of traces that span time scales of milliseconds through months, and that differ in the trace collection method, the filesystems being traced, and the chronological times of the tracing. Two sets of detailed, short-term file system trace data are analyzed; both are shown to have self-similar like behaviour, with consistent Hurst parameters (a measure of self-similarity) for all file system traffic as well as individual classes of file system events. Long-term file system trace data is then analyzed, and we discover that the traces' high variability and self-similar behaviour does not persist across time scales of days, weeks, and months. Using the short-term trace data, we show that sources of file system traffic exhibit ON/OFF source behaviour, which is characterized by highly variably lengthed bursts of activity, followed by similarly variably lengthed periods of inactivity. This ON/OFF behaviour is used to motivate a simple technique for synthesizing a stream of events that exhibit the same self-similar short-term behaviour as was observed in the file system traces.Keywords
This publication has 11 references indexed in Scilit:
- Long-range dependence in variable-bit-rate video trafficIEEE Transactions on Communications, 1995
- The HP AutoRAID hierarchical storage systemPublished by Association for Computing Machinery (ACM) ,1995
- A quantitative analysis of cache policies for scalable network file systemsPublished by Association for Computing Machinery (ACM) ,1994
- On the self-similar nature of Ethernet traffic (extended version)IEEE/ACM Transactions on Networking, 1994
- Wide-area trafficPublished by Association for Computing Machinery (ACM) ,1994
- Measurements of a distributed file systemPublished by Association for Computing Machinery (ACM) ,1991
- The design and implementation of a log-structured file systemPublished by Association for Computing Machinery (ACM) ,1991
- A scheme for real-time channel establishment in wide-area networksIEEE Journal on Selected Areas in Communications, 1990
- Using Renewal Processes to Generate Long-Range Dependence and High VariabilityPublished by Springer Nature ,1986
- A trace-driven analysis of the UNIX 4.2 BSD file systemACM SIGOPS Operating Systems Review, 1985