An analysis of XML compression efficiency
- 13 June 2007
- proceedings article
- Published by Association for Computing Machinery (ACM)
Abstract
XML simplifies data exchange among heterogeneous computers, but it is notoriously verbose and has spawned the development of many XML-specific compressors and binary formats. We present an XML test corpus and a combined efficiency metric integrating compression ratio and execution speed. We use this corpus and linear regression to assess 14 general-purpose and XML-specific compressors relative to the proposed metric. We also identify key factors when selecting a compressor. Our results show, XMill or WBXML may be useful in some instances, but a general-purpose compressor is often the best choice.Keywords
This publication has 10 references indexed in Scilit:
- Comparative Analysis of XML Compression TechnologiesWorld Wide Web, 2005
- AXECHOP: A Grammar-based Compressor for XMLPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- XPRESSPublished by Association for Computing Machinery (ACM) ,2003
- Algorithms and programming models for efficient representation of XML for Internet applicationsPublished by Association for Computing Machinery (ACM) ,2001
- XMillPublished by Association for Computing Machinery (ACM) ,2000
- Arithmetic coding revisitedACM Transactions on Information Systems, 1998
- Implementing the PPM data compression schemeIEEE Transactions on Communications, 1990
- Data Compression Using Adaptive Coding and Partial String MatchingIEEE Transactions on Communications, 1984
- A universal algorithm for sequential data compressionIEEE Transactions on Information Theory, 1977
- Communication in the Presence of NoiseProceedings of the IRE, 1949