XML INDEX COMPRESSION BY DTD SUBTRACTION

Stefan Böttcher, Rita Steinmetz, Niklas Klein

2007

Abstract

Whenever XML is used as format to exchange large amounts of data or even for data streams, the verbose behavior of XML is one of the bottlenecks. While compression of XML data seems to be a way out, it is essential for a variety of applications that the compression result can be queried efficiently. Furthermore, for efficient path query evaluation, an index is desired, which usually generates an additional data structure. For this purpose, we have developed a compression technique that uses structure information found in the DTD to perform a structure-preserving compression of XML data and provides a compression of an index that allows for efficient search in the compressed data. Our evaluation shows that compression factors which are close to gzip are possible, whereas the structural part of XML files can be compressed even better.

Download


Paper Citation


in Harvard Style

Böttcher S., Steinmetz R. and Klein N. (2007). XML INDEX COMPRESSION BY DTD SUBTRACTION . In Proceedings of the Ninth International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-972-8865-88-7, pages 86-94. DOI: 10.5220/0002365900860094

in Bibtex Style

@conference{iceis07,
author={Stefan Böttcher and Rita Steinmetz and Niklas Klein},
title={XML INDEX COMPRESSION BY DTD SUBTRACTION},
booktitle={Proceedings of the Ninth International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2007},
pages={86-94},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002365900860094},
isbn={978-972-8865-88-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Ninth International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - XML INDEX COMPRESSION BY DTD SUBTRACTION
SN - 978-972-8865-88-7
AU - Böttcher S.
AU - Steinmetz R.
AU - Klein N.
PY - 2007
SP - 86
EP - 94
DO - 10.5220/0002365900860094