PAPER DIGEST
Most Influential CIKM 2003 Paper · 2026-03 edition

XML Parsing: A Threat To Database Performance

Matthias Nicola; Jasmi John

Venue
ACM Conference on Information and Knowledge Management (CIKM) 2003
Recognition
Most Influential CIKM 2003 Paper (Rank No. 11)
Edition
2026-03
Impact factor
5
Certificate ID
37b5adcc782e73f8

Abstract

XML parsing is generally known to have poor performance characteristics relative to transactional database processing. Yet, its potentially fatal impact on overall database performance is being underestimated. We report real-word database applications where XML parsing performance is a key obstacle to a successful XML deployment. There is a considerable share of XML database applications which are prone to fail at an early and simple road block: XML parsing. We analyze XML parsing performance and quantify the extra overhead of DTD and schema validation. Comparison with relational database performance shows that the desired response times and transaction rates over XML data can not be achieved without major improvements in XML parsing technology. Thus, we identify research topics which are most promising for XML parser performance in database systems.

Download PDF certificate