PAPER DIGEST
Most Influential SIGMOD 2004 Paper · 2026-03 edition

Robust Query Processing Through Progressive Optimization

Volker Markl, Vijayshankar Raman, David Simmen, Guy Lohman, Hamid Pirahesh, Miso Cilimdzic

Venue
ACM SIGMOD Conference (SIGMOD) 2004
Recognition
Most Influential SIGMOD 2004 Paper (Rank No. 14)
Edition
2026-03
Impact factor
6
Certificate ID
be27b94fddf9cfbe

Abstract

Virtually every commercial query optimizer chooses the best plan for a query using a cost model that relies heavily on accurate cardinality estimation. Cardinality estimation errors can occur due to the use of inaccurate statistics, invalid assumptions about attribute independence, parameter markers, and so on. Cardinality estimation errors may cause the optimizer to choose a sub-optimal plan. We present an approach to query processing that is extremely robust because it is able to detect and recover from cardinality estimation errors. We call this approach "progressive query optimization" (POP). POP validates cardinality estimates against actual values as measured during query execution. If there is significant disagreement between estimated and actual values, execution might be stopped and re-optimization might occur. Oscillation between optimization and execution steps can occur any number of times. A re-optimization step can exploit both the actual cardinality and partial results, computed during a previous execution step. Checkpoint operators (CHECK) validate the optimizer's cardinality estimates against actual cardinalities. Each CHECK has a <i>condition</i> that indicates the cardinality bounds within which a plan is valid. We compute this <i>validity range</i> through a novel <i>sensitivity analysis</i> of query plan operators. If the CHECK condition is violated, CHECK triggers <i>re-optimization.</i> POP has been prototyped in a leading commercial DBMS. An experimental evaluation of POP using TPC-H queries illustrates the robustness POP adds to query processing, while incurring only negligible overhead. A case-study applying POP to a real-world database and workload shows the potential of POP, accelerating complex OLAP queries by almost two orders of magnitude.

Download PDF certificate