PAPER DIGEST
Most Influential SIGMOD 2001 Paper · 2026-03 edition

STHoles: A Multidimensional Workload-aware Histogram

Nicolas Bruno; Surajit Chaudhuri; Luis Gravano

Venue
ACM SIGMOD Conference (SIGMOD) 2001
Recognition
Most Influential SIGMOD 2001 Paper (Rank No. 6)
Edition
2026-03
Impact factor
6
Certificate ID
7f287adb7636b72f

Abstract

Attributes of a relation are not typically independent. Multidimensional histograms can be an effective tool for accurate multiattribute query selectivity estimation. In this paper, we introduce <i>STHoles</i>, a “workload-aware” histogram that allows bucket nesting to capture data regions with reasonably uniform tuple density. <i>STHoles</i> histograms are built without examining the data sets, but rather by just analyzing query results. Buckets are allocated where needed the most as indicated by the workload, which leads to accurate query selectivity estimations. Our extensive experiments demonstrate that <i>STHoles</i> histograms consistently produce good selectivity estimates across synthetic and real-world data sets and across query workloads, and, in many cases, outperform the best multidimensional histogram techniques that require access to and processing of the full data sets during histogram construction.

Download PDF certificate