PAPER DIGEST
Most Influential SIGMOD 2014 Paper · 2026-03 edition

Druid: A Real-time Analytical Data Store

Fangjin Yang, Eric Tschetter, Xavier Léauté, Nelson Ray, Gian Merlino, Deep Ganguli

Venue
ACM SIGMOD Conference (SIGMOD) 2014
Recognition
Most Influential SIGMOD 2014 Paper (Rank No. 14)
Edition
2026-03
Impact factor
5
Certificate ID
c9b47da8c388cf0c

Abstract

Druid is an open source data store designed for real-time exploratory analytics on large data sets. The system combines a column-oriented storage layout, a distributed, shared-nothing architecture, and an advanced indexing structure to allow for the arbitrary exploration of billion-row tables with sub-second latencies. In this paper, we describe Druid's architecture, and detail how it supports fast aggregations, flexible filters, and low latency data ingestion.

Download PDF certificate