PAPER DIGEST
Most Influential ICDE 2025 Paper · 2026-03 edition

BLEND: A Unified Data Discovery System

Mahdi Esmailoghli; Christoph Schnell; Renée J. Miller; Ziawasch Abedjan

Venue
IEEE International Conference on Data Engineering (ICDE) 2025
Recognition
Most Influential ICDE 2025 Paper (Rank No. 6)
Edition
2026-03
Impact factor
3
Certificate ID
9ec9c988c6a1ced9

Abstract

Most research on data discovery has so far focused on improving individual discovery operators such as join, correlation, or union discovery. However, in practice, a combination of these techniques and their corresponding indexes may be necessary to support arbitrary discovery tasks. We propose BLEND, a comprehensive data discovery system that supports existing operators and enables their flexible pipelining. BLEND is based on a set of lower-level operators that serve as fundamental building blocks for more complex and sophisticated user tasks. To reduce the execution runtime of discovery pipelines, we propose a unified index structure and a rule- and cost-based optimizer that rewrites SQL statements into low-level operators when possible. We show the superior flexibility and efficiency of our system compared to ad-hoc discovery pipelines and stand-alone solutions.

Download PDF certificate