PAPER DIGEST
Most Influential SIGMOD 2019 Paper · 2026-03 edition

CECI: Compact Embedding Cluster Index For Scalable Subgraph Matching

Bibek Bhattarai; Hang Liu; H. Howie Huang

Venue
ACM SIGMOD Conference (SIGMOD) 2019
Recognition
Most Influential SIGMOD 2019 Paper (Rank No. 10)
Edition
2026-03
Impact factor
4
Certificate ID
021af1b4e9c576ee

Abstract

Subgraph matching finds all distinct isomorphic embeddings of a query graph on a data graph. For large graphs, current solutions face the scalability challenge due to expensive joins, excessive false candidates, and workload imbalance. In this paper, we propose a novel framework for subgraph listing based on Compact Embedding Cluster Index (\idx), which divides the data graph into multiple embedding clusters for parallel processing. The \sub has three unique techniques: utilizing the BFS-based filtering and reverse-BFS-based refinement to prune the unpromising candidates early on, replacing the edge verification with set intersection to speed up the candidate verification, and using search cardinality based cost estimation for detecting and dividing large embedding clusters in advance. The experiments performed on several real and synthetic datasets show that the \sub outperforms state-of-the-art solutions on average by 20.4� for listing all embeddings and by 2.6� for enumerating the first 1,024 embeddings.

Download PDF certificate