PAPER DIGEST
Most Influential CIKM 2010 Paper · 2026-03 edition

Fast And Accurate Estimation Of Shortest Paths In Large Graphs

Andrey Gubichev; Srikanta Bedathur; Stephan Seufert; Gerhard Weikum

Venue
ACM Conference on Information and Knowledge Management (CIKM) 2010
Recognition
Most Influential CIKM 2010 Paper (Rank No. 13)
Edition
2026-03
Impact factor
4
Certificate ID
c6fd932bde331097

Abstract

Computing shortest paths between two given nodes is a fundamental operation over graphs, but known to be nontrivial over large disk-resident instances of graph data. While a number of techniques exist for answering reachability queries and approximating node distances efficiently, determining actual shortest paths (i.e. the sequence of nodes involved) is often neglected. However, in applications arising in massive online social networks, biological networks, and knowledge graphs it is often essential to find out many, if not all, shortest paths between two given nodes. In this paper, we address this problem and present a scalable sketch-based index structure that not only supports estimation of node distances, but also computes corresponding shortest paths themselves. Generating the actual path information allows for further improvements to the estimation accuracy of distances (and paths), leading to near-exact shortest-path approximations in real world graphs. We evaluate our techniques - implemented within a fully functional RDF graph database system - over large real-world social and biological networks of sizes ranging from tens of thousand to millions of nodes and edges. Experiments on several datasets show that we can achieve query response times providing several orders of magnitude speedup over traditional path computations while keeping the estimation errors between 0% and 1% on average.

Download PDF certificate