PAPER DIGEST
Most Influential CIKM 2012 Paper · 2026-03 edition

KORE: Keyphrase Overlap Relatedness For Entity Disambiguation

Johannes Hoffart; Stephan Seufert; Dat Ba Nguyen; Martin Theobald; Gerhard Weikum

Venue
ACM Conference on Information and Knowledge Management (CIKM) 2012
Recognition
Most Influential CIKM 2012 Paper (Rank No. 7)
Edition
2026-03
Impact factor
5
Certificate ID
1b7e785cb7505d80

Abstract

Measuring the semantic relatedness between two entities is the basis for numerous tasks in IR, NLP, and Web-based knowledge extraction. This paper focuses on disambiguating names in a Web or text document by jointly mapping all names onto semantically related entities registered in a knowledge base. To this end, we have developed a novel notion of semantic relatedness between two entities represented as sets of weighted (multi-word) keyphrases, with consideration of partially overlapping phrases. This measure improves the quality of prior link-based models, and also eliminates the need for (usually Wikipedia-centric) explicit interlinkage between entities. Thus, our method is more versatile and can cope with long-tail and newly emerging entities that have few or no links associated with them. For efficiency, we have developed approximation techniques based on min-hash sketches and locality-sensitive hashing. Our experiments on semantic relatedness and on named entity disambiguation demonstrate the superiority of our method compared to state-of-the-art baselines.

Download PDF certificate