PAPER DIGEST
Most Influential SIGIR 2002 Paper · 2026-03 edition

Generic Summarization And Keyphrase Extraction Using Mutual Reinforcement Principle And Sentence Clustering

Hongyuan Zha

Venue
ACM SIGIR Conference (SIGIR) 2002
Recognition
Most Influential SIGIR 2002 Paper (Rank No. 7)
Edition
2026-03
Impact factor
6
Certificate ID
6f985a6d8d89fcda

Abstract

A novel method for <i>simultaneous</i> keyphrase extraction and generic text summarization is proposed by modeling text documents as weighted undirected and weighted bipartite graphs. Spectral graph clustering algorithms are useed for partitioning sentences of the documents into topical groups with sentence link priors being exploited to enhance clustering quality. Within each topical group, saliency scores for keyphrases and sentences are generated based on a mutual reinforcement principle. The keyphrases and sentences are then ranked according to their saliency scores and selected for inclusion in the top keyphrase list and summaries of the document. The idea of building a hierarchy of summaries for documents capturing different levels of granularity is also briefly discussed. Our method is illustrated using several examples from news articles, news broadcast transcripts and web documents.

Download PDF certificate