PAPER DIGEST
Most Influential SIGIR 1993 Paper · 2026-03 edition

Cluster Analysis For Hypertext Systems

Rodrigo A. Botafogo

Venue
ACM SIGIR Conference (SIGIR) 1993
Recognition
Most Influential SIGIR 1993 Paper (Rank No. 12)
Edition
2026-03
Impact factor
5
Certificate ID
420a2a6e2112a493

Abstract

Identifying nodes of information that are highly related has many applications in any information systems, and in particular in hypertext systems. In this paper we present a technique to identify “natural” clusters in a hypertext. A natural cluster is a cluster that is not arbitrary, but depends only on intrinsic properties of the hypertext. In our case, the property we will use to identify the clusters is the number of independent paths between nodes. Using the graph theoretic definition of <i>k</i>-edge-components we present an aggregation technique to cluster the nodes. We then use this techniques to cluster three medium sized hypertexts that were developed by different authors for different users, using different methodologies. We also show how to use clustering to improve data display, browsing and retrieval.

Download PDF certificate