PAPER DIGEST
Most Influential SIGIR 2013 Paper · 2026-03 edition

A General Evaluation Measure For Document Organization Tasks

Enrique Amigó Julio Gonzalo; Felisa Verdejo

Venue
ACM SIGIR Conference (SIGIR) 2013
Recognition
Most Influential SIGIR 2013 Paper (Rank No. 13)
Edition
2026-03
Impact factor
4
Certificate ID
dcf0e8ad9128bc81

Abstract

A number of key Information Access tasks -- Document Retrieval, Clustering, Filtering, and their combinations -- can be seen as instances of a generic {\em document organization} problem that establishes priority and relatedness relationships between documents (in other words, a problem of forming and ranking clusters). As far as we know, no analysis has been made yet on the evaluation of these tasks from a global perspective. In this paper we propose two complementary evaluation measures -- <i>Reliability</i> and <i>Sensitivity</i> -- for the generic Document Organization task which are derived from a proposed set of formal constraints (properties that any suitable measure must satisfy). In addition to be the first measures that can be applied to any mixture of ranking, clustering and filtering tasks, Reliability and Sensitivity satisfy more formal constraints than previously existing evaluation metrics for each of the subsumed tasks. Besides their formal properties, its most salient feature from an empirical point of view is their strictness: a high score according to the harmonic mean of Reliability and Sensitivity ensures a high score with any of the most popular evaluation metrics in all the Document Retrieval, Clustering and Filtering datasets used in our experiments.

Download PDF certificate