PAPER DIGEST
Most Influential SIGIR 1998 Paper · 2026-03 edition

How Reliable Are The Results Of Large-scale Information Retrieval Experiments?

Justin Zobel

Venue
ACM SIGIR Conference (SIGIR) 1998
Recognition
Most Influential SIGIR 1998 Paper (Rank No. 10)
Edition
2026-03
Impact factor
7
Certificate ID
4379fd31574bf1f0

Abstract

Two stages in measurement of techniques for information retrieval are gathering of documents for relevance assessment and use of the assessments to numerically evaluate effectiveness. We consider both of these stages in the context of the TREC experiments, to determine whether they lead to measurements that are trustworthy and fair. Our detailed empirical investigation of the TREC results shows that the measured relative performance of systems appears to be reliable, but that recall is overestimated: it is likely that many relevant documents have not been found. We propose a new pooling strategy that can significantly in- crease the number of relevant documents found for given effort, without compromising fairness.

Download PDF certificate