PAPER DIGEST
Most Influential SIGIR 2012 Paper · 2026-03 edition

Time-based Calibration Of Effectiveness Measures

Mark D. Smucker; Charles L.A. Clarke

Venue
ACM SIGIR Conference (SIGIR) 2012
Recognition
Most Influential SIGIR 2012 Paper (Rank No. 9)
Edition
2026-03
Impact factor
5
Certificate ID
e1b966f3f027cf0d

Abstract

Many current effectiveness measures incorporate simplifying assumptions about user behavior. These assumptions prevent the measures from reflecting aspects of the search process that directly impact the quality of retrieval results as experienced by the user. In particular, these measures implicitly model users as working down a list of retrieval results, spending equal time assessing each document. In reality, even a careful user, intending to identify as much relevant material as possible, must spend longer on some documents than on others. Aspects such as document length, duplicates and summaries all influence the time required. In this paper, we introduce a time-biased gain measure, which explicitly accommodates such aspects of the search process. By conducting an appropriate user study, we calibrate and validate the measure against the TREC 2005 Robust Track test collection. We examine properties of the measure, contrasting it to traditional effectiveness measures, and exploring its extension to other aspects and environments. As its primary benefit, the measure allows us to evaluate system performance in human terms, while maintaining the simplicity and repeatability of system-oriented tests. Overall, we aim to achieve a clearer connection between user-oriented studies and system-oriented tests, allowing us to better transfer insights and outcomes from one to the other.

Download PDF certificate