PAPER DIGEST
Most Influential CIKM 2006 Paper · 2026-03 edition

Estimating Average Precision With Incomplete And Imperfect Judgments

Emine Yilmaz; Javed A. Aslam

Venue
ACM Conference on Information and Knowledge Management (CIKM) 2006
Recognition
Most Influential CIKM 2006 Paper (Rank No. 2)
Edition
2026-03
Impact factor
6
Certificate ID
f02c16b959927e7d

Abstract

We consider the problem of evaluating retrieval systems using incomplete judgment information. Buckley and Voorhees recently demonstrated that retrieval systems can be efficiently and effectively evaluated using incomplete judgments via the bpref measure [6]. When relevance judgments are complete, the value of bpref is an approximation to the value of average precision using complete judgments. However, when relevance judgments are incomplete, the value of bpref deviates from this value, though it continues to <i>rank</i> systems in a manner similar to average precision evaluated with a complete judgment set. In this work, we propose three evaluation measures that (1) are approximations to average precision even when the relevance judgments are incomplete and (2) are more robust to incomplete or imperfect relevance judgments than bpref. The proposed estimates of average precision are simple and accurate, and we demonstrate the utility of these estimates using TREC data.

Download PDF certificate