PAPER DIGEST
Most Influential CIKM 2008 Paper · 2026-03 edition

Proactive Learning: Cost-sensitive Active Learning With Multiple Imperfect Oracles

Pinar Donmez; Jaime G. Carbonell

Venue
ACM Conference on Information and Knowledge Management (CIKM) 2008
Recognition
Most Influential CIKM 2008 Paper (Rank No. 9)
Edition
2026-03
Impact factor
5
Certificate ID
ee344dbb93d9ea44

Abstract

Proactive learning is a generalization of active learning designed to relax unrealistic assumptions and thereby reach practical applications. Active learning seeks to select the most informative unlabeled instances and ask an omniscient oracle for their labels, so as to retrain the learning algorithm maximizing accuracy. However, the oracle is assumed to be infallible (never wrong), indefatigable (always answers), individual (only one oracle), and insensitive to costs (always free or always charges the same). Proactive learning relaxes all four of these assumptions, relying on a decision-theoretic approach to jointly select the optimal oracle and instance, by casting the problem as a utility optimization problem subject to a budget constraint. Results on multi-oracle optimization over several data sets demonstrate the superiority of our approach over the single-imperfect-oracle baselines in most cases.

Download PDF certificate