PAPER DIGEST
Most Influential SIGIR 2004 Paper · 2026-03 edition

Discriminative Models For Information Retrieval

Ramesh Nallapati

Venue
ACM SIGIR Conference (SIGIR) 2004
Recognition
Most Influential SIGIR 2004 Paper (Rank No. 11)
Edition
2026-03
Impact factor
6
Certificate ID
4f2883a22248ff38

Abstract

Discriminative models have been preferred over generative models in many machine learning problems in the recent past owing to some of their attractive theoretical properties. In this paper, we explore the applicability of discriminative classifiers for IR. We have compared the performance of two popular discriminative models, namely the maximum entropy model and support vector machines with that of language modeling, the state-of-the-art generative model for IR. Our experiments on ad-hoc retrieval indicate that although maximum entropy is significantly worse than language models, support vector machines are on par with language models. We argue that the main reason to prefer SVMs over language models is their ability to learn arbitrary features automatically as demonstrated by our experiments on the home-page finding task of TREC-10.

Download PDF certificate