Probabilistic Document Indexing From Relevance Feedback Data

N. Fuhr; C. Buckley

Venue: ACM SIGIR Conference (SIGIR) 1990
Recognition: Most Influential SIGIR 1990 Paper (Rank No. 11)
Edition: 2026-03
Impact factor: 3
Certificate ID: d5b6aa4fc4aa90c1

Abstract

Based on the binary independence indexing model, we apply three new concepts for probabilistic document indexing from relevance feedback data: <ul> <li>Abstraction from specific terms and documents, which overcomes the restriction of limited relevance information for parameter estimation.</li> <li>Flexibility of the representation, which allows the integration of new text analysis and knowledge-based methods in our approach as well as the consideration of more complex document structures or different types of terms (e.g. single words and noun phrases).</li> <li>Probabilistic learning or classification methods for the estimation of the indexing weights making better use of the available relevance information.</li> </ul> We give experimental results for five test collections which show improvements over other indexing methods.

Download PDF certificate