PAPER DIGEST
Most Influential SIGIR 2004 Paper · 2026-03 edition

Text Classification And Named Entities For New Event Detection

Giridhar Kumaran; James Allan

Venue
ACM SIGIR Conference (SIGIR) 2004
Recognition
Most Influential SIGIR 2004 Paper (Rank No. 6)
Edition
2026-03
Impact factor
6
Certificate ID
97eb2d5fa5e5a28c

Abstract

New Event Detection is a challenging task that still offers scope for great improvement after years of effort. In this paper we show how performance on New Event Detection (NED) can be improved by the use of text classification techniques as well as by using named entities in a new way. We explore modifications to the document representation in a vector space-based NED system. We also show that addressing named entities preferentially is useful only in certain situations. A combination of all the above results in a multi-stage NED system that performs much better than baseline single-stage NED systems.

Download PDF certificate