PAPER DIGEST
Most Influential SIGIR 2000 Paper · 2026-03 edition

Automatic Generation Of Overview Timelines

Russell Swan; James Allan

Venue
ACM SIGIR Conference (SIGIR) 2000
Recognition
Most Influential SIGIR 2000 Paper (Rank No. 9)
Edition
2026-03
Impact factor
5
Certificate ID
9f706628e6d9f2e9

Abstract

We present a statistical model of feature occurrence over time, and develop tests based on classical hypothesis testing for significance of term appearance on a given date. Using additional classical hypothesis testing we are able to combine these terms to generate “topics” as defined by the Topic Detection and Tracking study. The groupings of terms obtained can be used to automatically generate an interactive timeline displaying the major events and topics covered by the corpus. To test the validity of our technique we extracted a large number of these topics from a test corpus and had human evaluators judge how well the selected features captured the gist of the topics, and how they overlapped with a set of known topics from the corpus. The resulting topics were highly rated by evaluators who compared them to known topics.

Download PDF certificate