PAPER DIGEST
Most Influential ICML 2010 Paper · 2026-03 edition

Distance Dependent Chinese Restaurant Processes

David Blei; Peter Frazier

Venue
International Conference on Machine Learning (ICML) 2010
Recognition
Most Influential ICML 2010 Paper (Rank No. 12)
Edition
2026-03
Impact factor
6
Certificate ID
b6c08204a602a23f

Abstract

We develop the distance dependent Chinese restaurant process (CRP), a flexible class of distributions over partitions that allows for non-exchangeability. This class can be used to model dependencies between data in infinite clustering models, including dependencies across time or space. We examine the properties of the distance dependent CRP, discuss its connections to Bayesian nonparametric mixture models, and derive a Gibbs sampler for both observed and mixture settings. We study its performance with time-dependent models and three text corpora. We show that relaxing the assumption of exchangeability with distance dependent CRPs can provide a better fit to sequential data. We also show its alternative formulation of the traditional CRP leads to a faster-mixing Gibbs sampling algorithm than the one based on the original formulation.

Download PDF certificate