PAPER DIGEST
Most Influential SIGIR 2016 Paper · 2026-03 edition

Composite Correlation Quantization For Efficient Multimodal Retrieval

Mingsheng Long; Yue Cao; Jianmin Wang; Philip S. Yu

Venue
ACM SIGIR Conference (SIGIR) 2016
Recognition
Most Influential SIGIR 2016 Paper (Rank No. 11)
Edition
2026-03
Impact factor
5
Certificate ID
04613de8008e5aed

Abstract

Efficient similarity retrieval from large-scale multimodal database is pervasive in modern search engines and social networks. To support queries across content modalities, the system should enable cross-modal correlation and computation-efficient indexing. While hashing methods have shown great potential in achieving this goal, current attempts generally fail to learn isomorphic hash codes in a seamless scheme, that is, they embed multiple modalities in a continuous isomorphic space and separately threshold embeddings into binary codes, which incurs substantial loss of retrieval accuracy. In this paper, we approach seamless multimodal hashing by proposing a novel Composite Correlation Quantization (CCQ) model. Specifically, CCQ jointly finds correlation-maximal mappings that transform different modalities into isomorphic latent space, and learns composite quantizers that convert the isomorphic latent features into compact binary codes. An optimization framework is devised to preserve both intra-modal similarity and inter-modal correlation through minimizing both reconstruction and quantization errors, which can be trained from both paired and partially paired data in linear time. A comprehensive set of experiments clearly show the superior effectiveness and efficiency of CCQ against the state of the art hashing methods for both unimodal and cross-modal retrieval.

Download PDF certificate