PAPER DIGEST
Most Influential SIGMOD 2013 Paper · 2026-03 edition

Inter-media Hashing For Large-scale Retrieval From Heterogeneous Data Sources

Jingkuan Song; Yang Yang; Yi Yang; Zi Huang; Heng Tao Shen

Venue
ACM SIGMOD Conference (SIGMOD) 2013
Recognition
Most Influential SIGMOD 2013 Paper (Rank No. 1)
Edition
2026-03
Impact factor
7
Certificate ID
b2fe0714f2476b37

Abstract

In this paper, we present a new multimedia retrieval paradigm to innovate large-scale search of heterogenous multimedia data. It is able to return results of different media types from heterogeneous data sources, e.g., using a query image to retrieve relevant text documents or images from different data sources. This utilizes the widely available data from different sources and caters for the current users' demand of receiving a result list simultaneously containing multiple types of data to obtain a comprehensive understanding of the query's results. To enable large-scale inter-media retrieval, we propose a novel <i>inter-media hashing</i> (IMH) model to explore the correlations among multiple media types from different data sources and tackle the scalability issue. To this end, multimedia data from heterogeneous data sources are transformed into a common Hamming space, in which fast search can be easily implemented by XOR and bit-count operations. Furthermore, we integrate a linear regression model to learn hashing functions so that the hash codes for new data points can be efficiently generated. Experiments conducted on real-world large-scale multimedia datasets demonstrate the superiority of our proposed method compared with state-of-the-art techniques.

Download PDF certificate