ECCV 2012 - LNCS 7572-7578 and 7583-7585

Efficient Mining of Repetitions in Large-Scale TV Streams with Product Quantization Hashing

Jiangbo Yuan¹, Guillaume Gravier², Sébastien Campion², Xiuwen Liu¹, and Hervé Jégou²

¹Florida State University, Tallahassee, FL 32306, USA

²INRIA-IRISA, 35042, Rennes Cedex, France

Abstract. Duplicates or near-duplicates mining in video sequences is of broad interest to many multimedia applications. How to design an effective and scalable system, however, is still a challenge to the community. In this paper, we present a method to detect recurrent sequences in large-scale TV streams in an unsupervised manner and with little a priori knowledge on the content. The method relies on a product k-means quantizer that efficiently produces hash keys adapted to the data distribution for frame descriptors. This hashing technique combined with a temporal consistency check allows the detection of meaningful repetitions in TV streams. When considering all frames (about 47 millions) of a 22-day long TV broadcast, our system detects all repetitions in 15 minutes, excluding the computation of the frame descriptors. Experimental results show that our approach is a promising way to deal with very large video databases.

LNCS 7583, p. 271 ff.

Full article in PDF | BibTeX