Interpretable and effective hashing via Bernoulli variational auto-encoders

Mena, Francisco; Ñanculef, Ricardo; Valle, Carlos

doi:10.3233/IDA-200013

Interpretable and effective hashing via Bernoulli variational auto-encoders

Article type: Research Article

Authors: Mena, Francisco^{a; *} | Ñanculef, Ricardo^a | Valle, Carlos^b

Affiliations: [a] Department of Informatics, Federico Santa María University, RM, Chile | [b] Department of Computer Science and Informatics, Playa Ancha University, VA, Chile

Correspondence: [*] Corresponding author: Francisco Mena, Department of Informatics, Federico Santa María University, RM, Chile. E-mail: francisco.menat@usm.cl.

Abstract: Due to the rapid increase in the amount of data generated in many fields of science and engineering, information retrieval methods tailored to large-scale datasets have become increasingly important in the last years. Semantic hashing is an emerging technique for this purpose that works on the idea of representing complex data objects, like images and text, using similarity-preserving binary codes that are then used for indexing and search. In this paper, we investigate a hashing algorithm that uses a deep variational auto-encoder to learn and predict the codes. Unlike previous approaches of this type, that learn a continuous (Gaussian) representation and then project the embedding to obtain hash codes, our method employs Bernoulli latent variables in both the training and the prediction stage. Constraining the model to use a binary encoding allow us to obtain a more interpretable representation for hashing: each factor in the generative model represents a bit that should help to reconstruct and thus identify the input pattern. Interestingly, we found that the binary constraint does not lead to a loss but an increase of search accuracy. We argue that continuous formulations learn a representation that can significantly differ from the code used for search. Minding this gap in the design of the auto-encoder can translate into more accurate retrieval results. Extensive experiments on seven datasets involving image data and text data illustrate these findings and demonstrate the advantages of our approach.

Keywords: Hashing, variational autoencoders, deep learning, Gumbel-Softmax distribution, neural information retrieval

DOI: 10.3233/IDA-200013

Journal: Intelligent Data Analysis, vol. 24, no. S1, pp. 141-166, 2020

Published: 4 December 2020

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn

For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl

如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl

Share this:

North America

Europe

Asia