Deep Hashing with Hash-Consistent Large Margin Proxy Embeddings

Abstract

Image hash codes are produced by binarizing the embeddings of convolutional neural networks (CNN) trained for either classification or retrieval. While proxy embeddings achieve good performance on both tasks, they are non-trivial to binarize, due to a rotational ambiguity that encourages non-binary embeddings. The use of a fixed set of proxies (weights of the CNN classification layer) is proposed to eliminate this ambiguity, and a procedure to design proxy sets that are nearly optimal for both classification and hashing is introduced. The resulting hash-consistent large margin (HCLM) proxies are shown to encourage saturation of hashing units, thus guaranteeing a small binarization error, while producing highly discriminative hash-codes. A semantic extension (sHCLM), aimed to improve hashing performance in a transfer scenario, is also proposed. Extensive experiments show that sHCLM embeddings achieve significant improvements over state-of-the-art hashing procedures on several small and large datasets, both within and beyond the set of training classes.

Published at: International Journal of Computer Vision (IJCV), 2020.

Paper

Bibtex

@article{MorgadoProxyHashing,
 author = {Morgado, Pedro and Li, Yunsheng and Costa Pereira, Jose and Saberian, Mohammad and Vasconcelos, Nuno},
 journal = {International Journal of Computer Vision},
 title = {Deep Hashing with Hash-Consistent Large Margin Proxy Embeddings},
 year = {2020},
 doi = {10.1007/s11263-020-01362-7},
 isbn = {1573-1405},
 url = {https://doi.org/10.1007/s11263-020-01362-7}
}